...
The search engine is not a default part of the XperienCentral application; it runs in a separate (Java) process, therefore it has to be started by running either a batch file (Windows) or a shell script (Unix). Because the search engine process is executed in a separate process, it needs to have a basic form of authentication. This is configured in the credentials.xml
file. Open credentials.xml
in a text editor. It is stored in the search engine conf
directory:
- For normal GX WebManager XperienCentral installations on Windows computers this is
<xperiencentral-root>\webmanager-searchengine\target\classes\conf
. - For UNIX servers this is normally something like similar to
/vol/www/search/indexer-mywebsite/conf
.
...
The First Site Index
There are two main ways to index the website: manually and automatically. The second is configured by running the search engine at a scheduled time from a cron a Cron job or scheduled service. How to configure this is covered in the installation manuals for Windows and Linux product environments. This chapter will describe how to index a website manually, and also on local computers where the indexing is not scheduled.
...
- To manually index your website, navigate to the Setup Tool. Login in as Administrator and then click the Search Tools tab.
If the search engine is running correctly then the lower part of the page under ‘Server Status’ contains , the number of indexed pages plus some other information appears in the lower part of the page under "Server Status" . Normally when the search engine has been started for the first time than the number of pages will be 0, as starting the search engine does not automatically mean that the indexing is started:Right above the "Search status" information is the "Index URL" pane. This pane contains three fields that tell the search engine
- where to start crawling the website (the "
URL
" field). - which domains are allowed (the "
host
" field). This should contain at least the front-end hostname , and the backend hostname. Optionally this can contain front-end and backend alias hostnames. The Multiple hostnames must be separated by a comma. - how deep the crawler goes. Normally this is set to 1, which means the crawler will index the page in the ‘URL’
URL
field plus all the pages that are linked from the "URL
" page.
By default the "
URL
" field contains the URL of the so-called indexer page. This indexer page contains links to all accessible XperienCentral pages, documents and articles. The URL is normally something likehttp://<domain>:<port>/web/webmanager?id=39016
. To see all the URLs that will be indexed, open this URL in a separate window. - where to start crawling the website (the "
Click on the [Index] button to start the indexing process. The search engine crawler will open the indexer page and start indexing all the pages and documents. This process will take anywhere from several seconds for a small website to several hours for a large website. For a detailed description of the indexing process see *** Crawling.
In the upper part of the ‘Search Tools’ ‘"Search Tools page the search engine can be tested by entering one or more search terms and hitting ‘<Enter>’ <Enter> or by clicking on the [Relevance] or [Date] buttons:
In a normal situation this should return results. If there are zero documents in the index, or if there are documents in the index that are not returned in the search field, or if there is a certain document that’s missing, see ***troubleshooting Search Engine Troubleshooting.
...
Search Element and Search Results
...