Archive: SearchingUsing Spelling SuggestionsBy Yusuf Mohsinally, Your users have probably misspelled a search query on more than one occassion. It shouldn't matter. For example, an employee at a semiconductor company who types in "heaflouroethane" into an intranet search box should be asked if he meant to search for "hexafluoroethane." Similarly, a user in the company's American manufacturing facility who searches for the same compound's "vapor pressure" should be asked if she also wanted to search for the term "vapour pressure", which is found in documents written at the company's R&D lab in the United Kingdom. There's little doubt that spelling suggestions can be enormously helpful to users on your public website or intranet. Ultraseek's spelling suggestions use dynamic context-sensitive algorithms that are based on documents that have been indexed. This means users receive alternate spellings that occur within the document base, which helps if a particular word has been misspelled within the documents themselves. But, to avoid flooding a user with unhelpful suggestions, the suggested term must appear in at least three indexed documents. If you would like to suggest known alternate terms that may be related to the user's search, you can use the thesaurus feature. For example, you may want to suggest the search term "Ultraseek" when a user searches for "Verity". Or, use the QuickLinks feature to provide a link to the "best known" document for a particular search term. Activating/Deactivating Spelling Suggestions To activate (active by default) the spelling suggestion feature for a particular user interface style, select the "Show Spelling Suggestions" checkbox found under the Interface > Query tab in the admin interface. Posted August 25, 2005 by editor Thesaurus versus User DictionaryBy Ryan Weisenberger Manager, Software Development You can also use the <noshow> element if you do not want the term suggested as an alternative, but want it to display the other terms if searched for. After making these changes you'll need to restart Ultraseek. Stemming and the User Dictionary The user dictionary, on the other hand, is used to tell the indexer that two words should be treated the same. While this sounds a lot like synonyms, there is a subtle difference. The user dictionary makes the connection between the terms invisible to the user, so you are in essence tricking them. They think they are searching for one term, but you give them the results from another. This can be a little confusing, and it should not be used instead of the thesaurus. So when should you use the user dictionary? The user dictionary is meant to supplement the linguistic capabilities of Ultraseek. For example, the search engine can automatically map a plural word to its singular form. This way, a search for "geese" will match documents that contain "goose." You may have a word on your site that can be plural, but does not appear in a standard dictionary. A good example of this is a product name. In that case, you can enter the plural form of the word in the user dictionary, along with its singular counterpart, so that Ultraseek knows to treat these words as the same. To add a word to the English user dictionary, edit the en.usr in the /language directory. The correct format is WORD,ROOT:w. The entry below maps the plural of webserver to the correct singular form. webservers,webserver:w After making this change, you must restart Ultraseek, and reindex your content, before you will see it fully take effect. Posted August 02, 2005 by editor Relevance and User SatisfactionSearch relevance is usually thought of as a statistic that measures whether the search results match the query. That is useful in the lab, but not as useful for a search installation. When search is part of a site, we need to understand how it helps the users of that site. Can they find things quickly? Are they comfortable with the search? Focusing on user satisfaction helps avoid manager centered design, but you also need to know how the search engine helps your users. There are two main aspects of this: effectiveness and trust. You change different things to improve each of these. In order to improve relevance, you must be very clear about what it is, and what it means to make it better. You might end up tweaking the engine, changing what content is indexed, adding editorial results (“Best Bets” or “Quick Links”), or changing the presentation. I look at relevance two ways. UI Effectiveness: Relevant results reduce the number of clicks before visitors reach their goal. With every click, you lose visitors, maybe as many as 10%. Relevant results at the top mean fewer clicks. Ultraseek can measure the number of clicks per result page and report that. Fewer clicks is better, though zero clicks is not good, because it means the visitor left without visiting any results. To put specific results at the top, use Quick Links. But make sure this is based on user behavior, not on the org chart or datasheets. Quick Links must be more relevant than the first result. Transparency and Trust: When users have some clue about why the results are presented, they trust the engine more. This is a transparency issue, and I think it is the biggest advantage of passage-based summaries. The passages are the engine explaining, ‘this is why I’m showing you this document.’ It makes a huge difference in how comfortable visitors are. Relevance also increases trust. Irrelevant Quick Links will decrease trust, so be careful. By Walter Underwood Principal Software Architect Posted June 22, 2005 by editor |
CategoriesArchivesRecent EntriesThesaurus versus User Dictionary Relevance and User Satisfaction Related ForumHow do I remove the Highlight link under the Find Similar link. How can I obtain the value for number of search results (hits)? Limiting number of results displayed on results page Resources |