Sharepoint Thesaurus Editor


SharePoint Kenza is an editing tool for Microsoft® Search Thesaurus files used in SharePoint® 2010, 2007; Microsoft® Search Service 2010; Microsoft Search Server 2010 Express; Microsoft Search Server 2008 Express and SQL iFTS (SQL Server™ Integrated Full Text Search 2008).

The Problem

Without search term expansion, search results will only contain results where the words match exactly those used in user search queries; to overcome this problem SharePoint includes thesaurus files. The thesaurus files are in XML format and normally only system administrators can update, modify, or delete the thesaurus files. Editing the files in Windows Notepad or an XML editor is a slow and error prone process, any errors in the file format or encoding can lead to search working incorrectly; furthermore it is not easy for the work to be spread among several users.

The Solution

Kenza is designed for ease of use and can be run directly on the SharePoint server for use by the SharePoint Administrator or on any PC running Windows 7 or XP for use by non-technical staff. Kenza can create multiple thesaurus XML files and includes a merge feature so that files created by different people can easily be combined. Users can type or copy & paste from other documents or web pages for fast error free production.

Kenza ensures that the file is in the correct format and takes care of empty entries, unwanted white space, punctuation and other special characters automatically. Automatic de-duplication, text cleanup and noise word highlighting assist the user to produce correctly formatted files in record time. 

30 08 2012 Stop Word Highlight

30 08 2012 Select File

Apart from expanding search queries the thesaurus feature in Microsoft® Search can be used to replace user's search queries, where one or more patterns (a word or phrase) in a search query is replaced by one or more word or phrase substitutions.

UTP100 Sharepoint New Dialog 29 03 2012   Kenza lets you easily create and edit Expansion sets and both one-to-many or many-to-one Replacement sets.

It automatically takes care of removing duplicates and white space that may be accidently entered when cutting and pasting text, and ensures that files are saved in the correct Unicode XML format; Kenza also preserves backups of your original thesaurus files for peace of mind.

Kenza comes with example thesaurus XML files, currently these include examples of expansion and replacements sets, and Months and Days in over 20 languages.