Home

Getting Started

Utilities

Indexing

Omnidex

Development

Tutorials

Quick Links

 

OMNIDEX

Omnidex Text

Filtering the Thesaurus

WordNet License

 

Managed Synonyms

Synonym Searches

$CONTAINS

 

Omnidex Text

Thesaurus

A thesaurus search looks for synonyms based on the English language. A thesaurus search is broad and flexible and good for general searches where synonyms of general speech words should be used as criteria.

For example, when searches for records with references to a home or house, a thesaurus search could find records containing house, home, domicile, dwelling, abode, habitation, villa, etc..., with only a single criteria keyword.

Thesaurus searches are good for broad textual searches, but are not appropriate for more specific situations. A thesaurus will not contain most proper names, brand names, abbreviations and jargon. Use Managed Synonyms for these scenarios.

 

Filtering the Thesaurus

The thesaurus used in Omnidex Text contains a large number of words in common English usage. This includes words of various dialects, words found more commonly in British English or Canadian English, idiomatic phrases and common slang. The latter category includes common and uncommon swear words as well as other scatological language.

Some applications may require unfiltered searches while others might require the filtering of such words, insuring that these words do not appear as a result of a synonym search, misspelling search or suggestion.

A file named filter.exc, located in $OMNIDEX_HOME/config/english/thesaurus, contains some of the most common, objectionable words that are filtered by default. You can edit this file, adding or removing words, to suit your application needs. The contents are structured as one word or phrase per line with spaces represented by underscores.

If upgrading Omnidex to a new version, be sure to make a copy of this file before installing the new version, to avoid losing your changes when the file is overwritten.

 

WordNet® License

The following license agreement is found on the WordNet website (http://www.cogsci.princeton.edu/~wn/license.shtml):

Commercial Use of WordNet

WordNet® is unencumbered, and may be used in commercial applications in accordance with the following license agreement. An attorney representing the commercial interest should review this WordNet license with respect to the intended use.


--------------------------------------------------------------------------------

This software and database is being provided to you, the LICENSEE, by Princeton University under the following license. By obtaining, using and/or copying this software and database, you agree that you have read, understood, and will comply with these terms and conditions.:

Permission to use, copy, modify and distribute this software and database and its documentation for any purpose and without fee or royalty is hereby granted, provided that you agree to comply with the following copyright notice and statements, including the disclaimer, and that the same appear on ALL copies of the software, database and documentation, including modifications that you make for internal use or for distribution.

WordNet 2.0 Copyright © 2003 by Princeton University. All rights reserved.

THIS SOFTWARE AND DATABASE IS PROVIDED "AS IS" AND PRINCETON UNIVERSITY MAKES NO REPRESENTATIONS OR WARRANTIES, EXPRESS OR IMPLIED. BY WAY OF EXAMPLE, BUT NOT LIMITATION, PRINCETON UNIVERSITY MAKES NO REPRESENTATIONS OR WARRANTIES OF MERCHANTABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE OR THAT THE USE OF THE LICENSED SOFTWARE, DATABASE OR DOCUMENTATION WILL NOT INFRINGE ANY THIRD PARTY PATENTS, COPYRIGHTS, TRADEMARKS OR OTHER RIGHTS.

The name of Princeton University or Princeton may not be used in advertising or publicity pertaining to distribution of the software and/or database. Title to copyright in this software, database and any associated documentation shall at all times remain with Princeton University and LICENSEE agrees to preserve same.

 

Top