About
Resources
Download
|
Lucene Sandbox
|
Lucene project also contains a workspace, Lucene Sandbox, that is open to all Lucene committers, as well
as a few other developers. The purpose of the Sandbox is to host various third party contributions,
and to serve as a place to try out new ideas and prepare them for inclusion into the core Lucene
distribution.
Users are free to experiment with the components developed in the Sandbox, but Sandbox components will
not necessarily be maintained, particularly in their current state.
You can access the Lucene Sandbox repository at
http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/.
WordNet/Synonyms
|
The Lucene WordNet code consists of a single class which parses a prolog file
from the WordNet site that contains a list of English words and synonyms.
The class builds a Lucene index from the synonyms file. Your querying code could
hit this index to build up a set of synonyms for the terms in the
search query.
More information on the Lucene WordNet package.
WordNet is an online database of English language words that contains
synonyms, definitions, and various relationships between synonym sets.
The
repository for the WordNet module.
|
|
High Frequency Terms
|
The miscellaneous package is for classes that don't fit anywhere else. The only class in it right now determines
what terms occur the most inside a Lucene index. This could be useful for analyzing which terms may need to go
into a custom stop word list for better search results.
The
repository for miscellaneous classes.
|
|
|
|
|