2002-07-14 15:00:11 -04:00
|
|
|
<?xml version="1.0"?>
|
|
|
|
<document>
|
2003-10-15 09:51:38 -04:00
|
|
|
<properties>
|
|
|
|
<author>Otis Gospodentic</author>
|
|
|
|
<title>Lucene Sandbox</title>
|
|
|
|
</properties>
|
|
|
|
<body>
|
|
|
|
|
|
|
|
<section name="Lucene Sandbox">
|
|
|
|
<p>
|
|
|
|
Lucene project also contains a workspace, Lucene Sandbox, that is open to all Lucene committers, as well
|
|
|
|
as a few other developers. The purpose of the Sandbox is to host various third party contributions,
|
|
|
|
and to serve as a place to try out new ideas and prepare them for inclusion into the core Lucene
|
|
|
|
distribution.<br/>
|
|
|
|
Users are free to experiment with the components developed in the Sandbox, but Sandbox components will
|
|
|
|
not necessarily be maintained, particularly in their current state.
|
|
|
|
</p>
|
|
|
|
|
|
|
|
<p>
|
2005-02-14 11:48:47 -05:00
|
|
|
You can access the Lucene Sandbox repository at
|
2005-03-03 16:06:45 -05:00
|
|
|
<a href="http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/">http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/</a>.
|
2003-10-15 09:51:38 -04:00
|
|
|
</p>
|
|
|
|
|
|
|
|
<subsection name="Snowball Stemmers for Lucene">
|
|
|
|
<p>
|
|
|
|
This project provides pre-compiled versions of the Snowball stemmers
|
|
|
|
for Lucene.
|
|
|
|
</p>
|
|
|
|
|
|
|
|
<p>
|
2005-05-04 19:10:37 -04:00
|
|
|
<a href="http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/snowball">The
|
|
|
|
repository for the Snowball contribution.</a>
|
2003-10-15 09:51:38 -04:00
|
|
|
</p>
|
|
|
|
|
|
|
|
<p>
|
|
|
|
<a href="http://snowball.tartarus.org/">Background information on Snowball</a>,
|
|
|
|
which is a language for stemmers developed by Martin Porter.
|
|
|
|
</p>
|
2004-01-28 06:45:19 -05:00
|
|
|
</subsection>
|
|
|
|
|
|
|
|
<subsection name="Analyzers, Tokenizers, Filters">
|
|
|
|
<p>
|
|
|
|
Contributed Analyzers, Tokenizers, and Filters for various languages.
|
|
|
|
</p>
|
2003-10-15 09:51:38 -04:00
|
|
|
|
2004-01-28 06:45:19 -05:00
|
|
|
<p>
|
2005-03-03 16:06:45 -05:00
|
|
|
<a href="http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/analyzers/">The
|
2005-02-14 11:48:47 -05:00
|
|
|
repository for the Analyzers contribution.</a>
|
2003-10-15 09:51:38 -04:00
|
|
|
</p>
|
|
|
|
</subsection>
|
|
|
|
|
|
|
|
<subsection name="Ant">
|
|
|
|
<p>
|
|
|
|
The Ant project is a useful Ant task that creates a Lucene index out of an Ant fileset. It also
|
|
|
|
contains an example HTML parser that uses JTidy.
|
|
|
|
</p>
|
|
|
|
<p>
|
2005-03-03 16:06:45 -05:00
|
|
|
<a href="http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/ant/">The
|
2005-02-14 11:48:47 -05:00
|
|
|
repository for the Ant contribution.</a>
|
2003-10-15 09:51:38 -04:00
|
|
|
</p>
|
|
|
|
</subsection>
|
|
|
|
|
|
|
|
<subsection name="WordNet/Synonyms">
|
|
|
|
<p>
|
|
|
|
The Lucene WordNet code consists of a single class which parses a prolog file
|
|
|
|
from the WordNet site that contains a list of English words and synonyms.
|
|
|
|
The class builds a Lucene index from the synonyms file. Your querying code could
|
|
|
|
hit this index to build up a set of synonyms for the terms in the
|
|
|
|
search query.
|
|
|
|
</p>
|
|
|
|
<p>
|
|
|
|
More information on the <a href="http://www.tropo.com/techno/java/lucene/wordnet.html">Lucene WordNet package</a>.
|
2005-05-04 19:10:37 -04:00
|
|
|
<a href="http://wordnet.princeton.edu/">WordNet</a> is an online database of English language words that contains
|
2003-10-15 09:51:38 -04:00
|
|
|
synonyms, definitions, and various relationships between synonym sets.
|
|
|
|
</p>
|
|
|
|
<p>
|
2005-12-06 13:16:52 -05:00
|
|
|
<a href="http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/wordnet/">The
|
2005-05-04 19:10:37 -04:00
|
|
|
repository for the WordNet module.</a>
|
2003-10-15 09:51:38 -04:00
|
|
|
</p>
|
|
|
|
</subsection>
|
|
|
|
|
2004-01-28 06:45:19 -05:00
|
|
|
<subsection name="Lucli - Lucene Command-line Interface">
|
2003-10-15 09:51:38 -04:00
|
|
|
<p>
|
2004-01-28 06:45:19 -05:00
|
|
|
The Lucli application allows index manipulation from the
|
|
|
|
command-line.
|
2003-10-15 09:51:38 -04:00
|
|
|
</p>
|
|
|
|
|
2004-01-28 06:45:19 -05:00
|
|
|
<p>
|
2005-03-03 16:06:45 -05:00
|
|
|
<a href="http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/lucli/">The
|
2005-02-14 11:48:47 -05:00
|
|
|
repository for the Lucli contribution.</a>
|
2004-01-28 06:45:19 -05:00
|
|
|
</p>
|
2003-10-15 09:51:38 -04:00
|
|
|
</subsection>
|
|
|
|
|
2004-04-20 03:17:23 -04:00
|
|
|
<subsection name="Term Highlighter">
|
|
|
|
<p>
|
|
|
|
A small set of classes for highlighting matching terms in
|
|
|
|
search results.
|
|
|
|
</p>
|
2005-03-03 16:06:45 -05:00
|
|
|
<a href="http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/highlighter/">The
|
2005-02-14 11:48:47 -05:00
|
|
|
repository for the Highlighter contribution.</a>
|
2004-04-20 03:17:23 -04:00
|
|
|
</subsection>
|
|
|
|
|
2003-10-15 09:51:38 -04:00
|
|
|
<subsection name="Javascript Query Constructor">
|
|
|
|
<p>
|
|
|
|
Javascript library to support client-side query-building. Provides support for a user interface similar to
|
|
|
|
<a href="http://www.google.com.sg/advanced_search">Google's Advanced Search</a>.
|
|
|
|
</p>
|
|
|
|
<p>
|
|
|
|
|
2005-05-04 19:10:37 -04:00
|
|
|
<a href="http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/javascript/queryConstructor/">The
|
|
|
|
repository for the Javascript Query Constructor files.</a>
|
2003-10-15 09:51:38 -04:00
|
|
|
</p>
|
|
|
|
</subsection>
|
|
|
|
|
|
|
|
<subsection name="Javascript Query Validator">
|
|
|
|
<p>
|
|
|
|
Javascript library to support client-side query validation. Lucene doesn't like malformed queries and tends to
|
|
|
|
throw ParseException, which are often difficult to interpret and pass on to the user. This library hopes to
|
|
|
|
alleviate that problem.
|
|
|
|
</p>
|
|
|
|
<p>
|
|
|
|
|
2005-05-04 19:10:37 -04:00
|
|
|
<a href="http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/javascript/queryValidator/">The
|
|
|
|
repository for the Javascript Query Validator files.</a>
|
2003-10-15 09:51:38 -04:00
|
|
|
</p>
|
|
|
|
</subsection>
|
|
|
|
|
|
|
|
<subsection name="High Frequency Terms">
|
|
|
|
<p>
|
|
|
|
The miscellaneous package is for classes that don't fit anywhere else. The only class in it right now determines
|
|
|
|
what terms occur the most inside a Lucene index. This could be useful for analyzing which terms may need to go
|
|
|
|
into a custom stop word list for better search results.
|
|
|
|
</p>
|
|
|
|
<p>
|
|
|
|
|
2005-05-04 19:10:37 -04:00
|
|
|
<a href="http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/miscellaneous/">The
|
|
|
|
repository for miscellaneous classes.</a>
|
2003-10-15 09:51:38 -04:00
|
|
|
</p>
|
|
|
|
</subsection>
|
|
|
|
|
|
|
|
</section>
|
|
|
|
|
|
|
|
</body>
|
2002-07-14 15:00:11 -04:00
|
|
|
</document>
|