- Added information about other Lucene Sandbox contributions.

Submitted by: Jell Linwood


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149944 13f79535-47bb-0310-9956-ffa450edef68
This commit is contained in:
Otis Gospodnetic 2003-01-31 19:42:30 +00:00
parent 552bfaf4b1
commit a85ab1febc
2 changed files with 122 additions and 24 deletions

View File

@ -236,18 +236,26 @@ which is a language for stemmers developed by Martin Porter.
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#828DA6">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="Ant"><strong>Ant</strong></a>
<a name="Lucene Service for Fulcrum"><strong>Lucene Service for Fulcrum</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<p>
The Ant project is a useful Ant task that creates a Lucene index out of an Ant fileset. It also
contains an example HTML parser that uses JTidy.
Lucene can be run as a service inside <a href="http://jakarta.apache.org/turbine/fulcrum/index.html">Fulcrum</a>,
which is the services framework from the
<a href="http://jakarta.apache.org/turbine/">Turbine</a> project.</p>
<p>
The implementation consists of a SearchService interface, a LuceneSearchSearchService implementation, and a
SearchResults object that gets an array of Document objects from a Hits object. Calls to the search methods on
the service return the SearchResults object.
</p>
<p>
<a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/contributions/ant/">The
CVS repository for the Ant contribution.</a>
The service supports querying, but does not support indexing.
</p>
<p>
<a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/contributions/fulcrum/">
CVS repository for the Fulcrum Service.</a>
</p>
</blockquote>
</td></tr>
@ -256,20 +264,70 @@ CVS repository for the Ant contribution.</a>
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#828DA6">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="SearchBean"><strong>SearchBean</strong></a>
<a name="WordNet/Synonyms"><strong>WordNet/Synonyms</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<p>
SearchBean is a UI component that can be used to browse through the results of a Lucene search.
The SearchBean searches the index for a given query string, retrieves the hits, and then brings
them into the HitsIterator class, which can be used for paging and sorting through search results.
The Lucene WordNet code consists of a single class which parses a prolog file
from the WordNet site that contains a list of English words and synonyms.
The class builds a Lucene index from the synonyms file. Your querying code could
hit this index to build up a set of synonyms for the terms in the
search query.
</p>
<p>
<a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/contributions/searchBean/">The
CVS repository for the SearchBean contribution.</a>
More information on the <a href="http://www.tropo.com/techno/java/lucene/wordnet.html">Lucene WordNet package</a>.
<a href="http://www.cogsci.princeton.edu/~wn/">WordNet</a> is an online database of English language words that contains
synonyms, definitions, and various relationships between synonym sets.
</p>
<p>
<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/WordNet/">
CVS for the WordNet module.</a>
</p>
</blockquote>
</td></tr>
<tr><td><br/></td></tr>
</table>
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#828DA6">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="SAX/DOM XML Indexing demo"><strong>SAX/DOM XML Indexing demo</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<p>
This contribution is some sample code that demonstrates adding simple XML documents into the index. It creates
a new Document object for each file, and then populates the Document with a Field for each XML element, recursively.
There are examples included for both SAX and DOM.
</p>
<p>
<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/XML-Indexing-Demo/">
CVS for the XML Indexing Demo.</a>
</p>
</blockquote>
</td></tr>
<tr><td><br/></td></tr>
</table>
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#828DA6">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="High Frequency Terms"><strong>High Frequency Terms</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<p>
The miscellaneous package is for classes that don't fit anywhere else. The only class in it right now determines
what terms occur the most inside a Lucene index. This could be useful for analyzing which terms may need to go
into a custom stop word list for better search results.
</p>
<p>
<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/miscellaneous/src/java/org/apache/lucene/misc/">
CVS for miscellaneous classes.</a>
</p>
</blockquote>
</td></tr>

View File

@ -84,27 +84,67 @@ which is a language for stemmers developed by Martin Porter.
</subsection>
<subsection name="Ant">
<subsection name="Lucene Service for Fulcrum">
<p>
The Ant project is a useful Ant task that creates a Lucene index out of an Ant fileset. It also
contains an example HTML parser that uses JTidy.
Lucene can be run as a service inside <a href="http://jakarta.apache.org/turbine/fulcrum/index.html">Fulcrum</a>,
which is the services framework from the
<a href="http://jakarta.apache.org/turbine/">Turbine</a> project.</p>
<p>
The implementation consists of a SearchService interface, a LuceneSearchSearchService implementation, and a
SearchResults object that gets an array of Document objects from a Hits object. Calls to the search methods on
the service return the SearchResults object.
</p>
<p>
<a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/contributions/ant/">The
CVS repository for the Ant contribution.</a>
The service supports querying, but does not support indexing.
</p>
<p>
<a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/contributions/fulcrum/">
CVS repository for the Fulcrum Service.</a>
</p>
</subsection>
<subsection name="SearchBean">
<subsection name="WordNet/Synonyms">
<p>
SearchBean is a UI component that can be used to browse through the results of a Lucene search.
The SearchBean searches the index for a given query string, retrieves the hits, and then brings
them into the HitsIterator class, which can be used for paging and sorting through search results.
The Lucene WordNet code consists of a single class which parses a prolog file
from the WordNet site that contains a list of English words and synonyms.
The class builds a Lucene index from the synonyms file. Your querying code could
hit this index to build up a set of synonyms for the terms in the
search query.
</p>
<p>
<a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/contributions/searchBean/">The
CVS repository for the SearchBean contribution.</a>
More information on the <a href="http://www.tropo.com/techno/java/lucene/wordnet.html">Lucene WordNet package</a>.
<a href="http://www.cogsci.princeton.edu/~wn/">WordNet</a> is an online database of English language words that contains
synonyms, definitions, and various relationships between synonym sets.
</p>
<p>
<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/WordNet/">
CVS for the WordNet module.</a>
</p>
</subsection>
<subsection name="SAX/DOM XML Indexing demo">
<p>
This contribution is some sample code that demonstrates adding simple XML documents into the index. It creates
a new Document object for each file, and then populates the Document with a Field for each XML element, recursively.
There are examples included for both SAX and DOM.
</p>
<p>
<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/XML-Indexing-Demo/">
CVS for the XML Indexing Demo.</a>
</p>
</subsection>
<subsection name="High Frequency Terms">
<p>
The miscellaneous package is for classes that don't fit anywhere else. The only class in it right now determines
what terms occur the most inside a Lucene index. This could be useful for analyzing which terms may need to go
into a custom stop word list for better search results.
</p>
<p>
<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/miscellaneous/src/java/org/apache/lucene/misc/">
CVS for miscellaneous classes.</a>
</p>
</subsection>