mirror of https://github.com/apache/lucene.git
- Added information about other Lucene Sandbox contributions.
Submitted by: Jell Linwood git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149944 13f79535-47bb-0310-9956-ffa450edef68
This commit is contained in:
parent
552bfaf4b1
commit
a85ab1febc
|
@ -236,18 +236,26 @@ which is a language for stemmers developed by Martin Porter.
|
|||
<table border="0" cellspacing="0" cellpadding="2" width="100%">
|
||||
<tr><td bgcolor="#828DA6">
|
||||
<font color="#ffffff" face="arial,helvetica,sanserif">
|
||||
<a name="Ant"><strong>Ant</strong></a>
|
||||
<a name="Lucene Service for Fulcrum"><strong>Lucene Service for Fulcrum</strong></a>
|
||||
</font>
|
||||
</td></tr>
|
||||
<tr><td>
|
||||
<blockquote>
|
||||
<p>
|
||||
The Ant project is a useful Ant task that creates a Lucene index out of an Ant fileset. It also
|
||||
contains an example HTML parser that uses JTidy.
|
||||
Lucene can be run as a service inside <a href="http://jakarta.apache.org/turbine/fulcrum/index.html">Fulcrum</a>,
|
||||
which is the services framework from the
|
||||
<a href="http://jakarta.apache.org/turbine/">Turbine</a> project.</p>
|
||||
<p>
|
||||
The implementation consists of a SearchService interface, a LuceneSearchSearchService implementation, and a
|
||||
SearchResults object that gets an array of Document objects from a Hits object. Calls to the search methods on
|
||||
the service return the SearchResults object.
|
||||
</p>
|
||||
<p>
|
||||
<a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/contributions/ant/">The
|
||||
CVS repository for the Ant contribution.</a>
|
||||
The service supports querying, but does not support indexing.
|
||||
</p>
|
||||
<p>
|
||||
<a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/contributions/fulcrum/">
|
||||
CVS repository for the Fulcrum Service.</a>
|
||||
</p>
|
||||
</blockquote>
|
||||
</td></tr>
|
||||
|
@ -256,20 +264,70 @@ CVS repository for the Ant contribution.</a>
|
|||
<table border="0" cellspacing="0" cellpadding="2" width="100%">
|
||||
<tr><td bgcolor="#828DA6">
|
||||
<font color="#ffffff" face="arial,helvetica,sanserif">
|
||||
<a name="SearchBean"><strong>SearchBean</strong></a>
|
||||
<a name="WordNet/Synonyms"><strong>WordNet/Synonyms</strong></a>
|
||||
</font>
|
||||
</td></tr>
|
||||
<tr><td>
|
||||
<blockquote>
|
||||
<p>
|
||||
SearchBean is a UI component that can be used to browse through the results of a Lucene search.
|
||||
The SearchBean searches the index for a given query string, retrieves the hits, and then brings
|
||||
them into the HitsIterator class, which can be used for paging and sorting through search results.
|
||||
|
||||
The Lucene WordNet code consists of a single class which parses a prolog file
|
||||
from the WordNet site that contains a list of English words and synonyms.
|
||||
The class builds a Lucene index from the synonyms file. Your querying code could
|
||||
hit this index to build up a set of synonyms for the terms in the
|
||||
search query.
|
||||
</p>
|
||||
<p>
|
||||
<a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/contributions/searchBean/">The
|
||||
CVS repository for the SearchBean contribution.</a>
|
||||
More information on the <a href="http://www.tropo.com/techno/java/lucene/wordnet.html">Lucene WordNet package</a>.
|
||||
<a href="http://www.cogsci.princeton.edu/~wn/">WordNet</a> is an online database of English language words that contains
|
||||
synonyms, definitions, and various relationships between synonym sets.
|
||||
</p>
|
||||
<p>
|
||||
<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/WordNet/">
|
||||
CVS for the WordNet module.</a>
|
||||
</p>
|
||||
</blockquote>
|
||||
</td></tr>
|
||||
<tr><td><br/></td></tr>
|
||||
</table>
|
||||
<table border="0" cellspacing="0" cellpadding="2" width="100%">
|
||||
<tr><td bgcolor="#828DA6">
|
||||
<font color="#ffffff" face="arial,helvetica,sanserif">
|
||||
<a name="SAX/DOM XML Indexing demo"><strong>SAX/DOM XML Indexing demo</strong></a>
|
||||
</font>
|
||||
</td></tr>
|
||||
<tr><td>
|
||||
<blockquote>
|
||||
<p>
|
||||
This contribution is some sample code that demonstrates adding simple XML documents into the index. It creates
|
||||
a new Document object for each file, and then populates the Document with a Field for each XML element, recursively.
|
||||
There are examples included for both SAX and DOM.
|
||||
</p>
|
||||
<p>
|
||||
|
||||
<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/XML-Indexing-Demo/">
|
||||
CVS for the XML Indexing Demo.</a>
|
||||
</p>
|
||||
</blockquote>
|
||||
</td></tr>
|
||||
<tr><td><br/></td></tr>
|
||||
</table>
|
||||
<table border="0" cellspacing="0" cellpadding="2" width="100%">
|
||||
<tr><td bgcolor="#828DA6">
|
||||
<font color="#ffffff" face="arial,helvetica,sanserif">
|
||||
<a name="High Frequency Terms"><strong>High Frequency Terms</strong></a>
|
||||
</font>
|
||||
</td></tr>
|
||||
<tr><td>
|
||||
<blockquote>
|
||||
<p>
|
||||
The miscellaneous package is for classes that don't fit anywhere else. The only class in it right now determines
|
||||
what terms occur the most inside a Lucene index. This could be useful for analyzing which terms may need to go
|
||||
into a custom stop word list for better search results.
|
||||
</p>
|
||||
<p>
|
||||
|
||||
<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/miscellaneous/src/java/org/apache/lucene/misc/">
|
||||
CVS for miscellaneous classes.</a>
|
||||
</p>
|
||||
</blockquote>
|
||||
</td></tr>
|
||||
|
|
|
@ -84,27 +84,67 @@ which is a language for stemmers developed by Martin Porter.
|
|||
|
||||
</subsection>
|
||||
|
||||
<subsection name="Ant">
|
||||
<subsection name="Lucene Service for Fulcrum">
|
||||
<p>
|
||||
The Ant project is a useful Ant task that creates a Lucene index out of an Ant fileset. It also
|
||||
contains an example HTML parser that uses JTidy.
|
||||
Lucene can be run as a service inside <a href="http://jakarta.apache.org/turbine/fulcrum/index.html">Fulcrum</a>,
|
||||
which is the services framework from the
|
||||
<a href="http://jakarta.apache.org/turbine/">Turbine</a> project.</p>
|
||||
<p>
|
||||
The implementation consists of a SearchService interface, a LuceneSearchSearchService implementation, and a
|
||||
SearchResults object that gets an array of Document objects from a Hits object. Calls to the search methods on
|
||||
the service return the SearchResults object.
|
||||
</p>
|
||||
<p>
|
||||
<a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/contributions/ant/">The
|
||||
CVS repository for the Ant contribution.</a>
|
||||
The service supports querying, but does not support indexing.
|
||||
</p>
|
||||
<p>
|
||||
<a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/contributions/fulcrum/">
|
||||
CVS repository for the Fulcrum Service.</a>
|
||||
</p>
|
||||
</subsection>
|
||||
|
||||
<subsection name="SearchBean">
|
||||
<subsection name="WordNet/Synonyms">
|
||||
<p>
|
||||
SearchBean is a UI component that can be used to browse through the results of a Lucene search.
|
||||
The SearchBean searches the index for a given query string, retrieves the hits, and then brings
|
||||
them into the HitsIterator class, which can be used for paging and sorting through search results.
|
||||
|
||||
The Lucene WordNet code consists of a single class which parses a prolog file
|
||||
from the WordNet site that contains a list of English words and synonyms.
|
||||
The class builds a Lucene index from the synonyms file. Your querying code could
|
||||
hit this index to build up a set of synonyms for the terms in the
|
||||
search query.
|
||||
</p>
|
||||
<p>
|
||||
<a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/contributions/searchBean/">The
|
||||
CVS repository for the SearchBean contribution.</a>
|
||||
More information on the <a href="http://www.tropo.com/techno/java/lucene/wordnet.html">Lucene WordNet package</a>.
|
||||
<a href="http://www.cogsci.princeton.edu/~wn/">WordNet</a> is an online database of English language words that contains
|
||||
synonyms, definitions, and various relationships between synonym sets.
|
||||
</p>
|
||||
<p>
|
||||
<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/WordNet/">
|
||||
CVS for the WordNet module.</a>
|
||||
</p>
|
||||
</subsection>
|
||||
|
||||
<subsection name="SAX/DOM XML Indexing demo">
|
||||
<p>
|
||||
This contribution is some sample code that demonstrates adding simple XML documents into the index. It creates
|
||||
a new Document object for each file, and then populates the Document with a Field for each XML element, recursively.
|
||||
There are examples included for both SAX and DOM.
|
||||
</p>
|
||||
<p>
|
||||
|
||||
<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/XML-Indexing-Demo/">
|
||||
CVS for the XML Indexing Demo.</a>
|
||||
</p>
|
||||
</subsection>
|
||||
|
||||
<subsection name="High Frequency Terms">
|
||||
<p>
|
||||
The miscellaneous package is for classes that don't fit anywhere else. The only class in it right now determines
|
||||
what terms occur the most inside a Lucene index. This could be useful for analyzing which terms may need to go
|
||||
into a custom stop word list for better search results.
|
||||
</p>
|
||||
<p>
|
||||
|
||||
<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/miscellaneous/src/java/org/apache/lucene/misc/">
|
||||
CVS for miscellaneous classes.</a>
|
||||
</p>
|
||||
</subsection>
|
||||
|
||||
|
|
Loading…
Reference in New Issue