2001-12-20 02:16:17 -05:00
<?xml version="1.0"?>
<document >
2002-01-26 11:10:16 -05:00
<properties >
<author email= "carlson@apache.org" >
2002-07-09 22:54:52 -04:00
Peter Carlson
2002-01-26 11:10:16 -05:00
</author>
<title >
2002-07-09 22:54:52 -04:00
Contributions - Jakarta Lucene
2002-01-26 11:10:16 -05:00
</title>
</properties>
<body >
<section name= "Overview" >
<p > This page lists external resources for Lucene. If you've written something that should be included, please post all relevant information to one of the mailing lists. Nothing listed here is directly supported by the Lucene developers, if you encounter any problems with them, please use the contact information. </p>
</section>
<section name= "Lucene Documents" >
<p >
2002-07-09 22:54:52 -04:00
Lucene requires information you want to index to be converted into a Document class. Here are contributions for various Document classes for different formats.
2002-01-26 11:10:16 -05:00
</p>
<subsection name= "RTF->XML->Lucene" >
<table >
<tr >
<th >
2002-07-09 22:54:52 -04:00
URL
2002-01-26 11:10:16 -05:00
</th>
<td >
<a href= "http://www.tetrasix.com/" >
2002-07-09 22:54:52 -04:00
http://www.tetrasix.com/
2002-01-26 11:10:16 -05:00
</a>
</td>
</tr>
<tr >
<th >
2002-07-09 22:54:52 -04:00
author
2002-01-26 11:10:16 -05:00
</th>
<td >
2002-07-09 22:54:52 -04:00
N/A
2002-01-26 11:10:16 -05:00
</td>
</tr>
</table>
</subsection>
<subsection name= "XML Document #1" >
<table >
<tr >
<th >
2002-07-09 22:54:52 -04:00
URL
2002-01-26 11:10:16 -05:00
</th>
<td >
<a href= "http://marc.theaimsgroup.com/?l=lucene-dev&m=100723333506246&w=2" >
2002-07-09 22:54:52 -04:00
http://marc.theaimsgroup.com/?l=lucene-dev& m=100723333506246& w=2
2002-01-26 11:10:16 -05:00
</a>
</td>
</tr>
<tr >
<th >
2002-07-09 22:54:52 -04:00
author
2002-01-26 11:10:16 -05:00
</th>
<td >
2002-07-09 22:54:52 -04:00
Philip Ogren - ogren@mayo.edu
2002-01-26 11:10:16 -05:00
</td>
</tr>
</table>
</subsection>
<subsection name= "XML Document #2" >
<table >
<tr >
<th >
2002-07-09 22:54:52 -04:00
URL
2002-01-26 11:10:16 -05:00
</th>
<td >
<a href= "http://www.mail-archive.com/lucene-user@jakarta.apache.org/msg00346.html" >
2002-07-09 22:54:52 -04:00
http://www.mail-archive.com/lucene-user@jakarta.apache.org/msg00346.html
2002-01-26 11:10:16 -05:00
</a>
</td>
</tr>
<tr >
<th >
2002-07-09 22:54:52 -04:00
author
2002-01-26 11:10:16 -05:00
</th>
<td >
2002-07-09 22:54:52 -04:00
Peter Carlson - carlson@bookandhammer.com
2002-01-26 11:10:16 -05:00
</td>
</tr>
</table>
</subsection>
2002-02-07 15:22:12 -05:00
<subsection name= "XML Document #3" >
<table >
<tr >
<th >
2002-07-09 22:54:52 -04:00
URL
2002-02-07 15:22:12 -05:00
</th>
<td >
<a href= "http://www.isogen.com/papers/lucene_xml_indexing.html" >
http://www.isogen.com/papers/lucene_xml_indexing.html
</a>
</td>
</tr>
<tr >
<th >
2002-07-09 22:54:52 -04:00
author
2002-02-07 15:22:12 -05:00
</th>
<td >
W. Eliot Kimber - eliot@isogen.com
</td>
</tr>
</table>
</subsection>
2002-01-26 11:10:16 -05:00
<subsection name= "XPDF - PDF Document Conversion" >
<table >
<tr >
<th >
2002-07-09 22:54:52 -04:00
URL
2002-01-26 11:10:16 -05:00
</th>
<td >
<a href= "http://www.foolabs.com/xpdf" >
2002-07-09 22:54:52 -04:00
http://www.foolabs.com/xpdf
2002-01-26 11:10:16 -05:00
</a>
</td>
</tr>
<tr >
<th >
2002-07-09 22:54:52 -04:00
author
2002-01-26 11:10:16 -05:00
</th>
<td >
2002-07-09 22:54:52 -04:00
N/A
2002-01-26 11:10:16 -05:00
</td>
</tr>
</table>
</subsection>
<subsection name= "PJ - PDF Document Conversion" >
<table >
<tr >
<th >
2002-07-09 22:54:52 -04:00
URL
2002-01-26 11:10:16 -05:00
</th>
<td >
<a href= " http://www.etymon.com/pj/" >
2002-07-09 22:54:52 -04:00
http://www.etymon.com/pj/
2002-01-26 11:10:16 -05:00
</a>
</td>
</tr>
<tr >
<th >
2002-07-09 22:54:52 -04:00
author
2002-01-26 11:10:16 -05:00
</th>
<td >
2002-07-09 22:54:52 -04:00
N/A
</td>
</tr>
</table>
</subsection>
<subsection name= "PDF Parser" >
<table >
<tr >
<th >
URL
</th>
<td >
<a href= "http://www.csh.rit.edu/~ben/projects/pdfparser/" >
http://www.csh.rit.edu/~ben/projects/pdfparser/
</a>
</td>
</tr>
<tr >
<th >
author
</th>
<td >
Ben Litchfield - ben@csh.rit.edu
2002-01-26 11:10:16 -05:00
</td>
</tr>
</table>
</subsection>
</section>
<section name= "Lucene Analyzers" >
<p >
</p>
<subsection name= "Chinese Analyzer, Tokenizer, Filter" >
<table >
<tr >
<th >
2002-07-09 22:54:52 -04:00
URL
2002-01-26 11:10:16 -05:00
</th>
<td >
<a href= "http://marc.theaimsgroup.com/?l=lucene-dev&m=100705753831746&w=2" >
2002-07-09 22:54:52 -04:00
http://marc.theaimsgroup.com/?l=lucene-dev& m=100705753831746& w=2
2002-01-26 11:10:16 -05:00
</a>
</td>
</tr>
<tr >
<th >
2002-07-09 22:54:52 -04:00
author
2002-01-26 11:10:16 -05:00
</th>
<td >
2002-07-09 22:54:52 -04:00
Yiyi Sun - yiyisun@yahoo.com
2002-01-26 11:10:16 -05:00
</td>
</tr>
</table>
</subsection>
</section>
<section name= "Misc" >
<p >
</p>
2002-07-09 22:54:52 -04:00
2002-02-07 15:22:12 -05:00
<subsection name= "Term Highlighter" >
<p >
</p>
<table >
<tr >
<th >
2002-07-09 22:54:52 -04:00
URL
2002-02-07 15:22:12 -05:00
</th>
<td >
<a href= "http://www.iq-computing.de/lucene/highlight.htm" >
http://www.iq-computing.de/lucene/highlight.htm
</a>
</td>
</tr>
<tr >
<th >
2002-07-09 22:54:52 -04:00
author
2002-02-07 15:22:12 -05:00
</th>
<td >
2002-07-09 22:54:52 -04:00
Maik Schreiber - info@iq-computing.de
2002-02-07 15:22:12 -05:00
</td>
</tr>
</table>
</subsection>
2002-04-01 00:33:29 -05:00
<subsection name= "Chainable Filter" >
<p >
</p>
<table >
<tr >
<th >
2002-07-09 22:54:52 -04:00
URL
2002-04-01 00:33:29 -05:00
</th>
<td >
<a href= "http://www.mail-archive.com/lucene-user@jakarta.apache.org/msg01168.html" >
http://www.mail-archive.com/lucene-user@jakarta.apache.org/msg01168.html
</a>
</td>
</tr>
<tr >
<th >
2002-07-09 22:54:52 -04:00
author
2002-04-01 00:33:29 -05:00
</th>
<td >
2002-07-09 22:54:52 -04:00
Kelvin Tan - kelvin@relevanz.com
2002-04-01 00:33:29 -05:00
</td>
</tr>
</table>
</subsection>
<subsection name= "Multiple Field Searching" >
<p >
</p>
<table >
<tr >
<th >
2002-07-09 22:54:52 -04:00
URL
2002-04-01 00:33:29 -05:00
</th>
<td >
<a href= "http://http://www.mail-archive.com/lucene-user@jakarta.apache.org/msg00775.html" >
http://www.mail-archive.com/lucene-user@jakarta.apache.org/msg00775.html
</a>
</td>
</tr>
<tr >
<th >
2002-07-09 22:54:52 -04:00
author
2002-04-01 00:33:29 -05:00
</th>
<td >
2002-07-09 22:54:52 -04:00
Kelvin Tan - kelvin@relevanz.com
2002-04-01 00:33:29 -05:00
</td>
</tr>
</table>
</subsection>
2002-01-26 11:10:16 -05:00
<subsection name= "Lucene Tutorial" >
<p >
</p>
<table >
<tr >
<th >
2002-07-09 22:54:52 -04:00
URL
2002-01-26 11:10:16 -05:00
</th>
<td >
<a href= "http://www.darksleep.com/puff/lucene/lucene.html" >
2002-07-09 22:54:52 -04:00
http://www.darksleep.com/puff/lucene/lucene.html
2002-01-26 11:10:16 -05:00
</a>
</td>
</tr>
<tr >
<th >
2002-07-09 22:54:52 -04:00
author
2002-01-26 11:10:16 -05:00
</th>
<td >
2002-07-09 22:54:52 -04:00
Steven J. Owens - puff@darksleep.com
2002-01-26 11:10:16 -05:00
</td>
</tr>
</table>
</subsection>
<subsection name= "HTML Syntax Checker and Pretty Printer" >
<p >
</p>
<table >
<tr >
<th >
2002-07-09 22:54:52 -04:00
URL
2002-01-26 11:10:16 -05:00
</th>
<td >
<a href= "http://lempinen.net/sami/jtidy/" >
2002-07-09 22:54:52 -04:00
http://lempinen.net/sami/jtidy/
2002-01-26 11:10:16 -05:00
</a>
</td>
</tr>
<tr >
<th >
2002-07-09 22:54:52 -04:00
author
2002-01-26 11:10:16 -05:00
</th>
<td >
2002-07-09 22:54:52 -04:00
N/A
2002-01-26 11:10:16 -05:00
</td>
</tr>
</table>
</subsection>
<subsection name= "JavaCC" >
<table >
<tr >
<th >
2002-07-09 22:54:52 -04:00
URL
2002-01-26 11:10:16 -05:00
</th>
<td >
<a href= "http://www.webgain.com/products/java_cc/" >
2002-07-09 22:54:52 -04:00
http://www.webgain.com/products/java_cc/
2002-01-26 11:10:16 -05:00
</a>
</td>
</tr>
<tr >
<th >
2002-07-09 22:54:52 -04:00
author
2002-01-26 11:10:16 -05:00
</th>
<td >
2002-07-09 22:54:52 -04:00
N/A
2002-01-26 11:10:16 -05:00
</td>
</tr>
</table>
</subsection>
</section>
</body>
2001-12-20 02:16:17 -05:00
</document>