Initial revision

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149565 13f79535-47bb-0310-9956-ffa450edef68
This commit is contained in:
Sam Ruby 2001-09-11 21:44:36 +00:00
parent a0e7ee9d0d
commit 5a2615650e
30 changed files with 1910 additions and 0 deletions

174
Attic/api/overview.html Normal file
View File

@ -0,0 +1,174 @@
<!doctype html public "-//w3c//dtd html 4.0 transitional//en">
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<meta name="Author" content="Doug Cutting">
<meta name="Author" content="Ted Husted">
<meta name="GENERATOR" content="Mozilla/4.72 [en] (Win98; U) [Netscape]">
<title>Jakarta Lucene API Documentation</title>
</head>
<body>
<h1>Jakarta Lucene API Documentation</h1>
The <a href="http://jakarta.apache.org/lucene">Jakarta Lucene</a> API is divided into several
packages:
<ul>
<li>
<b><a href="org/apache/lucene/util/package-summary.html">com.lucene.util</a></b>
contains a few handy data structures, e.g., <a href="org/apache/lucene/util/BitVector.html">BitVector</a>
and <a href="org/apache/lucene/util/PriorityQueue.html">PriorityQueue</a>.</li>
<li>
<b><a href="org/apache/lucene/store/package-summary.html">com.lucene.store</a></b>
defines an abstract class for storing persistent data, the <a href="org/apache/lucene/store/Directory.html">Directory</a>,
a collection of named files written by an <a href="org/apache/lucene/store/OutputStream.html">OutputStream</a>
and read by an <a href="org/apache/lucene/store/InputStream.html">InputStream</a>.&nbsp;
Two implementations are provided, <a href="org/apache/lucene/store/FSDirectory.html">FSDirectory</a>,
which uses a file system directory to store files, and <a href="org/apache/lucene/store/RAMDirectory.html">RAMDirectory</a>
which implements files as memory-resident data structures.</li>
<li>
<b><a href="org/apache/lucene/document/package-summary.html">com.lucene.document</a></b>
provides a simple <a href="org/apache/lucene/document/Document.html">Document</a>
class.&nbsp; A document is simply a set of named <a href="org/apache/lucene/document/Field.html">Field</a>'s,
whose values may be strings or instances of <a href="http://java.sun.com/products/jdk/1.2/docs/api/java/io/Reader.html">java.io.Reader</a>.</li>
<li>
<b><a href="org/apache/lucene/analysis/package-summary.html">com.lucene.analysis</a></b>
defines an abstract <a href="org/apache/lucene/analysis/Analyzer.html">Analyzer</a>
API for converting text from a <a href="http://java.sun.com/products/jdk/1.2/docs/api/java/io/Reader.html">java.io.Reader</a>
into a <a href="org/apache/lucene/analysis/TokenStream.html">TokenStream</a>,
an enumeration of&nbsp; <a href="org/apache/lucene/analysis/Token.html">Token</a>'s.&nbsp;
A TokenStream is composed by applying <a href="org/apache/lucene/analysis/TokenFilter.html">TokenFilter</a>'s
to the output of a <a href="org/apache/lucene/analysis/Tokenizer.html">Tokenizer</a>.&nbsp;
A few simple implemenations are provided, including <a href="org/apache/lucene/analysis/StopAnalyzer.html">StopAnalyzer</a>
and the grammar-based <a href="org/apache/lucene/analysis/standard/StandardAnalyzer.html">StandardAnalyzer</a>.</li>
<li>
<b><a href="org/apache/lucene/index/package-summary.html">com.lucene.index</a></b>
provides two primary classes: <a href="org/apache/lucene/index/IndexWriter.html">IndexWriter</a>,
which creates and adds documents to indices; and <a href="org/apache/lucene/index/IndexReader.html">IndexReader</a>,
which accesses the data in the index.</li>
<li>
<b><a href="org/apache/lucene/search/package-summary.html">com.lucene.search</a></b>
provides data structures to represent queries (<a href="org/apache/lucene/search/TermQuery.html">TermQuery</a>
for individual words, <a href="org/apache/lucene/search/PhraseQuery.html">PhraseQuery</a>
for phrases, and <a href="org/apache/lucene/search/BooleanQuery.html">BooleanQuery</a>
for boolean combinations of queries) and the abstract <a href="org/apache/lucene/search/Searcher.html">Searcher</a>
which turns queries into <a href="org/apache/lucene/search/Hits.html">Hits</a>.
<a href="org/apache/lucene/search/IndexSearcher.html">IndexSearcher</a>
implements search over a single IndexReader.</li>
<li>
<b><a href="org/apache/lucene/queryParser/package-summary.html">com.lucene.queryParser</a></b>
uses <a href="http://www.suntest.com/JavaCC/">JavaCC</a> to implement a
<a href="org/apache/lucene/queryParser/QueryParser.html">QueryParser</a>.</li>
</ul>
To use Lucene, an application should:
<ol>
<li>
Create <a href="org/apache/lucene/document/Document.html">Document</a>'s by
adding
<a href="org/apache/lucene/document/Field.html">Field</a>'s.</li>
<li>
Create an <a href="org/apache/lucene/index/IndexWriter.html">IndexWriter</a>
and add documents to to it with <a href="org/apache/lucene/index/IndexWriter.html#addDocument(com.lucene.document.Document)">addDocument()</a>;</li>
<li>
Call <a href="org/apache/lucene/queryParser/QueryParser.html#parse(java.lang.String)">QueryParser.parse()</a>
to build a query from a string; and</li>
<li>
Create an <a href="org/apache/lucene/search/IndexSearcher.html">IndexSearcher</a>
and pass the query to it's <a href="org/apache/lucene/search/Searcher.html#search(com.lucene.search.Query)">search()</a>
method.</li>
</ol>
Some simple examples of code which does this are:
<ul>
<li>
&nbsp;<a href="../demo/FileDocument.java">FileDocument.java</a> contains
code to create a Document for a file.</li>
<li>
&nbsp;<a href="../demo/IndexFiles.java">IndexFiles.java</a> creates an
index for all the files contained in a directory.</li>
<li>
&nbsp;<a href="../demo/DeleteFiles.java">DeleteFiles.java</a> deletes some
of these files from the index.</li>
<li>
&nbsp;<a href="../demo/SearchFiles.java">SearchFiles.java</a> prompts for
queries and searches an index.</li>
</ul>
To demonstrate these, try:
<blockquote><tt>F:\> <b>java demo.IndexFiles rec.food.recipes\soups</b></tt>
<br><tt>adding rec.food.recipes\soups\abalone-chowder</tt>
<br><tt>&nbsp; </tt>[ ... ]
<p><tt>F:\> <b>java demo.SearchFiles</b></tt>
<br><tt>Query: <b>chowder</b></tt>
<br><tt>Searching for: chowder</tt>
<br><tt>34 total matching documents</tt>
<br><tt>0. rec.food.recipes\soups\spam-chowder</tt>
<br><tt>&nbsp; </tt>[ ... thirty-four documents contain the word "chowder",
"spam-chowder" with the greatest density.]
<p><tt>Query: <b>path:chowder</b></tt>
<br><tt>Searching for: path:chowder</tt>
<br><tt>31 total matching documents</tt>
<br><tt>0. rec.food.recipes\soups\abalone-chowder</tt>
<br><tt>&nbsp; </tt>[ ... only thrity-one have "chowder" in the "path"
field. ]
<p><tt>Query: <b>path:"clam chowder"</b></tt>
<br><tt>Searching for: path:"clam chowder"</tt>
<br><tt>10 total matching documents</tt>
<br><tt>0. rec.food.recipes\soups\clam-chowder</tt>
<br><tt>&nbsp; </tt>[ ... only ten have "clam chowder" in the "path" field.
]
<p><tt>Query: <b>path:"clam chowder" AND manhattan</b></tt>
<br><tt>Searching for: +path:"clam chowder" +manhattan</tt>
<br><tt>2 total matching documents</tt>
<br><tt>0. rec.food.recipes\soups\clam-chowder</tt>
<br><tt>&nbsp; </tt>[ ... only two also have "manhattan" in the contents.
]
<br>&nbsp;&nbsp;&nbsp; [ Note: "+" and "-" are canonical, but "AND", "OR"
and "NOT" may be used. ]</blockquote>
The <a href="../demo/IndexHTML.java">IndexHtml</a> demo is more sophisticated.&nbsp;
It incrementally maintains an index of HTML files, adding new files as
they appear, deleting old files as they disappear and re-indexing files
as they change.
<blockquote><tt>F:\><b>java demo.IndexHTML -create java\jdk1.1.6\docs\relnotes</b></tt>
<br><tt>adding java/jdk1.1.6/docs/relnotes/SMICopyright.html</tt>
<br><tt>&nbsp; </tt>[ ... create an index containing all the relnotes ]
<p><tt>F:\><b>del java\jdk1.1.6\docs\relnotes\smicopyright.html</b></tt>
<p><tt>F:\><b>java demo.IndexHTML java\jdk1.1.6\docs\relnotes</b></tt>
<br><tt>deleting java/jdk1.1.6/docs/relnotes/SMICopyright.html</tt></blockquote>
HTML indexes are searched using SUN's <a href="http://jserv.javasoft.com/products/webserver/index.html">JavaWebServer</a>
(JWS) and <a href="../demo/Search.jhtml">Search.jhtml</a>.&nbsp; To use
this:
<ul>
<li>
copy <tt>Search.html</tt> and <tt>Search.jhtml</tt> to JWS's <tt>public_html</tt>
directory;</li>
<li>
copy lucene.jar to JWS's lib directory;</li>
<li>
create and maintain your indexes with demo.IndexHTML in JWS's top-level
directory;</li>
<li>
launch JWS, with the <tt>demo</tt> directory on CLASSPATH (only one class
is actually needed);</li>
<li>
visit <a href="../demo/Search.html">Search.html</a>.</li>
</ul>
Note that indexes can be updated while searches are going on.&nbsp; <tt>Search.jhtml</tt>
will re-open the index when it is updated so that the latest version is
immediately available.
<br>&nbsp;
</body>
</html>

9
build.bat Executable file
View File

@ -0,0 +1,9 @@
@echo off
for %%i in (.\lib\*.jar) do call cpappend.bat %%i
echo CLASSPATH="%_CP%"
java -classpath "%_CP%" org.apache.tools.ant.Main -Dant.home=%_AH% %1 %2 %3
SET _CP=

36
build.sh Executable file
View File

@ -0,0 +1,36 @@
#!/bin/sh
if [ "$JAVA_HOME" = "" ] ; then
echo You must set JAVA_HOME to point at your Java Development Kit directory
exit 1
fi
# convert the existing path to unix
if [ "$OSTYPE" = "cygwin32" ] || [ "$OSTYPE" = "cygwin" ] ; then
CLASSPATH=`cygpath --path --unix "$CLASSPATH"`
fi
# Add in your .jar files first
for i in ./lib/*.jar
do
CLASSPATH=$CLASSPATH:"$i"
done
# Add in the jakarta-site2 library files
for i in ../jakarta-site2/lib/*.jar
do
CLASSPATH=$CLASSPATH:"$i"
done
# convert the unix path to windows
if [ "$OSTYPE" = "cygwin32" ] || [ "$OSTYPE" = "cygwin" ] ; then
CLASSPATH=`cygpath --path --windows "$CLASSPATH"`
fi
BUILDFILE=build.xml
#echo $CLASSPATH
java $ANT_OPTS -classpath "$CLASSPATH" org.apache.tools.ant.Main \
-Dant.home=$ANT_HOME \
-buildfile ${BUILDFILE} \
"$@"

47
build.xml Normal file
View File

@ -0,0 +1,47 @@
<project name="build-site" default="docs" basedir=".">
<!-- Initialization properties -->
<property name="project.name" value="site"/>
<property name="docs.src" value="./xdocs"/>
<property name="docs.dest" value="./docs"/>
<target name="prepare">
<available classname="org.apache.velocity.anakia.AnakiaTask"
property="AnakiaTask.present"/>
</target>
<target depends="prepare" name="prepare-error" unless="AnakiaTask.present">
<echo>
AnakiaTask is not present! Please check to make sure that
velocity.jar is in your classpath.
</echo>
</target>
<target name="docs" depends="prepare-error" if="AnakiaTask.present">
<taskdef name="anakia" classname="org.apache.velocity.anakia.AnakiaTask"/>
<anakia basedir="${docs.src}" destdir="${docs.dest}/"
extension=".html" style="./site.vsl"
projectFile="stylesheets/project.xml"
excludes="**/stylesheets/** empty.xml"
includes="**/*.xml"
lastModifiedCheck="true"
templatePath="../jakarta-site2/xdocs/stylesheets"
>
</anakia>
<copy todir="${docs.dest}/images" filtering="no">
<fileset dir="${docs.src}/images">
<include name="**/*.gif"/>
<include name="**/*.jpeg"/>
<include name="**/*.jpg"/>
</fileset>
</copy>
<!-- In case we have CSS someday
<copy todir="${docs.dest}" filtering="no">
<fileset dir="${docs.src}">
<include name="**/*.css"/>
</fileset>
</copy>
-->
</target>
</project>

1
cpappend.bat Executable file
View File

@ -0,0 +1 @@
set _CP=%1;%_CP%

169
docs/applications.html Normal file
View File

@ -0,0 +1,169 @@
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
<!-- Content Stylesheet for Site -->
<!-- start the processing -->
<!-- ====================================================================== -->
<!-- Main Page Section -->
<!-- ====================================================================== -->
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"/>
<meta name="author" value="Ted Husted">
<meta name="email" value="husted@apache.org">
<title>Jakarta Lucene - Applications - Jakarta Lucene</title>
</head>
<body bgcolor="#ffffff" text="#000000" link="#525D76">
<table border="0" width="100%" cellspacing="0">
<!-- TOP IMAGE -->
<tr>
<td align="left">
<a href="http://jakarta.apache.org"><img src="http://jakarta.apache.org/images/jakarta-logo.gif" border="0"/></a>
</td>
<td align="right">
<a href="http://jakarta.apache.org/lucene/docs/"><img src="./images/lucene-logo.gif" alt="Jakarta Lucene" border="0"/></a>
</td>
</tr>
</table>
<table border="0" width="100%" cellspacing="4">
<tr><td colspan="2">
<hr noshade="" size="1"/>
</td></tr>
<tr>
<!-- LEFT SIDE NAVIGATION -->
<td width="20%" valign="top" nowrap="true">
<p><strong>About</strong></p>
<ul>
<li> <a href="./index.html">Overview</a>
</li>
<li> <a href="./features.html">Features</a>
</li>
<li> <a href="./applications.html">Applications</a>
</li>
<li> <a href="./background.html">Background</a>
</li>
</ul>
<p><strong>Documentation</strong></p>
<ul>
<li> <a href="http://www.lucene.com/cgi-bin/faq/faqmanager.cgi">FAQs</a>
</li>
<li> <a href="../api/overview.html">Javadoc</a>
</li>
</ul>
<p><strong>Download</strong></p>
<ul>
<li> <a href="http://jakarta.apache.org/site/binindex.html">Binaries</a>
</li>
<li> <a href="http://jakarta.apache.org/site/sourceindex.html">Source Code</a>
</li>
<li> <a href="http://jakarta.apache.org/site/cvsindex.html">CVS Repositories</a>
</li>
</ul>
<p><strong>Community</strong></p>
<ul>
<li> <a href="./whoweare.html">Who We Are</a>
</li>
<li> <a href="./powered.html">Powered by Lucene</a>
</li>
<li> <a href="./resources.html">Resources</a>
</li>
<li> <a href="http://jakarta.apache.org/site/mail.html">Mailing Lists</a>
</li>
<li> <a href="http://jakarta.apache.org/site/bugs.html">Bugs</a>
</li>
<li> <a href="http://jakarta.apache.org/site/getinvolved.html">Get Involved</a>
</li>
</ul>
<p><strong>Jakarta</strong></p>
<ul>
<li> <a href="http://jakarta.apache.org/site/acknowledgements.html">Acknowledgements</a>
</li>
<li> <a href="http://jakarta.apache.org/site/contact.html">Contact</a>
</li>
<li> <a href="http://jakarta.apache.org/site/legal.html">Legal</a>
</li>
</ul>
</td>
<td width="80%" align="left" valign="top">
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#525D76">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="APPLICATIONS"><strong>APPLICATIONS</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<h3>Lucene can be used for a wide range of applications</h3>
</blockquote>
</p>
</td></tr>
<tr><td><br/></td></tr>
</table>
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#525D76">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="Example Applications"><strong>Example Applications</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<p>Lucene is designed to be used in a wide range of applications--from small,
desktop applications with a few hundred documents, to large internet server-based
applications with a few million documents.</p>
<h4>Searchable E-Mail</h4>
<ul>
<li>Search large e-mail archives instantly; update index as new messages
arrive.<br /></li>
</ul>
<h4>CD-ROM-based Online Documentation Search</h4>
<ul>
<li>Search large publications quickly with platform-independent system.<br /></li>
</ul>
<h4>Search Previously-Visited Web Pages</h4>
<ul>
<li>Relocate a page seen weeks or months ago.<br /></li>
</ul>
<h4>Web Site Searching</h4>
<ul>
<li>Let users search all the pages on your website.<br /></li>
</ul>
</blockquote>
</p>
</td></tr>
<tr><td><br/></td></tr>
</table>
</td>
</tr>
<!-- FOOTER -->
<tr><td colspan="2">
<hr noshade="" size="1"/>
</td></tr>
<tr><td colspan="2">
<div align="center"><font color="#525D76" size="-1"><em>
Copyright &#169; 1999-2001, Apache Software Foundation
</em></font></div>
</td></tr>
</table>
</body>
</html>
<!-- end the processing -->

177
docs/background.html Normal file
View File

@ -0,0 +1,177 @@
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
<!-- Content Stylesheet for Site -->
<!-- start the processing -->
<!-- ====================================================================== -->
<!-- Main Page Section -->
<!-- ====================================================================== -->
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"/>
<meta name="author" value="Ted Husted">
<meta name="email" value="husted@apache.org">
<title>Jakarta Lucene - Background - Jakarta Lucene</title>
</head>
<body bgcolor="#ffffff" text="#000000" link="#525D76">
<table border="0" width="100%" cellspacing="0">
<!-- TOP IMAGE -->
<tr>
<td align="left">
<a href="http://jakarta.apache.org"><img src="http://jakarta.apache.org/images/jakarta-logo.gif" border="0"/></a>
</td>
<td align="right">
<a href="http://jakarta.apache.org/lucene/docs/"><img src="./images/lucene-logo.gif" alt="Jakarta Lucene" border="0"/></a>
</td>
</tr>
</table>
<table border="0" width="100%" cellspacing="4">
<tr><td colspan="2">
<hr noshade="" size="1"/>
</td></tr>
<tr>
<!-- LEFT SIDE NAVIGATION -->
<td width="20%" valign="top" nowrap="true">
<p><strong>About</strong></p>
<ul>
<li> <a href="./index.html">Overview</a>
</li>
<li> <a href="./features.html">Features</a>
</li>
<li> <a href="./applications.html">Applications</a>
</li>
<li> <a href="./background.html">Background</a>
</li>
</ul>
<p><strong>Documentation</strong></p>
<ul>
<li> <a href="http://www.lucene.com/cgi-bin/faq/faqmanager.cgi">FAQs</a>
</li>
<li> <a href="../api/overview.html">Javadoc</a>
</li>
</ul>
<p><strong>Download</strong></p>
<ul>
<li> <a href="http://jakarta.apache.org/site/binindex.html">Binaries</a>
</li>
<li> <a href="http://jakarta.apache.org/site/sourceindex.html">Source Code</a>
</li>
<li> <a href="http://jakarta.apache.org/site/cvsindex.html">CVS Repositories</a>
</li>
</ul>
<p><strong>Community</strong></p>
<ul>
<li> <a href="./whoweare.html">Who We Are</a>
</li>
<li> <a href="./powered.html">Powered by Lucene</a>
</li>
<li> <a href="./resources.html">Resources</a>
</li>
<li> <a href="http://jakarta.apache.org/site/mail.html">Mailing Lists</a>
</li>
<li> <a href="http://jakarta.apache.org/site/bugs.html">Bugs</a>
</li>
<li> <a href="http://jakarta.apache.org/site/getinvolved.html">Get Involved</a>
</li>
</ul>
<p><strong>Jakarta</strong></p>
<ul>
<li> <a href="http://jakarta.apache.org/site/acknowledgements.html">Acknowledgements</a>
</li>
<li> <a href="http://jakarta.apache.org/site/contact.html">Contact</a>
</li>
<li> <a href="http://jakarta.apache.org/site/legal.html">Legal</a>
</li>
</ul>
</td>
<td width="80%" align="left" valign="top">
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#525D76">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="BACKGROUND"><strong>BACKGROUND</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<h3><img src="./images/dcutting.gif" align="right" alt="Doug Cutting" />How Lucene came to be</h3>
<p>Lucene is the brainchild of Doug Cutting (pictured), who has been working in the
field of information retrieval for over a decade.</p>
<p>Beginning in 1988, Doug spent five years at Xerox's Palo Alto Research
Center (PARC) developing novel
approaches to information access. These included a high-performance
retrieval engine, several innovative search paradigms, advanced linguistic
analysis methods, and high-quality text summarization algorithms.
This work resulted in seven <a href="publications.html">publications</a>
and six issued patents. Some of these technologies are now marketed by
<a href="http://www.inxight.com/">Inxight</a>.
</p>
<p>In 1993, Doug moved to Apple's Advanced Technology Group (ATG). There
he developed a state-of-the-art retrieval engine code-named V-Twin.
This engine was to be a part of the Copland
operating system, automatically indexing the content of all files as they
are created so that the the entire file system could be efficiently searched
at any time. Copland was cancelled, but V-Twin has been used in several
other Apple products.
</p>
<p>In April of 1996, Doug left Apple and joined <a href="http://www.excite.com/">Excite</a>.
Here he took over development of the core search technology. This included
growing Excite's web index from two million to fifty million
pages; substantially optimizing Excite's search performance; adding phrase-searching
capabilities; and creating a thesaurus-like feature which suggests related
terms to add to queries.
</p>
<p>In the fall of 1997, Doug reduced his commitment at Excite to
part-time so that he could write <a href="http://jakarta.apache.org/lucene/">Lucene</a>,
an efficient, full-featured text search engine written in Java. In early 1998 he
returned to Excite full-time for two more years. Lucene sat on the shelf for
much of that time, and was made open-source in the spring of 2000.</p>
<p>Lucene quickly became recognized as the leading server-side searching
solution for Java, and attracted several other open source developers, eager
to help refine the Lucene codebase.</p>
<p>In the fall of 2001, Lucene joined the Apache Jakarta Project, where
the product is maintained by a team of volunteer developers.</p>
<p>Doug now works for <a href="http://www.grandcentral.com/">Grand
Central</a>, a web services network. In his spare time he still
tries to help out with Lucene.</p>
<p>Please do not email Doug directly about Lucene. Instead use
the <a href="http://jakarta.apache.org/site/mail.html">Jakarta-Lucene mailing lists</a>.</p>
</blockquote>
</p>
</td></tr>
<tr><td><br/></td></tr>
</table>
</td>
</tr>
<!-- FOOTER -->
<tr><td colspan="2">
<hr noshade="" size="1"/>
</td></tr>
<tr><td colspan="2">
<div align="center"><font color="#525D76" size="-1"><em>
Copyright &#169; 1999-2001, Apache Software Foundation
</em></font></div>
</td></tr>
</table>
</body>
</html>
<!-- end the processing -->

225
docs/features.html Normal file
View File

@ -0,0 +1,225 @@
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
<!-- Content Stylesheet for Site -->
<!-- start the processing -->
<!-- ====================================================================== -->
<!-- Main Page Section -->
<!-- ====================================================================== -->
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"/>
<meta name="author" value="Ted Husted">
<meta name="email" value="husted@apache.org">
<title>Jakarta Lucene - Features - Jakarta Lucene</title>
</head>
<body bgcolor="#ffffff" text="#000000" link="#525D76">
<table border="0" width="100%" cellspacing="0">
<!-- TOP IMAGE -->
<tr>
<td align="left">
<a href="http://jakarta.apache.org"><img src="http://jakarta.apache.org/images/jakarta-logo.gif" border="0"/></a>
</td>
<td align="right">
<a href="http://jakarta.apache.org/lucene/docs/"><img src="./images/lucene-logo.gif" alt="Jakarta Lucene" border="0"/></a>
</td>
</tr>
</table>
<table border="0" width="100%" cellspacing="4">
<tr><td colspan="2">
<hr noshade="" size="1"/>
</td></tr>
<tr>
<!-- LEFT SIDE NAVIGATION -->
<td width="20%" valign="top" nowrap="true">
<p><strong>About</strong></p>
<ul>
<li> <a href="./index.html">Overview</a>
</li>
<li> <a href="./features.html">Features</a>
</li>
<li> <a href="./applications.html">Applications</a>
</li>
<li> <a href="./background.html">Background</a>
</li>
</ul>
<p><strong>Documentation</strong></p>
<ul>
<li> <a href="http://www.lucene.com/cgi-bin/faq/faqmanager.cgi">FAQs</a>
</li>
<li> <a href="../api/overview.html">Javadoc</a>
</li>
</ul>
<p><strong>Download</strong></p>
<ul>
<li> <a href="http://jakarta.apache.org/site/binindex.html">Binaries</a>
</li>
<li> <a href="http://jakarta.apache.org/site/sourceindex.html">Source Code</a>
</li>
<li> <a href="http://jakarta.apache.org/site/cvsindex.html">CVS Repositories</a>
</li>
</ul>
<p><strong>Community</strong></p>
<ul>
<li> <a href="./whoweare.html">Who We Are</a>
</li>
<li> <a href="./powered.html">Powered by Lucene</a>
</li>
<li> <a href="./resources.html">Resources</a>
</li>
<li> <a href="http://jakarta.apache.org/site/mail.html">Mailing Lists</a>
</li>
<li> <a href="http://jakarta.apache.org/site/bugs.html">Bugs</a>
</li>
<li> <a href="http://jakarta.apache.org/site/getinvolved.html">Get Involved</a>
</li>
</ul>
<p><strong>Jakarta</strong></p>
<ul>
<li> <a href="http://jakarta.apache.org/site/acknowledgements.html">Acknowledgements</a>
</li>
<li> <a href="http://jakarta.apache.org/site/contact.html">Contact</a>
</li>
<li> <a href="http://jakarta.apache.org/site/legal.html">Legal</a>
</li>
</ul>
</td>
<td width="80%" align="left" valign="top">
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#525D76">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="FEATURES"><strong>FEATURES</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<h3>Lucene offers powerful features through a simple API</h3>
</blockquote>
</p>
</td></tr>
<tr><td><br/></td></tr>
</table>
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#525D76">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="Scalable, High-Performance Indexing"><strong>Scalable, High-Performance Indexing</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<ul>
<li>over 200MB/hour on Pentium II/266<br /></li>
<li>incremental indexing as fast as batch indexing</li>
<li>small RAM requirements -- only 1MB heap</li>
<li>index size roughly 30% the size of text indexed</li>
</ul>
</blockquote>
</p>
</td></tr>
<tr><td><br/></td></tr>
</table>
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#525D76">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="Powerful, Accurate and Efficient Search Algorithms"><strong>Powerful, Accurate and Efficient Search Algorithms</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<ul>
<li>ranked searching -- best results returned first</li>
<li>boolean and phrase queries</li>
<li>fielded searching (e.g., title, author, contents)</li>
<li>date-range searching</li>
</ul>
</blockquote>
</p>
</td></tr>
<tr><td><br/></td></tr>
</table>
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#525D76">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="Simple API's allow developers to ..."><strong>Simple API's allow developers to ...</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<ul>
<li>incorporate new document types</li>
<li>localize for new languages (already handles most European languages)</li>
<li>develop new user interfaces</li>
</ul>
</blockquote>
</p>
</td></tr>
<tr><td><br/></td></tr>
</table>
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#525D76">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="Cross-Platform Solution"><strong>Cross-Platform Solution</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<ul>
<li>100%-pure Java <i>(not yet certified)</i></li>
</ul>
</blockquote>
</p>
</td></tr>
<tr><td><br/></td></tr>
</table>
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#525D76">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="Coming soon"><strong>Coming soon</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<ul>
<li><i>multiple-index searching with merged results</i></li>
<li><i>distributed searching over a network</i></li>
</ul>
</blockquote>
</p>
</td></tr>
<tr><td><br/></td></tr>
</table>
</td>
</tr>
<!-- FOOTER -->
<tr><td colspan="2">
<hr noshade="" size="1"/>
</td></tr>
<tr><td colspan="2">
<div align="center"><font color="#525D76" size="-1"><em>
Copyright &#169; 1999-2001, Apache Software Foundation
</em></font></div>
</td></tr>
</table>
</body>
</html>
<!-- end the processing -->

BIN
docs/images/dcutting.gif Normal file

Binary file not shown.

After

Width:  |  Height:  |  Size: 23 KiB

BIN
docs/images/lucene-logo.gif Normal file

Binary file not shown.

After

Width:  |  Height:  |  Size: 5.3 KiB

BIN
docs/images/lucene.gif Normal file

Binary file not shown.

After

Width:  |  Height:  |  Size: 12 KiB

222
docs/index.html Normal file
View File

@ -0,0 +1,222 @@
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
<!-- Content Stylesheet for Site -->
<!-- start the processing -->
<!-- ====================================================================== -->
<!-- Main Page Section -->
<!-- ====================================================================== -->
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"/>
<meta name="author" value="Jon S. Stevens">
<meta name="email" value="jon@latchkey.com">
<meta name="author" value="Ted Husted">
<meta name="email" value="husted@apache.org">
<title>Jakarta Lucene - Overview - Jakarta Lucene</title>
</head>
<body bgcolor="#ffffff" text="#000000" link="#525D76">
<table border="0" width="100%" cellspacing="0">
<!-- TOP IMAGE -->
<tr>
<td align="left">
<a href="http://jakarta.apache.org"><img src="http://jakarta.apache.org/images/jakarta-logo.gif" border="0"/></a>
</td>
<td align="right">
<a href="http://jakarta.apache.org/lucene/docs/"><img src="./images/lucene-logo.gif" alt="Jakarta Lucene" border="0"/></a>
</td>
</tr>
</table>
<table border="0" width="100%" cellspacing="4">
<tr><td colspan="2">
<hr noshade="" size="1"/>
</td></tr>
<tr>
<!-- LEFT SIDE NAVIGATION -->
<td width="20%" valign="top" nowrap="true">
<p><strong>About</strong></p>
<ul>
<li> <a href="./index.html">Overview</a>
</li>
<li> <a href="./features.html">Features</a>
</li>
<li> <a href="./applications.html">Applications</a>
</li>
<li> <a href="./background.html">Background</a>
</li>
</ul>
<p><strong>Documentation</strong></p>
<ul>
<li> <a href="http://www.lucene.com/cgi-bin/faq/faqmanager.cgi">FAQs</a>
</li>
<li> <a href="../api/overview.html">Javadoc</a>
</li>
</ul>
<p><strong>Download</strong></p>
<ul>
<li> <a href="http://jakarta.apache.org/site/binindex.html">Binaries</a>
</li>
<li> <a href="http://jakarta.apache.org/site/sourceindex.html">Source Code</a>
</li>
<li> <a href="http://jakarta.apache.org/site/cvsindex.html">CVS Repositories</a>
</li>
</ul>
<p><strong>Community</strong></p>
<ul>
<li> <a href="./whoweare.html">Who We Are</a>
</li>
<li> <a href="./powered.html">Powered by Lucene</a>
</li>
<li> <a href="./resources.html">Resources</a>
</li>
<li> <a href="http://jakarta.apache.org/site/mail.html">Mailing Lists</a>
</li>
<li> <a href="http://jakarta.apache.org/site/bugs.html">Bugs</a>
</li>
<li> <a href="http://jakarta.apache.org/site/getinvolved.html">Get Involved</a>
</li>
</ul>
<p><strong>Jakarta</strong></p>
<ul>
<li> <a href="http://jakarta.apache.org/site/acknowledgements.html">Acknowledgements</a>
</li>
<li> <a href="http://jakarta.apache.org/site/contact.html">Contact</a>
</li>
<li> <a href="http://jakarta.apache.org/site/legal.html">Legal</a>
</li>
</ul>
</td>
<td width="80%" align="left" valign="top">
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#525D76">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="OVERVIEW"><strong>OVERVIEW</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<h3>Lucene: Better Searching Through Java</h3>
</blockquote>
</p>
</td></tr>
<tr><td><br/></td></tr>
</table>
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#525D76">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="*** UNDER CONSTRUCTION ***"><strong>*** UNDER CONSTRUCTION ***</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<p>
This site will be the home of Jakarta Lucene. The product is still being transferred.
For now, the home of the Lucene project is still at SourceForge. See
<a href="http://www.lucene.com/">www.lucene.com</a> for those links.
</p>
</blockquote>
</p>
</td></tr>
<tr><td><br/></td></tr>
</table>
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#525D76">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="What is Lucene?"><strong>What is Lucene?</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<p>
Jakarta Lucene is a high-performance, full-featured text search engine
written entirely in Java. It is a technology suitable for nearly any
application that requires full-text search, especially
cross-platform.
</p>
<p>
Jakarta Lucene is an open source project available for
<a href="http://jakarta.apache.org/site/binindex.html">free download</a> from Apache Jakarta.
Please use the links on the left to access Lucene.
</p>
</blockquote>
</p>
</td></tr>
<tr><td><br/></td></tr>
</table>
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#525D76">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="Lucene News"><strong>Lucene News</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<p>
<b>Lucene v1.02 released</b> - This release repackages Lucene as product
of the Apache Software Foundation. Download it
<a href="http://jakarta.apache.org/site/binindex.html">here</a>.
</p>
<p>
<b>Lucene Joins Jakarta</b> - The Lucene Team is happy to announce that
Lucene is now part of a member of the Apache Jakarta Project. This move will
help Lucene continue to grow, and enhance its position as the leading
server-side searching solution for Java.
</p>
</blockquote>
</p>
</td></tr>
<tr><td><br/></td></tr>
</table>
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#525D76">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="About Apache Jakarta"><strong>About Apache Jakarta</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<p>
The goal of the <a href="http://jakarta.apache.org/">Apache Jakarta Project</a>
is to provide commercial-quality server solutions based on the Java Platform that
are developed in an open and cooperative fashion.
</p>
</blockquote>
</p>
</td></tr>
<tr><td><br/></td></tr>
</table>
</td>
</tr>
<!-- FOOTER -->
<tr><td colspan="2">
<hr noshade="" size="1"/>
</td></tr>
<tr><td colspan="2">
<div align="center"><font color="#525D76" size="-1"><em>
Copyright &#169; 1999-2001, Apache Software Foundation
</em></font></div>
</td></tr>
</table>
</body>
</html>
<!-- end the processing -->

141
docs/powered.html Normal file
View File

@ -0,0 +1,141 @@
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
<!-- Content Stylesheet for Site -->
<!-- start the processing -->
<!-- ====================================================================== -->
<!-- Main Page Section -->
<!-- ====================================================================== -->
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"/>
<meta name="author" value="Ted Husted">
<meta name="email" value="husted@apache.org">
<title>Jakarta Lucene - Who We Are - Jakarta Lucene</title>
</head>
<body bgcolor="#ffffff" text="#000000" link="#525D76">
<table border="0" width="100%" cellspacing="0">
<!-- TOP IMAGE -->
<tr>
<td align="left">
<a href="http://jakarta.apache.org"><img src="http://jakarta.apache.org/images/jakarta-logo.gif" border="0"/></a>
</td>
<td align="right">
<a href="http://jakarta.apache.org/lucene/docs/"><img src="./images/lucene-logo.gif" alt="Jakarta Lucene" border="0"/></a>
</td>
</tr>
</table>
<table border="0" width="100%" cellspacing="4">
<tr><td colspan="2">
<hr noshade="" size="1"/>
</td></tr>
<tr>
<!-- LEFT SIDE NAVIGATION -->
<td width="20%" valign="top" nowrap="true">
<p><strong>About</strong></p>
<ul>
<li> <a href="./index.html">Overview</a>
</li>
<li> <a href="./features.html">Features</a>
</li>
<li> <a href="./applications.html">Applications</a>
</li>
<li> <a href="./background.html">Background</a>
</li>
</ul>
<p><strong>Documentation</strong></p>
<ul>
<li> <a href="http://www.lucene.com/cgi-bin/faq/faqmanager.cgi">FAQs</a>
</li>
<li> <a href="../api/overview.html">Javadoc</a>
</li>
</ul>
<p><strong>Download</strong></p>
<ul>
<li> <a href="http://jakarta.apache.org/site/binindex.html">Binaries</a>
</li>
<li> <a href="http://jakarta.apache.org/site/sourceindex.html">Source Code</a>
</li>
<li> <a href="http://jakarta.apache.org/site/cvsindex.html">CVS Repositories</a>
</li>
</ul>
<p><strong>Community</strong></p>
<ul>
<li> <a href="./whoweare.html">Who We Are</a>
</li>
<li> <a href="./powered.html">Powered by Lucene</a>
</li>
<li> <a href="./resources.html">Resources</a>
</li>
<li> <a href="http://jakarta.apache.org/site/mail.html">Mailing Lists</a>
</li>
<li> <a href="http://jakarta.apache.org/site/bugs.html">Bugs</a>
</li>
<li> <a href="http://jakarta.apache.org/site/getinvolved.html">Get Involved</a>
</li>
</ul>
<p><strong>Jakarta</strong></p>
<ul>
<li> <a href="http://jakarta.apache.org/site/acknowledgements.html">Acknowledgements</a>
</li>
<li> <a href="http://jakarta.apache.org/site/contact.html">Contact</a>
</li>
<li> <a href="http://jakarta.apache.org/site/legal.html">Legal</a>
</li>
</ul>
</td>
<td width="80%" align="left" valign="top">
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#525D76">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="POWERED BY LUCENE"><strong>POWERED BY LUCENE</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<h3>Several public applications are using Lucene</h3>
<ul>
<li><a href="http://eyebrowse.tigris.org/">Eyebrowse</a></li>
<li><a href="http://www.jivesoftware.com/">Jive Forums</a></li>
<li><a href="http://www.i2a.com/websearch/"> Web Search</a></li>
</ul>
</blockquote>
</p>
</td></tr>
<tr><td><br/></td></tr>
</table>
</td>
</tr>
<!-- FOOTER -->
<tr><td colspan="2">
<hr noshade="" size="1"/>
</td></tr>
<tr><td colspan="2">
<div align="center"><font color="#525D76" size="-1"><em>
Copyright &#169; 1999-2001, Apache Software Foundation
</em></font></div>
</td></tr>
</table>
</body>
</html>
<!-- end the processing -->

140
docs/resources.html Normal file
View File

@ -0,0 +1,140 @@
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
<!-- Content Stylesheet for Site -->
<!-- start the processing -->
<!-- ====================================================================== -->
<!-- Main Page Section -->
<!-- ====================================================================== -->
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"/>
<meta name="author" value="Ted Husted">
<meta name="email" value="husted@apache.org">
<title>Jakarta Lucene - Who We Are - Jakarta Lucene</title>
</head>
<body bgcolor="#ffffff" text="#000000" link="#525D76">
<table border="0" width="100%" cellspacing="0">
<!-- TOP IMAGE -->
<tr>
<td align="left">
<a href="http://jakarta.apache.org"><img src="http://jakarta.apache.org/images/jakarta-logo.gif" border="0"/></a>
</td>
<td align="right">
<a href="http://jakarta.apache.org/lucene/docs/"><img src="./images/lucene-logo.gif" alt="Jakarta Lucene" border="0"/></a>
</td>
</tr>
</table>
<table border="0" width="100%" cellspacing="4">
<tr><td colspan="2">
<hr noshade="" size="1"/>
</td></tr>
<tr>
<!-- LEFT SIDE NAVIGATION -->
<td width="20%" valign="top" nowrap="true">
<p><strong>About</strong></p>
<ul>
<li> <a href="./index.html">Overview</a>
</li>
<li> <a href="./features.html">Features</a>
</li>
<li> <a href="./applications.html">Applications</a>
</li>
<li> <a href="./background.html">Background</a>
</li>
</ul>
<p><strong>Documentation</strong></p>
<ul>
<li> <a href="http://www.lucene.com/cgi-bin/faq/faqmanager.cgi">FAQs</a>
</li>
<li> <a href="../api/overview.html">Javadoc</a>
</li>
</ul>
<p><strong>Download</strong></p>
<ul>
<li> <a href="http://jakarta.apache.org/site/binindex.html">Binaries</a>
</li>
<li> <a href="http://jakarta.apache.org/site/sourceindex.html">Source Code</a>
</li>
<li> <a href="http://jakarta.apache.org/site/cvsindex.html">CVS Repositories</a>
</li>
</ul>
<p><strong>Community</strong></p>
<ul>
<li> <a href="./whoweare.html">Who We Are</a>
</li>
<li> <a href="./powered.html">Powered by Lucene</a>
</li>
<li> <a href="./resources.html">Resources</a>
</li>
<li> <a href="http://jakarta.apache.org/site/mail.html">Mailing Lists</a>
</li>
<li> <a href="http://jakarta.apache.org/site/bugs.html">Bugs</a>
</li>
<li> <a href="http://jakarta.apache.org/site/getinvolved.html">Get Involved</a>
</li>
</ul>
<p><strong>Jakarta</strong></p>
<ul>
<li> <a href="http://jakarta.apache.org/site/acknowledgements.html">Acknowledgements</a>
</li>
<li> <a href="http://jakarta.apache.org/site/contact.html">Contact</a>
</li>
<li> <a href="http://jakarta.apache.org/site/legal.html">Legal</a>
</li>
</ul>
</td>
<td width="80%" align="left" valign="top">
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#525D76">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="RESOURCES"><strong>RESOURCES</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<h3>Further reading</h3>
<ul>
<li><a href="http://www.javaworld.com/javaworld/jw-09-2000/jw-0915-lucene.html">The Lucene search engine Powerful flexible and free</a><br /> - JavaWorld September 2000</li>
<li><a href="http://www.javaworld.com/javaworld/jw-12-2000/jw-1229-cooltools.html">Build your own languages with JavaCC</a><br /> - JavaWorld December 2000</li>
</ul>
</blockquote>
</p>
</td></tr>
<tr><td><br/></td></tr>
</table>
</td>
</tr>
<!-- FOOTER -->
<tr><td colspan="2">
<hr noshade="" size="1"/>
</td></tr>
<tr><td colspan="2">
<div align="center"><font color="#525D76" size="-1"><em>
Copyright &#169; 1999-2001, Apache Software Foundation
</em></font></div>
</td></tr>
</table>
</body>
</html>
<!-- end the processing -->

178
docs/whoweare.html Normal file
View File

@ -0,0 +1,178 @@
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
<!-- Content Stylesheet for Site -->
<!-- start the processing -->
<!-- ====================================================================== -->
<!-- Main Page Section -->
<!-- ====================================================================== -->
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"/>
<meta name="author" value="Ted Husted">
<meta name="email" value="husted@apache.org">
<title>Jakarta Lucene - Who We Are - Jakarta Lucene</title>
</head>
<body bgcolor="#ffffff" text="#000000" link="#525D76">
<table border="0" width="100%" cellspacing="0">
<!-- TOP IMAGE -->
<tr>
<td align="left">
<a href="http://jakarta.apache.org"><img src="http://jakarta.apache.org/images/jakarta-logo.gif" border="0"/></a>
</td>
<td align="right">
<a href="http://jakarta.apache.org/lucene/docs/"><img src="./images/lucene-logo.gif" alt="Jakarta Lucene" border="0"/></a>
</td>
</tr>
</table>
<table border="0" width="100%" cellspacing="4">
<tr><td colspan="2">
<hr noshade="" size="1"/>
</td></tr>
<tr>
<!-- LEFT SIDE NAVIGATION -->
<td width="20%" valign="top" nowrap="true">
<p><strong>About</strong></p>
<ul>
<li> <a href="./index.html">Overview</a>
</li>
<li> <a href="./features.html">Features</a>
</li>
<li> <a href="./applications.html">Applications</a>
</li>
<li> <a href="./background.html">Background</a>
</li>
</ul>
<p><strong>Documentation</strong></p>
<ul>
<li> <a href="http://www.lucene.com/cgi-bin/faq/faqmanager.cgi">FAQs</a>
</li>
<li> <a href="../api/overview.html">Javadoc</a>
</li>
</ul>
<p><strong>Download</strong></p>
<ul>
<li> <a href="http://jakarta.apache.org/site/binindex.html">Binaries</a>
</li>
<li> <a href="http://jakarta.apache.org/site/sourceindex.html">Source Code</a>
</li>
<li> <a href="http://jakarta.apache.org/site/cvsindex.html">CVS Repositories</a>
</li>
</ul>
<p><strong>Community</strong></p>
<ul>
<li> <a href="./whoweare.html">Who We Are</a>
</li>
<li> <a href="./powered.html">Powered by Lucene</a>
</li>
<li> <a href="./resources.html">Resources</a>
</li>
<li> <a href="http://jakarta.apache.org/site/mail.html">Mailing Lists</a>
</li>
<li> <a href="http://jakarta.apache.org/site/bugs.html">Bugs</a>
</li>
<li> <a href="http://jakarta.apache.org/site/getinvolved.html">Get Involved</a>
</li>
</ul>
<p><strong>Jakarta</strong></p>
<ul>
<li> <a href="http://jakarta.apache.org/site/acknowledgements.html">Acknowledgements</a>
</li>
<li> <a href="http://jakarta.apache.org/site/contact.html">Contact</a>
</li>
<li> <a href="http://jakarta.apache.org/site/legal.html">Legal</a>
</li>
</ul>
</td>
<td width="80%" align="left" valign="top">
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#525D76">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="WHO WE ARE"><strong>WHO WE ARE</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<h3>Lucene is maintained by a team of volunteer developers</h3>
</blockquote>
</p>
</td></tr>
<tr><td><br/></td></tr>
</table>
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#525D76">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="Committers"><strong>Committers</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<ul>
<li><b>Doug Cutting</b> (cutting at apache.org)</li>
<li><b>Otis Gospodnetic</b> (otis at apache.org)</li>
<li><b>Brian Goetz</b> (briangoetz at apache.org)</li>
<li><b>Scott Ganyo</b> (scottganyo at apache.org)</li>
<li><b>Eugene Gluzberg</b> (drag0n at apache.org)</li>
<li><b>Matt Tucker</b> (mtucker at apache.org)</li>
<li><b>Cory Hubert</b> (clhubert at apache.org)</li>
<li><b>Dave Kor</b> (davekor at apache.org)</li>
<li><b>Jon Stevens</b> (jon at latchkey.com)</li>
<li><b>Tal Dayan</b> (zapta at apache.org)</li>
</ul>
</blockquote>
</p>
</td></tr>
<tr><td><br/></td></tr>
</table>
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr><td bgcolor="#525D76">
<font color="#ffffff" face="arial,helvetica,sanserif">
<a name="Other Contributors"><strong>Other Contributors</strong></a>
</font>
</td></tr>
<tr><td>
<blockquote>
<ul>
<li>Josh Bloch</li>
<li>Ted Husted</li>
</ul>
</blockquote>
</p>
</td></tr>
<tr><td><br/></td></tr>
</table>
</td>
</tr>
<!-- FOOTER -->
<tr><td colspan="2">
<hr noshade="" size="1"/>
</td></tr>
<tr><td colspan="2">
<div align="center"><font color="#525D76" size="-1"><em>
Copyright &#169; 1999-2001, Apache Software Foundation
</em></font></div>
</td></tr>
</table>
</body>
</html>
<!-- end the processing -->

2
lib/ant-1.3.jar Normal file
View File

@ -0,0 +1,2 @@
AnyObjectId[e4efab3ced62cc2c310add71f678bb899e2e51cd] was removed in git history.
Apache SVN contains full history.

2
lib/jdom-b7.jar Normal file
View File

@ -0,0 +1,2 @@
AnyObjectId[6be1f2de7126b91b14a11a4b00326859a85e5981] was removed in git history.
Apache SVN contains full history.

2
lib/velocity-1.2-dev.jar Normal file
View File

@ -0,0 +1,2 @@
AnyObjectId[5f8889968e8e6ebbee53e6425ccaca3130c12fef] was removed in git history.
Apache SVN contains full history.

2
lib/xerces-1.3.0.jar Normal file
View File

@ -0,0 +1,2 @@
AnyObjectId[311ab8d15990e3d0b0234e37afb89fd7f21f8eb8] was removed in git history.
Apache SVN contains full history.

57
velocity.log Normal file
View File

@ -0,0 +1,57 @@
Tue Sep 11 16:23:09 EDT 2001 [debug] AvalonLogSystem initialized using logfile D:\jakarta\CVS\jakarta-lucene\velocity.log
Tue Sep 11 16:23:09 EDT 2001 [info] RuntimeInstance v1.2x initializing : org.apache.velocity.runtime.RuntimeInstance@63b895
Tue Sep 11 16:23:09 EDT 2001 [info] Default Properties File: org\apache\velocity\runtime\defaults\velocity.properties
Tue Sep 11 16:23:09 EDT 2001 [info] Resource manager initializing.
Tue Sep 11 16:23:09 EDT 2001 [info] Resource Loader Instantiated: org.apache.velocity.runtime.resource.loader.FileResourceLoader
Tue Sep 11 16:23:09 EDT 2001 [info] FileResourceLoader : initialization starting.
Tue Sep 11 16:23:09 EDT 2001 [info] FileResourceLoader : adding path 'D:\jakarta\CVS\jakarta-site2\xdocs\stylesheets'
Tue Sep 11 16:23:09 EDT 2001 [info] FileResourceLoader : initialization complete.
Tue Sep 11 16:23:09 EDT 2001 [info] Loaded Pluggable Directive: org.apache.velocity.runtime.directive.Literal
Tue Sep 11 16:23:09 EDT 2001 [info] Loaded Pluggable Directive: org.apache.velocity.runtime.directive.Macro
Tue Sep 11 16:23:09 EDT 2001 [info] Loaded Pluggable Directive: org.apache.velocity.runtime.directive.Parse
Tue Sep 11 16:23:09 EDT 2001 [info] Loaded Pluggable Directive: org.apache.velocity.runtime.directive.Include
Tue Sep 11 16:23:09 EDT 2001 [info] Loaded Pluggable Directive: org.apache.velocity.runtime.directive.Foreach
Tue Sep 11 16:23:10 EDT 2001 [info] Created: 20 parsers.
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : initialization starting.
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : adding VMs from VM library template : VM_global_library.vm
Tue Sep 11 16:23:10 EDT 2001 [error] ResourceManager : unable to find resource 'VM_global_library.vm' in any resource loader.
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : error using VM library template VM_global_library.vm : org.apache.velocity.exception.ResourceNotFoundException: Unable to find resource 'VM_global_library.vm'
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : VM library template macro registration complete.
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : allowInline = true : VMs can be defined inline in templates
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : allowInlineToOverride = false : VMs defined inline may NOT replace previous VM definitions
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : allowInlineLocal = false : VMs defined inline will be global in scope if allowed.
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : messages on : VM system will output logging messages
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : autoload off : VM system will not automatically reload global library macros
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : initialization complete.
Tue Sep 11 16:23:10 EDT 2001 [info] Velocity successfully started.
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #table( table ) : source = ./site.vsl
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #tr( tr ) : source = ./site.vsl
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #td( value ) : source = ./site.vsl
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #th( value ) : source = ./site.vsl
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #projectanchor( name value ) : source = ./site.vsl
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #metaauthor( author email ) : source = ./site.vsl
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #image( value ) : source = ./site.vsl
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #source( value ) : source = ./site.vsl
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #subsection( subsection ) : source = ./site.vsl
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #section( section ) : source = ./site.vsl
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #makeProject( ) : source = ./site.vsl
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #getProjectImage( ) : source = ./site.vsl
Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #document( ) : source = ./site.vsl
Tue Sep 11 16:23:10 EDT 2001 [error] VM #tr: error : too few arguments to macro. Wanted 1 got 0 -->
Tue Sep 11 16:23:10 EDT 2001 [error] VM #td: error : too few arguments to macro. Wanted 1 got 0 -->
Tue Sep 11 16:23:10 EDT 2001 [error] VM #th: error : too few arguments to macro. Wanted 1 got 0 -->
Tue Sep 11 16:23:11 EDT 2001 [info] ResourceManager : found ./site.vsl with loader org.apache.velocity.runtime.resource.loader.FileResourceLoader
Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : table : inline not allowed to replace existing VM
Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : tr : inline not allowed to replace existing VM
Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : td : inline not allowed to replace existing VM
Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : th : inline not allowed to replace existing VM
Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : projectanchor : inline not allowed to replace existing VM
Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : metaauthor : inline not allowed to replace existing VM
Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : image : inline not allowed to replace existing VM
Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : source : inline not allowed to replace existing VM
Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : subsection : inline not allowed to replace existing VM
Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : section : inline not allowed to replace existing VM
Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : makeProject : inline not allowed to replace existing VM
Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : getProjectImage : inline not allowed to replace existing VM
Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : document : inline not allowed to replace existing VM
Tue Sep 11 16:23:11 EDT 2001 [info] ResourceManager : found ./site.vsl with loader org.apache.velocity.runtime.resource.loader.FileResourceLoader

38
xdocs/applications.xml Normal file
View File

@ -0,0 +1,38 @@
<?xml version="1.0"?>
<document>
<properties>
<author email="husted@apache.org">Ted Husted</author>
<title>Applications - Jakarta Lucene</title>
</properties>
<body>
<section name="APPLICATIONS">
<h3>Lucene can be used for a wide range of applications</h3>
</section>
<section name="Example Applications">
<p>Lucene is designed to be used in a wide range of applications--from small,
desktop applications with a few hundred documents, to large internet server-based
applications with a few million documents.</p>
<h4>Searchable E-Mail</h4>
<ul>
<li>Search large e-mail archives instantly; update index as new messages
arrive.<br/></li>
</ul>
<h4>CD-ROM-based Online Documentation Search</h4>
<ul>
<li>Search large publications quickly with platform-independent system.<br/></li>
</ul>
<h4>Search Previously-Visited Web Pages</h4>
<ul>
<li>Relocate a page seen weeks or months ago.<br/></li>
</ul>
<h4>Web Site Searching</h4>
<ul>
<li>Let users search all the pages on your website.<br/></li>
</ul>
</section>
</body>
</document>

56
xdocs/background.xml Normal file
View File

@ -0,0 +1,56 @@
<?xml version="1.0"?>
<document>
<properties>
<author email="husted@apache.org">Ted Husted</author>
<title>Background - Jakarta Lucene</title>
</properties>
<body>
<section name="BACKGROUND">
<h3><img src="./images/dcutting.gif" align="right" alt="Doug Cutting"/>How Lucene came to be</h3>
<p>Lucene is the brainchild of Doug Cutting (pictured), who has been working in the
field of information retrieval for over a decade.</p>
<p>Beginning in 1988, Doug spent five years at Xerox's Palo Alto Research
Center (PARC) developing novel
approaches to information access. These included a high-performance
retrieval engine, several innovative search paradigms, advanced linguistic
analysis methods, and high-quality text summarization algorithms.
This work resulted in seven <a href="publications.html">publications</a>
and six issued patents. Some of these technologies are now marketed by
<a href="http://www.inxight.com/">Inxight</a>.
</p>
<p>In 1993, Doug moved to Apple's Advanced Technology Group (ATG). There
he developed a state-of-the-art retrieval engine code-named V-Twin.
This engine was to be a part of the Copland
operating system, automatically indexing the content of all files as they
are created so that the the entire file system could be efficiently searched
at any time. Copland was cancelled, but V-Twin has been used in several
other Apple products.
</p>
<p>In April of 1996, Doug left Apple and joined <a href="http://www.excite.com/">Excite</a>.
Here he took over development of the core search technology. This included
growing Excite's web index from two million to fifty million
pages; substantially optimizing Excite's search performance; adding phrase-searching
capabilities; and creating a thesaurus-like feature which suggests related
terms to add to queries.
</p>
<p>In the fall of 1997, Doug reduced his commitment at Excite to
part-time so that he could write <a href="http://jakarta.apache.org/lucene/">Lucene</a>,
an efficient, full-featured text search engine written in Java. In early 1998 he
returned to Excite full-time for two more years. Lucene sat on the shelf for
much of that time, and was made open-source in the spring of 2000.</p>
<p>Lucene quickly became recognized as the leading server-side searching
solution for Java, and attracted several other open source developers, eager
to help refine the Lucene codebase.</p>
<p>In the fall of 2001, Lucene joined the Apache Jakarta Project, where
the product is maintained by a team of volunteer developers.</p>
<p>Doug now works for <a href="http://www.grandcentral.com/">Grand
Central</a>, a web services network. In his spare time he still
tries to help out with Lucene.</p>
<p>Please do not email Doug directly about Lucene. Instead use
the <a href="http://jakarta.apache.org/site/mail.html">Jakarta-Lucene mailing lists</a>.</p>
</section>
</body>
</document>

54
xdocs/features.xml Normal file
View File

@ -0,0 +1,54 @@
<?xml version="1.0"?>
<document>
<properties>
<author email="husted@apache.org">Ted Husted</author>
<title>Features - Jakarta Lucene</title>
</properties>
<body>
<section name="FEATURES">
<h3>Lucene offers powerful features through a simple API</h3>
</section>
<section name="Scalable, High-Performance Indexing">
<ul>
<li>over 200MB/hour on Pentium II/266<br/></li>
<li>incremental indexing as fast as batch indexing</li>
<li>small RAM requirements -- only 1MB heap</li>
<li>index size roughly 30% the size of text indexed</li>
</ul>
</section>
<section name="Powerful, Accurate and Efficient Search Algorithms">
<ul>
<li>ranked searching -- best results returned first</li>
<li>boolean and phrase queries</li>
<li>fielded searching (e.g., title, author, contents)</li>
<li>date-range searching</li>
</ul>
</section>
<section name="Simple API's allow developers to ...">
<ul>
<li>incorporate new document types</li>
<li>localize for new languages (already handles most European languages)</li>
<li>develop new user interfaces</li>
</ul>
</section>
<section name="Cross-Platform Solution">
<ul>
<li>100%-pure Java <i>(not yet certified)</i></li>
</ul>
</section>
<section name="Coming soon">
<ul>
<li><i>multiple-index searching with merged results</i></li>
<li><i>distributed searching over a network</i></li>
</ul>
</section>
</body>
</document>

BIN
xdocs/images/dcutting.gif Normal file

Binary file not shown.

After

Width:  |  Height:  |  Size: 23 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 5.3 KiB

60
xdocs/index.xml Normal file
View File

@ -0,0 +1,60 @@
<?xml version="1.0"?>
<document>
<properties>
<author email="jon@latchkey.com">Jon S. Stevens</author>
<author email="husted@apache.org">Ted Husted</author>
<title>Overview - Jakarta Lucene</title>
</properties>
<body>
<section name="OVERVIEW">
<h3>Lucene: Better Searching Through Java</h3>
</section>
<section name="*** UNDER CONSTRUCTION ***">
<p>
This site will be the home of Jakarta Lucene. The product is still being transferred.
For now, the home of the Lucene project is still at SourceForge. See
<a href="http://www.lucene.com/">www.lucene.com</a> for those links.
</p>
</section>
<section name="What is Lucene?">
<p>
Jakarta Lucene is a high-performance, full-featured text search engine
written entirely in Java. It is a technology suitable for nearly any
application that requires full-text search, especially
cross-platform.
</p>
<p>
Jakarta Lucene is an open source project available for
<a href="http://jakarta.apache.org/site/binindex.html">free download</a> from Apache Jakarta.
Please use the links on the left to access Lucene.
</p>
</section>
<section name="Lucene News">
<p>
<b>Lucene v1.02 released</b> - This release repackages Lucene as product
of the Apache Software Foundation. Download it
<a href="http://jakarta.apache.org/site/binindex.html">here</a>.
</p>
<p>
<b>Lucene Joins Jakarta</b> - The Lucene Team is happy to announce that
Lucene is now part of a member of the Apache Jakarta Project. This move will
help Lucene continue to grow, and enhance its position as the leading
server-side searching solution for Java.
</p>
</section>
<section name="About Apache Jakarta">
<p>
The goal of the <a href="http://jakarta.apache.org/">Apache Jakarta Project</a>
is to provide commercial-quality server solutions based on the Java Platform that
are developed in an open and cooperative fashion.
</p>
</section>
</body>
</document>

20
xdocs/powered.xml Normal file
View File

@ -0,0 +1,20 @@
<?xml version="1.0"?>
<document>
<properties>
<author email="husted@apache.org">Ted Husted</author>
<title>Who We Are - Jakarta Lucene</title>
</properties>
<body>
<section name="POWERED BY LUCENE">
<h3>Several public applications are using Lucene</h3>
<ul>
<li><a href="http://eyebrowse.tigris.org/">Eyebrowse</a></li>
<li><a href="http://www.jivesoftware.com/">Jive Forums</a></li>
<li><a href="http://www.i2a.com/websearch/"> Web Search</a></li>
</ul>
</section>
</body>
</document>

19
xdocs/resources.xml Normal file
View File

@ -0,0 +1,19 @@
<?xml version="1.0"?>
<document>
<properties>
<author email="husted@apache.org">Ted Husted</author>
<title>Who We Are - Jakarta Lucene</title>
</properties>
<body>
<section name="RESOURCES">
<h3>Further reading</h3>
<ul>
<li><a href="http://www.javaworld.com/javaworld/jw-09-2000/jw-0915-lucene.html">The Lucene search engine Powerful flexible and free</a><br/> - JavaWorld September 2000</li>
<li><a href="http://www.javaworld.com/javaworld/jw-12-2000/jw-1229-cooltools.html">Build your own languages with JavaCC</a><br/> - JavaWorld December 2000</li>
</ul>
</section>
</body>
</document>

View File

@ -0,0 +1,42 @@
<?xml version="1.0" encoding="ISO-8859-1"?>
<project name="Jakarta Lucene"
href="http://jakarta.apache.org/lucene/docs/">
<title>Jakarta Lucene</title>
<logo href="/images/lucene-logo.gif">Jakarta Lucene</logo>
<body>
<menu name="About">
<item name="Overview" href="/index.html"/>
<item name="Features" href="/features.html"/>
<item name="Applications" href="/applications.html"/>
<item name="Background" href="/background.html"/>
</menu>
<menu name="Documentation">
<item name="FAQs" href="http://www.lucene.com/cgi-bin/faq/faqmanager.cgi" target="_blank"/>
<item name="Javadoc" href="./api/overview.html"/>
</menu>
<menu name="Download">
<item name="Binaries" href="/site/binindex.html"/>
<item name="Source Code" href="/site/sourceindex.html"/>
<item name="CVS Repositories" href="/site/cvsindex.html"/>
</menu>
<menu name="Community">
<item name="Who We Are" href="/whoweare.html"/>
<item name="Powered by Lucene" href="/powered.html"/>
<item name="Resources" href="/resources.html"/>
<item name="Mailing Lists" href="/site/mail.html"/>
<item name="Bugs" href="/site/bugs.html"/>
<item name="Get Involved" href="/site/getinvolved.html"/>
</menu>
<menu name="Jakarta">
<item name="Acknowledgements" href="/site/acknowledgements.html"/>
<item name="Contact" href="/site/contact.html"/>
<item name="Legal" href="/site/legal.html"/>
</menu>
</body>
</project>

37
xdocs/whoweare.xml Normal file
View File

@ -0,0 +1,37 @@
<?xml version="1.0"?>
<document>
<properties>
<author email="husted@apache.org">Ted Husted</author>
<title>Who We Are - Jakarta Lucene</title>
</properties>
<body>
<section name="WHO WE ARE">
<h3>Lucene is maintained by a team of volunteer developers</h3>
</section>
<section name="Committers">
<ul>
<li><b>Doug Cutting</b> (cutting at apache.org)</li>
<li><b>Otis Gospodnetic</b> (otis at apache.org)</li>
<li><b>Brian Goetz</b> (briangoetz at apache.org)</li>
<li><b>Scott Ganyo</b> (scottganyo at apache.org)</li>
<li><b>Eugene Gluzberg</b> (drag0n at apache.org)</li>
<li><b>Matt Tucker</b> (mtucker at apache.org)</li>
<li><b>Cory Hubert</b> (clhubert at apache.org)</li>
<li><b>Dave Kor</b> (davekor at apache.org)</li>
<li><b>Jon Stevens</b> (jon at latchkey.com)</li>
<li><b>Tal Dayan</b> (zapta at apache.org)</li>
</ul>
</section>
<section name="Other Contributors">
<ul>
<li>Josh Bloch</li>
<li>Ted Husted</li>
</ul>
</section>
</body>
</document>