mirror of https://github.com/apache/lucene.git
129 lines
2.3 KiB
HTML
129 lines
2.3 KiB
HTML
|
<HTML>
|
||
|
<HEAD>
|
||
|
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=iso-8859-1">
|
||
|
<META NAME="Author" CONTENT="Doug Cutting">
|
||
|
<META NAME="GENERATOR" CONTENT="Mozilla/4.04 [en] (Win95; U) [Netscape]">
|
||
|
<TITLE>Lucene: a full-text search engine in Java</TITLE>
|
||
|
</HEAD>
|
||
|
<BODY>
|
||
|
|
||
|
<H1>
|
||
|
Lucene</H1>
|
||
|
Lucene is a full-text search engine written in Java. It is efficient,
|
||
|
providing high-performance indexing and searching using few system resources.
|
||
|
State-of-the-art search algorithms produce highest-quality search results.
|
||
|
The use of Java allows easy integration with cross-platform applications.
|
||
|
<H2>
|
||
|
Potential Applications</H2>
|
||
|
|
||
|
<UL>
|
||
|
<LI>
|
||
|
<B>Searchable E-Mail</B></LI>
|
||
|
|
||
|
<BR>Search large e-mail archives instantly; update index as new messages
|
||
|
arrive.
|
||
|
<LI>
|
||
|
<B>CD-ROM-based Online Documentation Search</B></LI>
|
||
|
|
||
|
<BR>Search large publications quickly with platform-independent system.
|
||
|
<LI>
|
||
|
<B>Search Previously-Visited Web Pages</B></LI>
|
||
|
|
||
|
<BR>Relocate a page seen weeks or months ago.
|
||
|
<LI>
|
||
|
<B>Web Site Searching</B></LI>
|
||
|
|
||
|
<BR>Let users search all the pages on your website.</UL>
|
||
|
|
||
|
<H2>
|
||
|
Features</H2>
|
||
|
|
||
|
<UL>
|
||
|
<LI>
|
||
|
<B>Scalable, High-Performance Indexing</B></LI>
|
||
|
|
||
|
<DL>
|
||
|
<DL>
|
||
|
<LI>
|
||
|
over 200MB/hour on Pentium II/266</LI>
|
||
|
|
||
|
<LI>
|
||
|
incremental indexing as fast as batch indexing</LI>
|
||
|
|
||
|
<LI>
|
||
|
small RAM requirements -- only 1MB heap</LI>
|
||
|
|
||
|
<LI>
|
||
|
index size roughly 30% the size of text indexed</LI>
|
||
|
</DL>
|
||
|
</DL>
|
||
|
|
||
|
<LI>
|
||
|
<B>Powerful, Accurate and Efficient Search Algorithms</B></LI>
|
||
|
|
||
|
<DL>
|
||
|
<DL>
|
||
|
<LI>
|
||
|
ranked searching -- best results returned first</LI>
|
||
|
|
||
|
<LI>
|
||
|
boolean and phrase queries</LI>
|
||
|
|
||
|
<LI>
|
||
|
fielded searching (e.g., title, author, contents)</LI>
|
||
|
|
||
|
<LI>
|
||
|
date-range searching</LI>
|
||
|
|
||
|
<LI>
|
||
|
<B><I>coming soon:</I></B></LI>
|
||
|
|
||
|
<DL>
|
||
|
<DL>
|
||
|
<LI>
|
||
|
<I>multiple-index searching with merged results</I></LI>
|
||
|
|
||
|
<LI>
|
||
|
<I>distributed searching over a network</I></LI>
|
||
|
</DL>
|
||
|
</DL>
|
||
|
</DL>
|
||
|
</DL>
|
||
|
|
||
|
<LI>
|
||
|
<B>Simple API's allow developers to:</B></LI>
|
||
|
|
||
|
<DL>
|
||
|
<DL>
|
||
|
<LI>
|
||
|
incorporate new document types</LI>
|
||
|
|
||
|
<LI>
|
||
|
localize for new languages (already handles most European languages)</LI>
|
||
|
|
||
|
<LI>
|
||
|
develop new user interfaces</LI>
|
||
|
</DL>
|
||
|
</DL>
|
||
|
|
||
|
<LI>
|
||
|
<B>Cross-Platform Solution</B></LI>
|
||
|
|
||
|
<DL>
|
||
|
<DL>
|
||
|
<LI>
|
||
|
100%-pure Java <I>(not yet certified)</I></LI>
|
||
|
</DL>
|
||
|
</DL>
|
||
|
</UL>
|
||
|
|
||
|
<H2>
|
||
|
Contact</H2>
|
||
|
|
||
|
<UL><B>Douglass R. Cutting</B>
|
||
|
<BR>Email: cutting@lucene.com
|
||
|
<BR>Phone: 1 (510) 595-0232</UL>
|
||
|
|
||
|
</BODY>
|
||
|
</HTML>
|