lucene/datasheet.html

129 lines
2.3 KiB
HTML

<HTML>
<HEAD>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=iso-8859-1">
<META NAME="Author" CONTENT="Doug Cutting">
<META NAME="GENERATOR" CONTENT="Mozilla/4.04 [en] (Win95; U) [Netscape]">
<TITLE>Lucene: a full-text search engine in Java</TITLE>
</HEAD>
<BODY>
<H1>
Lucene</H1>
Lucene is a full-text search engine written in Java.&nbsp; It is efficient,
providing high-performance indexing and searching using few system resources.&nbsp;
State-of-the-art search algorithms produce highest-quality search results.&nbsp;
The use of Java allows easy integration with cross-platform applications.
<H2>
Potential Applications</H2>
<UL>
<LI>
<B>Searchable E-Mail</B></LI>
<BR>Search large e-mail archives instantly; update index as new messages
arrive.
<LI>
<B>CD-ROM-based Online Documentation Search</B></LI>
<BR>Search large publications quickly with platform-independent system.
<LI>
<B>Search Previously-Visited Web Pages</B></LI>
<BR>Relocate a page seen weeks or months ago.
<LI>
<B>Web Site Searching</B></LI>
<BR>Let users search all the pages on your website.</UL>
<H2>
Features</H2>
<UL>
<LI>
<B>Scalable, High-Performance Indexing</B></LI>
<DL>
<DL>
<LI>
over 200MB/hour on Pentium II/266</LI>
<LI>
incremental indexing as fast as batch indexing</LI>
<LI>
small RAM requirements -- only 1MB heap</LI>
<LI>
index size roughly 30% the size of text indexed</LI>
</DL>
</DL>
<LI>
<B>Powerful, Accurate and Efficient Search Algorithms</B></LI>
<DL>
<DL>
<LI>
ranked searching -- best results returned first</LI>
<LI>
boolean and phrase queries</LI>
<LI>
fielded searching (e.g., title, author, contents)</LI>
<LI>
date-range searching</LI>
<LI>
<B><I>coming soon:</I></B></LI>
<DL>
<DL>
<LI>
<I>multiple-index searching with merged results</I></LI>
<LI>
<I>distributed searching over a network</I></LI>
</DL>
</DL>
</DL>
</DL>
<LI>
<B>Simple API's allow developers to:</B></LI>
<DL>
<DL>
<LI>
incorporate new document types</LI>
<LI>
localize for new languages (already handles most European languages)</LI>
<LI>
develop new user interfaces</LI>
</DL>
</DL>
<LI>
<B>Cross-Platform Solution</B></LI>
<DL>
<DL>
<LI>
100%-pure Java <I>(not yet certified)</I></LI>
</DL>
</DL>
</UL>
<H2>
Contact</H2>
<UL><B>Douglass R. Cutting</B>
<BR>Email:&nbsp;&nbsp;&nbsp; cutting@lucene.com
<BR>Phone:&nbsp;&nbsp; 1 (510) 595-0232</UL>
</BODY>
</HTML>