Lucene
Lucene is a full-text search engine written in Java. It is efficient,
providing high-performance indexing and searching using few system resources.
State-of-the-art search algorithms produce highest-quality search results.
The use of Java allows easy integration with cross-platform applications.
Potential Applications
-
Searchable E-Mail
Search large e-mail archives instantly; update index as new messages
arrive.
-
CD-ROM-based Online Documentation Search
Search large publications quickly with platform-independent system.
-
Search Previously-Visited Web Pages
Relocate a page seen weeks or months ago.
-
Web Site Searching
Let users search all the pages on your website.
Features
-
Scalable, High-Performance Indexing
-
over 200MB/hour on Pentium II/266
-
incremental indexing as fast as batch indexing
-
small RAM requirements -- only 1MB heap
-
index size roughly 30% the size of text indexed
-
Powerful, Accurate and Efficient Search Algorithms
-
ranked searching -- best results returned first
-
boolean and phrase queries
-
fielded searching (e.g., title, author, contents)
-
date-range searching
-
coming soon:
-
multiple-index searching with merged results
-
distributed searching over a network
-
Simple API's allow developers to:
-
incorporate new document types
-
localize for new languages (already handles most European languages)
-
develop new user interfaces
-
Cross-Platform Solution
-
100%-pure Java (not yet certified)
Contact
Douglass R. Cutting
Email: cutting@lucene.com
Phone: 1 (510) 595-0232