mirror of https://github.com/apache/lucene.git
bcfa0cbc60
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150766 13f79535-47bb-0310-9956-ffa450edef68 |
||
---|---|---|
.. | ||
legal | ||
lib | ||
src | ||
tools | ||
GETTING STARTED.txt | ||
README.txt | ||
appendcp.bat | ||
build.bat | ||
build.number | ||
build.sh | ||
build.xml | ||
index.html | ||
layout.xml | ||
module.xml | ||
patch | ||
properties.xml | ||
status.xml |
README.txt
This is the README file for a search framework contribution to Lucene Sandbox. It is an attempt at constructing a framework around the Lucene search API. (Can I have a name for it?) 3 interesting features of this framework are: datasource independence - through various datasource implementations, regardless of whether it is a database table, an object, a filesystem directory, or a website, these can all be indexed. complex datasource support - complex datasources are containers for what are potentially new datasources (a Zip archive, a HTML document containing links to other HTML documents, a Java object which contains references to other objects to be indexed, etc). The framework has basic support for complex datasources. pluggable file content handlers - content handlers which 'know' how to index various file formats (MS Word, Zip, Tar, etc) can be easily configured via an xml configuration file.