lucene/modules/analysis/README.txt

67 lines
1.8 KiB
Plaintext
Raw Normal View History

Analysis README file
INTRODUCTION
The Analysis Module provides analysis capabilities to Lucene and Solr
applications.
The Lucene web site is at:
http://lucene.apache.org/
Please join the Lucene-User mailing list by sending a message to:
java-user-subscribe@lucene.apache.org
FILES
lucene-analyzers-common-XX.jar
The primary analysis module library, containing general-purpose analysis
components and support for various languages.
lucene-analyzers-icu-XX.jar
An add-on analysis library that provides improved Unicode support via
International Components for Unicode (ICU). Note: this module depends on
the ICU4j jar file (version >= 4.6.0)
lucene-analyzers-kuromoji-XX.jar
An analyzer with morphological analysis for Japanese.
lucene-analyzers-morfologik-XX.jar
An analyzer using the Morfologik stemming library.
lucene-analyzers-phonetic-XX.jar
An add-on analysis library that provides phonetic encoders via Apache
Commons-Codec. Note: this module depends on the commons-codec jar
file (version >= 1.4)
lucene-analyzers-smartcn-XX.jar
An add-on analysis library that provides word segmentation for Simplified
Chinese.
lucene-analyzers-stempel-XX.jar
An add-on analysis library that contains a universal algorithmic stemmer,
including tables for the Polish language.
lucene-analyzers-uima-XX.jar
An add-on analysis library that contains tokenizers/analyzers using
Apache UIMA extracted annotations to identify tokens/types/etc.
common/src/java
icu/src/java
kuromoji/src/java
morfologik/src/java
phonetic/src/java
smartcn/src/java
stempel/src/java
uima/src/java
The source code for the libraries.
common/src/test
icu/src/test
kuromoji/src/test
morfologik/src/test
phonetic/src/test
smartcn/src/test
stempel/src/test
uima/src/test
Unit tests for the libraries.