mirror of https://github.com/apache/lucene.git
dd91f5ca82
Upgrade from icu 62.2 to 68.2, with Unicode 13 support. Modify GenerateUTR30DataFiles to take the release tag as a program argument. Gradle populates this automatically, removing a manual step from regeneration process. |
||
---|---|---|
.. | ||
common | ||
icu | ||
kuromoji | ||
morfologik | ||
nori | ||
opennlp | ||
phonetic | ||
smartcn | ||
stempel | ||
README.txt |
README.txt
Analysis README file INTRODUCTION The Analysis Module provides analysis capabilities to Lucene and Solr applications. The Lucene web site is at: http://lucene.apache.org/ Please join the Lucene-User mailing list by sending a message to: java-user-subscribe@lucene.apache.org FILES lucene-analysis-common-XX.jar The primary analysis module library, containing general-purpose analysis components and support for various languages. lucene-analysis-icu-XX.jar An add-on analysis library that provides improved Unicode support via International Components for Unicode (ICU). Note: this module depends on the ICU4j jar file (version >= 4.6.0) lucene-analysis-kuromoji-XX.jar An analyzer with morphological analysis for Japanese. lucene-analysis-morfologik-XX.jar An analyzer using the Morfologik stemming library. lucene-analysis-nori-XX.jar An analyzer with morphological analysis for Korean. lucene-analysis-opennlp-XX.jar An analyzer using the OpenNLP natural-language processing library. lucene-analysis-phonetic-XX.jar An add-on analysis library that provides phonetic encoders via Apache Commons-Codec. Note: this module depends on the commons-codec jar file lucene-analysis-smartcn-XX.jar An add-on analysis library that provides word segmentation for Simplified Chinese. lucene-analysis-stempel-XX.jar An add-on analysis library that contains a universal algorithmic stemmer, including tables for the Polish language. common/src/java icu/src/java kuromoji/src/java morfologik/src/java nori/src/java opennlp/src/java phonetic/src/java smartcn/src/java stempel/src/java The source code for the libraries. common/src/test icu/src/test kuromoji/src/test morfologik/src/test nori/src/test opennlp/src/test phonetic/src/test smartcn/src/test stempel/src/test Unit tests for the libraries.