lucene/lucene/analysis
Robert Muir dd91f5ca82
LUCENE-9773: upgrade icu to 68.2 (#2372)
Upgrade from icu 62.2 to 68.2, with Unicode 13 support.

Modify GenerateUTR30DataFiles to take the release tag as a program
argument. Gradle populates this automatically, removing a manual step
from regeneration process.
2021-02-15 14:56:13 -05:00
..
common LUCENE-9771: Hunspell: don't lookup word roots unnecessarily to check flags (#2369) 2021-02-15 20:21:44 +01:00
icu LUCENE-9773: upgrade icu to 68.2 (#2372) 2021-02-15 14:56:13 -05:00
kuromoji LUCENE-8626: Lucene standardize test naming part 3 and final (#2220) 2021-01-22 12:38:52 -05:00
morfologik LUCENE-9570: code reformatting [partial]. 2021-01-03 21:46:01 +01:00
nori LUCENE-8626: Lucene standardize test naming part 3 and final (#2220) 2021-01-22 12:38:52 -05:00
opennlp LUCENE-9570: code reformatting [partial]. 2021-01-03 21:46:01 +01:00
phonetic LUCENE-8626: Lucene standardize test naming part 3 and final (#2220) 2021-01-22 12:38:52 -05:00
smartcn LUCENE-9570: code reformatting [partial]. 2021-01-03 21:46:01 +01:00
stempel LUCENE-9570: code reformatting [partial]. 2021-01-03 21:46:01 +01:00
README.txt LUCENE-9562: All binary analysis packages (and corresponding Maven artifacts) with names containing '-analyzers-' have been renamed to '-analysis-'. (#1968) 2020-10-12 09:15:56 +02:00

README.txt

Analysis README file

INTRODUCTION

The Analysis Module provides analysis capabilities to Lucene and Solr
applications.

The Lucene web site is at:
  http://lucene.apache.org/

Please join the Lucene-User mailing list by sending a message to:
  java-user-subscribe@lucene.apache.org

FILES

lucene-analysis-common-XX.jar
  The primary analysis module library, containing general-purpose analysis
  components and support for various languages.

lucene-analysis-icu-XX.jar
  An add-on analysis library that provides improved Unicode support via
  International Components for Unicode (ICU). Note: this module depends on
  the ICU4j jar file (version >= 4.6.0)

lucene-analysis-kuromoji-XX.jar
  An analyzer with morphological analysis for Japanese.

lucene-analysis-morfologik-XX.jar
  An analyzer using the Morfologik stemming library.

lucene-analysis-nori-XX.jar
  An analyzer with morphological analysis for Korean.

lucene-analysis-opennlp-XX.jar
  An analyzer using the OpenNLP natural-language processing library.

lucene-analysis-phonetic-XX.jar
  An add-on analysis library that provides phonetic encoders via Apache
  Commons-Codec. Note: this module depends on the commons-codec jar 
  file
  
lucene-analysis-smartcn-XX.jar
  An add-on analysis library that provides word segmentation for Simplified
  Chinese.

lucene-analysis-stempel-XX.jar
  An add-on analysis library that contains a universal algorithmic stemmer,
  including tables for the Polish language.

common/src/java
icu/src/java
kuromoji/src/java
morfologik/src/java
nori/src/java
opennlp/src/java
phonetic/src/java
smartcn/src/java
stempel/src/java
  The source code for the libraries.

common/src/test
icu/src/test
kuromoji/src/test
morfologik/src/test
nori/src/test
opennlp/src/test
phonetic/src/test
smartcn/src/test
stempel/src/test
  Unit tests for the libraries.