lucene/lucene/analysis
Steve Rowe 629346febb LUCENE-7013: add licence header position checker to -validate-source-patterns, and fix the violations it found 2016-07-13 13:20:34 -04:00
..
common LUCENE-7013: add licence header position checker to -validate-source-patterns, and fix the violations it found 2016-07-13 13:20:34 -04:00
icu LUCENE-7318: graduate StandardAnalyzer and make it the default for IndexWriterConfig 2016-06-14 18:41:37 -04:00
kuromoji LUCENE-7355: Add Analyzer#normalize() and use it in query parsers. 2016-07-12 18:01:44 +02:00
morfologik LUCENE-7355: Add Analyzer#normalize() and use it in query parsers. 2016-07-12 18:01:44 +02:00
phonetic LUCENE-7319: remove unused imports 2016-06-13 12:35:49 +01:00
smartcn LUCENE-7355: Add Analyzer#normalize() and use it in query parsers. 2016-07-12 18:01:44 +02:00
stempel LUCENE-7355: Add Analyzer#normalize() and use it in query parsers. 2016-07-12 18:01:44 +02:00
uima LUCENE-7319: remove unused imports 2016-06-13 12:35:49 +01:00
README.txt LUCENE-3720: fix BeiderMorseFilter OOM issues 2012-09-17 15:32:46 +00:00
analysis-module-build.xml LUCENE-5593: speed up javadocs generation in build 2014-07-28 15:36:27 +00:00
build.xml LUCENE-6732: Remove tabs in JS and XML files 2015-08-11 23:00:39 +00:00

README.txt

Analysis README file

INTRODUCTION

The Analysis Module provides analysis capabilities to Lucene and Solr
applications.

The Lucene web site is at:
  http://lucene.apache.org/

Please join the Lucene-User mailing list by sending a message to:
  java-user-subscribe@lucene.apache.org

FILES

lucene-analyzers-common-XX.jar
  The primary analysis module library, containing general-purpose analysis
  components and support for various languages.

lucene-analyzers-icu-XX.jar
  An add-on analysis library that provides improved Unicode support via
  International Components for Unicode (ICU). Note: this module depends on
  the ICU4j jar file (version >= 4.6.0)

lucene-analyzers-kuromoji-XX.jar
  An analyzer with morphological analysis for Japanese.

lucene-analyzers-morfologik-XX.jar
  An analyzer using the Morfologik stemming library.

lucene-analyzers-phonetic-XX.jar
  An add-on analysis library that provides phonetic encoders via Apache
  Commons-Codec. Note: this module depends on the commons-codec jar 
  file
  
lucene-analyzers-smartcn-XX.jar
  An add-on analysis library that provides word segmentation for Simplified
  Chinese.

lucene-analyzers-stempel-XX.jar
  An add-on analysis library that contains a universal algorithmic stemmer,
  including tables for the Polish language.

lucene-analyzers-uima-XX.jar
  An add-on analysis library that contains tokenizers/analyzers using
  Apache UIMA extracted annotations to identify tokens/types/etc.

common/src/java
icu/src/java
kuromoji/src/java
morfologik/src/java
phonetic/src/java
smartcn/src/java
stempel/src/java
uima/src/java
  The source code for the libraries.

common/src/test
icu/src/test
kuromoji/src/test
morfologik/src/test
phonetic/src/test
smartcn/src/test
stempel/src/test
uima/src/test
  Unit tests for the libraries.