lucene/lucene/analysis
Koji Sekiguchi b3acba47e7 fix typo in javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1501639 13f79535-47bb-0310-9956-ffa450edef68
2013-07-10 03:39:13 +00:00
..
common LUCENE-5097: Analyzer now has an additional tokenStream(String fieldName, String text) method, so wrapping by StringReader for common use is no longer needed. This method uses an internal reuseable reader, which was previously only used by the Field class. 2013-07-08 17:55:48 +00:00
icu LUCENE-5097: Analyzer now has an additional tokenStream(String fieldName, String text) method, so wrapping by StringReader for common use is no longer needed. This method uses an internal reuseable reader, which was previously only used by the Field class. 2013-07-08 17:55:48 +00:00
kuromoji fix typo in javadoc 2013-07-10 03:39:13 +00:00
morfologik LUCENE-5097: Analyzer now has an additional tokenStream(String fieldName, String text) method, so wrapping by StringReader for common use is no longer needed. This method uses an internal reuseable reader, which was previously only used by the Field class. 2013-07-08 17:55:48 +00:00
phonetic LUCENE-4993: Fix BeiderMorseFilter to preserve custom attributes when inserting tokens with position increment 0. 2013-05-10 08:00:19 +00:00
smartcn LUCENE-5097: Analyzer now has an additional tokenStream(String fieldName, String text) method, so wrapping by StringReader for common use is no longer needed. This method uses an internal reuseable reader, which was previously only used by the Field class. 2013-07-08 17:55:48 +00:00
stempel LUCENE-4877: throw exception for invalid arguments in analysis factories 2013-04-01 16:30:00 +00:00
uima LUCENE-5097: Analyzer now has an additional tokenStream(String fieldName, String text) method, so wrapping by StringReader for common use is no longer needed. This method uses an internal reuseable reader, which was previously only used by the Field class. 2013-07-08 17:55:48 +00:00
README.txt LUCENE-3720: fix BeiderMorseFilter OOM issues 2012-09-17 15:32:46 +00:00
analysis-module-build.xml LUCENE-3977: remove javadoc duplication in the website/binary release 2012-04-21 07:51:46 +00:00
build.xml LUCENE-4758: 'ant jar', 'ant compile' and 'ant compile-test' should recurse 2013-02-06 17:57:53 +00:00

README.txt

Analysis README file

INTRODUCTION

The Analysis Module provides analysis capabilities to Lucene and Solr
applications.

The Lucene web site is at:
  http://lucene.apache.org/

Please join the Lucene-User mailing list by sending a message to:
  java-user-subscribe@lucene.apache.org

FILES

lucene-analyzers-common-XX.jar
  The primary analysis module library, containing general-purpose analysis
  components and support for various languages.

lucene-analyzers-icu-XX.jar
  An add-on analysis library that provides improved Unicode support via
  International Components for Unicode (ICU). Note: this module depends on
  the ICU4j jar file (version >= 4.6.0)

lucene-analyzers-kuromoji-XX.jar
  An analyzer with morphological analysis for Japanese.

lucene-analyzers-morfologik-XX.jar
  An analyzer using the Morfologik stemming library.

lucene-analyzers-phonetic-XX.jar
  An add-on analysis library that provides phonetic encoders via Apache
  Commons-Codec. Note: this module depends on the commons-codec jar 
  file
  
lucene-analyzers-smartcn-XX.jar
  An add-on analysis library that provides word segmentation for Simplified
  Chinese.

lucene-analyzers-stempel-XX.jar
  An add-on analysis library that contains a universal algorithmic stemmer,
  including tables for the Polish language.

lucene-analyzers-uima-XX.jar
  An add-on analysis library that contains tokenizers/analyzers using
  Apache UIMA extracted annotations to identify tokens/types/etc.

common/src/java
icu/src/java
kuromoji/src/java
morfologik/src/java
phonetic/src/java
smartcn/src/java
stempel/src/java
uima/src/java
  The source code for the libraries.

common/src/test
icu/src/test
kuromoji/src/test
morfologik/src/test
phonetic/src/test
smartcn/src/test
stempel/src/test
uima/src/test
  Unit tests for the libraries.