lucene/modules/analysis
Steven Rowe 8d7d57abdc LUCENE-2847: Added ASL2 license to supplementary macros generator, and to the generated file, and set svn:eol-style to native for both of them.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1056014 13f79535-47bb-0310-9956-ffa450edef68
2011-01-06 19:15:21 +00:00
..
common LUCENE-2847: Added ASL2 license to supplementary macros generator, and to the generated file, and set svn:eol-style to native for both of them. 2011-01-06 19:15:21 +00:00
icu LUCENE-2847: Added ASL2 license to supplementary macros generator, and to the generated file, and set svn:eol-style to native for both of them. 2011-01-06 19:15:21 +00:00
phonetic clear up javadocs warnings/errors (forgot to svn add these overview.htmls) 2010-10-03 13:30:29 +00:00
smartcn LUCENE-2020: Remove unused imports 2010-12-26 19:16:42 +00:00
stempel LUCENE-2020: Remove unused imports 2010-12-26 19:16:42 +00:00
CHANGES.txt LUCENE-2847: Support all of unicode, including supplementary code points above the basic multilingual plane, in StandardTokenizer and UAX29URLEmailTokenizer. 2011-01-06 13:51:10 +00:00
LICENSE.txt LUCENE-2444: boilerplate stuff for the analyzers module 2010-05-05 16:27:58 +00:00
NOTICE.txt LUCENE-2167: Implement StandardTokenizer with the UAX#29 Standard 2010-09-28 06:16:16 +00:00
README.txt LUCENE-2797: Upgrade icu to 4.6 2010-12-04 14:08:03 +00:00
build.xml getting 'generate-maven-artifacts' to work with analysis module 2010-08-19 19:58:36 +00:00

README.txt

Analysis README file

INTRODUCTION

The Analysis Module provides analysis capabilities to Lucene and Solr
applications.

The Lucene web site is at:
  http://lucene.apache.org/

Please join the Lucene-User mailing list by sending a message to:
  java-user-subscribe@lucene.apache.org

FILES

lucene-analyzers-common-XX.jar
  The primary analysis module library, containing general-purpose analysis
  components and support for various languages.

lucene-analyzers-icu-XX.jar
  An add-on analysis library that provides improved Unicode support via
  International Components for Unicode (ICU). Note: this module depends on
  the ICU4j jar file (version >= 4.6.0)

lucene-analyzers-phonetic-XX.jar
  An add-on analysis library that provides phonetic encoders via Apache
  Commons-Codec. Note: this module depends on the commons-codec jar 
  file (version >= 1.4)
  
lucene-analyzers-smartcn-XX.jar
  An add-on analysis library that provides word segmentation for Simplified
  Chinese.

lucene-analyzers-stempel-XX.jar
  An add-on analysis library that contains a universal algorithmic stemmer,
  including tables for the Polish language.

common/src/java
icu/src/java
phonetic/src/java
smartcn/src/java
stempel/src/java
  The source code for the ffve libraries.

common/src/test
icu/src/test
phonetic/src/test
smartcn/src/test
stempel/src/test
  Unit tests for the five libraries.