lucene/solr/contrib/analysis-extras
koji 995a902d1a SOLR-12570: OpenNLPExtractNamedEntitiesUpdateProcessor cannot support multi fields because pattern replacement doesn't work correctly 2018-07-23 16:58:46 +09:00
..
src SOLR-12570: OpenNLPExtractNamedEntitiesUpdateProcessor cannot support multi fields because pattern replacement doesn't work correctly 2018-07-23 16:58:46 +09:00
README.txt LUCENE-2899: Add OpenNLP Analysis capabilities as a module 2017-12-15 11:24:18 -05:00
build.xml LUCENE-2899: Add OpenNLP Analysis capabilities as a module 2017-12-15 11:24:18 -05:00
ivy.xml LUCENE-8420: Upgrade OpenNLP to 1.9.0 2018-07-23 13:55:22 +09:00

README.txt

The analysis-extras plugin provides additional analyzers that rely
upon large dependencies/dictionaries.

It includes integration with ICU for multilingual support,
analyzers for Chinese and Polish, and integration with
OpenNLP for multilingual tokenization, part-of-speech tagging
lemmatization, phrase chunking, and named-entity recognition.

ICU relies upon lucene-libs/lucene-analyzers-icu-X.Y.jar
and lib/icu4j-X.Y.jar

Smartcn relies upon lucene-libs/lucene-analyzers-smartcn-X.Y.jar

Stempel relies on lucene-libs/lucene-analyzers-stempel-X.Y.jar

Morfologik relies on lucene-libs/lucene-analyzers-morfologik-X.Y.jar
and lib/morfologik-*.jar

OpenNLP relies on lucene-libs/lucene-analyzers-opennlp-X.Y.jar
and lib/opennlp-*.jar