lucene/solr/contrib/analysis-extras
Erick Erickson 217c2faa2c LUCENE-7788: fail precommit on unparameterised log messages and examine for wasted work/objects 2020-05-01 13:06:57 -04:00
..
src LUCENE-7788: fail precommit on unparameterised log messages and examine for wasted work/objects 2020-05-01 13:06:57 -04:00
README.md SOLR-14429: Convert .txt files to properly formatted .md files (#1450) 2020-04-27 08:43:04 +09:00
build.gradle LUCENE-9182: add apache license headers to all .gradle files and enforce in rat task 2020-01-27 12:05:34 -05:00
build.xml LUCENE-2899: Add OpenNLP Analysis capabilities as a module 2017-12-15 11:24:18 -05:00
ivy.xml SOLR-13462: Update dependency definitions to include Ukrainian dictionary. 2019-05-14 21:29:52 +02:00

README.md

Apache Solr - Analysis Extras

The analysis-extras plugin provides additional analyzers that rely upon large dependencies/dictionaries.

It includes integration with ICU for multilingual support, analyzers for Chinese and Polish, and integration with OpenNLP for multilingual tokenization, part-of-speech tagging lemmatization, phrase chunking, and named-entity recognition.

Each of the jars below relies upon including /dist/solr-analysis-extras-X.Y.jar in the solrconfig.xml

  • ICU relies upon lucene-libs/lucene-analyzers-icu-X.Y.jar and lib/icu4j-X.Y.jar

  • Smartcn relies upon lucene-libs/lucene-analyzers-smartcn-X.Y.jar

  • Stempel relies on lucene-libs/lucene-analyzers-stempel-X.Y.jar

  • Morfologik relies on lucene-libs/lucene-analyzers-morfologik-X.Y.jar and lib/morfologik-*.jar

  • OpenNLP relies on lucene-libs/lucene-analyzers-opennlp-X.Y.jar and lib/opennlp-*.jar