lucene/solr/contrib/analysis-extras
Erick Erickson 8cbfb192ab SOLR-14563: Fix or suppress warnings in solr/contrib 2020-06-12 10:56:22 -04:00
..
src SOLR-14563: Fix or suppress warnings in solr/contrib 2020-06-12 10:56:22 -04:00
README.md SOLR-14429: Convert .txt files to properly formatted .md files (#1450) 2020-04-27 08:43:04 +09:00
build.gradle LUCENE-9321: Port markdown task to Gradle (#1477) 2020-05-17 14:46:26 +02:00
build.xml LUCENE-2899: Add OpenNLP Analysis capabilities as a module 2017-12-15 11:24:18 -05:00
ivy.xml SOLR-13462: Update dependency definitions to include Ukrainian dictionary. 2019-05-14 21:29:52 +02:00

README.md

Apache Solr - Analysis Extras

The analysis-extras plugin provides additional analyzers that rely upon large dependencies/dictionaries.

It includes integration with ICU for multilingual support, analyzers for Chinese and Polish, and integration with OpenNLP for multilingual tokenization, part-of-speech tagging lemmatization, phrase chunking, and named-entity recognition.

Each of the jars below relies upon including /dist/solr-analysis-extras-X.Y.jar in the solrconfig.xml

  • ICU relies upon lucene-libs/lucene-analyzers-icu-X.Y.jar and lib/icu4j-X.Y.jar

  • Smartcn relies upon lucene-libs/lucene-analyzers-smartcn-X.Y.jar

  • Stempel relies on lucene-libs/lucene-analyzers-stempel-X.Y.jar

  • Morfologik relies on lucene-libs/lucene-analyzers-morfologik-X.Y.jar and lib/morfologik-*.jar

  • OpenNLP relies on lucene-libs/lucene-analyzers-opennlp-X.Y.jar and lib/opennlp-*.jar