lucene/solr/contrib/analysis-extras
Robert Muir 784ede4eda
LUCENE-9215: replace checkJavaDocs.py with doclet (#1802)
This has the same logic as the previous python, but no longer relies
upon parsing HTML output, instead using java's doclet processor.

The errors are reported like "normal" javadoc errors with source file
name and line number and happen when running "gradlew javadoc"

Although the "rules" are the same as the previous python, the python had
some bugs where the checker didn't quite do exactly what we wanted, so
some fixes were applied throughout.

Co-authored-by: Dawid Weiss <dawid.weiss@carrotsearch.com>
Co-authored-by: Uwe Schindler <uschindler@apache.org>
2020-09-02 08:29:17 -04:00
..
src fix links to JavaDocs 2020-09-01 13:36:23 -04:00
README.md SOLR-14429: Convert .txt files to properly formatted .md files (#1450) 2020-04-27 08:43:04 +09:00
build.gradle LUCENE-9215: replace checkJavaDocs.py with doclet (#1802) 2020-09-02 08:29:17 -04:00

README.md

Apache Solr - Analysis Extras

The analysis-extras plugin provides additional analyzers that rely upon large dependencies/dictionaries.

It includes integration with ICU for multilingual support, analyzers for Chinese and Polish, and integration with OpenNLP for multilingual tokenization, part-of-speech tagging lemmatization, phrase chunking, and named-entity recognition.

Each of the jars below relies upon including /dist/solr-analysis-extras-X.Y.jar in the solrconfig.xml

  • ICU relies upon lucene-libs/lucene-analyzers-icu-X.Y.jar and lib/icu4j-X.Y.jar

  • Smartcn relies upon lucene-libs/lucene-analyzers-smartcn-X.Y.jar

  • Stempel relies on lucene-libs/lucene-analyzers-stempel-X.Y.jar

  • Morfologik relies on lucene-libs/lucene-analyzers-morfologik-X.Y.jar and lib/morfologik-*.jar

  • OpenNLP relies on lucene-libs/lucene-analyzers-opennlp-X.Y.jar and lib/opennlp-*.jar