lucene/solr/contrib/extraction
Koen De Groote 67104dd615 LUCENE-8847: Code Cleanup: Rewrite StringBuilder.append with concatted strings (#707)
This specific commit affects all points in the casebase where the argument of a StringBuilder.append() call is itself a regular String concatenation.
This defeats the purpose of using StringBuilder and also introduces an extra alloction.
These changes should avoid that.

ant tests have run, succeeded on local machine.

Removing test files from the changes.

Another suggested rework.
2019-06-10 18:07:43 +02:00
..
src LUCENE-8847: Code Cleanup: Rewrite StringBuilder.append with concatted strings (#707) 2019-06-10 18:07:43 +02:00
README.txt SOLR-3650: checkpoint, migrated CHANGES.txt for contrib/uima and contrib/extraction 2012-07-31 01:37:17 +00:00
build.xml SOLR-8180: jcl-over-slf4j is officially a solrj/solr dependency now; not marked optional in a POM. 2015-12-01 18:12:00 +00:00
ivy.xml SOLR-12423: Upgrade to Tika 1.19.1 when available. Fixes #468 2018-10-17 13:06:09 -04:00

README.txt

Apache Solr Content Extraction Library (Solr Cell)

Introduction
------------

Apache Solr Extraction provides a means for extracting and indexing content contained in "rich" documents, such
as Microsoft Word, Adobe PDF, etc.  (Each name is a trademark of their respective owners)  This contrib module
uses Apache Tika to extract content and metadata from the files, which can then be indexed.  For more information,
see http://wiki.apache.org/solr/ExtractingRequestHandler

Getting Started
---------------
You will need Solr up and running.  Then, simply add the extraction JAR file, plus the Tika dependencies (in the ./lib folder)
to your Solr Home lib directory.  See http://wiki.apache.org/solr/ExtractingRequestHandler for more details on hooking it in
 and configuring.