lucene/solr/contrib/extraction/src
Mark Robert Miller 7ab2e1f787 SOLR-1301: Add a Solr contrib that allows for building Solr indexes via Hadoop's MapReduce.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1547139 13f79535-47bb-0310-9956-ffa450edef68
2013-12-02 18:42:23 +00:00
..
java SOLR-1301: Add a Solr contrib that allows for building Solr indexes via Hadoop's MapReduce. 2013-12-02 18:42:23 +00:00
test/org/apache/solr/handler/extraction SOLR-4679, SOLR-4908, SOLR-5124: Text extracted from HTML or PDF files using Solr Cell was missing ignorable whitespace, which is inserted by TIKA for convenience to support plain text extraction without using the HTML elements. This bug resulted in glued words. 2013-08-09 13:26:55 +00:00
test-files/extraction SOLR-4679, SOLR-4908, SOLR-5124: Text extracted from HTML or PDF files using Solr Cell was missing ignorable whitespace, which is inserted by TIKA for convenience to support plain text extraction without using the HTML elements. This bug resulted in glued words. 2013-08-09 13:26:55 +00:00