lucene/solr/contrib
Uwe Schindler 57acbcfd00 SOLR-4679, SOLR-4908, SOLR-5124: Text extracted from HTML or PDF files using Solr Cell was missing ignorable whitespace, which is inserted by TIKA for convenience to support plain text extraction without using the HTML elements. This bug resulted in glued words.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1512296 13f79535-47bb-0310-9956-ffa450edef68
2013-08-09 13:26:55 +00:00
..
analysis-extras SOLR-5126: Update Carrot2 clustering to version 3.8.0, update Morfologik to version 1.7.1 2013-08-09 08:39:21 +00:00
clustering SOLR-5126: Update Carrot2 clustering to version 3.8.0, update Morfologik to version 1.7.1 2013-08-09 08:39:21 +00:00
dataimporthandler LUCENE-5107: Properties files by Lucene are now written in UTF-8 encoding, Unicode is no longer escaped. Reading of legacy properties files with \u escapes is still possible 2013-07-12 17:10:22 +00:00
dataimporthandler-extras SOLR-4942: test improvements to randomize use of compound files 2013-06-22 06:00:18 +00:00
extraction SOLR-4679, SOLR-4908, SOLR-5124: Text extracted from HTML or PDF files using Solr Cell was missing ignorable whitespace, which is inserted by TIKA for convenience to support plain text extraction without using the HTML elements. This bug resulted in glued words. 2013-08-09 13:26:55 +00:00
langid SOLR-4412: LanguageIdentifier lcmap for language field 2013-07-02 14:38:47 +00:00
uima SOLR-4708: Enable ClusteringComponent by default in collection1 example. 2013-08-09 09:42:57 +00:00
velocity LUCENE-5107: Properties files by Lucene are now written in UTF-8 encoding, Unicode is no longer escaped. Reading of legacy properties files with \u escapes is still possible 2013-07-12 17:10:22 +00:00
contrib-build.xml LUCENE-3808: Switch LuceneTestCaseRunner to RandomizedRunner. Enforce Random sharing contracts. Enforce thread leaks. 2012-04-15 14:41:44 +00:00