Commit Graph

135 Commits

Author SHA1 Message Date
Uwe Schindler 57acbcfd00 SOLR-4679, SOLR-4908, SOLR-5124: Text extracted from HTML or PDF files using Solr Cell was missing ignorable whitespace, which is inserted by TIKA for convenience to support plain text extraction without using the HTML elements. This bug resulted in glued words.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1512296 13f79535-47bb-0310-9956-ffa450edef68
2013-08-09 13:26:55 +00:00
Jan Høydahl e2fc82a978 SOLR-4986: Upgrade to Tika 1.4
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1499338 13f79535-47bb-0310-9956-ffa450edef68
2013-07-03 11:49:59 +00:00
Chris M. Hostetter be9bf006c2 SOLR-4942: test improvements to randomize use of compound files
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1495668 13f79535-47bb-0310-9956-ffa450edef68
2013-06-22 06:00:18 +00:00
Steven Rowe 28b7111dc7 SOLR-3251: Dynamically add fields to schema.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1470539 13f79535-47bb-0310-9956-ffa450edef68
2013-04-22 14:26:55 +00:00
Robert Muir 29b5142e79 LUCENE-4877: throw exception for invalid arguments in analysis factories
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1463191 13f79535-47bb-0310-9956-ffa450edef68
2013-04-01 16:30:00 +00:00
Robert Muir abea806409 SOLR-4641: Schema should throw exceptoin on illegal field parameters
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1462502 13f79535-47bb-0310-9956-ffa450edef68
2013-03-29 15:46:58 +00:00
Mark Robert Miller a8ce23b445 SOLR-4416: Upgrade to Tika 1.3.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1452499 13f79535-47bb-0310-9956-ffa450edef68
2013-03-04 20:39:36 +00:00
Erik Hatcher 13fc322cf0 SOLR-2856: Fix schema (scheam) typo in test configurations
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1449211 13f79535-47bb-0310-9956-ffa450edef68
2013-02-22 20:52:17 +00:00
Mark Robert Miller 529af9a0bb tests: remove these annoying svn sub properties from config files
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1440069 13f79535-47bb-0310-9956-ffa450edef68
2013-01-29 19:23:45 +00:00
Shai Erera e2f54df3ca fix missing Overrides
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1420195 13f79535-47bb-0310-9956-ffa450edef68
2012-12-11 14:31:01 +00:00
Chris M. Hostetter b6db7c76f1 SOLR-3904: progress adding package.html files
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1406204 13f79535-47bb-0310-9956-ffa450edef68
2012-11-06 16:17:50 +00:00
Robert Muir feddbabf80 fix and detect invalid html tags in javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1401947 13f79535-47bb-0310-9956-ffa450edef68
2012-10-25 01:57:46 +00:00
Robert Muir f025322f81 fix all malformed javadocs (@code/deprecated/param/returns/throws with no args)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1389133 13f79535-47bb-0310-9956-ffa450edef68
2012-09-23 19:54:23 +00:00
Uwe Schindler 90e692e180 SOLR-3775: Add a test for Office 2003 file (.doc)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1379357 13f79535-47bb-0310-9956-ffa450edef68
2012-08-31 08:31:34 +00:00
Jan Høydahl 01cdc012a0 SOLR-3707: Upgrade Solr to Tika 1.2
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1374501 13f79535-47bb-0310-9956-ffa450edef68
2012-08-18 01:47:40 +00:00
Chris M. Hostetter 1cac548005 SOLR-3650: checkpoint, migrated CHANGES.txt for contrib/uima and contrib/extraction
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1367384 13f79535-47bb-0310-9956-ffa450edef68
2012-07-31 01:37:17 +00:00
Robert Muir 6d2a288be3 LUCENE-4262: add ivy sync=true, so when resolve runs, only jars that should be there are there
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1366509 13f79535-47bb-0310-9956-ffa450edef68
2012-07-27 19:18:49 +00:00
Robert Muir 78ca29ee09 LUCENE-4263: remove conflicting versions of the same jar dependencies
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1366381 13f79535-47bb-0310-9956-ffa450edef68
2012-07-27 13:36:13 +00:00
Steven Rowe 5abc76ea42 LUCENE-3747: Support Unicode 6.1.0.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1365971 13f79535-47bb-0310-9956-ffa450edef68
2012-07-26 12:56:35 +00:00
Uwe Schindler a956609a85 Copy over changes from 3.6 branch #2
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1364570 13f79535-47bb-0310-9956-ffa450edef68
2012-07-23 10:33:34 +00:00
Robert Muir 9df919c5ba LUCENE-4199: fix more charset/locale usages
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene4199@1358890 13f79535-47bb-0310-9956-ffa450edef68
2012-07-09 01:17:15 +00:00
Robert Muir b6a4748f21 make Locale.ENGLISH/US -> ROOT for locale-independent operations, now that we are on java6
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene4199@1358594 13f79535-47bb-0310-9956-ffa450edef68
2012-07-07 15:42:40 +00:00
Mark Robert Miller 38fc03163a fix javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1356435 13f79535-47bb-0310-9956-ffa450edef68
2012-07-02 19:43:52 +00:00
Jan Høydahl 032cad944a SOLR-1929: Index encrypted files in SolrCell
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1354887 13f79535-47bb-0310-9956-ffa450edef68
2012-06-28 09:46:49 +00:00
Jan Høydahl 52516ebb21 SOLR-1856: In Solr Cell, literals should override Tika-parsed values
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1354455 13f79535-47bb-0310-9956-ffa450edef68
2012-06-27 12:05:55 +00:00
Mark Robert Miller 1fc4347e41 SOLR-1770: Move the default core instance directory into a collection1 folder
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1354308 13f79535-47bb-0310-9956-ffa450edef68
2012-06-27 01:48:00 +00:00
David Wayne Smiley f12b94ab86 SOLR-3313 Rename Query Type to Request Handler
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1351932 13f79535-47bb-0310-9956-ffa450edef68
2012-06-20 04:23:07 +00:00
Chris M. Hostetter 2ac3eb27c4 LUCENE-3949: fix license headers to not be javadoc style comments
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1348980 13f79535-47bb-0310-9956-ffa450edef68
2012-06-11 19:26:18 +00:00
Robert Muir e650829677 4.0.0 -> 4.0.0-ALPHA
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1348289 13f79535-47bb-0310-9956-ffa450edef68
2012-06-09 00:57:26 +00:00
Robert Muir f5223505b0 4.0 -> 5.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1343848 13f79535-47bb-0310-9956-ffa450edef68
2012-05-29 17:01:58 +00:00
Ryan McKinley 800f5a54a6 SOLR-2857: add UpdateRequestHandler that handles xml,csv,json, and javabin
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1335768 13f79535-47bb-0310-9956-ffa450edef68
2012-05-08 21:06:10 +00:00
Robert Muir 167b296353 LUCENE-3981: fix regex queryparsing issues (improperly recognized as wildcard, etc)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1326893 13f79535-47bb-0310-9956-ffa450edef68
2012-04-17 02:51:43 +00:00
Jan Høydahl ff5206e64f SOLR-3254: Upgrade Solr to Tika 1.1
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1311198 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 10:48:03 +00:00
Ryan McKinley 81ddd8cbe2 SOLR-3329: Dropping getSourceID() from SolrInfoMBean and most of the svn:keywords
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310219 13f79535-47bb-0310-9956-ffa450edef68
2012-04-06 08:13:15 +00:00
Chris M. Hostetter bb7bc2ff44 LUCENE-3945: use sha1 checksums to verify jars pulled from ivy match expectations
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1309503 13f79535-47bb-0310-9956-ffa450edef68
2012-04-04 17:53:32 +00:00
Robert Muir 2fe2e82584 LUCENE-1866: better RAT reporting
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1309248 13f79535-47bb-0310-9956-ffa450edef68
2012-04-04 05:03:53 +00:00
Robert Muir 09bf650fbc SOLR-3295: merge changes entry
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1308891 13f79535-47bb-0310-9956-ffa450edef68
2012-04-03 13:30:45 +00:00
Jan Høydahl 058179d177 SOLR-1052: Deprecate/Remove <indexDefaults> and <mainIndex> in favor of <indexConfig> in solrconfig.xml (trunk)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1308608 13f79535-47bb-0310-9956-ffa450edef68
2012-04-02 22:55:37 +00:00
Robert Muir 8f0d7cc135 LUCENE-3930: nuke jars from source tree and use ivy
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1307563 13f79535-47bb-0310-9956-ffa450edef68
2012-03-30 18:04:43 +00:00
Uwe Schindler 989530e17e LUCENE-3850: Fix rawtypes warnings for Java 7 compiler
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1297048 13f79535-47bb-0310-9956-ffa450edef68
2012-03-05 13:34:40 +00:00
Mark Robert Miller 54681c0cfd fix jdom license files
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1293343 13f79535-47bb-0310-9956-ffa450edef68
2012-02-24 17:12:51 +00:00
Jan Høydahl 6b113cd170 SOLR-2901: Added jdom-1.0.jar
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1293309 13f79535-47bb-0310-9956-ffa450edef68
2012-02-24 16:00:14 +00:00
Jan Høydahl 8f550126f7 SOLR-2901: Upgrade Solr to Tika 1.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235753 13f79535-47bb-0310-9956-ffa450edef68
2012-01-25 14:18:06 +00:00
Robert Muir f5d1e967db LUCENE-3667: make tests reasonable if you dont have a supercomputer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1225920 13f79535-47bb-0310-9956-ffa450edef68
2011-12-30 21:14:15 +00:00
Koji Sekiguchi 21822811a9 SOLR-2346: Add a chance to set content encoding explicitly via content type of stream.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1225120 13f79535-47bb-0310-9956-ffa450edef68
2011-12-28 07:17:55 +00:00
Simon Willnauer 356fa5d137 preparing Lucene 3.5 release adding 3.6 constants and changes sections
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1204451 13f79535-47bb-0310-9956-ffa450edef68
2011-11-21 11:27:57 +00:00
Martijn van Groningen 19765406b6 SOLR-2591: Remove commitLockTimeout option from solrconfig.xml
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1197154 13f79535-47bb-0310-9956-ffa450edef68
2011-11-03 14:38:13 +00:00
Erik Hatcher ce41a7b231 SOLR-2854: Fix ExtractingRequestHandler to call getStream before getting stream attributes.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1189803 13f79535-47bb-0310-9956-ffa450edef68
2011-10-27 15:24:14 +00:00
Robert Muir 7af9fbd16d LUCENE-3521: upgrade icu jar to 4.8.1.1 / remove lucenetestcase hack
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1183738 13f79535-47bb-0310-9956-ffa450edef68
2011-10-15 21:48:50 +00:00
Jan Høydahl 2c6623a3b3 SOLR-2372: Fixed LICENSE and NOTICE files for apache-mime4j
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1179427 13f79535-47bb-0310-9956-ffa450edef68
2011-10-05 20:39:25 +00:00