Commit Graph

519 Commits

Author SHA1 Message Date
Uwe Schindler 57acbcfd00 SOLR-4679, SOLR-4908, SOLR-5124: Text extracted from HTML or PDF files using Solr Cell was missing ignorable whitespace, which is inserted by TIKA for convenience to support plain text extraction without using the HTML elements. This bug resulted in glued words.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1512296 13f79535-47bb-0310-9956-ffa450edef68
2013-08-09 13:26:55 +00:00
Dawid Weiss 0fd1bed95b SOLR-4708: Enable ClusteringComponent by default in collection1 example.
The solr.clustering.enabled system property is set to 'true' by default.



git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1512227 13f79535-47bb-0310-9956-ffa450edef68
2013-08-09 09:42:57 +00:00
Dawid Weiss b40f603f46 SOLR-5126: Update Carrot2 clustering to version 3.8.0, update Morfologik to version 1.7.1
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1512203 13f79535-47bb-0310-9956-ffa450edef68
2013-08-09 08:39:21 +00:00
Uwe Schindler 40968cf653 LUCENE-5107: Properties files by Lucene are now written in UTF-8 encoding, Unicode is no longer escaped. Reading of legacy properties files with \u escapes is still possible
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1502615 13f79535-47bb-0310-9956-ffa450edef68
2013-07-12 17:10:22 +00:00
Alan Woodward 5a42052fba SOLR-4914: Factor out core discovery and persistence logic
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1502276 13f79535-47bb-0310-9956-ffa450edef68
2013-07-11 16:35:01 +00:00
Steven Rowe b2bc345a60 SOLR-4916: Move MiniDfsCluster test dependencies from solr test-framework to solr-core; download solr-core test dependencies to solr/core/test-lib/ instead of solr/core/lib/; download DIH test dependencies to solr/contrib/dataimporthandler/test-lib instead of [...]/lib/
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1502105 13f79535-47bb-0310-9956-ffa450edef68
2013-07-11 04:03:24 +00:00
Uwe Schindler f092795fe9 LUCENE-5097: Analyzer now has an additional tokenStream(String fieldName, String text) method, so wrapping by StringReader for common use is no longer needed. This method uses an internal reuseable reader, which was previously only used by the Field class.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1500862 13f79535-47bb-0310-9956-ffa450edef68
2013-07-08 17:55:48 +00:00
Shalin Shekhar Mangar 6e638c8460 SOLR-4978: Time is stripped from datetime column when imported into Solr date field if convertType=true
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1500666 13f79535-47bb-0310-9956-ffa450edef68
2013-07-08 11:30:40 +00:00
Shalin Shekhar Mangar e98616b8f1 SOLR-4788: Multiple Entities DIH delta import: dataimporter.[entityName].last_index_time is empty
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1500652 13f79535-47bb-0310-9956-ffa450edef68
2013-07-08 11:11:58 +00:00
Dawid Weiss efc6826ed8 LUCENE-5089: Update to Morfologik 1.6.0. MorfologikAnalyzer and MorfologikFilter no longer support multiple "dictionaries" as there is only one dictionary available.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1499352 13f79535-47bb-0310-9956-ffa450edef68
2013-07-03 12:14:50 +00:00
Jan Høydahl e2fc82a978 SOLR-4986: Upgrade to Tika 1.4
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1499338 13f79535-47bb-0310-9956-ffa450edef68
2013-07-03 11:49:59 +00:00
Alan Woodward f38c5163cc SOLR-4948: Tidy up CoreContainer construction logic
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1498990 13f79535-47bb-0310-9956-ffa450edef68
2013-07-02 15:50:49 +00:00
Jan Høydahl 42b3251408 SOLR-4412: LanguageIdentifier lcmap for language field
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1498959 13f79535-47bb-0310-9956-ffa450edef68
2013-07-02 14:38:47 +00:00
Chris M. Hostetter be9bf006c2 SOLR-4942: test improvements to randomize use of compound files
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1495668 13f79535-47bb-0310-9956-ffa450edef68
2013-06-22 06:00:18 +00:00
Shalin Shekhar Mangar 14d7d11d59 SOLR-4920: JdbcDataSource incorrectly suppresses exceptions when retrieving a connection from a JNDI context and falls back to trying to use DriverManager to obtain a connection. Additionally, if a SQLException is thrown while initializing a connection, such as in setAutoCommit(), the connection will not be closed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1492537 13f79535-47bb-0310-9956-ffa450edef68
2013-06-13 06:47:47 +00:00
Tommaso Teofili eee9bd713f SOLR-4865 - improved UIMA URP logging
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1486792 13f79535-47bb-0310-9956-ffa450edef68
2013-05-28 07:35:33 +00:00
Adrien Grand fa76c30147 LUCENE-4963: Completely remove deprecated options in 5.0.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1479171 13f79535-47bb-0310-9956-ffa450edef68
2013-05-04 20:25:51 +00:00
Steven Rowe 28b7111dc7 SOLR-3251: Dynamically add fields to schema.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1470539 13f79535-47bb-0310-9956-ffa450edef68
2013-04-22 14:26:55 +00:00
Shalin Shekhar Mangar ccf99d3e6c SOLR-4530: DIH: Provide configuration to use Tika's IdentityHtmlMapper
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1465879 13f79535-47bb-0310-9956-ffa450edef68
2013-04-09 04:40:12 +00:00
Robert Muir 29b5142e79 LUCENE-4877: throw exception for invalid arguments in analysis factories
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1463191 13f79535-47bb-0310-9956-ffa450edef68
2013-04-01 16:30:00 +00:00
Steven Rowe b1e4ba6843 SOLR-4658: In preparation for REST API requests that can modify the schema, a "managed schema" is introduced.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1463182 13f79535-47bb-0310-9956-ffa450edef68
2013-04-01 16:16:03 +00:00
Robert Muir abea806409 SOLR-4641: Schema should throw exceptoin on illegal field parameters
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1462502 13f79535-47bb-0310-9956-ffa450edef68
2013-03-29 15:46:58 +00:00
Uwe Schindler bf368745b9 SOLR-4634: Fix scripting engine tests to work with Java 8's "Nashorn" Javascript implementation
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1460069 13f79535-47bb-0310-9956-ffa450edef68
2013-03-23 00:53:25 +00:00
Uwe Schindler 9a54942863 LUCENE-4747: Fix nocommits
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1457747 13f79535-47bb-0310-9956-ffa450edef68
2013-03-18 13:07:24 +00:00
Uwe Schindler 6231597974 LUCENE-4747: Move to Java 7 in trunk
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1457734 13f79535-47bb-0310-9956-ffa450edef68
2013-03-18 12:28:52 +00:00
James Dyer e9f2108093 SOLR-4361: DIH to allow handler parameters with dots in the name
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1455245 13f79535-47bb-0310-9956-ffa450edef68
2013-03-11 17:37:17 +00:00
Mark Robert Miller a8ce23b445 SOLR-4416: Upgrade to Tika 1.3.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1452499 13f79535-47bb-0310-9956-ffa450edef68
2013-03-04 20:39:36 +00:00
Dawid Weiss 2080245c8d SOLR-4259: Carrot2 dependency should be declared on the mini version, not the core.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1452220 13f79535-47bb-0310-9956-ffa450edef68
2013-03-04 09:30:00 +00:00
Erik Hatcher 13fc322cf0 SOLR-2856: Fix schema (scheam) typo in test configurations
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1449211 13f79535-47bb-0310-9956-ffa450edef68
2013-02-22 20:52:17 +00:00
Erik Hatcher 1082bf8a71 SOLR-2470: Added more tests for VelocityResponseWriter (merged from lucene_4x r1449183)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1449188 13f79535-47bb-0310-9956-ffa450edef68
2013-02-22 19:57:47 +00:00
Steven Rowe 83fa975ae9 LUCENE-4789: fix typos
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1448400 13f79535-47bb-0310-9956-ffa450edef68
2013-02-20 20:53:46 +00:00
Tommaso Teofili 80430f5f62 LUCENE-4749 - added sample conf in solr/contrib/uima too
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1442112 13f79535-47bb-0310-9956-ffa450edef68
2013-02-04 13:41:06 +00:00
Jan Høydahl d6317b5e68 SOLR-3967: langid.enforceSchema option checks source field instead of target field
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1440226 13f79535-47bb-0310-9956-ffa450edef68
2013-01-30 00:26:39 +00:00
Mark Robert Miller 529af9a0bb tests: remove these annoying svn sub properties from config files
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1440069 13f79535-47bb-0310-9956-ffa450edef68
2013-01-29 19:23:45 +00:00
Chris M. Hostetter 5dbcae4cd3 SOLR-4342: Fix DataImportHandler stats to be a prper Map
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1439536 13f79535-47bb-0310-9956-ffa450edef68
2013-01-28 18:03:26 +00:00
James Dyer f8c00c1f85 SOLR-4325: fix TestBuiltInEvaluators
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1438597 13f79535-47bb-0310-9956-ffa450edef68
2013-01-25 17:20:22 +00:00
Uwe Schindler 4be9caa5b2 Disable solr fails!
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1435628 13f79535-47bb-0310-9956-ffa450edef68
2013-01-19 18:11:42 +00:00
Dawid Weiss b0cba0346b SOLR-4288: Improve logging for FileDataSource (basePath, relative resources).
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1433849 13f79535-47bb-0310-9956-ffa450edef68
2013-01-16 08:22:03 +00:00
Steven Rowe 56b62c1ddb SOLR-4287: Removed "apache-" prefix from Solr distribution and artifact filenames.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1432483 13f79535-47bb-0310-9956-ffa450edef68
2013-01-12 17:51:57 +00:00
Erick Erickson faad008d7f removed println
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1432058 13f79535-47bb-0310-9956-ffa450edef68
2013-01-11 14:11:56 +00:00
Shalin Shekhar Mangar f41e563d7d SOLR-4112: Fixed DataImportHandler ZKAwarePropertiesWriter implementation so
import works fine with SolrCloud clusters

git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1432045 13f79535-47bb-0310-9956-ffa450edef68
2013-01-11 13:38:23 +00:00
James Dyer 2a1e68e504 SOLR-2201: DIH "formatDate" evaluator timezone support
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1429957 13f79535-47bb-0310-9956-ffa450edef68
2013-01-07 18:54:21 +00:00
Dawid Weiss 0d05a14bd6 SOLR-4106: Javac/ ivy path warnings with morfologik fixed by updating to Morfologik 1.5.5 (no functional changes).
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1428823 13f79535-47bb-0310-9956-ffa450edef68
2013-01-04 12:42:00 +00:00
Stanisław Osiński ed09a998b6 SOLR-4253: Misleading resource loading warning from Carrot2 clustering component fixed
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1427755 13f79535-47bb-0310-9956-ffa450edef68
2013-01-02 12:06:49 +00:00
Stanisław Osiński bf412aa62c SOLR-3279: upgrade Carrot2 to version 3.6.2
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1427747 13f79535-47bb-0310-9956-ffa450edef68
2013-01-02 11:41:19 +00:00
James Dyer 69383cb079 SOLR-4247: fix bug with TestSimplePropertiesWriter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1427215 13f79535-47bb-0310-9956-ffa450edef68
2012-12-31 18:30:41 +00:00
Erik Hatcher fcc4d6f01a SOLR-3735: Relocate the example mime-to-extension mapping, and upgrade Velocity Engine to 1.7
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1426716 13f79535-47bb-0310-9956-ffa450edef68
2012-12-29 08:41:53 +00:00
Mark Robert Miller f5b5d3e17f SOLR-4204: Make SolrCloud tests more friendly to FreeBSD blackhole 2 environments.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1422728 13f79535-47bb-0310-9956-ffa450edef68
2012-12-17 00:38:03 +00:00
Uwe Schindler a9e41d20a3 SOLR-4195: Further restrict security policy of tests to disallow writing to files outside the test's work dir (e.g. disallow writing to build/test-files)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1422368 13f79535-47bb-0310-9956-ffa450edef68
2012-12-15 21:50:57 +00:00
Tommaso Teofili 7e7bfe1f4d [SOLR-3902] - minor fixes (imports / warnings)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1421108 13f79535-47bb-0310-9956-ffa450edef68
2012-12-13 07:58:27 +00:00