Commit Graph

217 Commits

Author SHA1 Message Date
Christine Poerschke fc24fa9506 SOLR-14267: complete <query><HashDocSet> solrconfig.xml removal 2020-02-26 17:47:25 +00:00
Robert Muir 0203815ab2
LUCENE-9220: regenerate all stemmers/stopwords/test data from snowball 2.0 (#1262)
Previous situation:

* The snowball base classes (Among, SnowballProgram, etc) had accumulated local performance-related changes. There was a task that would also "patch" generated classes (e.g. GermanStemmer) after-the-fact.
* Snowball classes had many "non-changes" from the original such as removal of tabs addition of javadocs, license headers, etc.
* Snowball test data (inputs and expected stems) was incorporated into lucene testing, but this was maintained manually. Also files had become large, making the test too slow (Nightly).
* Snowball stopwords lists from their website were manually maintained. In some cases encoding fixes were manually applied.
* Some generated stemmers (such as Estonian and Armenian) exist in lucene, but have no corresponding `.sbl` file in snowball sources at all.

Besides this mess, snowball project is "moving along" and acquiring new languages, adding non-BSD-licensed test data, huge test data, and other complexity. So it is time to automate the integration better.

New situation:

* Lucene has a `gradle snowball` regeneration task. It works on Linux or Mac only. It checks out their repos, applies the `snowball.patch` in our repository, compiles snowball stemmers, regenerates all java code, applies any adjustments so that our build is happy.
* Tests data is automatically regenerated from the commit hash of the snowball test data repository. Not all languages are tested from their data: only where the license is simple BSD. Test data is also (deterministically) sampled, so that we don't have huge files. We just want to make sure our integration works.
* Randomized tests are still set to test every language with generated fake words. The regeneration task ensures all languages get tested (it writes a simple text file list of them).
* Stopword files are automatically regenerated from the commit hash of the snowball website repository.
* The regeneration procedure is idempotent. This way when stuff does change, you know exactly what happened. For example if test data changes to a different license, you may see a git deletion. Or if a new language/stopwords/test data gets added, you will see git additions.
2020-02-17 12:38:01 -05:00
Robert Muir 975df9ddd3
LUCENE-9182: add apache license headers to all .gradle files and enforce in rat task 2020-01-27 12:05:34 -05:00
Dawid Weiss d4a9842375 Initial gradle build layer. 2019-12-02 15:34:57 +01:00
Dawid Weiss 063c82ebd6 SOLR-13952: reverting Erick's commit (with permission). 2019-11-25 17:56:20 +01:00
Erick Erickson 4b34d726ab SOLR-13952: Separate out Gradle-specific code from other (mostly test) changes and commit separately 2019-11-24 13:24:40 -05:00
Andrzej Bialecki b4fe911cc8 SOLR-13817: Remove legacy SolrCache implementations. 2019-11-14 21:21:44 +01:00
Dawid Weiss 2a1d5eea42 SOLR-13779: Use the safe fork of simple-xml for clustering contrib 2019-09-19 12:24:26 +02:00
Uwe Schindler faaee86efb LUCENE-8738: Move to Java 11 as minimum Java version (merged branch: jira/LUCENE-8738)
Co-authored-by: Adrien Grand <jpountz@apache.org>
2019-04-16 14:00:09 +02:00
Kevin Risden 6222abf448
SOLR-9079: Remove commons-lang as a dependency
Signed-off-by: Kevin Risden <krisden@apache.org>
2019-03-20 19:37:19 -04:00
Jason Gerlowski c34f29a4b4 SOLR-13090: Add sysprop override for maxBooleanClauses 2019-01-02 10:58:32 -05:00
Michael Braun f0e1864ceb Merge remote-tracking branch 'source/master' into remove-constructor-wrapper-classes 2018-07-14 13:39:37 -04:00
Erick Erickson 93ae3669b5 SOLR-12008: Settle a location for the log4j2.xml file 2018-07-11 20:57:38 -07:00
Michael Braun fb6574100e LUCENE-8345 - add wrapper class constructors to forbiddenapis 2018-06-03 15:40:50 -04:00
David Smiley 6e0da7e2f8 SOLR-12374 Add SolrCore.withSearcher(lambda accepting SolrIndexSearcher) 2018-05-29 16:27:11 -04:00
Dawid Weiss 0e4e0096fd SOLR-12396: Upgrade Carrot2 to 3.16.0, HPPC to 0.8.1, morfologik to 2.1.5 2018-05-25 09:16:41 +02:00
Christine Poerschke 4da0d68981 SOLR-12372: contrib/clustering (private) logger rename 2018-05-18 20:00:49 +01:00
David Smiley 1409ab8f84 SOLR-11914: Deprecated some SolrParams methods.
* toSolrParams(nl) moved to a NamedList method, which is more natural.
2018-04-23 13:26:49 -04:00
Erick Erickson 624d128b5e SOLR-7887: Upgrade Solr to use log4j2 -- log4j 1 now officially end of life 2018-03-25 19:16:09 -07:00
Uwe Schindler 7b313bb597 SOLR-11482: RunExecutableListener was removed for security reasons 2017-10-13 13:48:18 +02:00
David Smiley 82a44beb3c SOLR-6807: requestDispatcher/@handleSelect now defaults to false; stop using it.
Deprecated StandardRequestHandler; stop using it.
2017-06-28 17:22:44 -04:00
Chris Hostetter 9f56698d33 SOLR-10948: Fix extraction component to treat DatePointField the same as TrieDateField
SOLR-10947: Randomize the usage of Points based numerics in contrib test schemas

This commit also adds @SuppressPointFields needed for SOLR-10949
2017-06-26 17:03:00 -07:00
Uwe Schindler 5de15ff403 LUCENE-7883: Lucene/Solr no longer uses the context class loader when resolving resources 2017-06-22 20:20:50 +02:00
Jan Høydahl 4746ff0ec8 SOLR-9623: Disable remote streaming in example configs by default. Adjust Upload Limit defaults 2017-06-20 11:00:08 +02:00
Jan Høydahl e776cbe446 SOLR-7041: Cut over tests from <defaultSearchField> in schema to df on requests 2017-05-05 00:53:50 +02:00
Jan Høydahl 61f64829d8 SOLR-7041: Remove a lot of defaultOperator and defaultSearchField from test configs (still more work to do) 2017-04-29 20:39:50 +02:00
Steve Rowe 1a80e4d694 SOLR-10343: Update Solr default/example and test configs to use SynonymGraphFilterFactory 2017-03-28 11:47:02 -04:00
Steve Rowe b46b8278c6 SOLR-10344: Update Solr default/example and test configs to use WordDelimiterGraphFilterFactory 2017-03-27 23:53:55 -04:00
David Smiley f1aef3d12b SOLR-10304: Refactor new SolrDocumentFetcher out of SolrIndexSearcher 2017-03-24 23:01:32 -04:00
yonik c9522a3936 SOLR-9712: block when maxWarmingSearchers is exceeded instead of throwing exception, default to 1, remove from most configs 2016-12-16 11:46:48 -05:00
Dawid Weiss 401d77485d SOLR-7539: Upgrade the clustering plugin to Carrot2 3.15.0. 2016-11-04 16:02:00 +01:00
Noble Paul 5c5c0c93ae SOLR-9576: Make FieldAnalysisRequestHandler, DocumentAnalysisRequestHandler & DumpRequestHandler implicit 2016-09-28 20:04:04 +05:30
Mike McCandless 87016b5f0c LUCENE-7318: graduate StandardAnalyzer and make it the default for IndexWriterConfig 2016-06-14 16:38:04 -04:00
Mikhail Khludnev a9dea9a983 SOLR-8445: fix line separator in log4j.properties files 2016-06-07 00:21:49 +03:00
Erick Erickson 5341918a0c SOLR-9083: Remove deprecated <types> and <fields> from schemas 2016-05-10 06:54:11 -07:00
Steve Rowe 9427b7402d SOLR-8764: Remove deprecated methods and classes 2016-03-02 09:22:20 -05:00
Dawid Weiss 4c9dca8d90 SOLR-8711: follow-up removal of dependencies no longer used. (applying
again, with changes to the sources).

This reverts commit 594fb908f6.
2016-02-24 09:27:06 +01:00
Dawid Weiss 594fb908f6 Revert "SOLR-8711: follow-up removal of dependencies no longer used."
This reverts commit a77d67a926.
2016-02-24 09:08:18 +01:00
Dawid Weiss a77d67a926 SOLR-8711: follow-up removal of dependencies no longer used. 2016-02-23 10:49:23 +01:00
Uwe Schindler 0f43705788 LUCENE-7013: Revert changes to package-info.java files (adds newline after license header and before javadocs back) 2016-02-04 23:02:50 +01:00
Shai Erera 813ca77250 LUCENE-7013: Move license header to top of file 2016-02-04 15:51:19 +02:00
Christine Poerschke c4db8d57a4 SOLR-8317: use responseHeader and response accessors to SolrQueryResponse
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1721170 13f79535-47bb-0310-9956-ffa450edef68
2015-12-21 14:39:21 +00:00
Michael McCandless 7da175b0b6 LUCENE-6917: rename/deprecate numeric classes in favor of dimensional values
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1719562 13f79535-47bb-0310-9956-ffa450edef68
2015-12-11 21:13:41 +00:00
Varun Thacker 695870d7a2 SOLR-8131: fix test solrconfig.xml files for the contrib modules
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1718307 13f79535-47bb-0310-9956-ffa450edef68
2015-12-07 11:18:31 +00:00
Anshum Gupta 6caaa0c454 SOLR-8330: Standardize and fix logger creation and usage so that they aren't shared across source files.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1717590 13f79535-47bb-0310-9956-ffa450edef68
2015-12-02 09:22:22 +00:00
David Wayne Smiley ce0363c0ad SOLR-8180: jcl-over-slf4j is officially a solrj/solr dependency now; not marked optional in a POM.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1717481 13f79535-47bb-0310-9956-ffa450edef68
2015-12-01 18:12:00 +00:00
Chris M. Hostetter ecf410f413 SOLR-8239: Added ClassicSimilarityFactory, marked DefaultSimilarityFactory as deprecated
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1712979 13f79535-47bb-0310-9956-ffa450edef68
2015-11-06 15:45:50 +00:00
Chris M. Hostetter d207e8c39a SOLR-8239: reverting r1712840 from trunk -- backport to 5x was more complicated then I expected because aparently DefaultSimilarity deprecation with ClassicSimilarity was never backported? want to clarify/resolve that before moving forward here
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1712845 13f79535-47bb-0310-9956-ffa450edef68
2015-11-05 18:44:13 +00:00
Chris M. Hostetter 2a24085bcd SOLR-8239: Added ClassicSimilarityFactory, marked DefaultSimilarityFactory as deprecated
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1712840 13f79535-47bb-0310-9956-ffa450edef68
2015-11-05 18:19:54 +00:00
Dawid Weiss 9cf70a3afd SOLR-7969: Unavailable clustering engines should not fail the core.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1697661 13f79535-47bb-0310-9956-ffa450edef68
2015-08-25 12:21:39 +00:00