Commit Graph

34941 Commits

Author SHA1 Message Date
Peter Gromov 8f75933f3d
LUCENE-9716: Hunspell: support flag usage before its format is even specified (#2277) 2021-02-02 21:25:56 +01:00
Nhat Nguyen 47e3d06ce0
LUCENE-9722: Close merged readers on abort (#2288)
We fail to close the merged readers of an aborted merge if its 
output segment contains no document.

This bug was discovered by a test in Elasticsearch 
(elastic/elasticsearch#67884).
2021-02-02 11:24:10 -05:00
Andrzej Bialecki 4cb1000ea0 SOLR-15122: Tentative fix for the test failure - the node in the test could go down
before the new plugin was active on the Overseer.
2021-02-02 12:06:39 +01:00
Peter Gromov b48d5beb34
LUCENE-9707: Hunspell: check Lucene's implementation against Hunspel's test data (#2267) 2021-02-02 10:46:14 +01:00
Dawid Weiss 2da7a4a86d LUCENE-9686: Add changes entry. 2021-02-02 09:10:03 +01:00
zacharymorn 3835cb4e95
LUCENE-9686: Fix read past EOF handling in DirectIODirectory (#2258) 2021-02-02 09:07:30 +01:00
Chris Hostetter 15aaec60d9 SOLR-14330: ExpandComponent now supports an expand.nullGroup=true option 2021-02-01 16:19:34 -07:00
Mike Drob 99748384cf
SOLR-14253 Replace sleep calls with ZK waits (#1297)
Co-Authored-By: markrmiller <markrmiller@apache.org>
2021-02-01 13:25:17 -06:00
András Salamon e8bc758144
SOLR-15115: Remove unused methods from TestRerankBase (#2261) 2021-02-01 17:31:58 +00:00
Andrzej Bialecki 9e8ca98985 SOLR-15068: RefGuide documentation for replica placement plugins (plus
minor cleanups).
2021-02-01 16:50:25 +01:00
Peter Gromov 7a7949aed2
LUCENE-9708: Hunspell: support FLAG UTF-8 in absence of SET UTF-8 (#2270) 2021-02-01 10:36:24 +01:00
Peter Gromov 8a34cc7afd
LUCENE-9701: Hunspell: implement simple REP-based suggestion algorithm (#2251) 2021-02-01 10:23:54 +01:00
Peter Gromov 9d45dfe776
LUCENE-9710: Hunspell: support minor compounding-related flags (#2272)
* LUCENE-9710: Hunspell: support COMPOUNDFLAG

* LUCENE-9710: Hunspell: fix CHECKCOMPOUNDCASE support

* LUCENE-9710: Hunspell: support CHECKCOMPOUNDDUP

* LUCENE-9710: Hunspell: support triple flags (CHECKCOMPOUNDTRIPLE, SIMPLIFIEDTRIPLE)

* LUCENE-9710: Hunspell: support COMPOUNDFORBIDFLAG

* LUCENE-9710: Hunspell: support FORCEUCASE
2021-02-01 10:20:11 +01:00
Peter Gromov 40e92315ae
LUCENE-9709: Hunspell: no special dotted i treatment outside tr/az languages (#2271) 2021-02-01 10:05:28 +01:00
Mike Drob 5cca464517 SOLR-15122 Debug Logging 2021-01-29 15:49:10 -06:00
Mike McCandless 4d839225b1 LUCENE-9537: move to 8.9 section in CHANGES.txt; make it consistent with 8.x's CHANGES.txt; remove the leading UTF-8 BOM 2021-01-29 16:46:54 -05:00
Mike McCandless cac5c2a4b2 LUCENE-9694: make new DocumentSelector interface public so it is usable outside of its own package 2021-01-29 16:10:59 -05:00
cammiemw 9cc5c9b798
LUCENE-9537: Add initial Indri search engine functionality to Lucene 2021-01-29 14:47:24 -05:00
Patrick Zhai e4cede0e8c
LUCENE-9694: New tool for creating a deterministic index (#2246) 2021-01-29 13:32:24 -05:00
Eric Pugh 6d71a0aced
SOLR-14067: v4 Create /contrib/scripting module with ScriptingUpdateProcessor (#2257)
* Creating Scripting contrib module to centralize the less secure code related to scripts.

* tweak the changelog and update notice to explain why the name changed and the security posture thinking

* the test script happens to be a currency.xml, which made me think we were doing something specific to currency types, but instead any xml formatted file will suffice for the test.

* Update solr/contrib/scripting/src/java/org/apache/solr/scripting/update/ScriptUpdateProcessorFactory.java

* Update solr/contrib/scripting/src/java/org/apache/solr/scripting/update/package-info.java

* drop the ing, and be more specific on the name of the ref guide page

* comment out the script update chain.

The sample techproducts configSet is used by many of the solr unit tests, and by default doesn't have access to the jar file in the contrib module.   This is commented out, similar to how the lang contrib is.

* using a Mock for the script processor in order to keep the trusted configSets tests all together.

* tweak since we are using a mock script processor

Co-authored-by: David Smiley <dsmiley@apache.org>
2021-01-29 12:27:36 -05:00
Tim Dillon a7a434dbc4
SOLR-15025: MiniSolrCloudCluster.waitForAllNodes ignores passed timeout value (#2193)
* Change timeout values to seconds
2021-01-29 11:22:06 -06:00
Tim Owen 715caaae52
SOLR-15085 Prevent EmbeddedSolrServer calling shutdown on a CoreContainer that was passed to it 2021-01-29 11:15:22 -06:00
Mike Drob 0d4769e174
SOLR-15120 Reduce duplicated core creation work (#2266)
Use j.u.c collections instead of sync block
Rework how we load implicit handlers
Additional debug and trace logging for zookeeper comms
2021-01-29 10:20:16 -06:00
Peter Gromov ff943ece8f
LUCENE-9702: Hunspell: support alternate casing for short language codes (#2253) 2021-01-29 11:46:45 +01:00
Peter Gromov 6635d7a5e7
LUCENE-9704: Hunspell: support capitalization for German ß (#2260) 2021-01-29 10:03:37 +01:00
Peter Gromov 71705c900b
LUCENE-9703: Hunspell: prohibit FORBIDDENWORD words and their case variations (#2254) 2021-01-29 08:36:37 +01:00
Peter Gromov 4ba78f2ab2
LUCENE-9706: Hunspell: support NEEDAFFIX flag on affixes (#2262) 2021-01-29 08:24:23 +01:00
Peter Gromov 800f4d0919
LUCENE-9700: Hunspell: support words with trailing dots (#2249) 2021-01-29 08:23:03 +01:00
Cassandra Targett a9ad02cc54 SOLR-14616: remove leftover CDCR ref guide images 2021-01-28 15:40:32 -06:00
Mike Drob b335034615
Require Thread Names in Solr (#2264)
When we are creating a new thread we should give it a descriptive name and enforce this via ForbiddenAPIs. This doesn't apply to Runnable or Callable objects that we pass to an executor, since those should be getting named by the executor itself.

We don't require this in tests because the tests should be more self contained and there is less benefit in descriptive names. If somebody is already profiling a test, then they likely have the context to understand what the unnamed threads are doing, whereas a thread dump from a running Solr instance should have good thread names for everything. This is especially helpful when doing profiling, otherwise we end up with a bunch of Thread-# that are hard to tell apart and search on.
2021-01-28 15:04:15 -06:00
Mike Drob 9be71b3939
SOLR-15113 Do not attempt to start Solr server when embedded ZK fails (#2255) 2021-01-28 11:37:51 -06:00
Joel Bernstein acb98e549d SOLR-14672: Make timeouts configurable for the Streaming Expression SolrClientCache 2021-01-28 10:36:49 -05:00
Michael Sokolov d84925a201 Exclude degenerate edge case (maxConn=1) from TestKnnGraph.testSearch 2021-01-28 10:16:05 -05:00
Peter Gromov a176308aa6
LUCENE-9699: Support German-like compound words (#2248) 2021-01-27 22:31:58 +01:00
Michael Sokolov 38ec2602ce
LUCENE-9695: don't merge deleted vectors (#2239) 2021-01-27 12:18:55 -05:00
Andrzej Bialecki b1cd6b691f SOLR-15055: Fix this test to avoid OS dependency. 2021-01-27 13:38:46 +01:00
Houston Putman 7da0fa8a07 SOLR-15075: Remove docker gradle artifact
Gradle is currently picking up the dockerBuild task in "assemble",
because of the build artifact.
2021-01-26 13:33:32 -05:00
Julie Tibshirani 1887280475
LUCENE-9322: Move old field infos format to backwards-codecs. (#2245)
We introduced a new `Lucene90FieldInfosFormat`, so the old
`Lucene60FieldInfosFormat` should live in backwards-codecs.
2021-01-26 10:14:55 -08:00
Houston Putman 2b8d7bcd6a
SOLR-15075: Solr docker gradle improvements (#2197)
* Removed docker plugin from gradle builds.
* Removed package docker image.
* Tasks now have correct inputs/outputs/dependencies.
* Move gradle help text to docker folder.
* Reduce duplicated Docker layer by doing file removal and chmod in another stage.

Co-authored-by: David Smiley <dsmiley@apache.org>
2021-01-26 10:22:50 -05:00
Andrzej Bialecki 6c72e2e17b SOLR-15055: Add missing license headers. 2021-01-26 16:09:38 +01:00
Andrzej Bialecki 6e9185a33a SOLR-15055: Re-implement 'withCollection'. 2021-01-26 15:16:33 +01:00
Peter Gromov 695e789891
LUCENE-9698: Hunspell: reuse char[] when possible when stripping affix (#2243) 2021-01-26 13:03:44 +01:00
Peter Gromov 80e4def97b
LUCENE-9697: Hunspell Stemmer: use the same FST.BytesReader on all recursion levels (#2242) 2021-01-26 09:31:23 +01:00
Peter Gromov a82634db9d
LUCENE-9693: Hunspell: check that all flags are > 0 and fit char range (#2238) 2021-01-26 09:29:47 +01:00
epugh@opensourceconnections.com 0d88c14837 typo 2021-01-25 14:41:10 -05:00
Chris Hostetter 47a89aca71 SOLR-15078: Fix ExpandComponent behavior when expanding on numeric fields to differentiate '0' group from null group 2021-01-25 11:33:57 -07:00
Andrzej Bialecki 166d39a12e SOLR-15076: Fix wrong test assumption - type of this property has changed
in SOLR-14924.
2021-01-25 16:56:50 +01:00
epugh@opensourceconnections.com ce1bba6d66 Revert "SOLR-14067: v3 Create /contrib/scripting module with ScriptingUpdateProcessor (#2215)"
This reverts commit cf5db8d651.
2021-01-25 09:53:36 -05:00
Gus Heck f942b2dd8a
@gus-asf LUCENE-9575 Provide a producer for PatternTypingRule in TestRandomChains (#2241)
LUCENE-9575 Provide a producer for PatternTypingRule in TestRandomChains to fix failure on seed 65EA739C95F40313
2021-01-25 07:49:46 -05:00
jimczi 3d737bd76a LUCENE-9575: Add a producer for new PatternTypingRule arguments in TestRandomChains. 2021-01-25 10:00:16 +01:00