Commit Graph

34916 Commits

Author SHA1 Message Date
Peter Gromov 71705c900b
LUCENE-9703: Hunspell: prohibit FORBIDDENWORD words and their case variations (#2254) 2021-01-29 08:36:37 +01:00
Peter Gromov 4ba78f2ab2
LUCENE-9706: Hunspell: support NEEDAFFIX flag on affixes (#2262) 2021-01-29 08:24:23 +01:00
Peter Gromov 800f4d0919
LUCENE-9700: Hunspell: support words with trailing dots (#2249) 2021-01-29 08:23:03 +01:00
Cassandra Targett a9ad02cc54 SOLR-14616: remove leftover CDCR ref guide images 2021-01-28 15:40:32 -06:00
Mike Drob b335034615
Require Thread Names in Solr (#2264)
When we are creating a new thread we should give it a descriptive name and enforce this via ForbiddenAPIs. This doesn't apply to Runnable or Callable objects that we pass to an executor, since those should be getting named by the executor itself.

We don't require this in tests because the tests should be more self contained and there is less benefit in descriptive names. If somebody is already profiling a test, then they likely have the context to understand what the unnamed threads are doing, whereas a thread dump from a running Solr instance should have good thread names for everything. This is especially helpful when doing profiling, otherwise we end up with a bunch of Thread-# that are hard to tell apart and search on.
2021-01-28 15:04:15 -06:00
Mike Drob 9be71b3939
SOLR-15113 Do not attempt to start Solr server when embedded ZK fails (#2255) 2021-01-28 11:37:51 -06:00
Joel Bernstein acb98e549d SOLR-14672: Make timeouts configurable for the Streaming Expression SolrClientCache 2021-01-28 10:36:49 -05:00
Michael Sokolov d84925a201 Exclude degenerate edge case (maxConn=1) from TestKnnGraph.testSearch 2021-01-28 10:16:05 -05:00
Peter Gromov a176308aa6
LUCENE-9699: Support German-like compound words (#2248) 2021-01-27 22:31:58 +01:00
Michael Sokolov 38ec2602ce
LUCENE-9695: don't merge deleted vectors (#2239) 2021-01-27 12:18:55 -05:00
Andrzej Bialecki b1cd6b691f SOLR-15055: Fix this test to avoid OS dependency. 2021-01-27 13:38:46 +01:00
Houston Putman 7da0fa8a07 SOLR-15075: Remove docker gradle artifact
Gradle is currently picking up the dockerBuild task in "assemble",
because of the build artifact.
2021-01-26 13:33:32 -05:00
Julie Tibshirani 1887280475
LUCENE-9322: Move old field infos format to backwards-codecs. (#2245)
We introduced a new `Lucene90FieldInfosFormat`, so the old
`Lucene60FieldInfosFormat` should live in backwards-codecs.
2021-01-26 10:14:55 -08:00
Houston Putman 2b8d7bcd6a
SOLR-15075: Solr docker gradle improvements (#2197)
* Removed docker plugin from gradle builds.
* Removed package docker image.
* Tasks now have correct inputs/outputs/dependencies.
* Move gradle help text to docker folder.
* Reduce duplicated Docker layer by doing file removal and chmod in another stage.

Co-authored-by: David Smiley <dsmiley@apache.org>
2021-01-26 10:22:50 -05:00
Andrzej Bialecki 6c72e2e17b SOLR-15055: Add missing license headers. 2021-01-26 16:09:38 +01:00
Andrzej Bialecki 6e9185a33a SOLR-15055: Re-implement 'withCollection'. 2021-01-26 15:16:33 +01:00
Peter Gromov 695e789891
LUCENE-9698: Hunspell: reuse char[] when possible when stripping affix (#2243) 2021-01-26 13:03:44 +01:00
Peter Gromov 80e4def97b
LUCENE-9697: Hunspell Stemmer: use the same FST.BytesReader on all recursion levels (#2242) 2021-01-26 09:31:23 +01:00
Peter Gromov a82634db9d
LUCENE-9693: Hunspell: check that all flags are > 0 and fit char range (#2238) 2021-01-26 09:29:47 +01:00
epugh@opensourceconnections.com 0d88c14837 typo 2021-01-25 14:41:10 -05:00
Chris Hostetter 47a89aca71 SOLR-15078: Fix ExpandComponent behavior when expanding on numeric fields to differentiate '0' group from null group 2021-01-25 11:33:57 -07:00
Andrzej Bialecki 166d39a12e SOLR-15076: Fix wrong test assumption - type of this property has changed
in SOLR-14924.
2021-01-25 16:56:50 +01:00
epugh@opensourceconnections.com ce1bba6d66 Revert "SOLR-14067: v3 Create /contrib/scripting module with ScriptingUpdateProcessor (#2215)"
This reverts commit cf5db8d651.
2021-01-25 09:53:36 -05:00
Gus Heck f942b2dd8a
@gus-asf LUCENE-9575 Provide a producer for PatternTypingRule in TestRandomChains (#2241)
LUCENE-9575 Provide a producer for PatternTypingRule in TestRandomChains to fix failure on seed 65EA739C95F40313
2021-01-25 07:49:46 -05:00
jimczi 3d737bd76a LUCENE-9575: Add a producer for new PatternTypingRule arguments in TestRandomChains. 2021-01-25 10:00:16 +01:00
Peter Gromov e4ec3e3919
LUCENE-9692: Hunspell: extract Stemmer.stripAffix from similar code in prefix/suffix processing (#2237) 2021-01-25 09:11:11 +01:00
Peter Gromov f64e7cbbda
LUCENE-9691: Hunspell: support trailing comments on aff option lines (#2236)
plus cleanup & deduplicate parsing
2021-01-25 09:08:57 +01:00
Peter Gromov c7e1079da9
LUCENE-9690: Hunspell: support special title-case for words with apostrophe (#2235) 2021-01-25 09:01:12 +01:00
zacharymorn 7f4d4dfdbf
LUCENE-9668: Deprecate MinShouldMatchSumScorer with WANDScorer (#2205) 2021-01-25 07:28:19 +01:00
Mike McCandless 93107d6379 LUCENE-9575: fix style violations so precommit passes again 2021-01-24 08:06:50 -05:00
Gus Heck c087f6f8c0
LUCENE-9575 Add PatternTypingFilter to annotate tokens with flags and types (#1995)
LUCENE-9575 Add PatternTypingFilter
2021-01-23 18:40:13 -05:00
Christine Poerschke 9d4811e02f SOLR-15073: Fix ClassCastException in SystemInfoHandler.getSecurityInfo.
Same fix as the #2210 PR commit earlier but this time not extending SystemInfoHandlerTest and also not adding a static SystemInfoHandler.getSecurityInfo variant for test use.
2021-01-22 18:33:28 +00:00
Christine Poerschke 64d445bbaa Revert "SOLR-15073: Fix ClassCastException in SystemInfoHandler.getSecurityInfo (#2210)"
This reverts commit fb88b0268a.

Resolved Conflicts:
	solr/CHANGES.txt
2021-01-22 17:48:25 +00:00
Marcus 4bc5d51494
LUCENE-8626: Lucene standardize test naming part 3 and final (#2220) 2021-01-22 12:38:52 -05:00
Christine Poerschke 32e95ddb3f
SOLR-15071: Fix ArrayIndexOutOfBoundsException in contrib/ltr SolrFeatureScorer (#2196) 2021-01-22 13:34:58 +00:00
Christine Poerschke fb88b0268a
SOLR-15073: Fix ClassCastException in SystemInfoHandler.getSecurityInfo (#2210) 2021-01-22 12:10:10 +00:00
Peter Gromov 0a1a3f4c40
LUCENE-9688: Hunspell: consider prefix's continuation flags when applying suffix (#2229) 2021-01-22 12:03:55 +01:00
Peter Gromov d7968130c3
LUCENE-9684: Hunspell: support COMPOUNDRULE (#2228) 2021-01-22 12:01:53 +01:00
Eric Pugh cf5db8d651
SOLR-14067: v3 Create /contrib/scripting module with ScriptingUpdateProcessor (#2215)
* Creating Scripting contrib module to centralize the less secure code related to scripts.

* tweak the changelog and update notice to explain why the name changed and the security posture thinking

* the test script happens to be a currency.xml, which made me think we were doing something specific to currency types, but instead any xml formatted file will suffice for the test.

* drop the ing, and be more specific on the name of the ref guide page

* use the same name everywhere

Co-authored-by: David Smiley <dsmiley@apache.org>
2021-01-21 13:32:46 -05:00
András Salamon 83e0397c42
SOLR-14297: Replace commons-codec Base64 with JDK8 Base64 (#2222) 2021-01-21 13:10:33 -05:00
Anand e5a16f0b0f
LUCENE-9674: Use binary search in VectorValues.advance()
Lucene90VectorReader now implements advance() with binary search in place of prior linear scan
Co-authored-by: Anand Kotriwal <anakot@amazon.com>
2021-01-21 10:02:21 -05:00
Peter Gromov 37e31f2ac7
LUCENE-9685: Hunspell: prefix condition is only checked on suffix, not stem (#2226) 2021-01-21 13:38:45 +01:00
Peter Gromov fdf04d8c63
LUCENE-9681: Hunspell spellchecker: support numbers with separators (#2224) 2021-01-21 12:33:03 +01:00
Dawid Weiss 053060b925 LUCENE-9670: Workaround for gradle not liking the same stream for stdout and stderr sinks. 2021-01-21 09:51:56 +01:00
Cassandra Targett e8276e09a1
SOLR-13105 - Visual Guide to Math Expressions (#2227)
* SOLR-13105: The Visual Guide to Streaming Expressions and Math Expressions
2021-01-20 16:14:01 -06:00
Dawid Weiss 4da8f08c63
LUCENE-9683: fix incorrect logic for measuring stall time information. (#2225) 2021-01-20 19:05:22 +01:00
Peter Gromov 89456686ed
LUCENE-9681: deduplicate decodeFlags+hasFlag checks (#2223) 2021-01-20 17:27:04 +01:00
Peter Gromov 939699f550
LUCENE-9667: Hunspell: add spellchecker API, support BREAK and FORBIDDENWORD affix rules (#2207) 2021-01-20 10:57:27 +01:00
David Smiley a233ed2fd1
SOLR-15081: Metrics for core: isLeader, replicaState (#2198)
Note that getLastPublished returns an Enum type.  TextWriter.writeVal should probably support Enums, which would simplify this code.
2021-01-19 16:43:39 -05:00
Joel Bernstein 5328ced82e SOLR-14608: Reduce percentage of priority queue size above total hits 2021-01-19 15:35:57 -05:00