Commit Graph

34707 Commits

Author SHA1 Message Date
Michael Sokolov 38ec2602ce
LUCENE-9695: don't merge deleted vectors (#2239) 2021-01-27 12:18:55 -05:00
Andrzej Bialecki b1cd6b691f SOLR-15055: Fix this test to avoid OS dependency. 2021-01-27 13:38:46 +01:00
Houston Putman 7da0fa8a07 SOLR-15075: Remove docker gradle artifact
Gradle is currently picking up the dockerBuild task in "assemble",
because of the build artifact.
2021-01-26 13:33:32 -05:00
Julie Tibshirani 1887280475
LUCENE-9322: Move old field infos format to backwards-codecs. (#2245)
We introduced a new `Lucene90FieldInfosFormat`, so the old
`Lucene60FieldInfosFormat` should live in backwards-codecs.
2021-01-26 10:14:55 -08:00
Houston Putman 2b8d7bcd6a
SOLR-15075: Solr docker gradle improvements (#2197)
* Removed docker plugin from gradle builds.
* Removed package docker image.
* Tasks now have correct inputs/outputs/dependencies.
* Move gradle help text to docker folder.
* Reduce duplicated Docker layer by doing file removal and chmod in another stage.

Co-authored-by: David Smiley <dsmiley@apache.org>
2021-01-26 10:22:50 -05:00
Andrzej Bialecki 6c72e2e17b SOLR-15055: Add missing license headers. 2021-01-26 16:09:38 +01:00
Andrzej Bialecki 6e9185a33a SOLR-15055: Re-implement 'withCollection'. 2021-01-26 15:16:33 +01:00
Peter Gromov 695e789891
LUCENE-9698: Hunspell: reuse char[] when possible when stripping affix (#2243) 2021-01-26 13:03:44 +01:00
Peter Gromov 80e4def97b
LUCENE-9697: Hunspell Stemmer: use the same FST.BytesReader on all recursion levels (#2242) 2021-01-26 09:31:23 +01:00
Peter Gromov a82634db9d
LUCENE-9693: Hunspell: check that all flags are > 0 and fit char range (#2238) 2021-01-26 09:29:47 +01:00
epugh@opensourceconnections.com 0d88c14837 typo 2021-01-25 14:41:10 -05:00
Chris Hostetter 47a89aca71 SOLR-15078: Fix ExpandComponent behavior when expanding on numeric fields to differentiate '0' group from null group 2021-01-25 11:33:57 -07:00
Andrzej Bialecki 166d39a12e SOLR-15076: Fix wrong test assumption - type of this property has changed
in SOLR-14924.
2021-01-25 16:56:50 +01:00
epugh@opensourceconnections.com ce1bba6d66 Revert "SOLR-14067: v3 Create /contrib/scripting module with ScriptingUpdateProcessor (#2215)"
This reverts commit cf5db8d651.
2021-01-25 09:53:36 -05:00
Gus Heck f942b2dd8a
@gus-asf LUCENE-9575 Provide a producer for PatternTypingRule in TestRandomChains (#2241)
LUCENE-9575 Provide a producer for PatternTypingRule in TestRandomChains to fix failure on seed 65EA739C95F40313
2021-01-25 07:49:46 -05:00
jimczi 3d737bd76a LUCENE-9575: Add a producer for new PatternTypingRule arguments in TestRandomChains. 2021-01-25 10:00:16 +01:00
Peter Gromov e4ec3e3919
LUCENE-9692: Hunspell: extract Stemmer.stripAffix from similar code in prefix/suffix processing (#2237) 2021-01-25 09:11:11 +01:00
Peter Gromov f64e7cbbda
LUCENE-9691: Hunspell: support trailing comments on aff option lines (#2236)
plus cleanup & deduplicate parsing
2021-01-25 09:08:57 +01:00
Peter Gromov c7e1079da9
LUCENE-9690: Hunspell: support special title-case for words with apostrophe (#2235) 2021-01-25 09:01:12 +01:00
zacharymorn 7f4d4dfdbf
LUCENE-9668: Deprecate MinShouldMatchSumScorer with WANDScorer (#2205) 2021-01-25 07:28:19 +01:00
Mike McCandless 93107d6379 LUCENE-9575: fix style violations so precommit passes again 2021-01-24 08:06:50 -05:00
Gus Heck c087f6f8c0
LUCENE-9575 Add PatternTypingFilter to annotate tokens with flags and types (#1995)
LUCENE-9575 Add PatternTypingFilter
2021-01-23 18:40:13 -05:00
Christine Poerschke 9d4811e02f SOLR-15073: Fix ClassCastException in SystemInfoHandler.getSecurityInfo.
Same fix as the #2210 PR commit earlier but this time not extending SystemInfoHandlerTest and also not adding a static SystemInfoHandler.getSecurityInfo variant for test use.
2021-01-22 18:33:28 +00:00
Christine Poerschke 64d445bbaa Revert "SOLR-15073: Fix ClassCastException in SystemInfoHandler.getSecurityInfo (#2210)"
This reverts commit fb88b0268a.

Resolved Conflicts:
	solr/CHANGES.txt
2021-01-22 17:48:25 +00:00
Marcus 4bc5d51494
LUCENE-8626: Lucene standardize test naming part 3 and final (#2220) 2021-01-22 12:38:52 -05:00
Christine Poerschke 32e95ddb3f
SOLR-15071: Fix ArrayIndexOutOfBoundsException in contrib/ltr SolrFeatureScorer (#2196) 2021-01-22 13:34:58 +00:00
Christine Poerschke fb88b0268a
SOLR-15073: Fix ClassCastException in SystemInfoHandler.getSecurityInfo (#2210) 2021-01-22 12:10:10 +00:00
Peter Gromov 0a1a3f4c40
LUCENE-9688: Hunspell: consider prefix's continuation flags when applying suffix (#2229) 2021-01-22 12:03:55 +01:00
Peter Gromov d7968130c3
LUCENE-9684: Hunspell: support COMPOUNDRULE (#2228) 2021-01-22 12:01:53 +01:00
Eric Pugh cf5db8d651
SOLR-14067: v3 Create /contrib/scripting module with ScriptingUpdateProcessor (#2215)
* Creating Scripting contrib module to centralize the less secure code related to scripts.

* tweak the changelog and update notice to explain why the name changed and the security posture thinking

* the test script happens to be a currency.xml, which made me think we were doing something specific to currency types, but instead any xml formatted file will suffice for the test.

* drop the ing, and be more specific on the name of the ref guide page

* use the same name everywhere

Co-authored-by: David Smiley <dsmiley@apache.org>
2021-01-21 13:32:46 -05:00
András Salamon 83e0397c42
SOLR-14297: Replace commons-codec Base64 with JDK8 Base64 (#2222) 2021-01-21 13:10:33 -05:00
Anand e5a16f0b0f
LUCENE-9674: Use binary search in VectorValues.advance()
Lucene90VectorReader now implements advance() with binary search in place of prior linear scan
Co-authored-by: Anand Kotriwal <anakot@amazon.com>
2021-01-21 10:02:21 -05:00
Peter Gromov 37e31f2ac7
LUCENE-9685: Hunspell: prefix condition is only checked on suffix, not stem (#2226) 2021-01-21 13:38:45 +01:00
Peter Gromov fdf04d8c63
LUCENE-9681: Hunspell spellchecker: support numbers with separators (#2224) 2021-01-21 12:33:03 +01:00
Dawid Weiss 053060b925 LUCENE-9670: Workaround for gradle not liking the same stream for stdout and stderr sinks. 2021-01-21 09:51:56 +01:00
Cassandra Targett e8276e09a1
SOLR-13105 - Visual Guide to Math Expressions (#2227)
* SOLR-13105: The Visual Guide to Streaming Expressions and Math Expressions
2021-01-20 16:14:01 -06:00
Dawid Weiss 4da8f08c63
LUCENE-9683: fix incorrect logic for measuring stall time information. (#2225) 2021-01-20 19:05:22 +01:00
Peter Gromov 89456686ed
LUCENE-9681: deduplicate decodeFlags+hasFlag checks (#2223) 2021-01-20 17:27:04 +01:00
Peter Gromov 939699f550
LUCENE-9667: Hunspell: add spellchecker API, support BREAK and FORBIDDENWORD affix rules (#2207) 2021-01-20 10:57:27 +01:00
David Smiley a233ed2fd1
SOLR-15081: Metrics for core: isLeader, replicaState (#2198)
Note that getLastPublished returns an Enum type.  TextWriter.writeVal should probably support Enums, which would simplify this code.
2021-01-19 16:43:39 -05:00
Joel Bernstein 5328ced82e SOLR-14608: Reduce percentage of priority queue size above total hits 2021-01-19 15:35:57 -05:00
Joel Bernstein 64df5a65f0 SOLR-14608: Faster sorting for the /export handler 2021-01-19 15:35:56 -05:00
Simon Willnauer 1aeb1dcb86
LUCENE-9669: Restore Lucene70Codec (#2221)
This change restores the Lucene70Codec for file format compatibility of
indices that are created within the Lucene 7 major version. These indices
can be opened via an expert API on DirectoryReader in read-only mode. Changes
to these indices are prohibited and will be rejected by the IndexWriter.
In fact, IndexWriter will not open an index that is created with a major version
less than N-1 to the current major version.
2021-01-19 21:07:54 +01:00
Chris Hostetter df692076a6 rectify 8.9 CHANGES sections from branch_8x 2021-01-19 12:41:33 -07:00
Chris Hostetter a4aa3d1d4f SOLR-15079: Block Collapse - Faster collapse code when groups are co-located via Block Join style nested doc indexing.
Used by default when field=_root_, or explicitly requsted for other fields via hint=block.
2021-01-19 11:40:29 -07:00
Cassandra Targett 12e42a2a11 Ref Guide: update upgrade notes for 8.8 release 2021-01-19 12:00:27 -06:00
jimczi 5af12b5f14 LUCENE-9675: Binary doc values fields now expose their configured compression mode in the attributes of the field info. 2021-01-19 10:03:13 +01:00
Patrick Marty 227256d951
LUCENE-9646: Set BM25Similarity discountOverlaps via the constructor 2021-01-19 09:49:57 +01:00
Peter Gromov 9f5bdf43b7
LUCENE-9678: Hunspell: fix off-by-one error to support prefixes of word.length - 1 (#2219) 2021-01-19 09:34:27 +01:00
Peter Gromov 422c89baef
LUCENE-9676: Hunspell: improve stemming of all-caps words (#2217)
Hunspell: improve stemming of all-caps words

Repeat Hunspell's logic:
* when encountering a mixed- or (inflectable) all-case dictionary entry, add its title-case analog as a hidden entry
* use that hidden entry for stemming case variants for title- and uppercase words, but don't consider it a valid word itself
* ...unless there's another explicit dictionary entry of that title case
2021-01-19 09:32:23 +01:00