Peter Gromov
80e4def97b
LUCENE-9697: Hunspell Stemmer: use the same FST.BytesReader on all recursion levels ( #2242 )
2021-01-26 09:31:23 +01:00
Peter Gromov
a82634db9d
LUCENE-9693: Hunspell: check that all flags are > 0 and fit char range ( #2238 )
2021-01-26 09:29:47 +01:00
epugh@opensourceconnections.com
0d88c14837
typo
2021-01-25 14:41:10 -05:00
Chris Hostetter
47a89aca71
SOLR-15078: Fix ExpandComponent behavior when expanding on numeric fields to differentiate '0' group from null group
2021-01-25 11:33:57 -07:00
Andrzej Bialecki
166d39a12e
SOLR-15076: Fix wrong test assumption - type of this property has changed
...
in SOLR-14924.
2021-01-25 16:56:50 +01:00
epugh@opensourceconnections.com
ce1bba6d66
Revert "SOLR-14067: v3 Create /contrib/scripting module with ScriptingUpdateProcessor ( #2215 )"
...
This reverts commit cf5db8d651
.
2021-01-25 09:53:36 -05:00
Gus Heck
f942b2dd8a
@gus-asf LUCENE-9575 Provide a producer for PatternTypingRule in TestRandomChains ( #2241 )
...
LUCENE-9575 Provide a producer for PatternTypingRule in TestRandomChains to fix failure on seed 65EA739C95F40313
2021-01-25 07:49:46 -05:00
jimczi
3d737bd76a
LUCENE-9575: Add a producer for new PatternTypingRule arguments in TestRandomChains.
2021-01-25 10:00:16 +01:00
Peter Gromov
e4ec3e3919
LUCENE-9692: Hunspell: extract Stemmer.stripAffix from similar code in prefix/suffix processing ( #2237 )
2021-01-25 09:11:11 +01:00
Peter Gromov
f64e7cbbda
LUCENE-9691: Hunspell: support trailing comments on aff option lines ( #2236 )
...
plus cleanup & deduplicate parsing
2021-01-25 09:08:57 +01:00
Peter Gromov
c7e1079da9
LUCENE-9690: Hunspell: support special title-case for words with apostrophe ( #2235 )
2021-01-25 09:01:12 +01:00
zacharymorn
7f4d4dfdbf
LUCENE-9668: Deprecate MinShouldMatchSumScorer with WANDScorer ( #2205 )
2021-01-25 07:28:19 +01:00
Mike McCandless
93107d6379
LUCENE-9575: fix style violations so precommit passes again
2021-01-24 08:06:50 -05:00
Gus Heck
c087f6f8c0
LUCENE-9575 Add PatternTypingFilter to annotate tokens with flags and types ( #1995 )
...
LUCENE-9575 Add PatternTypingFilter
2021-01-23 18:40:13 -05:00
Christine Poerschke
9d4811e02f
SOLR-15073: Fix ClassCastException in SystemInfoHandler.getSecurityInfo.
...
Same fix as the #2210 PR commit earlier but this time not extending SystemInfoHandlerTest and also not adding a static SystemInfoHandler.getSecurityInfo variant for test use.
2021-01-22 18:33:28 +00:00
Christine Poerschke
64d445bbaa
Revert "SOLR-15073: Fix ClassCastException in SystemInfoHandler.getSecurityInfo ( #2210 )"
...
This reverts commit fb88b0268a
.
Resolved Conflicts:
solr/CHANGES.txt
2021-01-22 17:48:25 +00:00
Marcus
4bc5d51494
LUCENE-8626: Lucene standardize test naming part 3 and final ( #2220 )
2021-01-22 12:38:52 -05:00
Christine Poerschke
32e95ddb3f
SOLR-15071: Fix ArrayIndexOutOfBoundsException in contrib/ltr SolrFeatureScorer ( #2196 )
2021-01-22 13:34:58 +00:00
Christine Poerschke
fb88b0268a
SOLR-15073: Fix ClassCastException in SystemInfoHandler.getSecurityInfo ( #2210 )
2021-01-22 12:10:10 +00:00
Peter Gromov
0a1a3f4c40
LUCENE-9688: Hunspell: consider prefix's continuation flags when applying suffix ( #2229 )
2021-01-22 12:03:55 +01:00
Peter Gromov
d7968130c3
LUCENE-9684: Hunspell: support COMPOUNDRULE ( #2228 )
2021-01-22 12:01:53 +01:00
Eric Pugh
cf5db8d651
SOLR-14067: v3 Create /contrib/scripting module with ScriptingUpdateProcessor ( #2215 )
...
* Creating Scripting contrib module to centralize the less secure code related to scripts.
* tweak the changelog and update notice to explain why the name changed and the security posture thinking
* the test script happens to be a currency.xml, which made me think we were doing something specific to currency types, but instead any xml formatted file will suffice for the test.
* drop the ing, and be more specific on the name of the ref guide page
* use the same name everywhere
Co-authored-by: David Smiley <dsmiley@apache.org>
2021-01-21 13:32:46 -05:00
András Salamon
83e0397c42
SOLR-14297: Replace commons-codec Base64 with JDK8 Base64 ( #2222 )
2021-01-21 13:10:33 -05:00
Anand
e5a16f0b0f
LUCENE-9674: Use binary search in VectorValues.advance()
...
Lucene90VectorReader now implements advance() with binary search in place of prior linear scan
Co-authored-by: Anand Kotriwal <anakot@amazon.com>
2021-01-21 10:02:21 -05:00
Peter Gromov
37e31f2ac7
LUCENE-9685: Hunspell: prefix condition is only checked on suffix, not stem ( #2226 )
2021-01-21 13:38:45 +01:00
Peter Gromov
fdf04d8c63
LUCENE-9681: Hunspell spellchecker: support numbers with separators ( #2224 )
2021-01-21 12:33:03 +01:00
Dawid Weiss
053060b925
LUCENE-9670: Workaround for gradle not liking the same stream for stdout and stderr sinks.
2021-01-21 09:51:56 +01:00
Cassandra Targett
e8276e09a1
SOLR-13105 - Visual Guide to Math Expressions ( #2227 )
...
* SOLR-13105: The Visual Guide to Streaming Expressions and Math Expressions
2021-01-20 16:14:01 -06:00
Dawid Weiss
4da8f08c63
LUCENE-9683: fix incorrect logic for measuring stall time information. ( #2225 )
2021-01-20 19:05:22 +01:00
Peter Gromov
89456686ed
LUCENE-9681: deduplicate decodeFlags+hasFlag checks ( #2223 )
2021-01-20 17:27:04 +01:00
Peter Gromov
939699f550
LUCENE-9667: Hunspell: add spellchecker API, support BREAK and FORBIDDENWORD affix rules ( #2207 )
2021-01-20 10:57:27 +01:00
David Smiley
a233ed2fd1
SOLR-15081: Metrics for core: isLeader, replicaState ( #2198 )
...
Note that getLastPublished returns an Enum type. TextWriter.writeVal should probably support Enums, which would simplify this code.
2021-01-19 16:43:39 -05:00
Joel Bernstein
5328ced82e
SOLR-14608: Reduce percentage of priority queue size above total hits
2021-01-19 15:35:57 -05:00
Joel Bernstein
64df5a65f0
SOLR-14608: Faster sorting for the /export handler
2021-01-19 15:35:56 -05:00
Simon Willnauer
1aeb1dcb86
LUCENE-9669: Restore Lucene70Codec ( #2221 )
...
This change restores the Lucene70Codec for file format compatibility of
indices that are created within the Lucene 7 major version. These indices
can be opened via an expert API on DirectoryReader in read-only mode. Changes
to these indices are prohibited and will be rejected by the IndexWriter.
In fact, IndexWriter will not open an index that is created with a major version
less than N-1 to the current major version.
2021-01-19 21:07:54 +01:00
Chris Hostetter
df692076a6
rectify 8.9 CHANGES sections from branch_8x
2021-01-19 12:41:33 -07:00
Chris Hostetter
a4aa3d1d4f
SOLR-15079: Block Collapse - Faster collapse code when groups are co-located via Block Join style nested doc indexing.
...
Used by default when field=_root_, or explicitly requsted for other fields via hint=block.
2021-01-19 11:40:29 -07:00
Cassandra Targett
12e42a2a11
Ref Guide: update upgrade notes for 8.8 release
2021-01-19 12:00:27 -06:00
jimczi
5af12b5f14
LUCENE-9675: Binary doc values fields now expose their configured compression mode in the attributes of the field info.
2021-01-19 10:03:13 +01:00
Patrick Marty
227256d951
LUCENE-9646: Set BM25Similarity discountOverlaps via the constructor
2021-01-19 09:49:57 +01:00
Peter Gromov
9f5bdf43b7
LUCENE-9678: Hunspell: fix off-by-one error to support prefixes of word.length - 1 ( #2219 )
2021-01-19 09:34:27 +01:00
Peter Gromov
422c89baef
LUCENE-9676: Hunspell: improve stemming of all-caps words ( #2217 )
...
Hunspell: improve stemming of all-caps words
Repeat Hunspell's logic:
* when encountering a mixed- or (inflectable) all-case dictionary entry, add its title-case analog as a hidden entry
* use that hidden entry for stemming case variants for title- and uppercase words, but don't consider it a valid word itself
* ...unless there's another explicit dictionary entry of that title case
2021-01-19 09:32:23 +01:00
Simon Willnauer
c1ae6dc07c
LUCENE-9669: Add an expert API to allow opening indices created < N-1 ( #2212 )
...
Today we force indices that were created with N-2 and older versions of Lucene
to fail on open. This check doesn't even check if the codecs are available. In order
to allow users to open older indices and for us to support N-2 versions this change
adds an API on DirectoryReader to specify a minimum index version on a per reader basis.
This doesn't apply for the IndexWriter which will fail on opening older indices.
2021-01-19 09:23:49 +01:00
Peter Gromov
426c902bc9
LUCENE-9677: simplify Dictionary.affixData storage ( #2218 )
...
Use char[] instead of byte[], get rid of unnecessary byte array readers/writers.
2021-01-19 09:22:33 +01:00
Peter Gromov
ab08fdc6f0
LUCENE-9671: Hunspell: shorten Stemmer.applyAffix ( #2209 )
...
Call stem() recursively just once with different arguments depending on various conditions.
NOTE: committing in directly as this is a refactoring, not a functional change (no CHANGES.txt entry).
2021-01-18 22:54:22 +01:00
Noble Paul
8505d4d416
SOLR-15052: Per-replica states for reducing overseer bottlenecks (trunk) ( #2177 )
2021-01-19 02:59:41 +11:00
Uwe Schindler
4b508aef24
LUCENE-8982: Add a note to MIGRATE.md
2021-01-18 00:50:02 +01:00
zacharymorn
a7747b63b4
LUCENE-8982: Make NativeUnixDirectory pure java with FileChannel direct IO flag, and rename to DirectIODirectory ( #2052 )
...
LUCENE-8982: Make NativeUnixDirectory pure java with FileChannel direct IO flag, and rename to DirectIODirectory
2021-01-17 23:57:56 +01:00
Namgyu Kim
eb24e95731
LUCENE-9661: Fix deadlock in TermsEnum.EMPTY
2021-01-16 06:49:23 +09:00
Cassandra Targett
30aa0f5ba4
Ref Guide: copy edits for 8.8 release
2021-01-15 14:54:41 -06:00