Naoto MINAMI
0cbb38ff4a
SOLR-15114: WAND does not work correctly on multiple segments ( #2259 )
...
In Solr 8.6.3, minCompetitiveScore of WANDScorer resets to zero for each index segment and remain zero until maxScore is updated.
There are two causes of this problem:
* MaxScoreCollector does not set minCompetitiveScore of MinCompetitiveScoreAwareScorable newly generated for another index segment.
* MaxScoreCollector updates minCompetitiveScore only if maxScore is updated. This behavior is correct considering the purpose of MaxScoreCollector.
For details, see the attached pdf https://issues.apache.org/jira/secure/attachment/13019548/wand.pdf .
2021-02-11 09:10:03 -08:00
Ignacio Vera
683a9bd78a
LUCENE-9322: Add Vectors format to CodecReader accounting methods ( #2353 )
2021-02-11 16:54:59 +01:00
Peter Gromov
019872453d
LUCENE-9763: Hunspell: fix FORBIDDENWORD support ( #2351 )
...
don't decompound if it's a simple word with a forbidden root, don't lookup the word twice, don't forbid stemming (be like Hunspell)
2021-02-11 15:16:40 +01:00
Peter Gromov
01e34f8723
LUCENE-9764: Hunspell: try title case as FORCEUCASE misspelled word suggestions ( #2352 )
2021-02-11 15:12:58 +01:00
Peter Gromov
7f9b1f991b
LUCENE-9758: Hunspell: support NOSUGGEST option ( #2340 )
2021-02-11 09:15:37 +01:00
Peter Gromov
76b55509dd
LUCENE-9759: Hunspell: add more to TestHunspellRepositoryTestCases.EXPECTED_FAILURES ( #2341 )
2021-02-11 09:14:29 +01:00
Peter Gromov
a919f2df59
LUCENE-9761: Hunspell: check that FLAG and SET don't occur too far in the file, cleanup ( #2348 )
...
Thank you!
2021-02-11 09:13:34 +01:00
Peter Gromov
c0d32444be
LUCENE-9760: Hunspell: print total memory usage in TestAllDictionaries, cleanup ( #2347 )
2021-02-11 09:11:09 +01:00
Ignacio Vera
73d4f8cd03
Fix file leak in Lucene90VectorWriter ( #2331 )
...
Adds a new test class based on BaseIndexFileFormatTestCase
2021-02-11 08:39:48 +01:00
Jason Gerlowski
aea29464f1
SOLR-13608: Fix incremental-backup test failures
...
The initial incremental-backup commit introduced several test failures
on Windows test runs that I neglected to catch before committing. Most
of these failures were the result of bad 'location' path handling in the
test logic itself, though there were a few tweaks made to Solr code
itself to better handle Windows paths as well.
2021-02-10 15:39:13 -05:00
Cassandra Targett
131d060877
Remove "incubating" next to Zeppelin in parallel-sql-interface.adoc
2021-02-10 14:22:50 -06:00
Julie Tibshirani
29ea0043ef
LUCENE-9756: Extend FieldInfosFormat tests to cover points and vectors ( #2338 )
...
This commit adds coverage to `BaseFieldInfoFormatTestCase ` for points,
vectors, and the soft deletes field.
2021-02-10 08:57:26 -08:00
Peter Gromov
c4c569b998
LUCENE-9752: Hunspell Stemmer: reduce parameter count ( #2333 )
2021-02-10 09:34:36 +01:00
Peter Gromov
6f525302dd
LUCENE-9753: Hunspell: disallow compounds with parts present in dictionary, space-separated ( #2335 )
2021-02-10 09:23:15 +01:00
Peter Gromov
c3166e1dc3
LUCENE-9750: Hunspell: improve suggestions for mixed-case misspelled words ( #2332 )
2021-02-10 09:21:01 +01:00
Peter Gromov
5fd18881e9
LUCENE-9749: Hunspell: apply output conversion (OCONV) to the suggestions ( #2329 )
2021-02-10 09:17:44 +01:00
Peter Gromov
f2b7cdc491
LUCENE-9748: Hunspell: suggest inflected dictionary entries similar to the misspelled word ( #2330 )
2021-02-10 09:16:06 +01:00
Dawid Weiss
1f5b37f299
LUCENE-9747: dodge javadoc reporter NPE bug on Java 11. ( #2337 )
2021-02-09 21:47:38 +01:00
Dawid Weiss
061b3f29c9
LUCENE-9740: scan affix stream once. ( #2327 )
2021-02-09 09:58:26 +01:00
Ignacio Vera
f93cbb324e
Add TestLucene90FieldInfosFormat ( #2269 )
2021-02-09 09:32:42 +01:00
Ignacio Vera
eafeb66434
LUCENE-9705: Move Lucene50CompoundFormat to Lucene90CompoundFormat ( #2268 )
2021-02-09 09:19:07 +01:00
Peter Gromov
24984ff4e2
LUCENE-9742: Hunspell: suggest dictionary entries similar to the misspelled word ( #2320 )
2021-02-09 08:12:34 +01:00
Jason Gerlowski
e186d8c439
Fix debug-logging-caused test failures
2021-02-08 19:32:20 -05:00
Dawid Weiss
077f8ccf70
LUCENE-9744: NPE on a degenerate query in MinimumShouldMatchIntervalsSource$MinimumMatchesIterator.getSubMatches() ( #2323 )
2021-02-08 21:49:00 +01:00
Peter Gromov
80803eb9ad
LUCENE-9746: Hunspell: unify case variation logic in Stemmer and SpellChecker ( #2322 )
2021-02-08 21:37:32 +01:00
Peter Gromov
d0b4ef66d7
LUCENE-9745: Hunspell: tolerate more aff/dic file typos ( #2321 )
2021-02-08 21:36:44 +01:00
Joel Bernstein
da8b8ecdb8
SOLR-15142: Allow the cat Streaming Expression to read gzip files
2021-02-08 15:07:39 -05:00
Jason Gerlowski
ed2eebfa4d
Debug logging for TestIncrementalCoreBackup Windows failures
2021-02-08 14:36:54 -05:00
Jason Gerlowski
cede9723fa
SOLR-15118: CHANGES.txt entry
2021-02-08 10:45:37 -05:00
Jason Gerlowski
e89fba6fe7
SOLR-15118: Convert /v2/collections APIs to annotations ( #2281 )
...
Solr supports two different ways to write v2 APIs: a JSON spec based
approach, and one based on annotated POJOs. The POJO method is now
preferred.
This commit switches the /v2/collections APIs over to the
annotation-based approach. Since V2RequestSupport only works with
jsonspec-based APIs, this commit also changes CollectionAdminRequest
to no longer implement that interface.
2021-02-08 10:11:58 -05:00
Peter Gromov
4f64e39ec6
LUCENE-9743: Hunspell: ignore original tests which are out of scope ( #2319 )
2021-02-08 11:50:40 +01:00
Peter Gromov
c3fe9afcc6
LUCENE-9739: Hunspell: speed up numeric flag parsing ( #2316 )
2021-02-08 11:02:13 +01:00
Peter Gromov
653626399f
LUCENE-9736: Hunspell: support MAP-based suggestions for groups of similar letters ( #2314 )
2021-02-08 10:59:53 +01:00
Peter Gromov
061233ca4e
LUCENE-9735: Hunspell: speed up flag checks by avoiding allocations ( #2315 )
2021-02-08 10:56:10 +01:00
Dawid Weiss
903782d756
LUCENE-9727: build side support for running Hunspell tests. ( #2313 )
2021-02-08 10:50:25 +01:00
Peter Gromov
1cc26b6bb4
LUCENE-9724: Hunspell: tolerate existing aff/dic file typos ( #2307 )
2021-02-07 12:49:53 +01:00
Peter Gromov
1852d7ad5a
LUCENE-9734: Hunspell: support suggestions based on "ph" morphological data ( #2308 )
2021-02-06 17:04:12 +01:00
Eric Pugh
573b442903
SOLR-15123: Make all Tool option descriptions follow the same general pattern. ( #2275 )
...
* Make all Tool option descriptions follow the same general pattern for describing them.
* Figure out a switch to determine level of either cluster or collections(s)
* better wording on what cluster versus collection params mean
Co-authored-by: epugh@opensourceconnections.com <>
2021-02-05 15:17:58 -05:00
Peter Gromov
825d8dbfd9
LUCENE-9732: Hunspell: support dictionary entries starting with slash ( #2301 )
2021-02-05 11:25:32 +01:00
Jan Høydahl
2f6807cc76
Split the publish_maven step in two TODOs ( #2279 )
2021-02-05 09:58:48 +01:00
Peter Gromov
82f8d7ba1d
LUCENE-9728: Hunspell: add a performance test ( #2296 )
2021-02-05 09:47:02 +01:00
Peter Gromov
650f16ad5d
LUCENE-9729: Hunspell: support CHECKCOMPOUNDREP flags ( #2300 )
2021-02-05 09:46:22 +01:00
Peter Gromov
16764f1601
LUCENE-9733: Hunspell: exception when loading dictionaries with mixed-case words and aliased flags ( #2305 )
2021-02-05 09:40:06 +01:00
David Smiley
b5c1ed7129
SOLR-15011: BadApple the test
2021-02-04 23:10:09 -05:00
Julie Tibshirani
75755c837c
Update changelog with note about BM25FQuery.
2021-02-04 14:00:14 -08:00
Eric Pugh
d83a17490d
SOLR-15133: Document how to eliminate Failed to reserve shared memory warning ( #2304 )
...
* light copyediting
* document how to avoid shared memory issue
Co-authored-by: epugh@opensourceconnections.com <>
2021-02-04 16:15:43 -05:00
Julie Tibshirani
c3f5454d49
LUCENE-9725: Allow BM25FQuery to use other similarities. ( #2293 )
...
From a high level, BM25FQuery (1) computes statistic that represent the combined
field content and (2) passes these to a score function. This model makes sense
for many similarities besides BM25.
This PR unhardcodes BM25Similarity in BM25FQuery and instead uses the one
configured on IndexSearcher. It also renames BM25FQuery since it's no longer
specific to BM25.
2021-02-04 12:42:45 -08:00
Michael Sokolov
67f71d453d
LUCENE-9715: fix int overflow in Lucene90VectorReader
2021-02-04 13:52:13 -05:00
Julie Tibshirani
f0a2f1fe03
LUCENE-9705: Create Lucene90LiveDocsFormat ( #2274 )
...
For now this is just a copy of Lucene50LiveDocsFormat. The existing
Lucene50LiveDocsFormat was moved to backwards-codecs.
2021-02-04 10:43:16 -08:00
Jason Gerlowski
7fd64aabcc
CHANGES.txt entry for SOLR-13608
2021-02-04 08:54:25 -05:00