Peter Gromov
c3fe9afcc6
LUCENE-9739: Hunspell: speed up numeric flag parsing ( #2316 )
2021-02-08 11:02:13 +01:00
Peter Gromov
653626399f
LUCENE-9736: Hunspell: support MAP-based suggestions for groups of similar letters ( #2314 )
2021-02-08 10:59:53 +01:00
Peter Gromov
061233ca4e
LUCENE-9735: Hunspell: speed up flag checks by avoiding allocations ( #2315 )
2021-02-08 10:56:10 +01:00
Dawid Weiss
903782d756
LUCENE-9727: build side support for running Hunspell tests. ( #2313 )
2021-02-08 10:50:25 +01:00
Peter Gromov
1cc26b6bb4
LUCENE-9724: Hunspell: tolerate existing aff/dic file typos ( #2307 )
2021-02-07 12:49:53 +01:00
Peter Gromov
1852d7ad5a
LUCENE-9734: Hunspell: support suggestions based on "ph" morphological data ( #2308 )
2021-02-06 17:04:12 +01:00
Eric Pugh
573b442903
SOLR-15123: Make all Tool option descriptions follow the same general pattern. ( #2275 )
...
* Make all Tool option descriptions follow the same general pattern for describing them.
* Figure out a switch to determine level of either cluster or collections(s)
* better wording on what cluster versus collection params mean
Co-authored-by: epugh@opensourceconnections.com <>
2021-02-05 15:17:58 -05:00
Peter Gromov
825d8dbfd9
LUCENE-9732: Hunspell: support dictionary entries starting with slash ( #2301 )
2021-02-05 11:25:32 +01:00
Jan Høydahl
2f6807cc76
Split the publish_maven step in two TODOs ( #2279 )
2021-02-05 09:58:48 +01:00
Peter Gromov
82f8d7ba1d
LUCENE-9728: Hunspell: add a performance test ( #2296 )
2021-02-05 09:47:02 +01:00
Peter Gromov
650f16ad5d
LUCENE-9729: Hunspell: support CHECKCOMPOUNDREP flags ( #2300 )
2021-02-05 09:46:22 +01:00
Peter Gromov
16764f1601
LUCENE-9733: Hunspell: exception when loading dictionaries with mixed-case words and aliased flags ( #2305 )
2021-02-05 09:40:06 +01:00
David Smiley
b5c1ed7129
SOLR-15011: BadApple the test
2021-02-04 23:10:09 -05:00
Julie Tibshirani
75755c837c
Update changelog with note about BM25FQuery.
2021-02-04 14:00:14 -08:00
Eric Pugh
d83a17490d
SOLR-15133: Document how to eliminate Failed to reserve shared memory warning ( #2304 )
...
* light copyediting
* document how to avoid shared memory issue
Co-authored-by: epugh@opensourceconnections.com <>
2021-02-04 16:15:43 -05:00
Julie Tibshirani
c3f5454d49
LUCENE-9725: Allow BM25FQuery to use other similarities. ( #2293 )
...
From a high level, BM25FQuery (1) computes statistic that represent the combined
field content and (2) passes these to a score function. This model makes sense
for many similarities besides BM25.
This PR unhardcodes BM25Similarity in BM25FQuery and instead uses the one
configured on IndexSearcher. It also renames BM25FQuery since it's no longer
specific to BM25.
2021-02-04 12:42:45 -08:00
Michael Sokolov
67f71d453d
LUCENE-9715: fix int overflow in Lucene90VectorReader
2021-02-04 13:52:13 -05:00
Julie Tibshirani
f0a2f1fe03
LUCENE-9705: Create Lucene90LiveDocsFormat ( #2274 )
...
For now this is just a copy of Lucene50LiveDocsFormat. The existing
Lucene50LiveDocsFormat was moved to backwards-codecs.
2021-02-04 10:43:16 -08:00
Jason Gerlowski
7fd64aabcc
CHANGES.txt entry for SOLR-13608
2021-02-04 08:54:25 -05:00
Jason Gerlowski
33d16b570c
SOLR-13608: Incremental backup file format ( #2250 )
...
This commit introduces a new way for Solr to do backups (with a new
underlying file structure). This new "incremental" backup process
improves over the existing backup mechanism in several ways:
- multiple backups "points" can now be stored at a given backup
location/name, allowing users to choose which point in time they want
to restore
- subsequent backups skip over uploading files that were uploaded by
previous backups, saving time and network time.
- files are checksumed as they're uploaded, ensuring that corrupted
indices aren't persisted and accidentally restored later.
Incremental backups are now the default, and traditional backups
should now be considered 'deprecated' but can still be created by
passing an `incremental=false` parameter on backup requests.
2021-02-04 08:47:30 -05:00
Dawid Weiss
894d0bbb59
LUCENE-9730: cleaned up temp. folder management in hunspell.
2021-02-04 09:27:02 +01:00
Peter Gromov
04167b27f5
LUCENE-9726: Hunspell: speed up spellchecking by stopping at a single… ( #2295 )
2021-02-04 09:13:11 +01:00
Michael Sokolov
e2cf6ee74d
LUCENE-9731: restore consistent random seed to HnswGraphBuilder ( #2299 )
2021-02-03 22:14:01 -05:00
Mike Drob
8fccdfe353
SOLR-15122 Replace sleeps with phaser await ( #2291 )
2021-02-03 19:39:04 -06:00
Mike Drob
40c5d6b750
SOLR-14253 Avoid writes in ZKSR.waitForState ( #2297 )
2021-02-03 14:40:07 -06:00
Chris Hostetter
d693a61185
SOLR-15092: remove link anchors that are no longer neccessary due to relaxed validation rules
...
commit generated using: perl -i -ple 's/<<(.*?)\.adoc#\1,/<<.adoc#,/g' src/*.adoc
...with manual cleanup of src/language-analysis.adoc due to adoc syntax ambiguity
2021-02-03 10:36:12 -07:00
Julie Tibshirani
2544a2243b
Remove write logic from Lucene70NormsFormat. ( #2287 )
...
Our policy is to not maintain write logic for old formats that can't be written
to. The write logic is moved to the test folder to support unit testing.
2021-02-03 09:28:48 -08:00
Julie Tibshirani
902ce0809d
Improve backwards compatibility tests for sorted indexes. ( #2276 )
...
This commit also cleans up some old checks that only applied to pre-6.0 indices.
2021-02-03 09:27:40 -08:00
Chris Hostetter
8a0c1f5a0e
SOLR-15092: eliminate overly strict rules against empty link anchors in ref-guide
...
legacy enforcement from the days of building a PDF
2021-02-03 10:07:34 -07:00
Peter Gromov
d95e405fec
LUCENE-9721: Hunspell: disallow ONLYINCOMPOUND suffixes at the very end of compound words ( #2294 )
2021-02-03 17:46:54 +01:00
Peter Gromov
a79f641561
LUCENE-9720: Hunspell: more ways to vary misspelled word variations for suggestions ( #2286 )
2021-02-03 17:45:56 +01:00
Andrzej Bialecki
d88264ba72
SOLR-14234: Unhelpful message in RemoteExecutionException.
2021-02-03 16:27:47 +01:00
Peter Gromov
84aa683b6b
LUCENE-9723: Hunspell: update sanity tests that load all dictionaries ( #2290 )
2021-02-03 10:45:35 +01:00
Peter Gromov
d0ae2bd2b9
LUCENE-9717: Hunspell: support CHECKCOMPOUNDPATTERN ( #2280 )
2021-02-03 08:58:40 +01:00
Nazerke Seidan
6509a3003c
SOLR-15011: /admin/logging now distributes setLevel to all nodes ( #2230 )
...
The admin UI will set nodes=all for this.
Co-authored-by: Nazerke Seidan <nseidan@salesforce.com>
Co-authored-by: David Smiley <dsmiley@apache.org>
2021-02-02 21:41:23 -05:00
orenovadia
8d0cbcbb53
LUCENE-9680 - Re-add IndexWriter::getFieldNames
2021-02-02 17:38:43 -05:00
sbeniwal12
a53e8e7228
LUCENE-9615: Expose HnswGraphBuilder index-time hyperparameters as FieldType attributes (from Shubham Beniwal))
2021-02-02 17:26:29 -05:00
Peter Gromov
8f75933f3d
LUCENE-9716: Hunspell: support flag usage before its format is even specified ( #2277 )
2021-02-02 21:25:56 +01:00
Nhat Nguyen
47e3d06ce0
LUCENE-9722: Close merged readers on abort ( #2288 )
...
We fail to close the merged readers of an aborted merge if its
output segment contains no document.
This bug was discovered by a test in Elasticsearch
(elastic/elasticsearch#67884 ).
2021-02-02 11:24:10 -05:00
Andrzej Bialecki
4cb1000ea0
SOLR-15122: Tentative fix for the test failure - the node in the test could go down
...
before the new plugin was active on the Overseer.
2021-02-02 12:06:39 +01:00
Peter Gromov
b48d5beb34
LUCENE-9707: Hunspell: check Lucene's implementation against Hunspel's test data ( #2267 )
2021-02-02 10:46:14 +01:00
Dawid Weiss
2da7a4a86d
LUCENE-9686: Add changes entry.
2021-02-02 09:10:03 +01:00
zacharymorn
3835cb4e95
LUCENE-9686: Fix read past EOF handling in DirectIODirectory ( #2258 )
2021-02-02 09:07:30 +01:00
Chris Hostetter
15aaec60d9
SOLR-14330: ExpandComponent now supports an expand.nullGroup=true option
2021-02-01 16:19:34 -07:00
Mike Drob
99748384cf
SOLR-14253 Replace sleep calls with ZK waits ( #1297 )
...
Co-Authored-By: markrmiller <markrmiller@apache.org>
2021-02-01 13:25:17 -06:00
András Salamon
e8bc758144
SOLR-15115: Remove unused methods from TestRerankBase ( #2261 )
2021-02-01 17:31:58 +00:00
Andrzej Bialecki
9e8ca98985
SOLR-15068: RefGuide documentation for replica placement plugins (plus
...
minor cleanups).
2021-02-01 16:50:25 +01:00
Peter Gromov
7a7949aed2
LUCENE-9708: Hunspell: support FLAG UTF-8 in absence of SET UTF-8 ( #2270 )
2021-02-01 10:36:24 +01:00
Peter Gromov
8a34cc7afd
LUCENE-9701: Hunspell: implement simple REP-based suggestion algorithm ( #2251 )
2021-02-01 10:23:54 +01:00
Peter Gromov
9d45dfe776
LUCENE-9710: Hunspell: support minor compounding-related flags ( #2272 )
...
* LUCENE-9710: Hunspell: support COMPOUNDFLAG
* LUCENE-9710: Hunspell: fix CHECKCOMPOUNDCASE support
* LUCENE-9710: Hunspell: support CHECKCOMPOUNDDUP
* LUCENE-9710: Hunspell: support triple flags (CHECKCOMPOUNDTRIPLE, SIMPLIFIEDTRIPLE)
* LUCENE-9710: Hunspell: support COMPOUNDFORBIDFLAG
* LUCENE-9710: Hunspell: support FORCEUCASE
2021-02-01 10:20:11 +01:00