Commit Graph

34941 Commits

Author SHA1 Message Date
Ignacio Vera 4cdfbbb95b
LUCENE-9322: Lucene90VectorReader can leak open files (#2371) 2021-02-16 15:08:50 +01:00
Noble Paul 4d7fc57818 added CHANGES.txt section from 7.7.3 release 2021-02-16 17:31:34 +11:00
Robert Muir dd91f5ca82
LUCENE-9773: upgrade icu to 68.2 (#2372)
Upgrade from icu 62.2 to 68.2, with Unicode 13 support.

Modify GenerateUTR30DataFiles to take the release tag as a program
argument. Gradle populates this automatically, removing a manual step
from regeneration process.
2021-02-15 14:56:13 -05:00
Peter Gromov ef920388e6
LUCENE-9771: Hunspell: don't lookup word roots unnecessarily to check flags (#2369) 2021-02-15 20:21:44 +01:00
Peter Gromov 1ff11dd02c
LUCENE-9772: Hunspell: CHECKCOMPOUNDCASE shouldn't prohibit dash-separated uppercase compounds (#2370) 2021-02-15 20:20:58 +01:00
Peter Gromov b20e277569
LUCENE-9770: Hunspell: don't perform compound check recursively when looking for space-separated word pairs (#2368) 2021-02-15 20:20:19 +01:00
Peter Gromov f1a1165ac8
LUCENE-9766: Hunspell: add API for retrieving dictionary morphologica… (#2363) 2021-02-15 20:17:43 +01:00
epugh@opensourceconnections.com ee447d1516 document xslt functionality moving to scripting contrib module 2021-02-15 13:19:56 -05:00
Eric Pugh e6d9eaaf00
SOLR-15121: Move XSLT (tr param) response writer and update request handler to scripting contrib (#2306)
* relocate xslt related classes into scripting contrib
* relocating files to scripting and seperating out unit tests
* relocate files under test-files/scripting/solr, similar to how we do it in other contribs.  deals with some issues in finding files
* Reformatting using the Google Java Format...
* use actual param name, not the variable to properly test api!
* Clean up references to paths, and deal with the mish mash of Xslt and XSLT in class names.
* Move XSLT processing out of XMLLoader
* Move TransformerProvider.Dedupe getTransformer logic.


Co-authored-by: epugh@opensourceconnections.com <>
Co-authored-by: David Smiley <dsmiley@apache.org>
2021-02-15 13:16:18 -05:00
jaison 5856c0f176
LUCENE-9663: Add compression to terms dict from SortedSet/Sorted DocValues.
Closes #2302
2021-02-15 18:12:04 +01:00
Alessandro Benedetti 227ef3b397
SOLR-15149: model creation errors fixes (#2350)
SOLR-15149: model loading errors fix + tests
2021-02-15 18:02:25 +01:00
Jason Gerlowski f993c392f1
SOLR-15101: Add list/delete APIs for incremental backups (#2336)
SOLR-13608 introduces a new "incremental" backup format, which allows
storage of multiple backup "points" in the same location. This
development introduces a need for APIs to manage these potentially
plural backups.

This commit introduces /admin/collections?action=LISTBACKUPS and
/admin/collections?action=DELETEBACKUP to handle these backups.
2021-02-15 07:04:05 -05:00
Ishan Chattopadhyaya 83ccf23fa2 SOLR-15138: Send an intra-process message after ZkClient#create() call 2021-02-15 12:22:22 +05:30
Dawid Weiss 8f56ae0a4b
LUCENE-9767: infrastructure for icu regeneration in place. (#2362) 2021-02-14 21:07:39 +01:00
Timothy Potter 97763ad3ce
SOLR-15145: Additional fix for SolrJ back-compat (#2366) 2021-02-14 10:19:20 -07:00
David Smiley 99f011a06e Revert "LUCENE-9762: FunctionScoreQuery must guard score() called twice (#2358)"
This reverts commit 6c140b6d
2021-02-14 00:53:32 -05:00
David Smiley 6c140b6dcf
LUCENE-9762: FunctionScoreQuery must guard score() called twice (#2358)
The score() may be called multiple times. It should take care to call DoubleValues.advanceExact only the first time, or risk faulty behavior including exceptions.
2021-02-14 00:13:23 -05:00
Ilan Ginzburg 23755ddfdd
SOLR-14928: allow cluster state updates to be done in a distributed way and not through Overseer (#2364) 2021-02-13 02:42:18 +01:00
Joel Bernstein 4a42ecd936 SOLR-15132: Add temporal graph query to the nodes Streaming Expression 2021-02-12 15:19:05 -05:00
Ishan Chattopadhyaya 4b113067d8 SOLR-15138: Collection creation for PerReplicaStates does not scale to large collections as well as regular collections (#2318) 2021-02-13 01:10:35 +05:30
Dawid Weiss f7e42bdb35
LUCENE-9768: Add source sets for src/tools, clean up forbidden API and formatting errors (#2361) 2021-02-12 17:03:54 +01:00
Dawid Weiss d89cb72fa7 Remove leftover debug statement. 2021-02-12 17:02:30 +01:00
epugh@opensourceconnections.com f95c150da2 common typo for defined fixed 2021-02-12 10:11:09 -05:00
epugh@opensourceconnections.com 3ea64b39cb typo fix 2021-02-12 10:07:46 -05:00
Peter Gromov 02ea7a1139
LUCENE-9765: Hunspell: rename SpellChecker to Hunspell, fix test name, update javadoc and CHANGES.txt (#2354) 2021-02-12 15:44:36 +01:00
Jason Gerlowski 9905c0cc2d Fix CHANGES.txt attribution typo 2021-02-12 07:11:02 -05:00
Ishan Chattopadhyaya 938039a688 SOLR-15136: Reduce excessive logging introduced with Per Replica States feature 2021-02-12 11:52:19 +05:30
David Smiley db61297590 SOLR-15011: Remove flawed test 2021-02-11 22:53:38 -05:00
Lu Xugang 9837bc4a4d
LUCENE-9590: Add javadoc for Lucene86PointsFormat class (#2194)
to Lucene's Confluence.
* also corrected some trivial errors in javadocs & comments
2021-02-11 15:25:28 -05:00
Timothy Potter 8662121ca5
SOLR-15145: solr.storeBaseUrl feature flag introduced in 8.8.1 should default to false for 9.x 2021-02-11 12:50:29 -07:00
Tomas Fernandez Lobbe b6db6c88d7 SOLR-15114: Add CHANGES entry 2021-02-11 09:16:39 -08:00
Julie Tibshirani 096f054d56
LUCENE-9705: Reset internal version in Lucene90FieldInfosFormat. (#2339)
Since this is a fresh format, we can remove older version logic and reset the
internal version to 0.
2021-02-11 09:11:47 -08:00
Naoto MINAMI 0cbb38ff4a
SOLR-15114: WAND does not work correctly on multiple segments (#2259)
In Solr 8.6.3, minCompetitiveScore of WANDScorer resets to zero for each index segment and remain zero until maxScore is updated.
There are two causes of this problem:
* MaxScoreCollector does not set minCompetitiveScore of MinCompetitiveScoreAwareScorable newly generated for another index segment.
* MaxScoreCollector updates minCompetitiveScore only if maxScore is updated. This behavior is correct considering the purpose of MaxScoreCollector.

For details, see the attached pdf https://issues.apache.org/jira/secure/attachment/13019548/wand.pdf.
2021-02-11 09:10:03 -08:00
Ignacio Vera 683a9bd78a
LUCENE-9322: Add Vectors format to CodecReader accounting methods (#2353) 2021-02-11 16:54:59 +01:00
Peter Gromov 019872453d
LUCENE-9763: Hunspell: fix FORBIDDENWORD support (#2351)
don't decompound if it's a simple word with a forbidden root, don't lookup the word twice, don't forbid stemming (be like Hunspell)
2021-02-11 15:16:40 +01:00
Peter Gromov 01e34f8723
LUCENE-9764: Hunspell: try title case as FORCEUCASE misspelled word suggestions (#2352) 2021-02-11 15:12:58 +01:00
Peter Gromov 7f9b1f991b
LUCENE-9758: Hunspell: support NOSUGGEST option (#2340) 2021-02-11 09:15:37 +01:00
Peter Gromov 76b55509dd
LUCENE-9759: Hunspell: add more to TestHunspellRepositoryTestCases.EXPECTED_FAILURES (#2341) 2021-02-11 09:14:29 +01:00
Peter Gromov a919f2df59
LUCENE-9761: Hunspell: check that FLAG and SET don't occur too far in the file, cleanup (#2348)
Thank you!
2021-02-11 09:13:34 +01:00
Peter Gromov c0d32444be
LUCENE-9760: Hunspell: print total memory usage in TestAllDictionaries, cleanup (#2347) 2021-02-11 09:11:09 +01:00
Ignacio Vera 73d4f8cd03
Fix file leak in Lucene90VectorWriter (#2331)
Adds a new test class based on BaseIndexFileFormatTestCase
2021-02-11 08:39:48 +01:00
Jason Gerlowski aea29464f1 SOLR-13608: Fix incremental-backup test failures
The initial incremental-backup commit introduced several test failures
on Windows test runs that I neglected to catch before committing.  Most
of these failures were the result of bad 'location' path handling in the
test logic itself, though there were a few tweaks made to Solr code
itself to better handle Windows paths as well.
2021-02-10 15:39:13 -05:00
Cassandra Targett 131d060877 Remove "incubating" next to Zeppelin in parallel-sql-interface.adoc 2021-02-10 14:22:50 -06:00
Julie Tibshirani 29ea0043ef
LUCENE-9756: Extend FieldInfosFormat tests to cover points and vectors (#2338)
This commit adds coverage to `BaseFieldInfoFormatTestCase ` for points,
vectors, and the soft deletes field.
2021-02-10 08:57:26 -08:00
Peter Gromov c4c569b998
LUCENE-9752: Hunspell Stemmer: reduce parameter count (#2333) 2021-02-10 09:34:36 +01:00
Peter Gromov 6f525302dd
LUCENE-9753: Hunspell: disallow compounds with parts present in dictionary, space-separated (#2335) 2021-02-10 09:23:15 +01:00
Peter Gromov c3166e1dc3
LUCENE-9750: Hunspell: improve suggestions for mixed-case misspelled words (#2332) 2021-02-10 09:21:01 +01:00
Peter Gromov 5fd18881e9
LUCENE-9749: Hunspell: apply output conversion (OCONV) to the suggestions (#2329) 2021-02-10 09:17:44 +01:00
Peter Gromov f2b7cdc491
LUCENE-9748: Hunspell: suggest inflected dictionary entries similar to the misspelled word (#2330) 2021-02-10 09:16:06 +01:00
Dawid Weiss 1f5b37f299
LUCENE-9747: dodge javadoc reporter NPE bug on Java 11. (#2337) 2021-02-09 21:47:38 +01:00