Commit Graph

35488 Commits

Author SHA1 Message Date
Ignacio Vera ad911df260
LUCENE-9820: Separate logic for reading the BKD index from logic to intersecting it (#7)
Extract BKD tree interface and move intersecting logic to the PointValues abstract class.
2021-11-19 08:28:01 +01:00
zacharymorn 07ee3ba83a
LUCENE-10236: Update field-weight used in CombinedFieldQuery scoring calculation (#444) 2021-11-18 21:36:38 -08:00
Andriy Redko 6bd5c14bf3
LUCENE-10242: The TopScoreDocCollector::createSharedManager should use ScoreDoc instead of FieldDoc (#450)
Signed-off-by: Andriy Redko <andriy.redko@aiven.io>
2021-11-18 16:35:59 +01:00
Patrick Zhai b4476e4318
LUCENE-10122 Use NumericDocValue to store taxonomy parent array instead of custom term positions (#451) 2021-11-17 19:32:34 -05:00
Dawid Weiss bae095ae48
LUCENE-10240: gradle regenerate fails on java 17 (#449) 2021-11-17 18:36:34 +01:00
Dawid Weiss 0eeba8d37c
LUCENE-10238: Update icu4j to 70.1. (#447) 2021-11-17 18:13:40 +01:00
Adrien Grand 556c7c5fb5 Add back-compat indices for 8.11.0. 2021-11-17 11:53:49 +01:00
Bruno Roustant c71cbac4f9
LUCENE-10225: Improve IntroSelector with 3-way partitioning. 2021-11-17 10:38:27 +01:00
Adrien Grand c0112dd2ff DOAP changes for release 8.11.0 2021-11-16 10:54:24 +01:00
Dawid Weiss f5e5cf008a
LUCENE-10234: Add automatic module name to JAR manifests. (#440) 2021-11-15 17:02:40 +01:00
Quentin Pradet 1e5e997880
LUCENE-10085: Rename DocValuesFieldExistsQuery test (#441)
FieldValueQuery got renamed to DocValuesFieldExistsQuery but the test
wasn't renamed.
2021-11-15 16:24:29 +01:00
Julie Tibshirani 3b914a4d73
LUCENE-10069: Document that kNN queries might not return all results (#434)
Performing a kNN search with very large k may return fewer than k documents.
This is due to the fact that the HNSW graph is not guaranteed to be connected.
This commit documents the behavior as part of a general warning that the results
of a kNN search may be approximate.
2021-11-12 14:19:20 -08:00
Julie Tibshirani 2a9adb81df LUCENE-10063: Fix score calculation in SimpleTextKnnVectorsFormat
The method VectorSimilarityFunction#convertToScore already reverses the
similarity, so we shouldn't reverse it again.
2021-11-11 11:22:03 -08:00
Dawid Weiss f725b27e12
LUCENE-10223: interval support in standard syntax parser (#429) 2021-11-11 08:54:59 +01:00
Julie Tibshirani 53586d4231
LUCENE-10228: Ensure PerFieldKnnVectorsFormat uses right format name (#432)
Before when creating a KnnVectorsWriter for merging, we consulted the existing
"PER_FIELD_SUFFIX_KEY" attribute to determine the format's per-field suffix.
This isn't correct since we could be using a new codec (that produces different
formats/ suffixes).

This commit modifies TestPerFieldDocValuesFormat#testMergeUsesNewFormat to
trigger the problem. Without the fix we it throws an error like
"java.nio.file.FileAlreadyExistsException: File
"_3_Lucene90HnswVectorsFormat_0.vem" was already written to."
2021-11-10 08:18:01 -08:00
Jim Ferenczi 94b66c0ed2
LUCENE-10208: Ensure that the minimum competitive score does not decrease in concurrent search (#431)
Co-authored-by: Adrien Grand <jpountz@gmail.com>
2021-11-09 11:04:17 +01:00
Dawid Weiss 263765a9b0 LUCENE-10226: test target creates a weird folder (lazy property). 2021-11-09 08:34:04 +01:00
Dawid Weiss ded77d8bfd LUCENE-10222: Enable github precommit check workflow on branch_9x 2021-11-05 09:04:10 +01:00
Dawid Weiss 32d7f52446
LUCENE-10220: Add an utility method to get IntervalSource from analyzed text (or token stream) (#427) 2021-11-05 08:57:48 +01:00
Dawid Weiss bec8eaef70 Add version 9.1.0 changes section on main too. 2021-11-04 15:29:31 +01:00
Dawid Weiss 4e047ab0d8 Tidy. 2021-11-04 15:21:08 +01:00
Adrien Grand ae29a3b326 Fix OOM during tests. 2021-11-04 10:46:41 +01:00
Dawid Weiss adec73dd28
LUCENE-10088: allow per-class override in HandleLimitFS. Bump the limit a bit for nightlies in TestIndexWriterMergePolicy. (#424) 2021-11-04 08:31:28 +01:00
Uwe Schindler 1ae6b2a6b9
UCENE-10218: Extend validateSourcePatterns task to scan for LTR/RTL unicode to catch "Trojan Source" source code attacks (#425)
Co-authored-by: Dawid Weiss <dawid.weiss@carrotsearch.com>
2021-11-03 17:19:24 +01:00
Adrien Grand f9be01d5cc Format javadocs of new versions in a way that Spotless is happy with. 2021-11-02 13:12:39 +01:00
Adrien Grand 1ddb01044a Fix backward-compatibility test and remove unnecessary backward compatibility. 2021-11-02 13:12:39 +01:00
Dawid Weiss 20c4a1466d Tidy. 2021-11-02 12:55:02 +01:00
Dawid Weiss a53d633bd9 LUCENE-10195: LUCENE-10195: Add gradle cache option and make some tasks
cacheable
2021-11-02 12:20:09 +01:00
Dawid Weiss d14ee2d01e Use a dumber offlineLinks map but expose more structured information to gradle. 2021-11-02 12:18:52 +01:00
Dawid Weiss dab6e8454a Add changes entry. 2021-11-02 12:18:52 +01:00
Dawid Weiss d819a2eec4 Cherry pick the validateSourcePatterns. Correct exclusion patterns (.gradle and .idea are only at the root project level). 2021-11-02 12:18:51 +01:00
Jerome Prinet d44e03b8b0 Make ratTask cacheable 2021-11-02 12:18:51 +01:00
Jerome Prinet b3e59232f5 Make checkBrokenLinks cacheable 2021-11-02 12:18:51 +01:00
Dawid Weiss 1a38cac68e LUCENE-10195: add commented-out org.gradle.caching=true to the generated local settings. 2021-11-02 12:18:51 +01:00
Adrien Grand 2a7dd4d7dc Add next major version 10.0.0 2021-11-02 11:47:19 +01:00
Adrien Grand cc2a31f2be LUCENE-10103: Move CHANGES entry to correct version. 2021-11-02 10:35:55 +01:00
Bruno Roustant 63b9e603e6
LUCENE-10196: Improve IntroSorter with 3-ways partitioning. 2021-11-01 10:55:44 +01:00
Dawid Weiss 0544819b78
LUCENE-10200: store git revision in the release folder and read it back from buildAndPushRelease (#419) 2021-11-01 09:29:06 +01:00
Dawid Weiss 1d152c5f67 LUCENE-10192: drop jars from binary distribution and an aggregate merge of related minor tasks. 2021-10-31 10:50:11 +01:00
Dawid Weiss 98b17952f9 LUCENE-10213: Use unicode escapes in message property files in Luke (remove hacks) 2021-10-31 10:41:54 +01:00
Dawid Weiss ded915b29b LUCENE-10192: Use modules instead of classpath for binary distribution testing. 2021-10-31 10:41:49 +01:00
Dawid Weiss 01839da593 LUCENE-10192: Adjust checks to the new binary file structure. 2021-10-31 10:41:42 +01:00
Dawid Weiss d23f37d02d LUCENE-10200: The branch does not have to be on origin remote. Replace this logic with a check whether the branch is up to date with the remote. 2021-10-31 10:41:32 +01:00
Dawid Weiss 6d8ea58ccd LUCENE-10200: Rename pddl-10.txt to reference glove. 2021-10-31 10:41:16 +01:00
Dawid Weiss 7f7007966e LUCENE-10192: No need for hacky classpath, add the log4j module to the root set. Automatic modules have access to all other modules by default. 2021-10-31 10:41:10 +01:00
Dawid Weiss 627ef4d469 LUCENE-9978: Integrate Luke with the binary release package. 2021-10-31 10:40:51 +01:00
Dawid Weiss 39d388330c LUCENE-10192: Move the test framework to a separate top-level folder. I'm not even sure it really needs to be in the binary distribution but it is distinctively different from the rest of the modules. 2021-10-31 10:40:45 +01:00
Dawid Weiss fda47a24f8 LUCENE-10192: Flatten the modules into a single jar folder to allow --module-path to be used. So much simpler.' 2021-10-31 10:40:23 +01:00
Dawid Weiss bcdfc4c8c9 LUCENE-10192: drop third party jars from the binary distribution. 2021-10-31 10:36:33 +01:00
Michael Sokolov 84a4797d14 Apply query score conversion to vector similarities in SimpleTextKnnVectorReader 2021-10-30 21:26:17 -04:00