Commit Graph

35766 Commits

Author SHA1 Message Date
Ignacio Vera 78c8d7b7ea
LUCENE-9538: Detect polygon self-intersections in the Tessellator (#428)
Detect self-intersections so it can provide a more meaningful error to the users.
2021-11-29 11:05:03 +01:00
Ignacio Vera 634c22c527
LUCENE-10264: Clone index input when creating a PointTree in SimpleTextBKDReader (#478)
Fixes a race condition introduced in LUCENE-9820.
2021-11-29 09:20:20 +01:00
Robert Muir 63c89f678d
Speed up ECJ tasks by avoiding --release (#484)
LUCENE-10185 caused a large performance regression in ECJ tasks by using the --release flag.

Instead of using --release, we can just disable "terminal deprecation", and leave this check to `javac`. The --release flag makes this tool run 50% slower.
2021-11-28 15:10:32 -05:00
Robert Muir 1fb45da7bb
upgrade ecj linter from 3.25.0 -> 3.27.0 (#483)
The newest version has a significant performance increase for our
use-case.
2021-11-28 12:05:19 -05:00
Robert Muir 3772ff563a
speed up extremely slow test methods (runtime 15-30s) (#471) 2021-11-28 09:40:43 -05:00
Tomoko Uchida cb5f1b6ca0
Use the same analysis chain to StandardAnalyzer (a follow-up of #480) (#482) 2021-11-28 21:22:28 +09:00
Tomoko Uchida c041517304
set group to 'run' benchmark task (#481) 2021-11-28 21:22:07 +09:00
Tomoko Uchida 9eb7857199 fix typo in documentation 2021-11-28 10:11:49 +09:00
Uwe Schindler aed47c1862 Fix wrong path in documentation 2021-11-28 00:55:28 +01:00
Tomoko Uchida 57f695b14d
LUCENE-10261: clean up reflection stuff in luke module and make minor adjustments (#480) 2021-11-27 15:36:38 +09:00
Dawid Weiss 1029651d12 Don't log warnings from ant (different class loader, I guess). Makes Alan happier. 2021-11-26 11:39:55 +01:00
Dawid Weiss 651755aab7
LUCENE-10260: Luke's about window no longer shows version number (#473) 2021-11-26 08:32:23 +01:00
Ignacio Vera a590c6d2a0
LUCENE-10262: Lift up restrictions for navigating PointValues#PointTree (#476)
This change allows random navigation of a PointValues#PointTree.
2021-11-26 07:42:13 +01:00
Uwe Schindler d973e50c15
LUCENE-10259: Fix startup scripts to allow whitespace in path names and use /bin/sh only (#472) 2021-11-25 16:07:23 +01:00
Tomoko Uchida 40b38438c8
LUCENE-10261: Remove preset analyzer panel from Luke Analysis UI. (#475) 2021-11-25 20:30:36 +09:00
Ignacio Vera 800f002e44
LUCENE-9820: PointTree#size() should handle the case of balanced tree in pre-8.6 indexes (#462)
Handle properly the case where trees are fully balanced for number of dimension > 1
2021-11-25 11:03:16 +01:00
Adrien Grand 8710252116 Fix test failures with testIndexUpgraderCommandLineArgs and ExtraFS. 2021-11-25 08:51:56 +01:00
Adrien Grand f80d816ce7
Speed up TestBackwardsCompatibility#testCommandLineArgs. (#467)
This test unzip files that we already unzipped. This commit copies the already
uncompressed files instead.
2021-11-24 08:25:22 +01:00
Adrien Grand 24fcd80a37
LUCENE-10168: Only test N-2 codecs on nightly runs. (#466)
In order for tests to keep running fast, this annotates all tests of N-2 codecs
with `@Nightly`. To keep good coverage of releases, the smoke tester is now
configured to run nightly tests.
2021-11-24 08:20:04 +01:00
Greg Miller 6ee69e06fb
LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals (#264) 2021-11-23 06:00:11 -08:00
David Smiley 0fcf9c825f
Javadocs, Sorter impls (#426)
* Javadocs, Sorter impls
* clarify which sorts are stable/not
* link from utility methods to the primary Sorter implementations for further information
* describe when InPlaceMergeSorter is useful.  Fix incorrect statement that is uses insertion sort.

* Javadocs for Sorter
2021-11-23 07:13:40 -05:00
Tomoko Uchida 4193bcbc02 LUCENE-10200: fix luke lauch script. 2021-11-22 18:46:28 +09:00
Greg Miller 78ee53f837 Add missing CHANGES entry 2021-11-21 07:41:25 -08:00
Greg Miller 9d7e5ef388 Fixup TestCombinedFieldQuery to not (randomy) use numHits = 0 2021-11-21 07:38:28 -08:00
Andriy Redko 5993b9050a
LUCENE-10244: Please consider opening MultiCollector::getCollectors for public use (#455) 2021-11-21 07:36:54 -08:00
Adrien Grand 0902d803fd Add dash between `rev` and the git hash. 2021-11-20 08:09:42 +01:00
Quentin Pradet 1a869c185b
LUCENE-10085: Implement Weight#count on DocValuesFieldExistsQuery (#445)
Co-authored-by: Adrien Grand <jpountz@gmail.com>
2021-11-19 18:06:58 +01:00
Robert Muir af831d2810
LUCENE-10239: upgrade jflex (1.7.0 -> 1.8.2) (#452)
Upgrade jflex.

Change doesn't alter the behavior of any of the analyzers (unicode
version or grammar refactorings), just the minimal to get new tooling
working.
2021-11-19 09:24:27 -05:00
Ignacio Vera ad911df260
LUCENE-9820: Separate logic for reading the BKD index from logic to intersecting it (#7)
Extract BKD tree interface and move intersecting logic to the PointValues abstract class.
2021-11-19 08:28:01 +01:00
zacharymorn 07ee3ba83a
LUCENE-10236: Update field-weight used in CombinedFieldQuery scoring calculation (#444) 2021-11-18 21:36:38 -08:00
Andriy Redko 6bd5c14bf3
LUCENE-10242: The TopScoreDocCollector::createSharedManager should use ScoreDoc instead of FieldDoc (#450)
Signed-off-by: Andriy Redko <andriy.redko@aiven.io>
2021-11-18 16:35:59 +01:00
Patrick Zhai b4476e4318
LUCENE-10122 Use NumericDocValue to store taxonomy parent array instead of custom term positions (#451) 2021-11-17 19:32:34 -05:00
Dawid Weiss bae095ae48
LUCENE-10240: gradle regenerate fails on java 17 (#449) 2021-11-17 18:36:34 +01:00
Dawid Weiss 0eeba8d37c
LUCENE-10238: Update icu4j to 70.1. (#447) 2021-11-17 18:13:40 +01:00
Adrien Grand 556c7c5fb5 Add back-compat indices for 8.11.0. 2021-11-17 11:53:49 +01:00
Bruno Roustant c71cbac4f9
LUCENE-10225: Improve IntroSelector with 3-way partitioning. 2021-11-17 10:38:27 +01:00
Adrien Grand c0112dd2ff DOAP changes for release 8.11.0 2021-11-16 10:54:24 +01:00
Dawid Weiss f5e5cf008a
LUCENE-10234: Add automatic module name to JAR manifests. (#440) 2021-11-15 17:02:40 +01:00
Quentin Pradet 1e5e997880
LUCENE-10085: Rename DocValuesFieldExistsQuery test (#441)
FieldValueQuery got renamed to DocValuesFieldExistsQuery but the test
wasn't renamed.
2021-11-15 16:24:29 +01:00
Julie Tibshirani 3b914a4d73
LUCENE-10069: Document that kNN queries might not return all results (#434)
Performing a kNN search with very large k may return fewer than k documents.
This is due to the fact that the HNSW graph is not guaranteed to be connected.
This commit documents the behavior as part of a general warning that the results
of a kNN search may be approximate.
2021-11-12 14:19:20 -08:00
Julie Tibshirani 2a9adb81df LUCENE-10063: Fix score calculation in SimpleTextKnnVectorsFormat
The method VectorSimilarityFunction#convertToScore already reverses the
similarity, so we shouldn't reverse it again.
2021-11-11 11:22:03 -08:00
Dawid Weiss f725b27e12
LUCENE-10223: interval support in standard syntax parser (#429) 2021-11-11 08:54:59 +01:00
Julie Tibshirani 53586d4231
LUCENE-10228: Ensure PerFieldKnnVectorsFormat uses right format name (#432)
Before when creating a KnnVectorsWriter for merging, we consulted the existing
"PER_FIELD_SUFFIX_KEY" attribute to determine the format's per-field suffix.
This isn't correct since we could be using a new codec (that produces different
formats/ suffixes).

This commit modifies TestPerFieldDocValuesFormat#testMergeUsesNewFormat to
trigger the problem. Without the fix we it throws an error like
"java.nio.file.FileAlreadyExistsException: File
"_3_Lucene90HnswVectorsFormat_0.vem" was already written to."
2021-11-10 08:18:01 -08:00
Jim Ferenczi 94b66c0ed2
LUCENE-10208: Ensure that the minimum competitive score does not decrease in concurrent search (#431)
Co-authored-by: Adrien Grand <jpountz@gmail.com>
2021-11-09 11:04:17 +01:00
Dawid Weiss 263765a9b0 LUCENE-10226: test target creates a weird folder (lazy property). 2021-11-09 08:34:04 +01:00
Dawid Weiss ded77d8bfd LUCENE-10222: Enable github precommit check workflow on branch_9x 2021-11-05 09:04:10 +01:00
Dawid Weiss 32d7f52446
LUCENE-10220: Add an utility method to get IntervalSource from analyzed text (or token stream) (#427) 2021-11-05 08:57:48 +01:00
Dawid Weiss bec8eaef70 Add version 9.1.0 changes section on main too. 2021-11-04 15:29:31 +01:00
Dawid Weiss 4e047ab0d8 Tidy. 2021-11-04 15:21:08 +01:00
Adrien Grand ae29a3b326 Fix OOM during tests. 2021-11-04 10:46:41 +01:00