Commit Graph

35546 Commits

Author SHA1 Message Date
Robert Muir 95095d0d49
upgrade ecj linter from 3.25.0 -> 3.27.0 (#483)
The newest version has a significant performance increase for our
use-case.
2021-11-28 12:05:42 -05:00
Robert Muir 756550f88b
speed up extremely slow test methods (runtime 15-30s) (#471) 2021-11-28 09:41:15 -05:00
Tomoko Uchida 38762ee8cf Use the same analysis chain to StandardAnalyzer (a follow-up of #480) (#482) 2021-11-28 21:24:11 +09:00
Tomoko Uchida 93bb52c601 set group to 'run' benchmark task (#481) 2021-11-28 21:23:48 +09:00
Tomoko Uchida eb912a9158 fix typo in documentation 2021-11-28 10:13:28 +09:00
Uwe Schindler 92a2428906 Fix wrong path in documentation 2021-11-28 00:56:27 +01:00
Tomoko Uchida e222031943 LUCENE-10261: clean up reflection stuff in luke module and make minor adjustments (#480) 2021-11-27 15:42:16 +09:00
Dawid Weiss f599a8e2ee LUCENE-10260: Luke's about window no longer shows version number (#473) 2021-11-26 08:33:30 +01:00
Ignacio Vera cb0c2b87ed LUCENE-10262: Lift up restrictions for navigating PointValues#PointTree (#476)
This change allows random navigation of a PointValues#PointTree.
2021-11-26 07:43:43 +01:00
Uwe Schindler 30c4b8d5b8 LUCENE-10259: Fix startup scripts to allow whitespace in path names and use /bin/sh only (#472) 2021-11-25 16:11:37 +01:00
Tomoko Uchida bfa3f01a17 LUCENE-10261: Remove preset analyzer panel from Luke Analysis UI. (#475) 2021-11-25 20:33:34 +09:00
Ignacio Vera 58ef7d911a
LUCENE-9820: PointTree#size() should handle the case of balanced tree in pre-8.6 indexes (#462) (#474)
Handle properly the case where trees are fully balanced for number of dimension > 1
2021-11-25 11:19:02 +01:00
Adrien Grand a97a1e2815 Fix test failures with testIndexUpgraderCommandLineArgs and ExtraFS. 2021-11-25 08:50:27 +01:00
David Smiley e2e99da4a8
Javadocs, Sorter impls (#426)
* Javadocs, Sorter impls
* clarify which sorts are stable/not
* link from utility methods to the primary Sorter implementations for further information
* describe when InPlaceMergeSorter is useful.  Fix incorrect statement that is uses insertion sort.

* Javadocs for Sorter
2021-11-25 00:44:58 -05:00
Adrien Grand b3a36166a5 Speed up TestBackwardsCompatibility#testCommandLineArgs. (#467)
This test unzip files that we already unzipped. This commit copies the already
uncompressed files instead.
2021-11-24 08:26:42 +01:00
Adrien Grand 3f634e2ab9 LUCENE-10168: Only test N-2 codecs on nightly runs.
In order for tests to keep running fast, this annotates all tests of N-2 codecs
with `@Nightly`. To keep good coverage of releases, the smoke tester is now
configured to run nightly tests.
2021-11-24 08:26:42 +01:00
Tomoko Uchida 170137129a LUCENE-10200: fix luke lauch script. 2021-11-22 19:16:18 +09:00
Andriy Redko 51c37db005 LUCENE-10244: MultiCollector::getCollectors is now public 2021-11-21 07:44:08 -08:00
Adrien Grand ee8829da5b Add dash between `rev` and the git hash. 2021-11-20 08:09:33 +01:00
Greg Miller 0ba310782f
LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals 2021-11-19 13:11:42 -08:00
Patrick Zhai 6b99f03cdd
LUCENE-10122 Use NumericDocValue to store taxonomy parent array (#454) 2021-11-19 13:05:56 -05:00
Quentin Pradet 631d1ad749 LUCENE-10085: Implement Weight#count on DocValuesFieldExistsQuery (#445)
Co-authored-by: Adrien Grand <jpountz@gmail.com>
2021-11-19 18:07:29 +01:00
Robert Muir ee56d31425
LUCENE-10239: upgrade jflex (1.7.0 -> 1.8.2) (#452)
Upgrade jflex.

Change doesn't alter the behavior of any of the analyzers (unicode
version or grammar refactorings), just the minimal to get new tooling
working.
2021-11-19 09:28:11 -05:00
Ignacio Vera 9adf7e27f9
LUCENE-9820: Separate logic for reading the BKD index from logic to intersecting it (#7) (#457)
Extract BKD tree interface and move intersecting logic to the PointValues abstract class.
2021-11-19 08:39:28 +01:00
Jim Ferenczi 2e5c4bb5a5 LUCENE-10208: Ensure that the minimum competitive score does not decrease in concurrent search (#431)
Co-authored-by: Adrien Grand <jpountz@gmail.com>
2021-11-18 17:33:04 +01:00
Andriy Redko 42bee6f223 LUCENE-10242: The TopScoreDocCollector::createSharedManager should use ScoreDoc instead of FieldDoc (#450)
Signed-off-by: Andriy Redko <andriy.redko@aiven.io>
2021-11-18 16:36:32 +01:00
Dawid Weiss 8d07018050 LUCENE-10240: gradle regenerate fails on java 17 (#449) 2021-11-17 18:36:58 +01:00
Dawid Weiss 4c22d30f80 LUCENE-10238: Update icu4j to 70.1. (#447) 2021-11-17 18:14:33 +01:00
Adrien Grand 7ce0cfa9c5 Add back-compat indices for 8.11.0 2021-11-17 11:51:18 +01:00
Bruno Roustant 02a63f688c
LUCENE-10225: Improve IntroSelector with 3-way partitioning. 2021-11-17 11:31:11 +01:00
Adrien Grand b6f456573a DOAP changes for release 8.11.0 2021-11-16 10:55:08 +01:00
Dawid Weiss 9d0eb88d2c LUCENE-10234: Add automatic module name to JAR manifests. (#440) 2021-11-15 17:03:08 +01:00
Quentin Pradet e034a2d6e2 LUCENE-10085: Rename DocValuesFieldExistsQuery test (#441)
FieldValueQuery got renamed to DocValuesFieldExistsQuery but the test
wasn't renamed.
2021-11-15 16:24:57 +01:00
Julie Tibshirani 607b10dc2a LUCENE-10069: Document that kNN queries might not return all results (#434)
Performing a kNN search with very large k may return fewer than k documents.
This is due to the fact that the HNSW graph is not guaranteed to be connected.
This commit documents the behavior as part of a general warning that the results
of a kNN search may be approximate.
2021-11-12 14:20:09 -08:00
Julie Tibshirani 68be365283 LUCENE-10063: Fix score calculation in SimpleTextKnnVectorsFormat
The method VectorSimilarityFunction#convertToScore already reverses the
similarity, so we shouldn't reverse it again.
2021-11-11 11:36:50 -08:00
Julie Tibshirani 9c73562161 LUCENE-10228: Ensure PerFieldKnnVectorsFormat uses right format name (#432)
Before when creating a KnnVectorsWriter for merging, we consulted the existing
"PER_FIELD_SUFFIX_KEY" attribute to determine the format's per-field suffix.
This isn't correct since we could be using a new codec (that produces different
formats/ suffixes).

This commit modifies TestPerFieldDocValuesFormat#testMergeUsesNewFormat to
trigger the problem. Without the fix we it throws an error like
"java.nio.file.FileAlreadyExistsException: File
"_3_Lucene90HnswVectorsFormat_0.vem" was already written to."
2021-11-11 11:22:52 -08:00
Dawid Weiss ff9ee28c60 LUCENE-10223: interval support in standard syntax parser (#429) 2021-11-11 08:56:48 +01:00
Dawid Weiss 238cd5fd0c LUCENE-10226: test target creates a weird folder (lazy property). 2021-11-09 08:38:42 +01:00
Dawid Weiss ffe40d23e1 LUCENE-10222: Enable github precommit check workflow on branch_9x 2021-11-05 09:01:45 +01:00
Dawid Weiss 5de05f3556 LUCENE-10220: Add an utility method to get IntervalSource from analyzed text (or token stream) (#427) 2021-11-05 08:58:37 +01:00
Uwe Schindler 6ccee3204f UCENE-10218: Extend validateSourcePatterns task to scan for LTR/RTL unicode to catch "Trojan Source" source code attacks (#425)
Co-authored-by: Dawid Weiss <dawid.weiss@carrotsearch.com>
# Conflicts:
#	gradle/validation/validate-source-patterns.gradle
2021-11-03 17:21:15 +01:00
Adrien Grand 5fa093bdba Format javadocs of new versions in a way that Spotless is happy with. 2021-11-02 13:23:45 +01:00
Adrien Grand 713385004f Add next minor version 9.1.0 2021-11-02 13:20:20 +01:00
Adrien Grand cc2a31f2be LUCENE-10103: Move CHANGES entry to correct version. 2021-11-02 10:35:55 +01:00
Bruno Roustant 63b9e603e6
LUCENE-10196: Improve IntroSorter with 3-ways partitioning. 2021-11-01 10:55:44 +01:00
Dawid Weiss 0544819b78
LUCENE-10200: store git revision in the release folder and read it back from buildAndPushRelease (#419) 2021-11-01 09:29:06 +01:00
Dawid Weiss 1d152c5f67 LUCENE-10192: drop jars from binary distribution and an aggregate merge of related minor tasks. 2021-10-31 10:50:11 +01:00
Dawid Weiss 98b17952f9 LUCENE-10213: Use unicode escapes in message property files in Luke (remove hacks) 2021-10-31 10:41:54 +01:00
Dawid Weiss ded915b29b LUCENE-10192: Use modules instead of classpath for binary distribution testing. 2021-10-31 10:41:49 +01:00
Dawid Weiss 01839da593 LUCENE-10192: Adjust checks to the new binary file structure. 2021-10-31 10:41:42 +01:00