Uwe Schindler
1ae6b2a6b9
UCENE-10218: Extend validateSourcePatterns task to scan for LTR/RTL unicode to catch "Trojan Source" source code attacks ( #425 )
...
Co-authored-by: Dawid Weiss <dawid.weiss@carrotsearch.com>
2021-11-03 17:19:24 +01:00
Adrien Grand
f9be01d5cc
Format javadocs of new versions in a way that Spotless is happy with.
2021-11-02 13:12:39 +01:00
Adrien Grand
1ddb01044a
Fix backward-compatibility test and remove unnecessary backward compatibility.
2021-11-02 13:12:39 +01:00
Dawid Weiss
20c4a1466d
Tidy.
2021-11-02 12:55:02 +01:00
Dawid Weiss
a53d633bd9
LUCENE-10195: LUCENE-10195: Add gradle cache option and make some tasks
...
cacheable
2021-11-02 12:20:09 +01:00
Dawid Weiss
d14ee2d01e
Use a dumber offlineLinks map but expose more structured information to gradle.
2021-11-02 12:18:52 +01:00
Dawid Weiss
dab6e8454a
Add changes entry.
2021-11-02 12:18:52 +01:00
Dawid Weiss
d819a2eec4
Cherry pick the validateSourcePatterns. Correct exclusion patterns (.gradle and .idea are only at the root project level).
2021-11-02 12:18:51 +01:00
Jerome Prinet
d44e03b8b0
Make ratTask cacheable
2021-11-02 12:18:51 +01:00
Jerome Prinet
b3e59232f5
Make checkBrokenLinks cacheable
2021-11-02 12:18:51 +01:00
Dawid Weiss
1a38cac68e
LUCENE-10195: add commented-out org.gradle.caching=true to the generated local settings.
2021-11-02 12:18:51 +01:00
Adrien Grand
2a7dd4d7dc
Add next major version 10.0.0
2021-11-02 11:47:19 +01:00
Adrien Grand
cc2a31f2be
LUCENE-10103: Move CHANGES entry to correct version.
2021-11-02 10:35:55 +01:00
Bruno Roustant
63b9e603e6
LUCENE-10196: Improve IntroSorter with 3-ways partitioning.
2021-11-01 10:55:44 +01:00
Dawid Weiss
0544819b78
LUCENE-10200: store git revision in the release folder and read it back from buildAndPushRelease ( #419 )
2021-11-01 09:29:06 +01:00
Dawid Weiss
1d152c5f67
LUCENE-10192: drop jars from binary distribution and an aggregate merge of related minor tasks.
2021-10-31 10:50:11 +01:00
Dawid Weiss
98b17952f9
LUCENE-10213: Use unicode escapes in message property files in Luke (remove hacks)
2021-10-31 10:41:54 +01:00
Dawid Weiss
ded915b29b
LUCENE-10192: Use modules instead of classpath for binary distribution testing.
2021-10-31 10:41:49 +01:00
Dawid Weiss
01839da593
LUCENE-10192: Adjust checks to the new binary file structure.
2021-10-31 10:41:42 +01:00
Dawid Weiss
d23f37d02d
LUCENE-10200: The branch does not have to be on origin remote. Replace this logic with a check whether the branch is up to date with the remote.
2021-10-31 10:41:32 +01:00
Dawid Weiss
6d8ea58ccd
LUCENE-10200: Rename pddl-10.txt to reference glove.
2021-10-31 10:41:16 +01:00
Dawid Weiss
7f7007966e
LUCENE-10192: No need for hacky classpath, add the log4j module to the root set. Automatic modules have access to all other modules by default.
2021-10-31 10:41:10 +01:00
Dawid Weiss
627ef4d469
LUCENE-9978: Integrate Luke with the binary release package.
2021-10-31 10:40:51 +01:00
Dawid Weiss
39d388330c
LUCENE-10192: Move the test framework to a separate top-level folder. I'm not even sure it really needs to be in the binary distribution but it is distinctively different from the rest of the modules.
2021-10-31 10:40:45 +01:00
Dawid Weiss
fda47a24f8
LUCENE-10192: Flatten the modules into a single jar folder to allow --module-path to be used. So much simpler.'
2021-10-31 10:40:23 +01:00
Dawid Weiss
bcdfc4c8c9
LUCENE-10192: drop third party jars from the binary distribution.
2021-10-31 10:36:33 +01:00
Michael Sokolov
84a4797d14
Apply query score conversion to vector similarities in SimpleTextKnnVectorReader
2021-10-30 21:26:17 -04:00
David Smiley
c2c215d3a8
LUCENE-10201: Upgrade Spatial4j to 0.8 ( #409 )
...
Upgrading Spatial4j to 0.8 improving a varitety of minor things.
See release notes:
https://github.com/locationtech/spatial4j/releases/tag/spatial4j-0.8
Test-only dependency on JTS is upgraded to 1.17 as well
2021-10-29 22:01:52 -04:00
Mike Drob
23256a30fa
Replace deprecated Gradle 7.2 properties ( #417 )
2021-10-29 09:59:47 -05:00
Adrien Grand
53b40e0fb7
LUCENE-10145: Revert change to computeMinMax.
...
This part of the change would call `ArrayUtil#getUnsignedComparator` on a
length that is rarely 4 or 8. In such cases it's better to use
`Arrays#compareUnsigned`.
2021-10-28 16:29:05 +02:00
Mike McCandless
512cad0e01
LUCENE-9673: fix IntBlockPool's slice allocator to actually grow properly with larger and larger slice-chained int[]; excise wasted RAM due to unused (overallocation) of int[] to track in-memory postings
2021-10-28 09:37:36 -04:00
Dawid Weiss
727c6b1e0b
LUCENE-10209: Temporarily comment out gradle validation.
2021-10-27 21:12:14 +02:00
Dawid Weiss
62eb9a809e
LUCENE-10200: remove unused dangling license exclusions. Add references to the remaining ones.
2021-10-27 20:40:39 +02:00
Julie Tibshirani
abd5ec4ff0
LUCENE-9614: Fix KnnVectorQuery failure when numDocs is 0 ( #413 )
...
When the reader has no live docs, `KnnVectorQuery` can error out. This happens
because `IndexReader#numDocs` is 0, and we end up passing an illegal value of
`k = 0` to the search method.
This commit removes the problematic optimization in `KnnVectorQuery` and
replaces with a lower-level based on the total number of vectors in the segment.
2021-10-27 11:08:47 -07:00
Nik Everett
941df98c3f
LUCENE-10206 Implement O(1) count on query cache ( #415 )
...
When we load a query into the query cache we always calculate the count
of matching documents. This uses that count to power the new `O(1)`
`Weight#count` method.
2021-10-27 10:20:10 +02:00
Dawid Weiss
1613355149
LUCENE-10163: update smoke tester - README inside lucene/ is no longer there in the source release.
2021-10-26 21:58:20 +02:00
Dawid Weiss
4329450392
LUCENE-10198: remove debug statement that crept in.
2021-10-26 21:33:19 +02:00
Dawid Weiss
fb6aaa7b2c
LUCENE-10199: drop binary .zip artifact. ( #407 )
2021-10-26 21:21:30 +02:00
Dawid Weiss
08c0356664
LUCENE-10163: clean up and remove some old cruft in readme files. Move binary release only README.md to the distribution project so that it doesn't look weird in the source tree. ( #406 )
2021-10-26 21:20:42 +02:00
Dawid Weiss
780846a732
LUCENE-10198: Allow external JAVA_OPTS in gradlew scripts; use sane defaults (heap, stack and system proxies) ( #405 )
...
Co-authored-by: balmukundblr <balmukund.mandal@intel.com>
2021-10-26 09:15:55 +02:00
Dawid Weiss
486141f0eb
LUCENE-9660: correct help/tests.txt.
2021-10-26 08:45:58 +02:00
Mayya Sharipova
2ed6e4aa78
LUCENE-10154 NumericLeafComparator to define getPointValues ( #364 )
...
This patch adds getPointValues to NumericLeafComparatorsimilar how it
has getNumericDocValues.
Numeric Sort optimization with points relies on the assumption that
points and doc values record the same information, as we substitute
iterator over doc_values with one over points.
If we override getNumericDocValues it almost certainly means that whatever
PointValues NumericComparator is going to look at shouldn't be used to
skip non-competitive documents. Returning null for pointValues in this
case will force comparator NOT to use sort optimization with points,
and continue with a traditional way of iterating over doc values.
2021-10-25 09:38:37 -04:00
Dawid Weiss
81f5b4d642
LUCENE-9660: add tests.neverUpToDate=true option which, by default, makes test tasks always execute. ( #410 )
2021-10-25 14:51:11 +02:00
David Smiley
2719cf6630
LUCENE-9431: UnifiedHighlighter WEIGHT_MATCHES is now true by default ( #362 )
...
Co-authored-by: Animesh Pandey <apanimesh061@gmail.com>
2021-10-22 20:40:22 -04:00
Michael McCandless
e3151d6c7d
LUCENE-10093: fix conflicting test assert to match how TieredMergePolicy (TMP) works; improv TMP javadocs ( #375 )
2021-10-21 09:23:17 -04:00
Adrien Grand
8b6c90eccd
LUCENE-10165: Fix test failures.
2021-10-21 11:32:10 +02:00
Adrien Grand
9e84b2fd41
LUCENE-10165: Implement Lucene90DocValuesProducer#getMergeInstance. ( #374 )
...
This speeds up merging by returning doc values that perform faster when all doc
IDs and values are consumed.
2021-10-21 08:41:47 +02:00
Nhat Nguyen
4c2692e897
Do not run testHighOrdsSortedSetDV with SimpleTextCodec ( #403 )
...
Avoid running testHighOrdsSortedSetDV with SimpleTextCodec as it
requires a lot of memory and the bug was with Lucene90 Codec.
2021-10-20 18:22:34 -04:00
Adrien Grand
3a11983de2
LUCENE-10189: Optimize flush of doc-value fields that are effectively single-valued. ( #399 )
2021-10-20 19:05:40 +02:00
Adrien Grand
0e1f9fcf31
LUCENE-10193: Cut over more array access to VarHandles. ( #402 )
...
LZ4 is interesting because it used to read data in little-endian order even
though Directory APIs were big endian. So most calls to LZ4 in backward-codecs
have been changed to change the endianness of the input/output.
2021-10-20 19:04:01 +02:00