Commit Graph

36214 Commits

Author SHA1 Message Date
Greg Miller 8f459eb0f9
LUCENE-10603: Mark SortedSetDocValues#NO_MORE_ORDS deprecated (#969) 2022-06-21 09:05:15 -07:00
Adrien Grand 68f77dbf53
Handle more cases in `BooleanWeight#count`. (#961)
As suggested by @zhaih on #950, we could support more cases in
`BooleanWeight#count`. This PR adds support for these cases specifically:
 - Pure disjunctions where only one clause has a non-zero count.
 - Pure disjunctions where one clause matches all docs.
 - Negations where positive clauses match all docs (pure negation).
 - Negations where positive clauses match no docs.
 - Negations where negative clauses match no docs.
 - Negations where negative clauses match all docs.
2022-06-21 17:29:43 +02:00
Adrien Grand adcf58fe87 LUCENE-10507: Fix test failure. 2022-06-21 09:15:10 +02:00
JoeHF 08a9dfddeb
LUCENE-10618: Implement BooleanQuery rewrite rules based for minimumShouldMatch (#965) 2022-06-20 14:04:38 +02:00
Greg Miller 6ba759df86
LUCENE-10603: Change iteration methodology for SSDV ordinals in the faceting module (#954) 2022-06-17 10:37:24 -07:00
Adrien Grand b180a8a97e LUCENE-10078: Fix more TestIndexWriterWithThreads failures. 2022-06-17 17:54:26 +02:00
Luca Cavanna e78b2f2fe0
Make it more likely to perform concurrent search in tests (#959)
1) Replace default useThreads value: rarely() -> randomBoolean()
2) apply lower slices thresholds more frequently: randomBoolean() -> frequently
3) lower maxDocsPerSlice and maxSegmentsPerSlice threshold when applied
4) apply lower maxSegments and maxSegmentsPerSlice also when wrapWithAssertions is true
2022-06-17 17:27:08 +02:00
Julie Tibshirani 450ee81154 LUCENE-10611: Tweak the CHANGES description 2022-06-16 12:04:47 -07:00
Kaival Parikh 6df6cb093c
LUCENE-10611: Fix Heap Error in HnswGraphSearcher (#958)
The HNSW graph search does not consider that visitedLimit may be reached in the
upper levels of graph search itself

This occurs when the pre-filter is too restrictive (and its count sets the
visitedLimit). So instead of switching over to exactSearch, it tries to pop
from an empty heap and throws an error.

We can check if results are incomplete after searching in upper levels, and
break out accordingly. This way it won't throw heap errors, and gracefully
switch to exactSearch instead
2022-06-16 12:01:54 -07:00
Lu Xugang 78b7b17f93
LUCENE-10600: SortedSetDocValues#docValueCount should be an int, not long (#960) 2022-06-16 12:22:05 +08:00
Greg Miller 0b5e0bfa4f
LUCENE-10584: Properly support #getSpecificValue for hierarchical dims in SSDV faceting (#929) 2022-06-15 11:31:23 -07:00
Adrien Grand d52122605d LUCENE-10608: Ignore conjunctions that have a non-zero minShouldMatch from the count logic. 2022-06-15 10:13:22 +02:00
Adrien Grand 7bbc1bf28d LUCENE-10078, LUCENE-10617: Tentative fix for TestMergeSchedulerExternal. 2022-06-15 09:45:47 +02:00
Adrien Grand 3be18bc62e LUCENE-10078: Fix TestIndexWriterOutOfFileDescriptors failures due to merge-on-refresh. 2022-06-15 09:20:21 +02:00
Julie Tibshirani 89dbe651cc
Make sure KnnVectorQuery applies search boost (#956)
Before, the rewritten query DocAndScoreQuery ignored the boost.
2022-06-14 08:34:02 -07:00
Adrien Grand 83461601ad
LUCENE-10608: Implement Weight#count on pure conjunctions. (#950) 2022-06-14 09:41:38 +02:00
Lu Xugang 7504b0a258
LUCENE-10598: Use count to record docValueCount similar to SortedNumericDocValues did (#942) 2022-06-14 13:03:19 +08:00
Dawid Weiss 76d418676e LUCENE-10613: Clean up outdated NOTICE.txt information concerning morfologik 2022-06-13 19:50:54 +02:00
Rushabh Shah fcd98fd337
LUCENE-10266 Move nearest-neighbor search on points to core (#897)
Co-authored-by: Rushabh Shah <shahrs87@apache.org>
2022-06-13 18:41:39 +02:00
Adrien Grand d850a22a51 LUCENE-10078: Fix TestIndexWriterExceptions' expectations regarding merges on full flushes. 2022-06-13 09:25:48 +02:00
Craig Taverner 66b65b79e8
LUCENE-10604: Add Tessellation monitor for easier debugging of triangulation algorithm (#946) 2022-06-10 07:33:07 +02:00
Sun WuQiang 26f21ae36d
LUCENE-10605: fix error in 32bit jvm object alignment gap calculation (#949) 2022-06-09 18:07:37 -07:00
Adrien Grand b1eddec821
LUCENE-10599: Improve LogMergePolicy's handling of maxMergeSize. (#935)
With this change, segments are more likely to be considered for merging until
they reach the max merge size. Before this change, LogMergePolicy would exclude
an entire window of `mergeFactor` segments from merging if this window had a
too large segment and other segments were on the same tier.
2022-06-09 17:43:19 +02:00
Dawid Weiss 54c67db10d
Try to fix the gradle compilation in idea (#945)
* Try to fix the gradle compilation in idea
* Try to detect sync and build phases within intellij and act accordingly to support both modes of compilation (gradle and intellij).
2022-06-09 11:14:10 +02:00
Craig Taverner 1dceff12c8
LUCENE-10563: Fix failure to tessellate complex polygon (#933) 2022-06-08 10:58:40 +02:00
Adrien Grand b5795db0cf
LUCENE-10078: Enable merge-on-refresh by default. (#921)
This gives implementations of `findFullFlushMerges` to `LogMergePolicy` and
`TieredMergePolicy` and enables merge-on-refresh with a default timeout of
500ms.

The idea behind the 500ms default is that it felt both high-enough to have time
to run merges of small segments, and low enough that the freshness of the data
wouldn't look badly affected for users who have high refresh rates (e.g.
refreshing every second).

In both cases, `findFullFlushMerges` delegates to `findMerges` and filters
merges whose segments are all below the min/floor size.
2022-06-07 16:59:55 +02:00
Patrick Zhai 7e9d5ab768
Revise javadoc of ExactPhraseMatcher#advancePosition (#944) 2022-06-03 23:05:44 -07:00
Dawid Weiss e8c1720956 Revert "Try to detect sync and build phases within intellij and act accordingly to support both modes of compilation (gradle and intellij)."
This reverts commit 13e10eaae2.
2022-06-03 20:19:21 +02:00
Dawid Weiss 13e10eaae2 Try to detect sync and build phases within intellij and act accordingly to support both modes of compilation (gradle and intellij). 2022-06-03 20:18:04 +02:00
Adrien Grand 3738beb038
LUCENE-10574: Avoid O(n^2) merging with LogMergePolicy (#936) 2022-06-03 11:02:27 +02:00
Adrien Grand 267a5fcc15 LUCENE-10598: Fix docValueCount() on Lucene80 sparse sorted set doc values. 2022-06-02 09:33:04 +02:00
Tomoko Uchida ee99a2a452
LUCENE-10597: move globalMaxScore to MaxScoreCache (#931) 2022-06-02 10:20:25 +09:00
Lu Xugang a383253fe1
LUCENE-10598: SortedSetDocValues#docValueCount() should be always greater than zero (#934)
* LUCENE-10598: SortedSetDocValues#docValueCount() should be always greater than zero
2022-06-02 01:33:47 +08:00
Tomoko Uchida 4c53a53238
LUCENE-10588: log elapsed time for initializing gui (#937) 2022-06-01 21:01:08 +09:00
Michael Sokolov 1b105f0eeb
Create Lucene93 Codec and move Lucene92 to backwards_codecs (#924) 2022-05-31 17:05:12 -04:00
tang donghai 40e9e5a00d
LUCENE-10596: Remove unused parameter in #getOrAddPerField (#930) 2022-05-31 15:33:51 +09:00
Yannick Welsch e319a5223c
LUCENE-10582: Fix merging of CollectionStatistics in CombinedFieldQuery (#910)
CombinedFieldQuery does not properly combine overridden collection statistics, resulting in an IllegalArgumentException during searches.
2022-05-30 11:02:40 -07:00
Adrien Grand 318177af83 LUCENE-10574: Fix TestTieredMergePolicy's expectations about the segment count. 2022-05-30 11:40:59 +02:00
Greg Miller 8db1e41fc0
LUCENE-10585: Scrub copy/paste code in the facets module and attempt to simplify a bit (#915) 2022-05-29 01:26:51 -07:00
András Salamon 3a80968ddf
LUCENE-10591 Invalid character in SortableSingleDocSource.java (#925) 2022-05-25 17:46:10 -07:00
Alan Woodward 9fa2b53a69 Fix path to backcompat indexes in release wizard 2022-05-25 12:17:32 +01:00
Tomoko Uchida 2620b5669f
LUCENE-10589: increase upper bound of test range query (#920) 2022-05-25 17:43:27 +09:00
Alan Woodward 893018ffe9 Add back-compat indices for 9.2.0 2022-05-24 15:35:26 +01:00
Alan Woodward d79d95bffd DOAP changes for release 9.2.0 2022-05-24 14:22:35 +01:00
Tomoko Uchida f5c1f11a2a
LUCENE-10586: Minor cleanup local variables in BlockTreeTermsReader (#918) 2022-05-24 15:22:34 +09:00
Tomoko Uchida 2be00cc393
Refine contribution guide and pull request template (#916) 2022-05-24 14:56:37 +09:00
Robert Muir c86f9b2d8c
remove commented-out/obselete AwaitsFix (#901)
* remove commented-out/obselete AwaitsFix

All of these issues are fixed, but the AwaitsFix annotation is still there, just commented out. This causes confusion and makes it harder to keep an eye/review the AwaitsFix tests, e.g. false positives when running 'git grep AwaitsFix'

* Remove @AwaitsFix from TestMatchRegionRetriever. The problem has been fixed in LUCENE-10229.

Co-authored-by: Dawid Weiss <dawid.weiss@carrotsearch.com>
2022-05-23 09:27:18 -04:00
Dawid Weiss 5b92002fed LUCENE-10370: recreate temporary location in case it's wiped by a clean. 2022-05-23 10:51:02 +02:00
Shai Erera 4d02335154
Update dev-docs (#919) 2022-05-23 11:13:07 +03:00
Rushabh Shah d17c6056d8 LUCENE-8519: MultiDocValues.getNormValues should not call getMergedFieldInfos (#902)
Co-authored-by: Rushabh Shah <shahrs87@apache.org>
2022-05-21 23:28:29 -04:00