Commit Graph

36052 Commits

Author SHA1 Message Date
Adrien Grand 7bbc1bf28d LUCENE-10078, LUCENE-10617: Tentative fix for TestMergeSchedulerExternal. 2022-06-15 09:45:47 +02:00
Adrien Grand 3be18bc62e LUCENE-10078: Fix TestIndexWriterOutOfFileDescriptors failures due to merge-on-refresh. 2022-06-15 09:20:21 +02:00
Julie Tibshirani 89dbe651cc
Make sure KnnVectorQuery applies search boost (#956)
Before, the rewritten query DocAndScoreQuery ignored the boost.
2022-06-14 08:34:02 -07:00
Adrien Grand 83461601ad
LUCENE-10608: Implement Weight#count on pure conjunctions. (#950) 2022-06-14 09:41:38 +02:00
Lu Xugang 7504b0a258
LUCENE-10598: Use count to record docValueCount similar to SortedNumericDocValues did (#942) 2022-06-14 13:03:19 +08:00
Dawid Weiss 76d418676e LUCENE-10613: Clean up outdated NOTICE.txt information concerning morfologik 2022-06-13 19:50:54 +02:00
Rushabh Shah fcd98fd337
LUCENE-10266 Move nearest-neighbor search on points to core (#897)
Co-authored-by: Rushabh Shah <shahrs87@apache.org>
2022-06-13 18:41:39 +02:00
Adrien Grand d850a22a51 LUCENE-10078: Fix TestIndexWriterExceptions' expectations regarding merges on full flushes. 2022-06-13 09:25:48 +02:00
Craig Taverner 66b65b79e8
LUCENE-10604: Add Tessellation monitor for easier debugging of triangulation algorithm (#946) 2022-06-10 07:33:07 +02:00
Sun WuQiang 26f21ae36d
LUCENE-10605: fix error in 32bit jvm object alignment gap calculation (#949) 2022-06-09 18:07:37 -07:00
Adrien Grand b1eddec821
LUCENE-10599: Improve LogMergePolicy's handling of maxMergeSize. (#935)
With this change, segments are more likely to be considered for merging until
they reach the max merge size. Before this change, LogMergePolicy would exclude
an entire window of `mergeFactor` segments from merging if this window had a
too large segment and other segments were on the same tier.
2022-06-09 17:43:19 +02:00
Dawid Weiss 54c67db10d
Try to fix the gradle compilation in idea (#945)
* Try to fix the gradle compilation in idea
* Try to detect sync and build phases within intellij and act accordingly to support both modes of compilation (gradle and intellij).
2022-06-09 11:14:10 +02:00
Craig Taverner 1dceff12c8
LUCENE-10563: Fix failure to tessellate complex polygon (#933) 2022-06-08 10:58:40 +02:00
Adrien Grand b5795db0cf
LUCENE-10078: Enable merge-on-refresh by default. (#921)
This gives implementations of `findFullFlushMerges` to `LogMergePolicy` and
`TieredMergePolicy` and enables merge-on-refresh with a default timeout of
500ms.

The idea behind the 500ms default is that it felt both high-enough to have time
to run merges of small segments, and low enough that the freshness of the data
wouldn't look badly affected for users who have high refresh rates (e.g.
refreshing every second).

In both cases, `findFullFlushMerges` delegates to `findMerges` and filters
merges whose segments are all below the min/floor size.
2022-06-07 16:59:55 +02:00
Patrick Zhai 7e9d5ab768
Revise javadoc of ExactPhraseMatcher#advancePosition (#944) 2022-06-03 23:05:44 -07:00
Dawid Weiss e8c1720956 Revert "Try to detect sync and build phases within intellij and act accordingly to support both modes of compilation (gradle and intellij)."
This reverts commit 13e10eaae2.
2022-06-03 20:19:21 +02:00
Dawid Weiss 13e10eaae2 Try to detect sync and build phases within intellij and act accordingly to support both modes of compilation (gradle and intellij). 2022-06-03 20:18:04 +02:00
Adrien Grand 3738beb038
LUCENE-10574: Avoid O(n^2) merging with LogMergePolicy (#936) 2022-06-03 11:02:27 +02:00
Adrien Grand 267a5fcc15 LUCENE-10598: Fix docValueCount() on Lucene80 sparse sorted set doc values. 2022-06-02 09:33:04 +02:00
Tomoko Uchida ee99a2a452
LUCENE-10597: move globalMaxScore to MaxScoreCache (#931) 2022-06-02 10:20:25 +09:00
Lu Xugang a383253fe1
LUCENE-10598: SortedSetDocValues#docValueCount() should be always greater than zero (#934)
* LUCENE-10598: SortedSetDocValues#docValueCount() should be always greater than zero
2022-06-02 01:33:47 +08:00
Tomoko Uchida 4c53a53238
LUCENE-10588: log elapsed time for initializing gui (#937) 2022-06-01 21:01:08 +09:00
Michael Sokolov 1b105f0eeb
Create Lucene93 Codec and move Lucene92 to backwards_codecs (#924) 2022-05-31 17:05:12 -04:00
tang donghai 40e9e5a00d
LUCENE-10596: Remove unused parameter in #getOrAddPerField (#930) 2022-05-31 15:33:51 +09:00
Yannick Welsch e319a5223c
LUCENE-10582: Fix merging of CollectionStatistics in CombinedFieldQuery (#910)
CombinedFieldQuery does not properly combine overridden collection statistics, resulting in an IllegalArgumentException during searches.
2022-05-30 11:02:40 -07:00
Adrien Grand 318177af83 LUCENE-10574: Fix TestTieredMergePolicy's expectations about the segment count. 2022-05-30 11:40:59 +02:00
Greg Miller 8db1e41fc0
LUCENE-10585: Scrub copy/paste code in the facets module and attempt to simplify a bit (#915) 2022-05-29 01:26:51 -07:00
András Salamon 3a80968ddf
LUCENE-10591 Invalid character in SortableSingleDocSource.java (#925) 2022-05-25 17:46:10 -07:00
Alan Woodward 9fa2b53a69 Fix path to backcompat indexes in release wizard 2022-05-25 12:17:32 +01:00
Tomoko Uchida 2620b5669f
LUCENE-10589: increase upper bound of test range query (#920) 2022-05-25 17:43:27 +09:00
Alan Woodward 893018ffe9 Add back-compat indices for 9.2.0 2022-05-24 15:35:26 +01:00
Alan Woodward d79d95bffd DOAP changes for release 9.2.0 2022-05-24 14:22:35 +01:00
Tomoko Uchida f5c1f11a2a
LUCENE-10586: Minor cleanup local variables in BlockTreeTermsReader (#918) 2022-05-24 15:22:34 +09:00
Tomoko Uchida 2be00cc393
Refine contribution guide and pull request template (#916) 2022-05-24 14:56:37 +09:00
Robert Muir c86f9b2d8c
remove commented-out/obselete AwaitsFix (#901)
* remove commented-out/obselete AwaitsFix

All of these issues are fixed, but the AwaitsFix annotation is still there, just commented out. This causes confusion and makes it harder to keep an eye/review the AwaitsFix tests, e.g. false positives when running 'git grep AwaitsFix'

* Remove @AwaitsFix from TestMatchRegionRetriever. The problem has been fixed in LUCENE-10229.

Co-authored-by: Dawid Weiss <dawid.weiss@carrotsearch.com>
2022-05-23 09:27:18 -04:00
Dawid Weiss 5b92002fed LUCENE-10370: recreate temporary location in case it's wiped by a clean. 2022-05-23 10:51:02 +02:00
Shai Erera 4d02335154
Update dev-docs (#919) 2022-05-23 11:13:07 +03:00
Rushabh Shah d17c6056d8 LUCENE-8519: MultiDocValues.getNormValues should not call getMergedFieldInfos (#902)
Co-authored-by: Rushabh Shah <shahrs87@apache.org>
2022-05-21 23:28:29 -04:00
Tomoko Uchida 59b6d41bd1
LUCENE-10531: Disable distribution test (gui test) on windows. (#917) 2022-05-21 23:46:07 +09:00
Tomoko Uchida 10a43d916e LUCENE-10184: avoid smoke tester failure 2022-05-21 16:11:00 +09:00
Tomoko Uchida 71a9acb2e2
LUCENE-10312: MIGRATE entry and small follow-ups (#908) 2022-05-21 12:43:02 +09:00
Dawid Weiss 63b66e06cd
LUCENE-10370: pass proper classpath/module arguments for forking jvms from within tests. (#909) 2022-05-21 00:30:02 +02:00
Robert Muir 2090ac4318
LUCENE-10579: fix smoketester backwards-check to not parse stdout (#903)
This is very noisy, can contain gradle status updates, various other tests.verbose prints from other threads, you name it.

It causes the check to be flaky, and randomly "miss" seeing a test that executed.

Instead, let's look at the zip files. We can still preserve the essence of what the test wants to do, but without any flakiness.
2022-05-20 07:20:57 -04:00
Adrien Grand 5e9dfbed27
LUCENE-10574: Keep allowing unbalanced merges if they would reclaim lots of deletes. (#905)
`TestTieredMergePolicy` caught this special case: if a segment has lots of
deletes, we should still allow unbalanced merges.
2022-05-20 10:06:38 +02:00
Adrien Grand 8e777a1320 Fix precommit. 2022-05-19 09:49:11 +02:00
Adrien Grand 3960c16296 LUCENE-10574: More test failures.
- MergeOnFlushMergePolicy doesn't try to avoid O(n^2) merges, so I'm disabling
   the test on it for now.
 - TestUpgradeIndexMergePolicy would sometimes wrap with a non-standard merge
   policy like the alcoholic merge policy, I forced it to wrap a
   TieredMergePolicy.
2022-05-19 09:35:17 +02:00
Adrien Grand bf07d98f13 LUCENE-10574: Fix test failure.
LogDocMergePolicy would previously always force-merge an index that has 10
segments of size 1 to 10, due to the min doc count. This is not the case
anymore, but the test was assuming that such an index would get merged, so I
fixed the test's expectations.

Also changed the merge policy to keep working when RAM buffers are flushed in
such a way that segments do not appear in decreasing size order using the same
logic as LogMergePolicy.
2022-05-19 09:24:54 +02:00
Adrien Grand 4240159b44 LUCENE-10574: Fix test failure.
If a LogByteSizeMergePolicy is used, then it might decide to not merge the two
one-document segments if their on-disk sizes are too different. Using a
LogDocMergePolicy addresses the issue as both segments are always considered
the same size.
2022-05-18 23:33:08 +02:00
Adrien Grand 268d29b845
LUCENE-10574: Prevent pathological merging. (#900)
This updates TieredMergePolicy and Log(Doc|Size)MergePolicy to only ever
consider merges where the resulting segment would be at least 50% bigger than
the biggest input segment. While a merge that only grows the biggest segment by
50% is still quite inefficient, this constraint is good enough to prevent
pathological O(N^2) merging.
2022-05-18 23:05:54 +02:00
Alan Woodward ac2267035a Add next minor version 9.2.0 2022-05-18 16:37:10 +01:00