Tomoko Uchida
2a3e5ca07f
LUCENE-10475: Merge o.a.l.a.[ja|ko].util into o.a.l.a.[ja|ko].dict ( #772 )
2022-03-29 21:09:26 +09:00
Tomoko Uchida
ac6c36d406
LUCENE-10184: add CONTRIBUTING.md; reorganize README. ( #771 )
2022-03-29 16:52:27 +09:00
Greg Miller
d438a0cde7
Add CHANGES entry for LUCENE-10325
2022-03-28 15:57:18 -07:00
Yuting Gan
7c33f04d37
LUCENE-10325: Add getTopDims functionality to Facets ( #747 )
2022-03-28 15:54:07 -07:00
Tomoko Uchida
0f93130d7b
remove obsolete image/description from luke/README.md
2022-03-28 08:44:29 +09:00
Uwe Schindler
ff263f0aa4
Upgrade to forbiddenapis 3.3 ( #768 )
2022-03-26 17:09:42 +01:00
Tomoko Uchida
bd22f199de
LUCENE-10393: Unify binary dictionary and dictionary writer in kuromoji and nori ( #740 )
2022-03-25 18:44:36 +09:00
Mike Drob
b3906e96ea
LUCENE-9651 Update benchmark module docs ( #759 )
2022-03-23 14:51:28 -05:00
Lu Xugang
5450d72258
LUCENE-10458: BoundedDocSetIdIterator may supply error count in Weigth#count(LeafReaderContext) when missingValue enables ( #736 )
2022-03-23 15:54:52 +01:00
Mike Drob
1c6f631678
LUCENE-10481: FacetsCollector will not request scores if it does not use them ( #760 )
2022-03-23 09:44:02 -05:00
Christine Poerschke
779c332a8c
LUCENE-10477: mention 'call multiple times' in Query.rewrite javadoc ( #758 )
2022-03-22 15:39:59 +00:00
Adrien Grand
04127ed9fc
Add back-compat indices for 9.1.0.
2022-03-22 16:10:10 +01:00
Adrien Grand
3105998ce6
Synchronize CHANGES.
2022-03-22 16:08:59 +01:00
Christine Poerschke
ca252d6621
LUCENE-10464, LUCENE-10477: WeightedSpanTermExtractor.extractWeightedSpanTerms to rewrite sufficiently ( #737 )
2022-03-22 14:53:41 +00:00
Adrien Grand
28d3adcf69
Add version 9.1.0.
2022-03-22 15:43:27 +01:00
Adrien Grand
0a3bad5985
DOAP changes for release 9.1.0
2022-03-22 15:22:27 +01:00
Alan Woodward
42bf77229e
LUCENE-10422: Make errorprone happy
2022-03-22 09:18:27 +00:00
Tomoko Uchida
fa61953afd
LUCENE-10478: mark Test4GBStoredFields as @Monster ( #757 )
2022-03-22 17:58:05 +09:00
mogui
be99178956
LUCENE-10422: Read-only monitor implementation ( #679 )
...
This commit adds a read-only monitor implementation that can
search the QueryIndex of another monitor without supporting adding
new queries.
2022-03-21 16:42:03 +00:00
Adrien Grand
f239c0e03c
LUCENE-10473: Make tests a bit faster when running nightly. ( #754 )
2022-03-21 10:37:57 +01:00
Julie Tibshirani
a4b30b4cf4
LUCENE-9905: Fix check in TestPerFieldKnnVectorsFormat#testMergeUsesNewFormat
...
Before the assertion checked if two sets were equal, which resulted in rare
failures. Now we use 'contains' from hamcrest matchers.
2022-03-18 15:38:30 -07:00
Julie Tibshirani
18f9d31608
LUCENE-9614: Fix rare TestKnnVectorQuery failures
...
Some of our checks relied on doc IDs corresponding to the order in which docs
were passed to IndexWriter. This is fragile and sometimes resulted in failures.
Now we check against an "id" field instead.
2022-03-18 14:52:00 -07:00
Luca Cavanna
bb7568d865
LUCENE-10472: Fix TestMatchAllDocsQuery#testEarlyTermination ( #753 )
...
As part of #716 I moved the test to use a collector manager, but I forgot to update one of the assertions.
We can't rely on totalHits being accurate when the search is executed my multiple threads and early terminated.
2022-03-18 18:49:20 +01:00
Adrien Grand
1dcb64b492
LUCENE-10418: Move CHANGES to the correct section.
2022-03-17 16:44:01 +01:00
Adrien Grand
8fb6543280
LUCENE-10418: Optimize `Query#rewrite` in the non-scoring case. ( #672 )
2022-03-17 16:41:55 +01:00
Adrien Grand
86bd921fce
LUCENE-10469: Fix score mode propagation in ConstantScoreQuery. ( #750 )
2022-03-16 13:16:33 +01:00
Peter Gromov
0e3c315b76
LUCENE-10452, LUCENE-10451: mention hunspell changes in CHANGES.txt
2022-03-16 09:18:58 +01:00
Peter Gromov
af97c5ef37
LUCENE-10452: Hunspell: call checkCanceled less frequently to reduce the overhead ( #723 )
2022-03-16 09:04:08 +01:00
Julie Tibshirani
6b7953b8ce
Add 9.2.0 section to release notes
2022-03-15 11:26:20 -07:00
Peter Gromov
92a20c24e9
LUCENE-10451 Hunspell: don't perform potentially expensive spellchecking after timeout ( #721 )
...
move all expensive operations closer to the suggestion creation, encapsulate case and output conversion into a new Suggestion class
2022-03-15 18:43:56 +01:00
Tomoko Uchida
b6c1024f55
LUCENE-10463: increment java version to 17 in smoke tester ( #748 )
2022-03-15 19:54:54 +09:00
Dawid Weiss
25c4310bd5
LUCENE-10461: fix windows launch script for luke so that it works with integration tests AND actual command line. Cmd escaping rules and start command line is absolutely insane. ( #743 )
2022-03-12 19:39:31 +09:00
Dawid Weiss
9e9c457f80
LUCENE-10459: Update smoke tester for 9.1 ( #744 )
...
Add demo dependencies to third party modules. Add an IT that checks whether
demo classes are loadable.
Co-authored-by: Tomoko Uchida <tomoko.uchida.1111@gmail.com>
Co-authored-by: Julie Tibshirani <julietibs@apache.org>
2022-03-11 10:22:17 -08:00
Dawid Weiss
e999056c19
LUCENE-10311: avoid division by zero on small sets.
2022-03-09 11:41:01 +01:00
Mayya Sharipova
e5717cddfd
LUCENE-10408 Test correction checksum ( #734 )
...
Use double instead of float to test vector values checksum
2022-03-09 08:02:40 +00:00
Spyros Kapnissis
8afec33e74
LUCENE-10171: OpenNLPOpsFactory should directly cache DictionaryLemmatizer objects ( #380 )
...
Instead of caching dictionary strings and building multiple redundant DictionaryLemmatizer objects.
Co-authored-by: Michael Gibney <michael@michaelgibney.net>
2022-03-08 12:47:16 -05:00
Daniel Doubrovkine (dB.)
7aec489945
Fix: typo + +minScore. ( #735 )
2022-03-08 08:21:22 -05:00
Yuting Gan
10b78714c1
Fixed a typo in the Javadoc of TaxonomyReader ( #732 )
...
Co-authored-by: Yuting Gan <ganyutingi@gmail.com>
2022-03-06 19:21:34 -05:00
Adrien Grand
ae16917c1d
LUCENE-10311: Remove pop_XXX helpers from `BitUtil`. ( #724 )
...
As @rmuir noted, it would be as simple and create less cognitive overhead to
use `Long#bitCount` directly.
2022-03-05 18:38:57 +01:00
Adrien Grand
8086ef9f45
LUCENE-10455: CHANGES entry.
2022-03-05 18:32:41 +01:00
Adrien Grand
9d732380ae
LUCENE-10453: Speed up euclidean distances. ( #725 )
2022-03-05 18:31:56 +01:00
Chris Lu
2700c6b525
LUCENE-10455: IndexSortSortedNumericDocValuesRangeQuery should implement Weight#scorerSupplier(LeafReaderContext) ( #729 )
2022-03-05 18:27:29 +01:00
Alan Woodward
e049e426dd
LUCENE-10431: Remove MultiTermQuery.setRewriteMethod() ( #726 )
2022-03-04 11:54:02 +00:00
Dawid Weiss
81ab1e598f
LUCENE-10447: always use utf8 for forked process encoding. Use the sa… ( #717 )
2022-03-03 20:53:20 +01:00
Alan Woodward
3f994dec53
LUCENE-10431: Deprecate MultiTermQuery.setRewriteMethod() ( #722 )
...
Allowing users to mutate MultiTermQuery can give rise to odd bugs, for example
in wrapper queries such as BooleanQuery which lazily calculate their hashcodes
and then cache the result. This commit deprecates the setRewriteMethod()
method on MultiTermQuery, in preparation for removing it entirely, and adds
constructor parameters to the various MTQ implementations as a preferred
way to set the rewrite method.
2022-03-03 11:08:39 +00:00
Adrien Grand
bff4246476
LUCENE-10002: Fix test failure.
...
When IndexSearcher is created with a threadpool it becomes impossible to assert
on the number of evaluated hits overall.
2022-03-03 10:10:35 +01:00
Adrien Grand
44a2a82319
LUCENE-10428: Avoid infinite loop under error conditions. ( #711 )
...
Co-authored-by: dblock <dblock@dblock.org>
2022-03-03 09:42:12 +01:00
Adrien Grand
ca73ed1c28
LUCENE-10311: Make FixedBitSet#approximateCardinality faster (and actually approximate). ( #710 )
...
This computes a pop count on a sample of the longs that back the bitset.
Quick benchmarks suggest that this runs 5x-10x faster than
`FixedBitSet#cardinality` depending on the length of the bitset.
2022-03-03 08:48:44 +01:00
Peter Gromov
9ed526b70e
[hunspell] make SuggestionTimeoutException public
...
to make it easier for custom checkCanceled implementations to throw it depending on their ad-hoc conditions and get partial results
2022-03-02 21:24:24 +01:00
Adrien Grand
46f9a25216
LUCENE-10237: Move CHANGES entry to 9.1.
2022-03-02 09:39:54 +01:00