Commit Graph

173 Commits

Author SHA1 Message Date
Robert Muir 975df9ddd3
LUCENE-9182: add apache license headers to all .gradle files and enforce in rat task 2020-01-27 12:05:34 -05:00
Robert Muir c53cc3edaf
LUCENE-9167: test speedup for slowest/pathological tests (round 3) 2020-01-24 08:58:59 -05:00
Robert Muir 1051db4038
LUCENE-9163: test speedup for slowest/pathological tests
Calming down individual test methods with double-digit execution times
after running tests many times.

There are a few more issues remaining, but this solves the majority of them.
2020-01-22 17:49:33 -05:00
Mike 338d386ae0
LUCENE-9145 First pass addressing static analysis (#1181)
Fixed a bunch of the smaller warnings found by error-prone compiler
plugin, while ignoring a lot of the bigger ones.
2020-01-17 13:30:39 -06:00
Alan Woodward 9d72bfc1af
LUCENE-9068: Build FuzzyQuery automata up-front (#1042)
FuzzyTermsEnum can now either take an array of compiled automata, and
an AttributeSource, to be used across multiple segments (eg during
FuzzyQuery rewrite); or it can take a term, edit distance, prefix and transition
boolean and build the automata itself if only being used once (eg for fuzzy
nearest neighbour calculations).

Rather than interact via attribute sources and specialized attributes, users of
FuzzyTermsEnum can get the boost and set minimum competitive boosts
directly on the enum.
2020-01-15 14:58:11 +00:00
Dawid Weiss abd5102312 Merge with master. 2019-12-13 15:35:14 +01:00
Bruno Roustant 1812b367ab
LUCENE-9089: FST Builder renamed FSTCompiler with fluent-style Builder.
Closes #1070
2019-12-13 14:37:54 +01:00
Dawid Weiss d4a9842375 Initial gradle build layer. 2019-12-02 15:34:57 +01:00
Alan Woodward fbd05167f4
LUCENE-3041: QueryVisitor (#581)
This commit adds an introspection API to Query, allowing users to traverse
the nested structure of a query and examine its leaves.  It replaces the existing
`extractTerms` method on Weight, and alters some highlighting code to use
the new API
2019-03-14 15:04:33 +00:00
Bruno P. Kinoshita 5310152450 Fix some spell check issues 2019-02-12 21:35:30 -08:00
Tommaso Teofili 33896dde5a LUCENE-5698 - minor adjustments 2019-01-24 10:24:52 +01:00
Tommaso Teofili 6df32fbc35 LUCENE-5698 - forbidden APIs related fixes 2019-01-20 09:24:25 +01:00
Tommaso Teofili 552c367c5d LUCENE-5698 - added test for 20n dataset, minor code adjustments 2019-01-18 21:10:33 +01:00
David Smiley fd9164801e LUCENE-7875: Moved MultiFields static methods to MultiTerms, FieldInfos and MultiBits.
MultiBits is now public and has getLiveDocs.
2018-10-18 19:49:14 -04:00
Adrien Grand 9ca053712a LUCENE-8430: TopDocs.totalHits may now be a lower bound of the hit count. 2018-07-30 16:38:05 +02:00
Adrien Grand 331ccf3910 LUCENE-8405: Remove TopDocs.maxScore. 2018-07-18 08:38:57 +02:00
Robert Muir 2c1ab31b4e LUCENE-7690: Add preserveOriginal option to the NGram and EdgeNGram filters 2018-06-04 21:24:20 -04:00
Tommaso Teofili b3cf209f6f LUCENE-8223 - remove time dependent checks in performance test 2018-03-27 10:14:55 +02:00
Christine Poerschke a3a0e0b11e Revert "LUCENE-8115: remove one TODO-on-its-own javadoc."
This reverts commit bd69d64ad0.
2018-01-08 20:18:45 +00:00
Christine Poerschke bd69d64ad0 LUCENE-8115: remove one TODO-on-its-own javadoc. 2018-01-08 20:04:58 +00:00
Christine Poerschke 07afc23dce LUCENE-8115: remove unnecessary-on-its-own {@inheritDoc} annotations. 2018-01-08 19:45:19 +00:00
Alan Woodward d250a1463d LUCENE-8133: Rename TermContext to TermStates, and load TermState lazily if term stats are not required 2018-01-05 14:17:15 +00:00
Adrien Grand 8fd7ead940 LUCENE-8116: SimScorer now only takes a frequency and a norm as per-document scoring factors. 2018-01-04 15:13:36 +01:00
Christine Poerschke 152d223b32 Revert "LUCENE-8110: Fix potential IndexOutOfBoundsException in *Classifier.getClasses(?,int)."
This reverts commit af41d02eae.
2017-12-28 15:11:39 +00:00
Christine Poerschke af41d02eae LUCENE-8110: Fix potential IndexOutOfBoundsException in *Classifier.getClasses(?,int). 2017-12-28 12:28:41 +00:00
Christine Poerschke 6aa312a8db LUCENE-7991: KNearestNeighborDocumentClassifier.knnSearch no longer applies a previous boosted field's factor to subsequent unboosted fields. 2017-10-18 13:18:40 +01:00
Tommaso Teofili 97f246b4ba LUCENE-7981 - CTB should check that CR is not null 2017-09-29 15:34:52 +02:00
Tommaso Teofili c2c2e8a85e LUCENE-7950 - fixed potential NPE when no docs have the class field 2017-09-02 14:43:59 +02:00
Tommaso Teofili 92e460389d LUCENE-7838 - removed dep from sandbox, created a minimal FLT version specific for knn classification 2017-06-29 10:01:49 +02:00
Adrien Grand 44d1f1fe3f LUCENE-7872: TopDocs.totalHits is now a long. 2017-06-21 09:55:02 +02:00
Tomas Fernandez Lobbe d30d012c7c SOLR-10042, LUCENE-7838: Fix precommit 2017-05-18 17:11:40 -07:00
Adrien Grand c9bdce937a LUCENE-7838: Remove unused imports. 2017-05-18 16:36:14 +02:00
Tommaso Teofili c53d19e7b2 LUCENE-7838 - removed unused import 2017-05-18 14:42:56 +02:00
Tommaso Teofili bd9e32d358 LUCENE-7838 - added knn classifier based on flt 2017-05-18 14:36:18 +02:00
Tommaso Teofili 8990500183 LUCENE-7823 - added bm25 nb classifier 2017-05-11 16:26:32 +02:00
Christine Poerschke 5c5254341e LUCENE-7776: change javadocs default mention from Classic to BM25
(Also kinda added missing javadoc for new method to fix 'ant precommit'.)
2017-04-11 19:36:44 +01:00
Tommaso Teofili 9c00fc6795 LUCENE-7776 - visualize diff btwn BytesRef values in ClassificationTestBase 2017-04-11 17:12:13 +02:00
Christine Poerschke 1bf3696228 LUCENE-7776: remove unused import 2017-04-11 11:35:15 +01:00
Tommaso Teofili 0f60c4233c LUCENE-7776 - use bm25 for knn classifier 2017-04-11 10:44:36 +02:00
Tommaso Teofili f37fad206b LUCENE-5548 - improved testing for SNBC 2017-04-07 10:59:20 +02:00
Tommaso Teofili 276ccff751 LUCENE-6853 - re-enabled test classification measures for bpc 2017-04-07 10:59:20 +02:00
Tommaso Teofili cbad533d7a LUCENE-6853 - renamed threshold to bias, initialize to avg tf 2017-04-07 10:59:20 +02:00
Adrien Grand 53064e46dd LUCENE-7754: Inner classes should be static whenever possible. 2017-03-28 15:35:04 +02:00
Tommaso Teofili 87d8b5450a LUCENE-7591 - approximate to no. of terms when DVs are not available 2016-12-12 10:00:21 +01:00
Tommaso Teofili 96489d2384 SOLR-8871 - adjusted header positioning 2016-11-24 23:46:20 +01:00
Tommaso Teofili 53a34b312e LUCENE-7400 - splitter should be able to group using sorted set dv 2016-07-30 07:58:25 +02:00
Mike McCandless 7b5d7b3962 LUCENE-7337: empty boolean query now rewrites to MatchNoDocsQuery instead of vice/versa 2016-06-21 11:26:19 -04:00
Tommaso Teofili daa26090a3 Let CachingNBClassifier be constructed from IndexReaders 2016-06-21 13:43:54 +02:00
Tommaso Teofili fcf4389d82 LUCENE-7350 - Let classifiers be constructed from IndexReaders 2016-06-21 13:10:34 +02:00
Christine Poerschke 52f5c50246 LUCENE-7319: remove unused imports 2016-06-13 09:59:14 +01:00