Robert Muir
975df9ddd3
LUCENE-9182: add apache license headers to all .gradle files and enforce in rat task
2020-01-27 12:05:34 -05:00
Robert Muir
c53cc3edaf
LUCENE-9167: test speedup for slowest/pathological tests (round 3)
2020-01-24 08:58:59 -05:00
Robert Muir
1051db4038
LUCENE-9163: test speedup for slowest/pathological tests
...
Calming down individual test methods with double-digit execution times
after running tests many times.
There are a few more issues remaining, but this solves the majority of them.
2020-01-22 17:49:33 -05:00
Mike
338d386ae0
LUCENE-9145 First pass addressing static analysis ( #1181 )
...
Fixed a bunch of the smaller warnings found by error-prone compiler
plugin, while ignoring a lot of the bigger ones.
2020-01-17 13:30:39 -06:00
Alan Woodward
9d72bfc1af
LUCENE-9068: Build FuzzyQuery automata up-front ( #1042 )
...
FuzzyTermsEnum can now either take an array of compiled automata, and
an AttributeSource, to be used across multiple segments (eg during
FuzzyQuery rewrite); or it can take a term, edit distance, prefix and transition
boolean and build the automata itself if only being used once (eg for fuzzy
nearest neighbour calculations).
Rather than interact via attribute sources and specialized attributes, users of
FuzzyTermsEnum can get the boost and set minimum competitive boosts
directly on the enum.
2020-01-15 14:58:11 +00:00
Dawid Weiss
abd5102312
Merge with master.
2019-12-13 15:35:14 +01:00
Bruno Roustant
1812b367ab
LUCENE-9089: FST Builder renamed FSTCompiler with fluent-style Builder.
...
Closes #1070
2019-12-13 14:37:54 +01:00
Dawid Weiss
d4a9842375
Initial gradle build layer.
2019-12-02 15:34:57 +01:00
Alan Woodward
fbd05167f4
LUCENE-3041: QueryVisitor ( #581 )
...
This commit adds an introspection API to Query, allowing users to traverse
the nested structure of a query and examine its leaves. It replaces the existing
`extractTerms` method on Weight, and alters some highlighting code to use
the new API
2019-03-14 15:04:33 +00:00
Bruno P. Kinoshita
5310152450
Fix some spell check issues
2019-02-12 21:35:30 -08:00
Tommaso Teofili
33896dde5a
LUCENE-5698 - minor adjustments
2019-01-24 10:24:52 +01:00
Tommaso Teofili
6df32fbc35
LUCENE-5698 - forbidden APIs related fixes
2019-01-20 09:24:25 +01:00
Tommaso Teofili
552c367c5d
LUCENE-5698 - added test for 20n dataset, minor code adjustments
2019-01-18 21:10:33 +01:00
David Smiley
fd9164801e
LUCENE-7875: Moved MultiFields static methods to MultiTerms, FieldInfos and MultiBits.
...
MultiBits is now public and has getLiveDocs.
2018-10-18 19:49:14 -04:00
Adrien Grand
9ca053712a
LUCENE-8430: TopDocs.totalHits may now be a lower bound of the hit count.
2018-07-30 16:38:05 +02:00
Adrien Grand
331ccf3910
LUCENE-8405: Remove TopDocs.maxScore.
2018-07-18 08:38:57 +02:00
Robert Muir
2c1ab31b4e
LUCENE-7690: Add preserveOriginal option to the NGram and EdgeNGram filters
2018-06-04 21:24:20 -04:00
Tommaso Teofili
b3cf209f6f
LUCENE-8223 - remove time dependent checks in performance test
2018-03-27 10:14:55 +02:00
Christine Poerschke
a3a0e0b11e
Revert "LUCENE-8115: remove one TODO-on-its-own javadoc."
...
This reverts commit bd69d64ad0
.
2018-01-08 20:18:45 +00:00
Christine Poerschke
bd69d64ad0
LUCENE-8115: remove one TODO-on-its-own javadoc.
2018-01-08 20:04:58 +00:00
Christine Poerschke
07afc23dce
LUCENE-8115: remove unnecessary-on-its-own {@inheritDoc} annotations.
2018-01-08 19:45:19 +00:00
Alan Woodward
d250a1463d
LUCENE-8133: Rename TermContext to TermStates, and load TermState lazily if term stats are not required
2018-01-05 14:17:15 +00:00
Adrien Grand
8fd7ead940
LUCENE-8116: SimScorer now only takes a frequency and a norm as per-document scoring factors.
2018-01-04 15:13:36 +01:00
Christine Poerschke
152d223b32
Revert "LUCENE-8110: Fix potential IndexOutOfBoundsException in *Classifier.getClasses(?,int)."
...
This reverts commit af41d02eae
.
2017-12-28 15:11:39 +00:00
Christine Poerschke
af41d02eae
LUCENE-8110: Fix potential IndexOutOfBoundsException in *Classifier.getClasses(?,int).
2017-12-28 12:28:41 +00:00
Christine Poerschke
6aa312a8db
LUCENE-7991: KNearestNeighborDocumentClassifier.knnSearch no longer applies a previous boosted field's factor to subsequent unboosted fields.
2017-10-18 13:18:40 +01:00
Tommaso Teofili
97f246b4ba
LUCENE-7981 - CTB should check that CR is not null
2017-09-29 15:34:52 +02:00
Tommaso Teofili
c2c2e8a85e
LUCENE-7950 - fixed potential NPE when no docs have the class field
2017-09-02 14:43:59 +02:00
Tommaso Teofili
92e460389d
LUCENE-7838 - removed dep from sandbox, created a minimal FLT version specific for knn classification
2017-06-29 10:01:49 +02:00
Adrien Grand
44d1f1fe3f
LUCENE-7872: TopDocs.totalHits is now a long.
2017-06-21 09:55:02 +02:00
Tomas Fernandez Lobbe
d30d012c7c
SOLR-10042, LUCENE-7838: Fix precommit
2017-05-18 17:11:40 -07:00
Adrien Grand
c9bdce937a
LUCENE-7838: Remove unused imports.
2017-05-18 16:36:14 +02:00
Tommaso Teofili
c53d19e7b2
LUCENE-7838 - removed unused import
2017-05-18 14:42:56 +02:00
Tommaso Teofili
bd9e32d358
LUCENE-7838 - added knn classifier based on flt
2017-05-18 14:36:18 +02:00
Tommaso Teofili
8990500183
LUCENE-7823 - added bm25 nb classifier
2017-05-11 16:26:32 +02:00
Christine Poerschke
5c5254341e
LUCENE-7776: change javadocs default mention from Classic to BM25
...
(Also kinda added missing javadoc for new method to fix 'ant precommit'.)
2017-04-11 19:36:44 +01:00
Tommaso Teofili
9c00fc6795
LUCENE-7776 - visualize diff btwn BytesRef values in ClassificationTestBase
2017-04-11 17:12:13 +02:00
Christine Poerschke
1bf3696228
LUCENE-7776: remove unused import
2017-04-11 11:35:15 +01:00
Tommaso Teofili
0f60c4233c
LUCENE-7776 - use bm25 for knn classifier
2017-04-11 10:44:36 +02:00
Tommaso Teofili
f37fad206b
LUCENE-5548 - improved testing for SNBC
2017-04-07 10:59:20 +02:00
Tommaso Teofili
276ccff751
LUCENE-6853 - re-enabled test classification measures for bpc
2017-04-07 10:59:20 +02:00
Tommaso Teofili
cbad533d7a
LUCENE-6853 - renamed threshold to bias, initialize to avg tf
2017-04-07 10:59:20 +02:00
Adrien Grand
53064e46dd
LUCENE-7754: Inner classes should be static whenever possible.
2017-03-28 15:35:04 +02:00
Tommaso Teofili
87d8b5450a
LUCENE-7591 - approximate to no. of terms when DVs are not available
2016-12-12 10:00:21 +01:00
Tommaso Teofili
96489d2384
SOLR-8871 - adjusted header positioning
2016-11-24 23:46:20 +01:00
Tommaso Teofili
53a34b312e
LUCENE-7400 - splitter should be able to group using sorted set dv
2016-07-30 07:58:25 +02:00
Mike McCandless
7b5d7b3962
LUCENE-7337: empty boolean query now rewrites to MatchNoDocsQuery instead of vice/versa
2016-06-21 11:26:19 -04:00
Tommaso Teofili
daa26090a3
Let CachingNBClassifier be constructed from IndexReaders
2016-06-21 13:43:54 +02:00
Tommaso Teofili
fcf4389d82
LUCENE-7350 - Let classifiers be constructed from IndexReaders
2016-06-21 13:10:34 +02:00
Christine Poerschke
52f5c50246
LUCENE-7319: remove unused imports
2016-06-13 09:59:14 +01:00