Olli Kuonanoja
351e21f620
Fix MLT like text with custom frequencies
...
When an analyzer with custom term frequencies is used with MLT like
texts, the custom term frequencies are incorrectly omitted and a fixed
frequency of 1 is used instead.
This commit fixes the issue by using `TermFrequencyAttribute` to get
the term frequencies instead of using fixed 1. Also adds test cases
for them mentioned issue.
2019-04-30 12:13:49 -04:00
Andrzej Bialecki
170f5fb7a3
SOLR-13423: Upgrade RRD4j to version 3.5.
2019-04-24 12:33:48 +02:00
Mike McCandless
e522147eb1
cosmetic fix in FST.Builder assertion (thanks zhenyuan wei)
2019-04-22 11:52:18 -04:00
Tomoko Uchida
62ed046e52
LUCENE-2562: Preserve map key order for the user interface.
2019-04-22 00:36:41 +09:00
Tomoko Uchida
1204327b56
LUCENE-8738: Force locale to be 'en_US' in javadocs task.
2019-04-21 21:28:14 +09:00
Tomoko Uchida
511efc89fe
LUCENE-2562: Use Map#copyOf(), List#copyOf() and Set#copyOf() instead of Collections#unmodifiableMap(), unmodifiableList() and unmodifiableSet()
2019-04-21 18:18:23 +09:00
Uwe Schindler
cd0706bd43
Revert previous change to use separate Groovy artifacts: Use groovy-all again (bugs in ivy)
2019-04-20 10:51:05 +02:00
Uwe Schindler
a43fa13d11
Revert Groovy update and downgrade Groovy to 2.4.16, as new version is not compatible with Java 13 EA builds!
2019-04-20 02:57:26 +02:00
Uwe Schindler
52090c9b11
Update flexmark to latest version
2019-04-20 01:12:51 +02:00
Uwe Schindler
f1911f82d5
LUCENE-8768: Fix Javadocs build in Java 11
2019-04-20 01:01:13 +02:00
Uwe Schindler
77e1bec7dc
LUCENE-8738: Add missing dependency for Maven build
2019-04-20 00:51:55 +02:00
Uwe Schindler
b8494c8702
LUCENE-8738: Update Groovy to make the warnings with Java 11 a bit more silent (only one warning on first Groovy invocation)
2019-04-20 00:09:56 +02:00
Tomoko Uchida
7ba8bdca8f
LUCENE-2562: Suppress SimpleText codec for CommitsImplTest.
2019-04-19 11:07:01 +09:00
Nicholas Knize
faa78ad72c
LUCENE-8736: Fix Polygon.contains to approriately check longitude range, and pass correct line segment vertices in EdgeTree
2019-04-18 13:15:07 -05:00
Andrzej Bialecki
bd8905150d
SOLR-12461: Upgrade Dropwizard Metrics to 4.0.5 release.
2019-04-18 19:08:20 +02:00
Adrien Grand
61d7569f78
LUCENE-7386: Flatten nested disjunctions.
2019-04-18 11:24:50 +02:00
Zhao Yang
48a68365bb
Fix OneDimensionBKDWriter valueCount validation
...
Signed-off-by: Adrien Grand <jpountz@gmail.com>
2019-04-18 11:24:38 +02:00
Adrien Grand
793635eb0e
LUCENE-2562: Don't randomly enable term vectors on fields.
2019-04-18 09:27:09 +02:00
Simon Willnauer
3a6f2f7543
Use Map.copyOf in lucene core ( #649 )
...
This cuts over several places that use the pattern of creating a copy of
the supplied map with Map.copyOf.
2019-04-17 15:26:12 +02:00
Uwe Schindler
faaee86efb
LUCENE-8738: Move to Java 11 as minimum Java version (merged branch: jira/LUCENE-8738)
...
Co-authored-by: Adrien Grand <jpountz@apache.org>
2019-04-16 14:00:09 +02:00
Jan Høydahl
181ca4ea7d
SOLR-12121: Upgrade jose4j to v0.6.5 which adds Java11 RSA PSS signature support
2019-04-15 22:11:27 +02:00
Simon Willnauer
a302be381e
LUCENE-8671: Introduce Reader attributes ( #640 )
...
Reader attributes allows a per IndexReader configuration of codec internals.
For instance this allows a per reader configuration if FSTs are loaded into memory or are left
on disk.
2019-04-15 20:39:36 +02:00
Nicholas Knize
8975ae5954
LUCENE-8736: Fix line detection bug in Polygon2D and Line2D componentRelateTriangle
2019-04-15 10:43:56 -05:00
Uwe Schindler
7830cae571
LUCENE-8752: Add license header to patch file
...
Revert "LUCENE-8752: Fix precommit error: patch files cannot have a license header" - This reverts commit b60548f6d8
.
2019-04-14 02:01:00 +02:00
Uwe Schindler
b60548f6d8
LUCENE-8752: Fix precommit error: patch files cannot have a license header
2019-04-14 01:39:49 +02:00
Tomoko Uchida
7619c07d3a
LUCENE-8752: Add Japanese new imperial era '令和' (Reiwa) to the dictionary used in JapaneseTokenizer
2019-04-13 21:07:27 +09:00
Tomoko Uchida
f85c08224b
LUCENE-2562: Luke has no Maven artifacts
2019-04-13 01:23:46 +09:00
Jan Høydahl
ef9566e13e
LUCENE-8725: Fix precommit (javadoc)
2019-04-12 09:26:46 +02:00
Noble Paul
5f34c49813
LUCENE-8725: Make TermsQuery.SeekingTermSetTermsEnum a top level class and public
2019-04-12 06:22:09 +10:00
Nicholas Knize
55c241d87f
LUCENE-8736: Fix LatLonShapePolygonQuery and Polygon2D.contains to correctly include points that fall on the boundary
2019-04-11 09:27:36 -05:00
Uwe Schindler
2ed86c6d54
Fix precommit error
2019-04-10 16:09:19 +02:00
jimczi
81fe814428
LUCENE-8751: Weight#matches now use the ScorerSupplier to build scorers with a lead cost of 1 (single document).
2019-04-10 12:20:02 +02:00
Tomoko Uchida
3a78c1999a
LUCENE-2562: Add Luke as a Lucene module
2019-04-10 19:15:57 +09:00
Simon Willnauer
a9503d2e81
LUCENE-8754: Prevent ConcurrentModificationException in SegmentInfo ( #637 )
...
In order to prevent ConcurrentModificationException this change makes
an unmodifiable copy on write for all maps in SegmentInfo. MergePolicies
can access these maps without synchronization and cause exceptions if
it's modified in the merge thread.
2019-04-10 09:29:22 +02:00
jimczi
c1222b57e9
LUCENE-8477: Restore public ctr for FilteredIntervalsSource
2019-04-08 13:29:49 +02:00
jimczi
b8829480f8
LUCENE-8701: remove unused import
2019-04-05 17:08:04 +02:00
jimczi
04c33c77bf
LUCENE-8701: force the score mode of the child weight to COMPLETE if the score is required
2019-04-05 17:06:57 +02:00
jimczi
926de465ee
LUCENE-8701: force the score mode of the child weight to COMPLETE if the score is required
2019-04-05 17:05:40 +02:00
jimczi
4b0c36b17b
LUCENE-8701: ToParentBlockJoinQuery now creates a child scorer that disallows skipping over
...
non-competitive documents if the score of a parent depends on the score of multiple
children (avg, max, min). Additionally the score mode `none` that assigns a constant score to
each parent can early terminate top scores's collection.
2019-04-05 15:24:09 +02:00
Simon Willnauer
1ec229b604
LUCENE-8671: Expose FST off/on-heap options on Lucene50PostingsFormat ( #613 )
...
Before we can expose options to configure this postings format
on a per-reader basis we need to expose the option to load the terms
index FST off or on heap on the postings format. This already allows to
change the default in a per-field posting format if an expert user
wants to change the defaults. This essentially provides the ability to change
defaults globally while still involving some glue code.
2019-04-04 16:59:37 +02:00
Ishan Chattopadhyaya
bbf6722e6a
Adding 6.6.6 backcompat indexes
2019-04-03 15:23:10 +05:30
Alan Woodward
06a2b73245
Add setMissingValue to sorts from Double/LongValuesSource
2019-04-03 09:42:36 +01:00
erick
58376874aa
SOLR-13363: Upgrade to ZooKeeper 3.4.14
2019-04-02 20:18:01 -07:00
Christine Poerschke
de13c8e79d
LUCENE-8742: add 'missing slop attribute' test coverage for SpanNearBuilder
2019-04-02 19:31:03 +01:00
Alan Woodward
3de0b36719
LUCENE-8730: WordDelimiterGraphFilter always emits its original token first
2019-04-02 08:58:49 +01:00
Erick Erickson
e04a28e140
SOLR-12028: BadApple and AwaitsFix annotations usage
2019-04-01 09:27:33 -07:00
jimczi
7e83451610
LUCENE-8732: ConstantScoreQuery can now early terminate the query if the minimum score is greater than the constant score and total hits are not requested
2019-03-27 18:09:23 +01:00
Alan Woodward
1c46dd1f9b
LUCENE-8741: Fix ClassCastException in ValueSource.fromDoubleValuesSource()
2019-03-27 16:08:25 +00:00
Alan Woodward
3a63c58db3
LUCENE-8477: Add CHANGES entry
2019-03-27 11:29:20 +00:00
Alan Woodward
f1782d0dd1
LUCENE-8477: Automatically rewrite disjunctions when internal gaps matter ( #620 )
...
We have a number of IntervalsSource implementations where automatic minimization of
disjunctions can lead to surprising results:
* PHRASE queries can miss matches because a longer matching sub-source is minimized
away, leaving a gap
* MAXGAPS queries can miss matches for the same reason
* CONTAINING, NOT_CONTAINING, CONTAINED_BY and NOT_CONTAINED_BY queries
can miss matches if the 'big' interval gets minimized
The proper way to deal with this is to rewrite the queries by pulling disjunctions to the top
of the query tree, so that PHRASE("a", OR(PHRASE("b", "c"), "c")) is rewritten to
OR(PHRASE("a", "b", "c"), PHRASE("a", "c")). To be able to do this generally, we need to
add a new pullUpDisjunctions() method to IntervalsSource that performs this rewriting
for each source that it would apply to.
Because these rewritten queries will in general be less efficient due to the duplication of
effort (eg the rewritten PHRASE query above pulls 5 term iterators rather than 4 in the
original), we also add an option to Intervals.or() that will prevent this happening, so that
consumers can choose speed over accuracy if it suits their usecase.
2019-03-27 11:23:43 +00:00
Simon Willnauer
8d7619f4e6
LUCENE-8735: fix changes entry after backport
2019-03-26 15:57:43 +01:00
Henning Andersen
04afdb6442
LUCENE-8735: Avoid FileAlreadyExistsException on windows. ( #619 )
...
FilterDirectory.getPendingDeletions() did not delegate the call, which
resulted in a new IndexWriter on same directory not considering pending
delete files. This could in turn result in a FileAlreadyExistsException
when running windows.
2019-03-26 14:56:45 +01:00
Tommaso Teofili
42a548e28e
LUCENE-6687 - avoid unnecessary looping
2019-03-26 11:27:09 +01:00
Kevin Risden
361d1e84ce
SOLR-13112: Upgrade jackson to 2.9.8
...
Signed-off-by: Kevin Risden <krisden@apache.org>
2019-03-25 10:38:00 -04:00
Kevin Risden
4e230388ba
SOLR-13335: Upgrade to velocity 2.0 and velocity-tools 3.0
...
Signed-off-by: Kevin Risden <krisden@apache.org>
2019-03-25 09:58:44 -04:00
Simon Willnauer
65e1b3ef2a
Add assume to test since it only works with mmap directory
2019-03-21 14:20:23 +01:00
Kevin Risden
6222abf448
SOLR-9079: Remove commons-lang as a dependency
...
Signed-off-by: Kevin Risden <krisden@apache.org>
2019-03-20 19:37:19 -04:00
Simon Willnauer
14175c46d2
LUCENE-8671: Load FST off-heap if reader is not opened from an index writer ( #610 )
...
Today we never load FSTs of ID-like fields off-heap since we need
very fast access for updates. Yet, a reader that is not loaded from
an IndexWriter can also leave the FST on disk. This change adds
this information to SegmentReadState to allow the postings format
to make this decision without configuration.
2019-03-20 11:28:10 +01:00
Uwe Schindler
2a1ed6e484
LUCENE-8729: Workaround to allow compile under JDK13+
2019-03-19 19:05:15 +01:00
Adrien Grand
07f3535793
LUCENE-8138: Check that dv producers's next/advance and advanceExact impls are consistent.
2019-03-19 10:53:10 +01:00
Adrien Grand
577bef53dd
LUCENE-8166: Require merge instances to be consumed in the thread that created them.
2019-03-19 10:51:54 +01:00
Alan Woodward
5ca2524927
LUCENE-8694: Payload-filtered term intervals
2019-03-18 09:41:00 +00:00
Simon Willnauer
af5c2b22da
Fix more tests to not use concurrent flushing in RIW#commit()
2019-03-15 22:18:20 +01:00
Simon Willnauer
ad457d188e
Improve RIW exception handling and opt out of concurrent flushing if exception is expected
2019-03-15 11:00:16 +01:00
Adrien Grand
425f207f40
LUCENE-8688: Forced merges merge more than necessary.
2019-03-15 10:27:27 +01:00
Alan Woodward
fbd05167f4
LUCENE-3041: QueryVisitor ( #581 )
...
This commit adds an introspection API to Query, allowing users to traverse
the nested structure of a query and examine its leaves. It replaces the existing
`extractTerms` method on Weight, and alters some highlighting code to use
the new API
2019-03-14 15:04:33 +00:00
Simon Willnauer
ffb1fc83de
Concurrently flush next buffer during commit in RandomIndexWriter ( #607 )
...
This is a spinn-off from `LUCENE-8700` that is satisfied by IndexWriter#flushNextBuffer.
The idea here is to additionally call flushNextBuffer in RandomIndexWriter for better
test coverage. This is a test-only change.
2019-03-14 15:43:35 +01:00
jimczi
2afa6cb00a
Add 8.0.0 back compat test indexes and reenable backward compatibility tests for 9
2019-03-14 14:00:13 +01:00
Alan Woodward
9096b846af
LUCENE-8719: Traverse all paths at the end of a TokenStream in FixedShingleFilter
2019-03-14 11:10:02 +00:00
Alan Woodward
84785e8e45
LUCENE-8726: Changes entry
2019-03-14 10:57:34 +00:00
Alan Woodward
d19dcb4ff0
LUCENE-8726: ValueSource.asDoubleValuesSource() could leak a reference to IndexSearcher
2019-03-14 10:17:12 +00:00
Mike McCandless
c1bea96cf9
LUCENE-8720: fix int overflow in NameIntCacheLRU
2019-03-12 12:19:33 -04:00
jimczi
b2c83de361
LUCENE-8652: remove unused import
2019-03-12 09:56:14 +01:00
Yeongsu Kim
b1f870a416
LUCENE-8631: The Korean user dictionary now picks the longest-matching word and discards the other matches.
2019-03-12 09:49:16 +01:00
jimczi
c87e7614f1
LUCENE-8652: ensure that the norm doesn't influence the score in TestSynonymQuery#testBoosts
2019-03-12 09:33:40 +01:00
iverase
fd8d9d5199
LUCENE-8713: Add Line2D tests
2019-03-12 09:18:54 +01:00
Ignacio Vera
458205396e
LUCENE-8712: Polygon2D does not detect crossings in some cases ( #598 )
...
LUCENE-8712: revert crossing logic to use boolean logic and skip lines
over the dateline to support dateline crossing logic
2019-03-12 08:40:54 +01:00
Erick Erickson
b893548d97
SOLR-13268: Clean up any test failures resulting from defaulting to async logging. Put TestXmlQParser back
2019-03-11 21:40:23 -07:00
Erick Erickson
3a1603dab3
SOLR-13268: Clean up any test failures resulting from defaulting to async logging. Fix precommit test to not include the baseDir
2019-03-11 11:03:31 -07:00
jimczi
d8cfeba912
LUCENE-8652: SynonymQuery can now deboost the document frequency of each term when blending the score of the synonym
2019-03-11 10:20:11 +01:00
erick
9272c29539
SOLR-13268: Clean up any test failures resulting from defaulting to async logging
2019-03-10 11:05:00 -07:00
Ishan Chattopadhyaya
aeb0774b38
Adding 7.7.1 version
2019-03-06 01:00:20 +05:30
Adrien Grand
a940c40b18
Add 7.7.1 back compat test indexes
2019-03-04 10:51:32 +01:00
Alan Woodward
7ad0ac0191
LUCENE-8714: Don't use NoMergePolicy in norms tests
...
This can cause spurious failures when run in conjunction with HandleLimitFS,
as we can end up with lots of very small segments which trips the file handles
limit
2019-03-01 14:47:54 +00:00
iverase
d381b73999
LUCENE-8709: Set pointWriter to null when size of the HeapPointWriter is 0
2019-03-01 10:44:07 +01:00
iverase
15dd4a1738
UCENE-8703: Build point writers in the BKD tree only when they are needed
2019-03-01 07:41:16 +01:00
iverase
5b2f064ba7
LUCENE-8709: Handle case of creating a HeapPointWriter with size equal 0
2019-02-28 10:41:06 +01:00
Karl Wright
78a57f75cb
Merge branch 'master' of https://git-wip-us.apache.org/repos/asf/lucene-solr
2019-02-27 02:28:50 -05:00
Karl Wright
ff799ac03d
LUCENE-8696: Rework how endpoint circles are represented to allow for consistency on WGS84.
2019-02-27 02:28:33 -05:00
Mike McCandless
1ad45a7974
fix typo in javadocs
2019-02-26 18:04:24 -05:00
Karl Wright
149469ddbc
LUCENE-8696: Refactor, in preparation for creating a new SegmentEndpoint implementation to fix the problem.
2019-02-26 08:46:17 -05:00
Karl Wright
8c34da8a62
LUCENE-8696: Fix precommit objections
2019-02-25 09:46:53 -05:00
Karl Wright
303d11921f
Merge branch 'master' of https://git-wip-us.apache.org/repos/asf/lucene-solr
2019-02-25 04:06:09 -05:00
Karl Wright
9bf2c8cb93
LUCENE-8696: Update test to be what's actually failing
2019-02-25 04:05:56 -05:00
Alan Woodward
edc73ee77c
LUCENE-8702: Simplify some Intervals factory methods
2019-02-25 08:58:46 +00:00
Karl Wright
c599ad9802
Merge branch 'master' of https://git-wip-us.apache.org/repos/asf/lucene-solr
2019-02-25 03:34:05 -05:00
Karl Wright
3754959797
OSQ-8696: Come up with a simple test case that exercises the random failure
2019-02-25 03:33:35 -05:00
erick
fe5a96a284
SOLR-13268: Clean up any test failures resulting from SOLR-12055 (async logging). Kevin's upgrades
2019-02-24 09:50:04 -08:00
iverase
9b8a4a9e6e
LUCENE-8699: Add lucene internal tag to PointValue interface
...
and fix some typos
2019-02-22 11:04:46 +01:00
iverase
f5a4159d75
LUCENE-8704: Use the HeapPointWriter to spill offline as BKDWriter does
2019-02-21 12:26:37 +01:00
jimczi
4625270ce0
LUCENE-8686: Fix test bug in TestTaxonomyFacetSumValueSource#testRandom
2019-02-21 16:36:26 +09:00
iverase
3ef6e015dd
LUCENE-8699: Change HeapPointWriter to use a single byte array instead to a list
...
of byte arrays. In addition a new interface PointValue is added to abstract out
the different formats between offline and on-heap writers.
2019-02-20 12:14:58 +01:00
Kevin Risden
af3ff118ef
SOLR-11763: Upgrade Guava to 25.1-jre (Markus Jelsma, Kevin Risden)
...
Signed-off-by: Kevin Risden <krisden@apache.org>
2019-02-19 17:27:58 -05:00
Mike McCandless
ec801b4c54
LUCENE-8635: add option to move FSTs off-heap, and do so for the FST terms index in the default codec for non-primary-key fields if MMapDirectory is being used
2019-02-19 12:52:22 -05:00
iverase
6f61bdea0e
LUCENE-8685: Refactor LatLonShape tests
2019-02-19 16:58:21 +01:00
Alan Woodward
55b4d2dcaa
LUCENE-8697: GraphTokenStreamFiniteStrings correctly handles side paths with gaps
2019-02-19 13:56:17 +00:00
Simon Willnauer
4a513fa99f
LUCENE-8292: Make TermsEnum fully abstract ( #574 )
2019-02-15 17:32:55 +01:00
Alan Woodward
10acaf02f8
SOLR-13126: Correctly combine multiplicative query boosts
2019-02-15 10:09:26 +00:00
Bruno P. Kinoshita
5310152450
Fix some spell check issues
2019-02-12 21:35:30 -08:00
jimczi
4e34a0cb41
Add 7.7.0 back compat test indexes
2019-02-12 15:11:59 +01:00
iverase
9db39ab0ca
LUCENE-8687: Optimise radix partitioning for points on heap
2019-02-11 08:11:23 +01:00
yyuan2
a3a4ecd80b
LUCENE-8662: Change TermsEnum.seekExact(BytesRef) to abstract
2019-02-08 15:10:38 -08:00
iverase
56007af4a4
LUCENE-8680: Add CHANGES.txt entry
2019-02-08 16:26:22 +01:00
iverase
06c1ebc09e
LUCENE-8680: Refactor EdgeTree#relateTriangle method
2019-02-08 16:19:38 +01:00
Kevin Risden
475f2fd36b
SOLR-9515 - Remove pom dependency kerby-kerb
...
Signed-off-by: Kevin Risden <krisden@apache.org>
2019-02-07 11:38:18 -05:00
iverase
5d1d6448b9
LUCENE-8673: Use radix partitioning when merging dimensional points instead of sorting all dimensions before hand.
2019-02-07 08:12:13 +01:00
Alan Woodward
1b077cf0e3
LUCENE-8655: Add .getSource() method to FunctionScoreQuery
2019-02-05 09:46:15 +00:00
markrmiller
4a3ddc94d8
SOLR-9515: Update to Hadoop 3 (Mark Miller, Kevin Risden)
...
Signed-off-by: Kevin Risden <krisden@apache.org>
2019-02-01 15:04:56 -05:00
jimczi
e9c02a6f71
LUCENE-8676: The Korean tokenizer does not update the last position if the backtrace is caused by a big buffer (1024 chars).
2019-02-01 11:37:31 +01:00
iverase
fdb6353539
LUCENE-8679: return WITHIN in EdgeTree#relateTriangle only when polygon and triangle share one edge
2019-02-01 11:23:31 +01:00
Kevin Risden
e4f202c1e3
Revert "SOLR-9515: Update to Hadoop 3"
...
This reverts commit 6bb24673f4
.
2019-01-31 19:46:02 -05:00
Nicholas Knize
edb05314b3
LUCENE-8669: Fix LatLonShape WITHIN queries that fail with Multiple search Polygons that share the dateline.
2019-01-31 16:19:42 -06:00
markrmiller
6bb24673f4
SOLR-9515: Update to Hadoop 3
...
Signed-off-by: Kevin Risden <krisden@apache.org>
2019-01-31 13:00:20 -05:00
Luca Cavanna
57f693bfc2
LUCENE-8664: Add equals/hashcode methods to TotalHits class
...
Adding equals/hashcode to TotalHits is convenient for quick comparisons, especially for users that may use it as part of bigger objects that have their own equals/hashcode.
2019-01-30 13:11:28 +01:00
Dawid Weiss
b802a52685
LUCENE-8663: NRTCachingDirectory.slowFileExists may open a file while it's inaccessible
2019-01-30 09:44:32 +01:00
jimczi
cf39708609
Add next minor release version 8.1.0
2019-01-29 14:12:38 +01:00
jimczi
a269a4d1cb
LUCENE-8660: TopDocsCollectors now return an accurate count (instead of a lower bound) if the total hit count is equal to the provided threshold.
2019-01-29 08:58:09 +01:00
Dawid Weiss
d7dc53ff7c
LUCENE-8474: Remove deprecated RAMDirectory.
2019-01-28 13:49:03 +01:00
Alan Woodward
7713a4f245
LUCENE-8650: Fix end() and reset() in ConcatenatingTokenStream
2019-01-28 09:00:01 +00:00
Mikhail Khludnev
f543b4e1f4
LUCENE-8640: fixing DateRangePrefixTreeTest, accepting 0..59 minutes and seconds.
2019-01-28 11:14:34 +03:00
Mikhail Khludnev
8e69d12dd7
LUCENE-8640: date range syntax validation
2019-01-27 21:45:28 +03:00
Tommaso Teofili
000785e68e
LUCENE-8659 - updated sha1 for OpenNLP dependency
2019-01-26 23:32:59 +01:00
Tommaso Teofili
48073a9778
LUCENE-8659 - upgrade Lucene/Solr to use OpenNLP 1.9.1
2019-01-26 13:11:30 +01:00
Adrien Grand
23bd97bae7
Refactor IndexedDISI to avoid method call when advancing by small gaps.
2019-01-25 10:21:48 +01:00
Adrien Grand
ef47582fd5
LUCENE-8658: Fix illegal assertion in WANDScorer.
2019-01-25 07:21:16 +01:00
Tommaso Teofili
c32e9b6c0a
Merge branch 'master' of github.com:apache/lucene-solr into LUCENE-5698
2019-01-24 10:25:10 +01:00
Tommaso Teofili
33896dde5a
LUCENE-5698 - minor adjustments
2019-01-24 10:24:52 +01:00
Jan Høydahl
ea2c8ba38e
SOLR-12121: JWT Token authentication plugin with OpenID Connect implicit flow login through Admin UI
2019-01-23 13:03:01 +01:00
iverase
893f5cda57
LUCENE-8654: Polygon2D#relateTriangle returns the wrong answer if polygon is inside the triangle
2019-01-23 08:25:50 +01:00
Uwe Schindler
452ffa3626
LUCENE-8340: Fix typo in CHANGES.txt
2019-01-22 15:54:40 +01:00
Alan Woodward
7d7ab14776
LUCENE-8646: Multi-term intervals
2019-01-22 13:36:02 +00:00
Alan Woodward
87d68c8253
LUCENE-8645: Intervals.fixField()
2019-01-22 13:35:15 +00:00
iverase
01dfe7bf4b
LUCENE-8649: LatLonShape's within and disjoint queries can return false positives with indexed multi-shapes
2019-01-21 16:00:28 +01:00
Tommaso Teofili
6df32fbc35
LUCENE-5698 - forbidden APIs related fixes
2019-01-20 09:24:25 +01:00
Tommaso Teofili
e32170db83
Merge branch 'master' of github.com:apache/lucene-solr into LUCENE-5698
2019-01-20 09:24:02 +01:00
Erick Erickson
73d1b07f8e
LUCENE-8585: fix precommit failure
2019-01-18 18:37:52 -08:00
Toke Eskildsen
c13645bd4c
LUCENE-8585: Create jump-tables for DocValues at index-time
2019-01-18 22:42:04 +01:00
Tommaso Teofili
552c367c5d
LUCENE-5698 - added test for 20n dataset, minor code adjustments
2019-01-18 21:10:33 +01:00
Dawid Weiss
efef89adc6
LUCENE-8642: RamUsageTester.sizeOf ignores arrays and collections if --illegal-access=deny.
2019-01-18 11:55:53 +01:00
David Smiley
70dd3ee06a
LUCENE-8581: Unreference LatLonPoint.BYTES from LatLonShape & Rectangle2D.
2019-01-17 09:00:18 -05:00
Dawid Weiss
f2352e9456
Revert "LUCENE-8642, LUCENE-8641: correct RamUsageTester.sizeOf's handling of ByteBuffers. Throw exceptions on denied reflection to catch problems early. This affects tests only."
...
This reverts commit a16f0833ed
.
2019-01-17 13:05:36 +01:00
Dawid Weiss
a16f0833ed
LUCENE-8642, LUCENE-8641: correct RamUsageTester.sizeOf's handling of ByteBuffers. Throw exceptions on denied reflection to catch problems early. This affects tests only.
2019-01-17 12:23:30 +01:00
Dawid Weiss
9926b45597
LUCENE-8643: TestLatLonLineShapeQueries.testRandomBig fails with suite
...
timeout (simple text codec used, decreased iteration counts).
2019-01-17 10:10:11 +01:00
Simon Willnauer
e35adf6824
LUCENE-8639: Prevent new threadstates from being created while we cut over to a new delete queue ( #535 )
...
This prevents an edge case where suddenly a lot of threads start indexing
while we carry over sequence ids from the previous to the new delete queue.
We now lock creation of new thread states for a very short time until we created and assigned
a new delete queue.
2019-01-16 16:37:49 +01:00
Alan Woodward
a826649241
LUCENE-8633: Remove term weighting from IntervalQuery scores
2019-01-16 13:59:16 +00:00
Dawid Weiss
09778b2133
LUCENE-8641: add awaits fix to stop jenkins from creating subsequent alerts for this problem.
2019-01-16 11:20:11 +01:00
Dawid Weiss
d4e016afdf
LUCENE-8474: (partial) removal of accesses to RAMFile and RAMDirectory streams. Removal of GrowableByteArrayDataOutput.
2019-01-15 13:42:25 +01:00
Dawid Weiss
f083473b89
LUCENE-8636: follow-up speedups and cleanups.
2019-01-15 12:02:15 +01:00
Dawid Weiss
d965b3547e
LUCENE-8636: TestPointQueries times out on nightly (decreased big range to 50k, excluded simple text codec).
2019-01-14 14:12:45 +01:00
iverase
8762b071bb
LUCENE-8623: Decrease I/O pressure when merging high dimensional points
2019-01-14 07:56:06 +01:00
Steve Rowe
283b19a8da
LUCENE-8527: Upgrade JFlex to 1.7.0. StandardTokenizer and UAX29URLEmailTokenizer now support Unicode 9.0, and provide UTS#51 v11.0 Emoji tokenization with the '<EMOJI>' token type.
2019-01-08 13:33:49 -05:00
Alan Woodward
e015afadaa
LUCENE-8622: Minimum-should-match interval function
2019-01-07 14:08:40 +00:00
Alan Woodward
7d34bfdaad
LUCENE-8629: New interval functions: before, after, within, overlapping
2019-01-07 13:33:59 +00:00
Alan Woodward
906f94a8cc
Remove unnecessary back compatibility
2019-01-07 13:20:01 +00:00
Alan Woodward
04f48dba8b
Move to version 9.0.0
2019-01-07 13:04:01 +00:00
Mike McCandless
63dfba4c7d
LUCENE-8601: attributes added to IndexableFieldType during indexing will now be preserved in the index and accessible at search time via FieldInfo attributes
2019-01-03 18:44:41 -05:00
Christine Poerschke
63a6c250d7
Update copyright year(s) in lucene/NOTICE.txt and solr/NOTICE.txt files.
2019-01-03 18:46:30 +00:00
Jim Ferenczi
7c70365811
LUCENE-8627: Fix SearchAfter#testQueries to always count the number of hits accurately.
2019-01-02 09:37:10 +01:00
Alan Woodward
2532a5d31c
LUCENE-8612: Add Intervals.extend()
2019-01-01 17:55:19 +00:00
Dawid Weiss
c9c3ef0ef9
LUCENE-8625: int overflow in ByteBuffersDataInput.sliceBufferList
2018-12-28 12:10:48 +01:00
Dawid Weiss
45c50a0273
LUCENE-8624: int overflow in ByteBuffersDataOutput.size().
2018-12-28 10:55:25 +01:00
Steve Rowe
86bc028f0d
LUCENE-8611: Fix Maven test compilation by making lucene-test-framework's dependencies transitive for all modules that depend on it, by not writing out the <dependencyManagement>/<exclusions> section for it in the grandparent POM.
2018-12-20 09:30:16 -05:00
Dawid Weiss
f28c5bec9b
LUCENE-8604: TestRuleLimitSysouts now has an optional "hard limit" of bytes that can be written to stderr and stdout (anything beyond the hard limit is ignored). The default hard limit is 2 GB of logs per test class.
2018-12-18 22:03:44 +01:00
Adrien Grand
dcd4a288b4
LUCENE-8600: Use a faster sort in DocValuesFieldUpdates.
2018-12-18 17:45:17 +01:00
iverase
d185ba99de
LUCENE-8581: Change LatLonShape encoding to use 4 bytes Per Dimension
2018-12-18 16:50:24 +01:00
Cao Manh Dat
f80e8e1167
Merge jira/http2 branch to master
2018-12-16 16:58:20 +00:00
Alan Woodward
dae3e304a1
LUCENE-8597: TestIntervals.assertGaps() needs to take docBase into account
2018-12-16 11:26:40 +00:00
Jim Ferenczi
53b05c5e7e
LUCENE-8603 Fix the inversion of right ids for additional nouns in the Korean user dictionary
2018-12-16 15:03:12 +07:00
Dawid Weiss
e916f1fb86
LUCENE-8611: Update randomizedtesting to 2.7.2, JUnit to 4.12, add hamcrest-core dependency.
2018-12-15 09:49:36 +01:00
Simon Willnauer
5c5c42cc37
LUCENE-8609: Remove deprecated IW#numDocs() and IW#maxDoc() methdos
2018-12-14 19:36:25 +01:00
Simon Willnauer
e974311d91
LUCENE-8609: Allow getting consistent docstats from IndexWriter
...
Today we have #numDocs() and #maxDoc() on IndexWriter. This is enough
to get all stats for the current index but it's subject to concurrency
and might return numbers that are not consistent ie. some cases can
return maxDoc < numDocs which is undesirable. This change adds a getDocStats()
method to index writer to allow fetching consistent numbers for these stats.
This change also deprecates IndexWriter#numDocs() and IndexWriter#maxDoc()
and replaces all their usages wiht IndexWriter#getDocStats()
2018-12-14 19:36:25 +01:00
Nicholas Knize
d3ee925bdd
Add 7.6.0 back compat test indexes
2018-12-13 21:50:53 -06:00
Simon Willnauer
4272c631ca
LUCENE-8608: Extract utility class to iterate over terms docs
...
Today we re-implement the same algorithm in various places
when we want to consume all docs for a set/list of terms. This
caused serious slowdowns for instance in the case of applying
updates fixed in LUCENE-8602. This change extracts the common
usage and shares the interation code including logic to reuse
Terms and PostingsEnum instances as much as possble and adds
tests for it.
2018-12-13 15:30:34 +01:00
Alan Woodward
fa025e1f78
LUCENE-8607: MatchAllDocsQuery can skip counting hits
2018-12-13 09:01:49 +00:00
iverase
ce9a8012c0
LUCENE-8605: Separate bounding box spatial logic from query logic on LatLonShapeBoundingBoxQuery
2018-12-12 13:46:35 +01:00
Adrien Grand
55993ecb9b
Fix test failure when floorSegmentMB > maxMergedSegmentMB.
2018-12-12 13:21:06 +01:00
Adrien Grand
21dc237248
Revert "test"
...
This reverts commit 783f01bab3e096f4f843b9de62af25d6fbe07340.
2018-12-12 13:20:37 +01:00
Adrien Grand
adf73ff6bd
test
2018-12-12 13:20:37 +01:00
Simon Willnauer
61e448666d
LUCENE-8599: Add hasNoValue bitset to ramBytesUsed calculation
2018-12-11 22:36:12 +01:00
Karl Wright
6587f640d2
Merge branch 'master' of https://git-wip-us.apache.org/repos/asf/lucene-solr
2018-12-11 13:48:44 -05:00
Karl Wright
2edf5adedc
LUCENE-8587: Fix the equals method of GeoComplexPolygon
2018-12-11 13:48:34 -05:00
Simon Willnauer
d08e2d49d3
LUCENE-8602: Share TermsEnum if possible while applying DV updates
...
Today we pull a new terms enum when we apply DV updates even though the
field stays the same which is the common case. Benchmarking this on a
larger term dictionary with a significant number of updates shows a
2x improvement in performance.
2018-12-11 19:14:16 +01:00
Karl Wright
874937aba8
Merge branch 'master' of https://git-wip-us.apache.org/repos/asf/lucene-solr
2018-12-11 09:30:17 -05:00
Karl Wright
da62c73282
LUCENE-8587: Change GeoPoint serialization to make serialization/unserialization non-lossy
2018-12-11 09:30:01 -05:00
Toke Eskildsen
8a20705b82
Revert "LUCENE-8374 part 1/4: Reduce reads for sparse DocValues".
...
LUCENE-8374 was committed without consensus and is expected to be superseded by LUCENE-8585.
This reverts commit 58a7a8ada5
.
2018-12-11 14:17:57 +01:00
Toke Eskildsen
1da6d39b41
Revert "LUCENE-8374 part 2/4: Reduce reads for sparse DocValues".
...
LUCENE-8374 was committed without consensus and is expected to be superseded by LUCENE-8585.
This reverts commit 7ad027627a
.
2018-12-11 14:14:22 +01:00
Toke Eskildsen
6c5d87a505
Revert "LUCENE-8374 part 3/4: Reduce reads for sparse DocValues".
...
LUCENE-8374 was committed without consensus and is expected to be superseded by LUCENE-8585.
This reverts commit 7949b98f80
.
2018-12-11 14:14:07 +01:00
Toke Eskildsen
3158d0c485
Revert "LUCENE-8374 part 4/4: Reduce reads for sparse DocValues".
...
LUCENE-8374 was committed without consensus and is expected to be superseded by LUCENE-8585.
This reverts commit e356d793ca
.
2018-12-11 14:13:37 +01:00
Toke Eskildsen
870bb11cc8
Revert "Pre-commit fixes for LUCENE-8374 (JavaDoc + arguments)".
...
LUCENE-8374 was committed without consensus and is expected to be superseded by LUCENE-8585.
This reverts commit 6c11161111
.
2018-12-11 14:13:17 +01:00
Noble Paul
c651cbfd9a
SOLR-12885: BinaryResponseWriter (javabin format) should directly copy from BytesRef to output
2018-12-11 18:35:57 +11:00