David Smiley
20a7ee9e11
LUCENE-8408: Highlighter: Remove obsolete private AttributeFactory instance
2018-07-20 12:33:34 -04:00
Jim Ferenczi
a19bc5ecb6
LUCENE-8402: Remove invalid identityHashCode assertions in TestPriorityQueue.
2018-07-20 10:13:19 +02:00
Nicholas Knize
9d3cc1e16f
LUCENE-8399: Fix LatLonShapeBoundingBoxQuery to use orient method for computing edge intersections
2018-07-19 13:29:05 -05:00
David Smiley
eafc9ffc6f
LUCENE-8407: Add SpanTermQuery.getTermStates getter
2018-07-18 09:09:31 -04:00
Adrien Grand
7d8fc543f0
LUCENE-8411: Remove fillFields from TopFieldCollector factory methods.
2018-07-18 12:55:05 +02:00
Mike McCandless
a2f113c5c6
LUCENE-8400: make BytesRefHash.compact public
2018-07-18 05:53:59 -04:00
Adrien Grand
331ccf3910
LUCENE-8405: Remove TopDocs.maxScore.
2018-07-18 08:38:57 +02:00
Adrien Grand
8093c450c1
LUCENE-8263: Replace TieredMergePolicy's reclaimDeletesWeight with deletesPctAllowed.
2018-07-17 18:31:06 +02:00
Adrien Grand
d730c8b214
LUCENE-8060: Remove usage of TopDocs#totalHits that should really be IndexSearcher#count.
...
Many tests were written before we introduced IndexSearcher#count and used
`searcher.search(query, 1).totalHits` to get the number of matches of a query
rather than `searcher.count(query)`.
2018-07-17 14:32:02 +02:00
Mike McCandless
ae3929c3ed
LUCENE-8487: add IndexSearcher.getSlices
2018-07-16 12:26:58 -04:00
Erick Erickson
8ce46b6c45
LUCENE-8398: TieredMergePolicy.getMaxMergedSegmentMB has rounding error
2018-07-16 08:12:35 -07:00
Jim Ferenczi
4b9e2c406e
LUCENE-8402: Mute test
2018-07-16 14:35:39 +02:00
Uwe Schindler
c97f27b06c
Merge branch 'remove-constructor-wrapper-classes' of https://github.com/michaelbraun/lucene-solr :
...
LUCENE-8345, GitHub PR #392 : Remove instantiation of redundant wrapper classes for primitives; add wrapper class constructors to forbiddenapis.
This closes #392
2018-07-16 12:24:45 +02:00
Adrien Grand
b6d6f1e3b5
LUCENE-8399: Disable test.
2018-07-16 11:23:02 +02:00
Nicholas Knize
f291f3eb6c
isolate large vertex shape test from basic shape test
2018-07-15 16:12:06 -05:00
Nicholas Knize
d387cb5241
update TestLatLonShape to handle large number of vertices
2018-07-15 09:36:18 -05:00
Nicholas Knize
e56c81b452
Fix simple tessellator test errors
2018-07-14 23:45:43 -05:00
Erick Erickson
db8db0d2b6
SOLR-12551: Upgrade to Tika 1.18
2018-07-14 15:45:33 -07:00
Nicholas Knize
3b1714e737
LUCENE-8396: silence random large poly test - for now
2018-07-14 12:48:28 -05:00
Michael Braun
f0e1864ceb
Merge remote-tracking branch 'source/master' into remove-constructor-wrapper-classes
2018-07-14 13:39:37 -04:00
Nicholas Knize
b5ef13330f
LUCENE-8396: Add Points Based Shape Indexing and Search that decomposes shapes into a triangular mesh and indexes individual triangles as a 6 dimension point
2018-07-14 11:28:37 -05:00
Mike McCandless
07de1dc011
LUCENE-8397: add DirectoryTaxonomyWriter.getCache
2018-07-13 10:20:03 -04:00
Alan Woodward
8997d41357
LUCENE-8395: WordDelimiterGraphFilter can incorrectly add holes
2018-07-12 13:13:04 +01:00
Adrien Grand
b68829e656
Add 6.6.5 bwc indices.
2018-07-12 11:08:04 +02:00
Adrien Grand
e01eefe965
Add version 6.6.5.
2018-07-12 11:02:21 +02:00
Adrien Grand
14e132b29f
LUCENE-8382: Don't propagate calls to setMinCompetitiveScore in MultiCollector.
2018-07-10 09:17:34 +02:00
Adrien Grand
b1bb11b79d
LUCENE-8391: More tests for merge policies.
2018-07-10 09:17:34 +02:00
Adrien Grand
a864ef8231
LUCENE-8394: Explicitly handle the case when segments are smaller than the floor size.
2018-07-10 09:17:34 +02:00
Adrien Grand
8e0d9849bd
LUCENE-8393: TieredMergePolicy needs to take into account the maximum segment size when computing the allowed number of segments.
2018-07-10 09:17:34 +02:00
Adrien Grand
9433ab79e2
LUCENE-8392: Avoir corner cases when maxMergeAtOnce is greater than segmentsPerTier.
2018-07-10 09:17:34 +02:00
Adrien Grand
ad01baedbf
LUCENE-8383: Fix computation of mergingBytes in TieredMergePolicy.
2018-07-09 15:23:36 +02:00
Adrien Grand
41ddac5b44
LUCENE-8385: Fix computation of the allowed segment count in TieredMergePolicy.
2018-07-09 15:21:10 +02:00
Alan Woodward
963cceebff
LUCENE-8390: Replace MatchesIteratorSupplier with IOSupplier
2018-07-09 12:35:12 +01:00
Alexandre Rafalovitch
b7d14c50fb
SOLR-11694: Remove outdated UIMA module
2018-07-07 09:58:57 -04:00
Alan Woodward
6d6e67140b
LUCENE-8388: Remove PostingsEnum#attributes()
2018-07-06 18:16:09 +01:00
Mike McCandless
fa3cd86de8
LUCENE-8379: add TermQuery.getTermStates
2018-07-05 10:16:55 -04:00
Mike McCandless
4dc2008dc1
LUCENE-8378: add DocIdSetIterator.range method
2018-07-05 10:01:53 -04:00
Nhat Nguyen
7e548ba860
LUCENE-8384: Fix missing advance DVGen in PendingSoftDeletes
...
Today we only advance the docValues update generation in
PendingSoftDeletes for the soft-deletes field. If we update a
soft-deletes DV field, then update a non-soft-deletes DV field, then
onNewReader will consider that we never update DV field although we did.
This commit makes sure that we always advance the docValues update
generation when handling docValues update in PendingSoftDeletes.
Co-authored-by: Simon Willnauer <simonw@apache.org>
2018-07-04 22:15:47 -04:00
Nhat Nguyen
d96ef95e0f
LUCENE-8381: Add change log entry
2018-07-04 20:19:43 -04:00
Nhat Nguyen
0ccd5d580d
LUCENE-8381: Do not count hard-deletes as soft-deletes in merges
...
Today if a document is soft-deleted, then hard-deleted, IW, however,
considers that doc as soft-deleted when wrapping readers for merges.
This change makes sure that IW excludes the hard-deleted documents from
a merge reader, and does not count them as soft-deletes.
Co-authored-by: Simon Willnauer <simonw@apache.org>
2018-07-04 14:16:34 -04:00
Adrien Grand
799d2acd88
LUCENE-7976: Fix indentation.
2018-07-04 13:36:54 +02:00
Dawid Weiss
0f652627a0
LUCENE-8380: UTF8TaxonomyWriterCache page/ offset calculation bug
2018-07-04 09:06:33 +02:00
Alan Woodward
f835d24997
LUCENE-8376, LUCENE-8371: ConditionalTokenFilter fixes
2018-07-02 15:36:12 +01:00
Alan Woodward
ad727bac43
LUCENE-8373: Removed StandardAnalyzer.ENGLISH_STOP_WORDS_SET
2018-06-29 16:07:09 +01:00
Alan Woodward
6566b59f36
LUCENE-8373: Move ENGLISH_STOPWORDS_SET to EnglishAnalyzer
2018-06-29 16:07:08 +01:00
Erick Erickson
c303c5f126
LUCENE-8370: Reproducing TestLucene{54,70}DocValuesFormat.testSortedSetVariableLengthBigVsStoredFields() failures
2018-06-28 18:28:37 -07:00
Mike McCandless
4c646dab3f
Fix PriorityQueue constructor to throw IllegalArgumentException if you pass Integer.MAX_VALUE as maxSize (thanks rsaavedraf)
2018-06-28 12:11:18 -04:00
Christine Poerschke
e1d2749b20
SOLR-12419: standardise solr/contrib (private) logger names
2018-06-28 14:47:33 +01:00
Mike McCandless
e8d1057403
LUCENE-8367: make per-dimension drill down optional for each facet dimension
2018-06-27 15:57:21 -04:00
Alan Woodward
39fb4b8a18
LUCENE-8356: Remove StandardFilter and StandardFilterFactory
2018-06-27 12:05:23 +01:00
Alan Woodward
0fbe3d257e
LUCENE-8356: Remove StandardFilter from Solr schemas
2018-06-27 12:05:23 +01:00
Alan Woodward
fce841534a
LUCENE-8356: Deprecate StandardFilter
2018-06-27 12:05:23 +01:00
Erick Erickson
1d85cd7838
SOLR-12513 Reproducing TestCodecSupport.testMixedCompressionMode failure
2018-06-25 15:17:40 -07:00
Shalin Shekhar Mangar
ffd9944315
SOLR-12468: Upgrade Jetty to 9.4.11.v20180605
2018-06-25 19:02:06 +05:30
Adrien Grand
3b9d3a760a
Add 7.4.0 bw indices.
2018-06-22 13:31:42 +02:00
Robert Muir
2ea416ee3d
LUCENE-8366: Upgrade to ICU 62.1
2018-06-21 20:09:13 -04:00
Marc-Andre Morissette
008bc74beb
LUCENE-8365: Fix ArrayIndexOutOfBoundsException in UnifiedHighlighter
...
This fixes a "off by one" error in the UnifiedHighlighter's code that
is only triggered when two nested SpanNearQueries contain the same term.
Signed-off-by: Simon Willnauer <simonw@apache.org>
Closes #408
2018-06-20 16:49:49 +02:00
Karl Wright
dcfbaf31db
LUCENE-8363: Interpolate should yield points on the ellipsoid.
2018-06-19 02:31:40 -04:00
Alan Woodward
ab2fec1642
LUCENE-8237: Correct handling of position increments in sub-tokenstreams
2018-06-18 09:57:38 +01:00
Alan Woodward
d1dc874d90
LUCENE-8360: NGram filters should set posIncAtt in end()
2018-06-18 09:57:38 +01:00
Erick
2519025fda
LUCENE-7976: Make TieredMergePolicy respect maxSegmentSizeMB and allow singleton merges of very large segments
2018-06-15 12:33:02 -07:00
Mike McCandless
e7aac9b0c9
fix minor javadoc typos
2018-06-15 13:57:24 -04:00
Simon Willnauer
772e171ac6
LUCENE-8358: Relax assertion in IW#writeSomeDocValuesUpdates
...
This assertion is too strict since we can see this situation if for instance
a ReadersAndUpdates instance gets written to disk concurrently and
readerpooling is off. This change also simplifies ReaderPool#getReadersByRam and
adds a test for it.
2018-06-15 12:01:29 +02:00
Erick Erickson
a98fcda9c6
SOLR-12028: BadApple and AwaitsFix annotations usage
2018-06-14 13:16:39 -07:00
Alan Woodward
6f24be9457
LUCENE-8357: Fix function score explanations
2018-06-14 11:07:30 +01:00
David Smiley
228a84fd6d
LUCENE-8344: TokenStreamToAutomaton doesn't ignore trailing posInc when preservePositionIncrements=false
2018-06-13 23:35:44 -04:00
Alan Woodward
5ae716c412
LUCENE-7444: StandardAnalyzer not longer uses english stopwords by default
2018-06-13 11:38:16 +01:00
Simon Willnauer
7eb74ac50f
LUCENE-8164: IndexWriter silently accepts broken payload.
...
This has been fixed via LUCENE-8165 since we are now checking
for offset+length going out of bounds.
Co-authored-by: Nhat Nguyen <nhat.nguyen@elastic.co>
Co-authored-by: Robert Muir <rmuir@apache.org>
2018-06-13 10:16:33 +02:00
Simon Willnauer
61e68ec1e8
LUCENE-8355: Prevent IW from opening an already dropped segment while DV updates are written
...
This change fixes an isse where IW asks ReadersAndUpdates to write a DV updates for a
segment that has been dropped concurrently. The race only occurs if ram buffers are filled
up enough to trigger flushing DV to disk.
Co-authored-by: Nhat Nguyen <nhat.nguyen@elastic.co>
2018-06-13 10:10:01 +02:00
Steve Rowe
90e4eca9db
LUCENE-8278: move CHANGES entry to 7.4 section
2018-06-12 09:34:53 -04:00
Steve Rowe
ead05a10b1
LUCENE-8278: Some end-of-input no-scheme domain-only URL tokens are typed as <ALPHANUM> rather than <URL>
2018-06-11 17:09:30 -04:00
ivera
36b7cdde06
LUCENE-8350: Fix for time-out in RandomGeoPolygonTests
2018-06-08 12:47:26 +02:00
Alan Woodward
a4fa168962
LUCENE-8273: Don't wrap ShingleFilter in conditions in testRandomChains
2018-06-07 12:54:10 +01:00
Adrien Grand
a82c9cfcf4
Add a 7.5.0 version.
2018-06-07 10:33:52 +02:00
Nhat Nguyen
8a6f1bf5ad
LUCENE-8165: Ban copyOf and copyOfRange.
...
These methods are lenient with out-of-bounds indices.
Signed-off-by: Adrien Grand <jpountz@gmail.com>
2018-06-07 10:08:21 +02:00
Andrzej Bialecki
3022bbce2e
SOLR-12445: Upgrade Dropwizard Metrics to version 3.2.6.
2018-06-06 15:14:22 +02:00
Ignacio Vera
af7dfb182e
LUCENE-8299: Geo3D wrapper uses new polygon method factory that gives better support for polygons with many points (>100)
2018-06-06 11:39:39 +02:00
Jim Ferenczi
2b406a57c4
LUCENE-8346: Remove final modifier on QueryBuilder#createSpanQuery to allow override
2018-06-06 09:18:25 +02:00
Michael Braun
78079fc552
Merge remote-tracking branch 'source/master' into remove-constructor-wrapper-classes
2018-06-05 18:48:55 -04:00
Michael Braun
6d33beb92d
Remove extra T
2018-06-05 18:29:46 -04:00
David Smiley
f9f5e83745
LUCENE-8332: New ConcatenateGraphFilter (from CompletionTokenStream).
...
* Added a test for FingerprintFilter and clarified FF's end condition.
2018-06-04 23:07:31 -04:00
Robert Muir
5c6a49b13f
LUCENE-7960: remove deprecations
2018-06-04 22:40:48 -04:00
Robert Muir
208d4a9c34
LUCENE-7960: Add preserveOriginal option to the NGram and EdgeNGram filters
...
(this is a correction of the issue number in both the CHANGES.txt and the commit message, sorry for the noise).
2018-06-04 21:32:53 -04:00
Robert Muir
2c1ab31b4e
LUCENE-7690: Add preserveOriginal option to the NGram and EdgeNGram filters
2018-06-04 21:24:20 -04:00
Simon Willnauer
59087d148a
[TEST] Ensure MDW.assertNoUnreferencedFilesOnClose is threadsafe
2018-06-04 17:33:18 +02:00
Simon Willnauer
fe83838ec3
LUCENE-8341: Record soft deletes in SegmentCommitInfo
...
This change add the number of documents that are soft deletes but
not hard deleted to the segment commit info. This is the last step
towards making soft deletes as powerful as hard deltes since now the
number of document can be read from commit points without opening a
full blown reader. This also allows merge posliies to make decisions
without requiring an NRT reader to get the relevant statistics. This
change doesn't enforce any field to be used as soft deletes and the statistic
is maintained per segment.
2018-06-04 15:05:12 +02:00
Simon Willnauer
e7a0a12926
LUCENE-8335: Enforce soft-deletes field up-front
...
Soft deletes field must be marked as such once it's introduced
and can't be changed after the fact.
Co-authored-by: Nhat Nguyen <nhat.nguyen@elastic.co>
2018-06-04 08:28:38 +02:00
Michael Braun
fb6574100e
LUCENE-8345 - add wrapper class constructors to forbiddenapis
2018-06-03 15:40:50 -04:00
Simon Willnauer
3dc4fa199c
Revert "LUCENE-8335: Enforce soft-deletes field up-front."
...
This reverts commit a2d9276674
.
2018-06-02 13:47:24 +02:00
Simon Willnauer
a2d9276674
LUCENE-8335: Enforce soft-deletes field up-front.
...
Soft deletes field must be marked as such once it's introduced
and can't be changed after the fact.
2018-06-02 13:14:53 +02:00
Simon Willnauer
76263087b5
LUCENE-8338: Ensure number returned for PendingDeletes are well defined
...
Today a call to PendingDeletes#numPendingDeletes might return 0
if the deletes are written to disk. This doesn't mean these values are committed
or refreshed in the latest reader. Some places in IW use these numbers to make
decisions if there has been deletes added since last time checked (BufferedUpdateStream)
which can cause wrong (while not fatal) decision ie. to kick of new merges.
Now this API is made protected and not visible outside of PendingDeletes to prevent
any kind of confusion. The APIs now allow to get absolute numbers of getDelCount and numDocs
which have the same name and semantics as their relatives on IndexReader/Writer
and SegmentCommitInfo.
2018-05-30 20:48:51 +02:00
Alan Woodward
e3d4c7e9b7
LUCENE-8300: Allow unordered intervals to exclude overlaps
2018-05-30 16:15:39 +01:00
Karl Wright
ceb4f768bf
LUCENE-8337: Fix problems with how travel planes too close to edge of world are disallowed, and increase the size of the disallowed window by an order of magnitude.
2018-05-29 21:03:03 -04:00
Karl Wright
7ce6dbda18
LUCENE-8337: Add disabled unit test for failure
2018-05-29 12:42:56 -04:00
Alan Woodward
4ea9d2ea8c
LUCENE-8273: Adjust position increments when filtering stacked tokens
2018-05-29 15:59:36 +01:00
Simon Willnauer
34741a863a
LUCENE-8330: Exclude MockRandomMP from basic tests
2018-05-29 16:58:03 +02:00
Andrzej Bialecki
6bbce38b77
SOLR-11779: Basic long-term collection of aggregated metrics.
2018-05-29 10:35:39 +02:00
Yuri Astrakhan
d039730638
Fix incorrect BitUtil.deinterleave() description
...
Signed-off-by: Adrien Grand <jpountz@gmail.com>
Closes #367
2018-05-28 17:10:47 +02:00
Adrien Grand
1971ef3109
LUCENE-8186: LowerCaseTokenizerFactory now lowercases text in multi-term queries.
2018-05-28 16:20:49 +02:00
Adrien Grand
78ca82e63a
Add 6.6.4 back compat test indexes.
2018-05-28 16:11:32 +02:00