Jim Ferenczi
49e3cca77f
LUCENE-8204: Boolean queries with a mix of required and optional clauses are now faster if the total hit count is not required
2018-08-08 15:49:58 +02:00
Adrien Grand
e56c8722ce
Revert "Make the nightly test smaller so that it does not fail with GC overhead exceeded (OOM). Clean up random number fetching to make it shorter."
...
This reverts commit 3203e99d8f
.
2018-08-01 15:44:57 +02:00
Adrien Grand
86a39fa29f
Revert "Fix AAIOOBE in GeoTestUtil."
...
This reverts commit c3e813188e
.
2018-08-01 15:44:47 +02:00
Adrien Grand
c3e813188e
Fix AAIOOBE in GeoTestUtil.
2018-08-01 15:17:53 +02:00
Dawid Weiss
3203e99d8f
Make the nightly test smaller so that it does not fail with GC overhead exceeded (OOM). Clean up random number fetching to make it shorter.
2018-08-01 14:05:02 +02:00
Adrien Grand
99dbe93681
LUCENE-8060: IndexSearcher's search and searchAfter methods now only compute total hit counts accurately up to 1,000.
2018-08-01 09:01:21 +02:00
Steve Rowe
a08eadb480
Fix InfixSuggestersTest.testShutdownDuringBuild() failures
2018-07-30 22:49:49 -04:00
Adrien Grand
61e89e3ca0
LUCENE-8431: Top-docs collectors now collect lower bounds of the hit count.
2018-07-30 16:38:05 +02:00
Adrien Grand
9ca053712a
LUCENE-8430: TopDocs.totalHits may now be a lower bound of the hit count.
2018-07-30 16:38:05 +02:00
Dawid Weiss
d25f62634b
LUCENE-8415: test quirk follow up. MockDirectoryWriter uses AccessDeniedException (a subclass of IOException) to signal files still open for writing when read access is requested.
2018-07-25 11:34:31 +02:00
Dawid Weiss
8892c0d9af
LUCENE-8415: Clean up Directory contracts (write-once, no reads-before-write-completed). Minor test improvements and cleanups.
2018-07-24 08:47:50 +02:00
Jason Gerlowski
6ed9607f74
SOLR-12555: Add add'l expectThrows() test helper
2018-07-23 20:37:04 -04:00
Alan Woodward
028c86b1fa
LUCENE-8306: Allow iteration over submatches
...
Also includes LUCENE-8404, adding match iteration to SpanQuery
2018-07-23 10:02:01 +01:00
Alan Woodward
6e3f61f6f9
Revert "LUCENE-8306: Allow iteration over submatches"
...
Incorrect patch committed in error
This reverts commit a8839b7eab
.
2018-07-22 22:36:46 +01:00
Alan Woodward
a8839b7eab
LUCENE-8306: Allow iteration over submatches
2018-07-22 21:42:46 +01:00
Adrien Grand
331ccf3910
LUCENE-8405: Remove TopDocs.maxScore.
2018-07-18 08:38:57 +02:00
Adrien Grand
8093c450c1
LUCENE-8263: Replace TieredMergePolicy's reclaimDeletesWeight with deletesPctAllowed.
2018-07-17 18:31:06 +02:00
Adrien Grand
d730c8b214
LUCENE-8060: Remove usage of TopDocs#totalHits that should really be IndexSearcher#count.
...
Many tests were written before we introduced IndexSearcher#count and used
`searcher.search(query, 1).totalHits` to get the number of matches of a query
rather than `searcher.count(query)`.
2018-07-17 14:32:02 +02:00
Michael Braun
f0e1864ceb
Merge remote-tracking branch 'source/master' into remove-constructor-wrapper-classes
2018-07-14 13:39:37 -04:00
Nicholas Knize
b5ef13330f
LUCENE-8396: Add Points Based Shape Indexing and Search that decomposes shapes into a triangular mesh and indexes individual triangles as a 6 dimension point
2018-07-14 11:28:37 -05:00
Adrien Grand
b1bb11b79d
LUCENE-8391: More tests for merge policies.
2018-07-10 09:17:34 +02:00
Adrien Grand
41ddac5b44
LUCENE-8385: Fix computation of the allowed segment count in TieredMergePolicy.
2018-07-09 15:21:10 +02:00
Erick Erickson
c303c5f126
LUCENE-8370: Reproducing TestLucene{54,70}DocValuesFormat.testSortedSetVariableLengthBigVsStoredFields() failures
2018-06-28 18:28:37 -07:00
Alan Woodward
ab2fec1642
LUCENE-8237: Correct handling of position increments in sub-tokenstreams
2018-06-18 09:57:38 +01:00
Nhat Nguyen
8a6f1bf5ad
LUCENE-8165: Ban copyOf and copyOfRange.
...
These methods are lenient with out-of-bounds indices.
Signed-off-by: Adrien Grand <jpountz@gmail.com>
2018-06-07 10:08:21 +02:00
Michael Braun
78079fc552
Merge remote-tracking branch 'source/master' into remove-constructor-wrapper-classes
2018-06-05 18:48:55 -04:00
Simon Willnauer
59087d148a
[TEST] Ensure MDW.assertNoUnreferencedFilesOnClose is threadsafe
2018-06-04 17:33:18 +02:00
Simon Willnauer
fe83838ec3
LUCENE-8341: Record soft deletes in SegmentCommitInfo
...
This change add the number of documents that are soft deletes but
not hard deleted to the segment commit info. This is the last step
towards making soft deletes as powerful as hard deltes since now the
number of document can be read from commit points without opening a
full blown reader. This also allows merge posliies to make decisions
without requiring an NRT reader to get the relevant statistics. This
change doesn't enforce any field to be used as soft deletes and the statistic
is maintained per segment.
2018-06-04 15:05:12 +02:00
Simon Willnauer
e7a0a12926
LUCENE-8335: Enforce soft-deletes field up-front
...
Soft deletes field must be marked as such once it's introduced
and can't be changed after the fact.
Co-authored-by: Nhat Nguyen <nhat.nguyen@elastic.co>
2018-06-04 08:28:38 +02:00
Michael Braun
fb6574100e
LUCENE-8345 - add wrapper class constructors to forbiddenapis
2018-06-03 15:40:50 -04:00
Simon Willnauer
3dc4fa199c
Revert "LUCENE-8335: Enforce soft-deletes field up-front."
...
This reverts commit a2d9276674
.
2018-06-02 13:47:24 +02:00
Simon Willnauer
a2d9276674
LUCENE-8335: Enforce soft-deletes field up-front.
...
Soft deletes field must be marked as such once it's introduced
and can't be changed after the fact.
2018-06-02 13:14:53 +02:00
Simon Willnauer
34741a863a
LUCENE-8330: Exclude MockRandomMP from basic tests
2018-05-29 16:58:03 +02:00
Simon Willnauer
c93f628317
LUCENE-8330: Detach IndexWriter from MergePolicy
...
This change introduces a new MergePolicy.MergeContext interface
that is easy to mock and cuts over all instances of IW to MergeContext.
Since IW now implements MergeContext the cut over is straight forward.
This reduces the exposed API available in MP dramatically and allows
efficient testing without relying on IW to improve the coverage and
testability of our MP implementations.
2018-05-25 07:37:09 +02:00
Simon Willnauer
70cfe46689
LUCENE-8320: Fix NPE in WindowsFS if target file exists but isn't open
2018-05-18 19:38:11 +02:00
Alan Woodward
b1ee23c525
LUCENE-8273: Fix end() and posInc handling
2018-05-18 13:11:39 +01:00
Simon Willnauer
42a79970d5
LUCENE-8320: Fix WindowsFS#rename with hardlinks
2018-05-18 09:33:50 +02:00
Simon Willnauer
3fe612bed2
LUCENE-8318: Ensure pending delete is not brought back on a try delete attempt
...
When renaming a file, `FSDirectory#rename` tries to delete the dest file
if it's in the pending deletes list. If that delete fails, it adds the
dest to the pending deletes list again. This causes the dest file to be
deleted later by `deletePendingFiles`.
2018-05-17 11:02:35 +02:00
Adrien Grand
6d69824a6b
LUCENE-8314: More checks on AssertingScorer.
2018-05-16 17:54:19 +02:00
Adrien Grand
9b9776a714
LUCENE-8313: Simplify SimScorer.
2018-05-16 17:53:56 +02:00
Simon Willnauer
585952797c
LUCENE-8310: Ensure IndexFileDeleter accounts for pending deletes
...
Today we fail creating the IndexWriter when the directory has a
pending delete. Yet, this is mainly done to prevent writing still
existing files more than once. IndexFileDeleter already accounts for
that for existing files which we can now use to also take pending
deletes into account which ensures that all file generations per segment
always go forward.
2018-05-16 11:17:43 +02:00
Adrien Grand
d764156f91
LUCENE-8303: Make the overflow test a Monster rather than Nightly.
2018-05-11 14:36:42 +02:00
Simon Willnauer
a3c86373e4
LUCENE-8298: Allow DocValues updates to reset a value
...
Today once a document has a value in a certain DV field this values
can only be changed but not removed. While resetting / removing a value
from a field is certainly a corner case it can be used to undelete a
soft-deleted document unless it's merged away.
This allows to rollback changes without rolling back to another commitpoint
or trashing all uncommitted changes. In certain cenarios it can be used to
"repair" history of documents in distributed systems.
2018-05-09 18:57:57 +02:00
Adrien Grand
8dc69428e3
LUCENE-8303: Make LiveDocsFormat only responsible for serialization/deserialization of live docs.
2018-05-09 15:40:14 +02:00
Dawid Weiss
85c00e77ef
LUCENE-8267: removed references to memory codecs.
2018-05-08 10:32:11 +02:00
Adrien Grand
67c13bbe2e
LUCENE-8142: Fix QueryUtils to only call getMaxScore when it is legal to do so.
2018-05-02 17:42:18 +02:00
Adrien Grand
46ecb73976
LUCENE-8142: Fix AssertingImpactsEnum and add missing javadoc.
2018-05-02 17:20:42 +02:00
Adrien Grand
af680af77f
LUCENE-8142: Make postings APIs expose raw impacts rather than scores.
2018-05-02 14:49:32 +02:00
Simon Willnauer
933d8a6995
LUCENE-8275: Fix BaseLockFactoryTestCase to step out on Windowns if pending files are found
...
The particular test here is #testStressLocks that has several protectesion against
WindowsFS and special logic in the catch clause that steps out on fatal exceptions with
pending deletes. Since we now check this consistently in the IW ctor we need to also
skip this entire test if we are on windows and have pending deletes.
2018-04-26 12:10:10 +02:00
Alan Woodward
e167e91247
LUCENE-8270: Remove MatchesIterator.term()
2018-04-23 16:51:17 +01:00
Simon Willnauer
6f0a884582
LUCENE-8269: Detach downstream classes from IndexWriter
...
IndexWriter today is shared with many classes like BufferedUpdateStream,
DocumentsWriter and DocumentsWriterPerThread. Some of them even acquire locks
on the writer instance or assert that the current thread doesn't hold a lock.
This makes it very difficult to have a manageable threading model.
This change separates out the IndexWriter from those classes and makes them all
independent of IW. IW now implements a new interface for DocumentsWriter to communicate
on failed or successful flushes and tragic events. This allows IW to make it's critical
methods private and execute all lock critical actions on it's private queue that ensures
that the IW lock is not held. Follow-up changes will try to detach more code like
publishing flushed segments to ensure we never call back into IW in an uncontrolled way.
2018-04-23 17:17:40 +02:00
Simon Willnauer
c70cceaee5
LUCENE-8253: Account for soft-deletes before they are flushed to disk
...
Inside the IndexWriter buffers are only written to disk if it's needed
or "worth it" which doesn't guarantee soft deletes to be accounted
in time. This is not necessarily a problem since they are eventually
collected and segments that have soft-deletes will me merged eventually
but for tests and on par behavior compared to hard deletes this behavior
is tricky.
This change cuts over to accounting in-place just like hard-deletes. This
results in accurate delete numbers for soft deletes at any give point in time
once the reader is loaded or a pending soft delete occurs.
This change also fixes an issue where all updates to a DV field are allowed
event if the field is unknown. Now this only works if the field is equal
to the soft deletes field. This behavior was never released.
2018-04-16 16:17:06 +02:00
Mike McCandless
7c0387ad3f
LUCENE-8248: MergePolicyWrapper is renamed to FilterMergePolicy and now also overrides getMaxCFSSegmentSizeMB
2018-04-13 15:45:19 -04:00
Alan Woodward
040a9601b1
LUCENE-8229: Add Weight.matches() to iterate over match positions
2018-04-11 09:43:27 +01:00
Alan Woodward
798d351034
LUCENE-8242: Deprecate createNormalizedWeight
2018-04-09 15:07:04 +01:00
Simon Willnauer
ed62b990d8
LUCENE-8237: Add a SoftDeletesDirectoryReaderWrapper
...
This adds support for soft deletes if the reader is opened form a directory.
Today we only support soft deletes for NRT readers, this change allows to wrap
existing DirectoryReader with a SoftDeletesDirectoryReaderWrapper to also filter
out soft deletes in the case of a non-NRT reader.
2018-04-09 11:50:38 +02:00
Simon Willnauer
ecc17f9023
LUCENE-8233: Add support for soft deletes to IndexWriter
...
This change adds support for soft deletes as a fully supported feature
by the index writer. Soft deletes are accounted for inside the index
writer and therefor also by merge policies.
This change also adds a SoftDeletesRetentionMergePolicy that allows
users to selectively carry over soft_deleted document across merges
for renention policies. The merge policy selects documents that should
be kept around in the merged segment based on a user provided query.
2018-04-04 13:45:14 +02:00
Robert Muir
e595541ef3
LUCENE-8192: always enforce index-time offsets are correct with BaseTokenStreamTestCase
2018-03-26 22:02:34 -04:00
Alan Woodward
fac84c01c8
LUCENE-8202: Add FixedShingleFilter
2018-03-21 13:45:03 +00:00
Simon Willnauer
2e35ef2b3d
LUCENE-8215: Fix several fragile exception handling places in o.a.l.index
...
Several places in the index package don't handle exceptions well or ignores them.
This change adds some utility methods and cuts over to make use of try/with blocks
to simplify exception handling.
2018-03-20 10:50:12 +01:00
Adrien Grand
3048e5da22
LUCENE-8008: Remove unintended changes.
2018-03-20 09:52:24 +01:00
Robert Muir
97299ed006
LUCENE-8191: if a tokenstream has broken offsets, its broken. IndexWriter always checks, so a separate whitelist can't work
2018-03-04 11:23:45 -05:00
Erick
ad7e94afb2
SOLR-12028: BadApple and AwaitsFix annotations usage
2018-03-03 21:42:14 -08:00
Uwe Schindler
7dba350c7a
SOLR-12028: Make initialization of constants dynamic (by reading the annotation), also add missing reproduce info
2018-02-28 00:47:00 +01:00
Erick Erickson
1fe45606b9
SOLR-12028: BadApple and AwaitsFix annotations usage
2018-02-26 20:35:12 -08:00
Adrien Grand
317a2e0c3d
LUCENE-8153: Make impacts checks lighter by default.
...
The new `-slow` switch makes checks more complete but also more heavy. This
option also cross-checks term vectors.
2018-02-20 17:14:11 +01:00
Adrien Grand
4fb7e3d02c
LUCENE-8135: Implement block-max WAND.
2018-02-15 15:13:58 +01:00
Alan Woodward
342e38217a
LUCENE-8163: BaseDirectoryTestCase produces random filenames that fail on Windows
2018-02-09 09:14:02 +00:00
Adrien Grand
f410df8113
LUCENE-4198: Give codecs the opportunity to index impacts.
2018-01-31 14:54:52 +01:00
Adrien Grand
75d50b4492
LUCENE-8116: Remove unnecessary IOException.
2018-01-11 11:49:36 +01:00
Adrien Grand
838c604b76
LUCENE-8119: Remove SimScorer.maxScore(float maxFreq).
2018-01-09 14:42:16 +01:00
Alan Woodward
d250a1463d
LUCENE-8133: Rename TermContext to TermStates, and load TermState lazily if term stats are not required
2018-01-05 14:17:15 +00:00
Adrien Grand
8fd7ead940
LUCENE-8116: SimScorer now only takes a frequency and a norm as per-document scoring factors.
2018-01-04 15:13:36 +01:00
Alan Woodward
c1030eeb74
LUCENE-8012: Explanation takes Number rather than float
2018-01-02 11:06:59 +00:00
Adrien Grand
b2f248164c
LUCENE-8010: Fix similarities so that they pass tests.
2017-12-29 10:06:00 +01:00
Steve Rowe
3e2f9e62d7
LUCENE-2899: Add OpenNLP Analysis capabilities as a module
2017-12-15 11:24:18 -05:00
Adrien Grand
d5c72eb588
LUCENE-8081: Remove unused import.
2017-12-08 08:45:18 +01:00
Simon Willnauer
ede46fe6e9
LUCENE-8081: Allow IndexWriter to opt out of flushing on indexing threads
...
Index/Update Threads try to help out flushing pending document buffers to
disk. This change adds an expert setting to opt ouf of this behavior unless
flusing is falling behind.
2017-12-07 16:22:52 +01:00
Adrien Grand
4fc5a872de
LUCENE-4100: Faster disjunctions when the hit count is not needed.
2017-12-07 10:49:39 +01:00
Adrien Grand
63b63c5734
LUCENE-8015: Fixed DFR similarities' scores to not decrease when tfn increases.
2017-12-06 18:19:57 +01:00
Adrien Grand
a8a63464e7
LUCENE-7996: Queries are now required to produce positive scores.
2017-12-06 14:06:03 +01:00
Simon Willnauer
01d12777c4
LUCENE-8068: Allow IndexWriter to write a single DWPT to disk
...
Adds a `flushNextBuffer` method to IndexWriter that allows the caller to
synchronously move the next pending or the biggest non-pending index buffer to
disk. This enables flushing selected buffer to disk without highjacking an
indexing thread. This is for instance useful if more than one IW (shards) must
be maintained in a single JVM / system.
2017-11-30 18:57:27 +01:00
Adrien Grand
d27ddcb409
LUCENE-8008: Reduce leniency in CheckHits.
2017-11-29 18:09:38 +01:00
David Smiley
64d95e6a6d
LUCENE-8049: IndexWriter.getMergingSegments() signature changed to return Set instead of Collection
2017-11-26 23:25:06 -05:00
Alan Woodward
183571c085
LUCENE-6278: Remove Scorer.freq()
2017-11-15 11:14:16 +00:00
Alan Woodward
276e317e94
LUCENE-8042: Add SegmentCachable interface
2017-11-10 12:17:50 +00:00
Alan Woodward
1aa049bb27
LUCENE-8014: Remove deprecated SimScorer methods
2017-11-10 09:43:18 +00:00
Alan Woodward
764abcb31a
Revert "LUCENE-8014: Remove deprecated SimScorer methods"
...
Reverting to fix test failures
This reverts commit 946ec9d5b9
.
2017-11-10 09:02:03 +00:00
Alan Woodward
946ec9d5b9
LUCENE-8014: Remove deprecated SimScorer methods
2017-11-09 14:05:34 +00:00
Alan Woodward
a886a001a4
LUCENE-8017: Add Weight.getCacheHelper()
2017-11-03 10:40:14 +00:00
Robert Muir
ca5f9b3457
LUCENE-8007: Make scoring statistics mandatory
2017-11-02 23:02:21 -04:00
Robert Muir
875d45ff14
LUCENE-8030: fix buggy assert
2017-10-31 22:30:33 -04:00
Robert Muir
e0bde57981
LUCENE-8020: don't force sim to score bogus terms (e.g. docfreq=0)
2017-10-30 20:32:12 -04:00
Robert Muir
489ca238c4
LUCENE-8021: Add AssertingSimilarity
2017-10-30 18:38:26 -04:00
Robert Muir
42717d5f4b
LUCENE-7997: More sanity testing of similarities
2017-10-24 22:48:04 -04:00
Mike McCandless
ea36f5040c
LUCENE-7999: upgrade int to long for tracking the counter for the next segment name to prevent overflow
2017-10-24 13:13:41 -04:00
Dawid Weiss
46cd679e91
LUCENE-7983: IndexWriter.IndexReaderWarmer is now a functional interface instead of an abstract class with a single method.
2017-10-04 10:59:16 +02:00
Nicholas Knize
bf71650ad7
LUCENE-7392: Add point based LatLonBoundingBox as new RangeField Type.
2017-09-19 14:45:04 -05:00
yonik
a4374e840d
SOLR-11173: implement Points support in TermsComponent via PointMerger
2017-08-19 18:02:11 -04:00
Adrien Grand
9c83d025e4
LUCENE-7897: IndexOrDocValuesQuery now requires the range cost to be more than 8x greater than the cost of the lead iterator in order to use doc values.
2017-08-10 12:10:44 +02:00
Adrien Grand
bc9c4144df
Remove unnecessary backward compatibility.
2017-07-03 14:17:09 +02:00
Alan Woodward
af1ee47f2b
LUCENE-7867: Remove deprecated Token class
2017-06-23 09:51:05 +01:00
Mike McCandless
58105a203a
LUCENE-7868: use multiple threads to concurrently resolve deletes and DV udpates
2017-06-21 13:47:15 -04:00
Adrien Grand
44d1f1fe3f
LUCENE-7872: TopDocs.totalHits is now a long.
2017-06-21 09:55:02 +02:00
David Smiley
abc393dbfd
LUCENE-7500: Remove LeafReader.fields in lieu of LeafReader.terms.
...
Optimized MultiFields.getTerms.
2017-06-15 00:59:01 -04:00
David Smiley
f470bbcbdc
LUCENE-7876 avoid leafReader.fields
2017-06-13 22:44:04 -04:00
Mike McCandless
d276acfbbc
LUCENE-7854: enable indexing custom term frequencies
2017-06-06 13:37:31 -04:00
Adrien Grand
14320a584c
LUCENE-7847: Fix the all-docs-match optimization of range queries on range fields.
2017-05-23 18:46:50 +02:00
Adrien Grand
06a6034d9b
LUCENE-7730: Better accuracy for the length normalization factor.
2017-05-18 16:27:31 +02:00
Dawid Weiss
e52d860994
LUCENE-7796: Make IOUtils.reThrow idiom declare Error return type so
...
callers may use it in a way that compiler knows subsequent code is
unreachable. reThrow is now deprecated in favor of IOUtils.rethrowAlways.
2017-04-28 12:17:21 +02:00
Adrien Grand
d3494c2cd6
LUCENE-7781: Call ensureOpen when registering closed listeners.
2017-04-13 08:41:47 +02:00
Adrien Grand
15a1561d43
LUCENE-7767: SortedDocValues.ordValue() now throws an IOException.
2017-04-11 10:02:51 +02:00
Adrien Grand
23b002a0fd
LUCENE-7756: Only record the major Lucene version that created the index, and record the minimum Lucene version that contributed to segments.
2017-04-04 09:57:16 +02:00
Adrien Grand
e80643e5a7
LUCENE-7753: Make fields static when possible.
2017-03-31 17:28:21 +02:00
Adrien Grand
53064e46dd
LUCENE-7754: Inner classes should be static whenever possible.
2017-03-28 15:35:04 +02:00
Nicholas Knize
d34d81f9af
LUCENE-7740: Refactor Range Fields to remove Field suffix (e.g., DoubleRange),
...
move InetAddressRange and InetAddressPoint from sandbox to misc module, and
refactor all other range fields from sandbox to core.
2017-03-13 02:22:29 -05:00
Adrien Grand
8ed2b764ed
LUCENE-6819: Remove index-time boosts.
2017-03-02 19:18:16 +01:00
Adrien Grand
540a237231
LUCENE-7410: Make TestReaderClosed pass if the IndexSearcher wraps a threadpool.
2017-03-01 19:12:14 +01:00
Adrien Grand
df6f830723
LUCENE-7410: Make cache keys and close listeners less trappy.
2017-02-28 14:46:45 +01:00
Ishan Chattopadhyaya
9a1702a8f5
SOLR-8029: Reverting the previous commit and the merge
2017-02-15 07:13:28 +05:30
Noble Paul
325cbf0018
Revert "Merge remote-tracking branch 'origin/master'"
...
This reverts commit 26298f35df
, reversing
changes made to 65c6c576b7
.
undoing the merge commit
2017-02-14 17:32:14 -08:00
Christine Poerschke
738da0998a
LUCENE-7693: Replace "org.apache." logic in GetMavenDependenciesTask. (Daniel Collins, Christine Poerschke)
2017-02-14 14:27:44 +00:00
Mike McCandless
f1c5cd5784
LUCENE-7690: also handle expected CorruptIndexException in this test
2017-02-13 10:29:56 -05:00
Mike McCandless
a3bfc3e67c
LUCENE-7690: also handle expected CorruptIndexException in this test
2017-02-11 13:55:09 -05:00
Mike McCandless
be007a6c11
LUCENE-7662: throw CorruptIndexException if index files are missing
2017-02-10 20:20:53 -05:00
Steve Rowe
6c1a4b673a
SOLR-6246: SolrSuggester.build() now throws SolrCoreState.CoreIsClosedException when a core reload/shutdown happens; add a random test lookup dictionary with configurable size; add {Analyzing,Blended}InfixSuggester reload/build tests; add a wrapped-exception expectThrows() variant to LuceneTestCase
2017-02-03 18:25:38 -05:00
Mike McCandless
72eaeab715
LUCENE-7668: add new test case; remove dead code; improve CannedTokenStream to copy all Token attributes
2017-01-31 11:56:07 -05:00
Adrien Grand
cf943c5454
LUCENE-7656: Implement geo box/distance queries using doc values.
2017-01-26 13:59:35 +01:00
Adrien Grand
94530940e4
LUCENE-7647: CompressingStoredFieldsFormat should reclaim memory more aggressively.
2017-01-25 16:15:04 +01:00
Alan Woodward
94e3460305
Revert "LUCENE-7628: Scorer.getChildren() returns only matching Scorers"
...
This reverts commit 5bdc492c9c
.
2017-01-23 11:30:03 +00:00
Alan Woodward
5bdc492c9c
LUCENE-7628: Scorer.getChildren() returns only matching Scorers
2017-01-23 09:35:16 +00:00
Mike McCandless
637915b890
LUCENE-7619: add WordDelimiterGraphFilter (replacing WordDelimiterFilter) to produce a correct token stream graph when splitting words
2017-01-17 10:38:07 -05:00
Jim Ferenczi
1acd2ee2bb
Various fixes and updates for index sorting on flush
...
* IndexWriter.validateIndexSort now throws a CorruptIndexException if a segment created by version >= 6.5.0 is not sorted (already applied in branch_6x)
* Removes unneeded check in AssertingLiveDocsFormat (already applied in branch_6x)
* Removes try/finally block when stored fields consumer finishes (already applied in branch_6x).
2017-01-17 14:22:47 +01:00
Adrien Grand
86233cb95d
LUCENE-7055: Add ScorerProvider to get an estimation of the cost of scorers before building them.
2017-01-17 08:51:58 +01:00
Alan Woodward
8fa0a8dd1e
LUCENE-7627: Add #intersect(CompiledAutomaton) to Sorted*DocValues
2017-01-14 09:40:19 +00:00
Mike McCandless
64b86331c2
LUCENE-7626: IndexWriter no longer accepts broken offsets
2017-01-13 17:46:02 -05:00
Mike McCandless
f985fcaa23
promote this test case to core
2017-01-08 06:38:37 -05:00
Mike McCandless
e64111c654
TokenStreamToAutomaton failed to handle certain holes correctly
2017-01-08 06:26:08 -05:00
Uwe Schindler
db9190db93
LUCENE-7595: Fix bug with RamUsageTester incorrectly handling Iterables outside Java Runtime
2016-12-29 20:31:47 +01:00
Uwe Schindler
f29d2b5668
LUCENE-7595: Improve RAMUsageTester in test-framework to estimate memory usage of runtime classes and work with Java 9 EA (b148+). Disable static field heap usage checker in LuceneTestCase
2016-12-29 01:56:23 +01:00
Adrien Grand
ba47f530d1
LUCENE-7401: Make sure BKD trees index all dimensions.
2016-12-26 10:37:16 +01:00
Mike McCandless
c0467bb929
LUCENE-6664: add SynonymGraphFilter for correct multi-token synonym handling
2016-12-22 15:39:17 -05:00
Uwe Schindler
d5e87898b1
LUCENE-6989: Fix some tests that hardcode MMapDirectory (and also the FSDirectory randomizer), to only use MMapDirectory on Windows, if it supports unmapping. Otherwise tests will fail.
2016-12-21 23:07:50 +01:00
Mike McCandless
4ccb9fbd2b
LUCENE-7579: sort segments at flush too
2016-12-20 06:45:06 -05:00
Mike McCandless
268d4ace36
remove bad assertion
2016-12-15 09:30:25 -05:00
Adrien Grand
be74f4f7a7
LUCENE-7569: Fix index sorting test failures.
2016-11-24 14:10:19 +01:00
Mike McCandless
24b5a624bb
LUCENE-7652: improve BasePostingsFormatTestCase with a level 2 ghost test
2016-11-16 05:19:05 -05:00
Mike McCandless
6c3c6bc379
LUCENE-7537: Index time sorting now supports multi-valued sorts using selectors (MIN, MAX, etc.)
2016-11-15 16:22:51 -05:00
Mike McCandless
c0f9bdf3c3
don't use BooleanSimilarity in RandomSimilarity
2016-11-13 18:41:18 -05:00
Adrien Grand
c4c5e868d2
LUCENE-7531: Removed packing support from FST.
2016-11-10 15:01:49 +01:00
Adrien Grand
6b9f11311a
LUCENE-7524: More detailed explanation of idf.
2016-11-10 15:01:49 +01:00
Adrien Grand
3e15233b23
LUCENE-5867: Add a BooleanSimilarity.
2016-11-10 15:01:49 +01:00
Adrien Grand
af60048097
LUCENE-7429: AnalyzerWrapper can now wrap the normalization analysis chain too.
2016-10-27 16:27:45 +02:00
Adrien Grand
9aca4c9d56
LUCENE-7462: Give doc values APIs an `advanceExact` method.
2016-10-24 10:51:23 +02:00
Dawid Weiss
bc0116af69
LUCENE-7515: RunListenerPrintReproduceInfo may try to access static rule fields without
...
the rule being called. This flag is needed to ensure this isn't the case.
2016-10-21 11:19:12 +02:00
Dawid Weiss
a19ec194d2
LUCENE-7513: Update to randomizedtesting 2.4.0.
2016-10-21 11:14:37 +02:00
Adrien Grand
c22725f0b5
LUCENE-7494: Give points a per-field API.
2016-10-18 10:14:27 +02:00
Adrien Grand
ea1212232d
LUCENE-7495: Fix doc values iterators' assertions in advance().
2016-10-14 14:38:32 +02:00
Mike McCandless
1b7a88f61e
LUCENE-7491: fix merge exception if the same field has points in some segments but not in others
2016-10-12 09:00:26 -04:00
yonik
98191225eb
SOLR-9579: SchemaField should implement lucene.IndexableFieldType to avoid repeated creation
2016-10-11 11:17:55 -04:00
Adrien Grand
63ef459022
LUCENE-7487: Remove unnecessary synchronization from Lucene70NormsProducer.
2016-10-11 08:31:29 +02:00
Mike McCandless
4fe3110e49
automaton initial state is always 0
2016-10-10 13:43:46 -04:00
Adrien Grand
fb088817eb
LUCENE-7479: SimpleText should directly implement the new doc values APIs.
2016-10-07 10:59:11 +02:00
Adrien Grand
9128bdbaf5
LUCENE-7475: Make norms sparse.
2016-10-06 14:07:27 +02:00
Mike McCandless
001a3ca55b
LUCENE-7407: speed up iterating norms a bit by having default codec implement the iterator directly
2016-10-05 14:18:55 -04:00
Adrien Grand
32446e9205
LUCENE-7453: Create a Lucene70Codec.
2016-10-04 10:50:34 +02:00
Noble Paul
e8767daa7b
SOLR-8995: use lambdas for CoreCloseListener
2016-09-29 11:33:22 +05:30
Mike McCandless
24ec3bd085
fix test bug
2016-09-23 06:13:26 -04:00
Mike McCandless
f7aa200d40
LUCENE-7407: switch doc values usage to an iterator API, based on DocIdSetIterator, instead of random acces, freeing codecs for future improvements
2016-09-21 09:41:41 -04:00
Mike McCandless
c7fb49d7b5
LUCENE-7439: FuzzyQuery now matches all terms within the specified edit distance, even if they are short
2016-09-15 14:44:26 -04:00
Robert Muir
f6253d5ed1
LUCENE-7426: remove deprecated DataInput/Output map/string methods
2016-08-25 12:02:52 -04:00
Mike McCandless
f0963b3b4f
add test case to ensure SearchManager can handle concurrent close, refresh, searching and IndexWriter closing
2016-08-19 09:28:17 -04:00
Robert Muir
18373e1129
LUCENE-7414: TestUtil.checkIterator can make tests very slow
2016-08-16 16:12:55 -04:00
Mike McCandless
36d67aaba9
use method reference instead of lambda in these tests
2016-08-09 08:07:07 -04:00
Mike McCandless
04086fbfc4
LUCENE-7409: improve MockDirectoryWrapper's IndexInput to detect if a clone is being used after its parent was closed
2016-08-09 05:03:29 -04:00
Mike McCandless
1aecdd28d1
LUCENE-7390: revert this change, since it's obsoleted by the much better LUCENE-7396
2016-07-29 14:02:00 -04:00
Adrien Grand
bc29a3d7ed
Add missing update of `lastDocId` in AssertingPointsFormat.
2016-07-28 15:27:14 +02:00
Mike McCandless
3a0a9fd2c2
LUCENE-7390: improve points indexing performance by letting the codec use transient heap in proportion to IndexWriter's indexing buffer, by default
2016-07-25 11:33:34 -04:00
Steve Rowe
51d4af6859
LUCENE-7013: add licence header position checker to -validate-source-patterns, and fix the violations it found
2016-07-13 13:21:36 -04:00
Christine Poerschke
15e8719b8a
LUCENE-7372: Factor out an org.apache.lucene.search.FilterWeight class.
2016-07-13 16:04:24 +01:00
Adrien Grand
866398bea6
LUCENE-7371: Better compression of values in Lucene60PointsFormat.
2016-07-12 17:57:56 +02:00
Adrien Grand
e92a38af90
LUCENE-7355: Add Analyzer#normalize() and use it in query parsers.
2016-07-12 16:33:07 +02:00
Adrien Grand
5def78ba10
LUCENE-7368: Remove queryNorm.
2016-07-12 14:38:36 +02:00
Mike McCandless
df9efb8b6d
LUCENE-7373: deprecate Directory.renameFile, which both renamed and fsync'd the directory, replacing it with separate rename and syncMetaData methods
2016-07-11 11:15:46 -04:00
Adrien Grand
f1528bf338
LUCENE-7369: Similarity.coord and BooleanQuery.disableCoord are removed.
2016-07-07 10:02:29 +02:00
Steve Rowe
17d113dac1
LUCENE-2605: Add classic QueryParser option setSplitOnWhitespace() to control whether to split on whitespace prior to text analysis. Default behavior remains unchanged: split-on-whitespace=true.
2016-07-05 18:03:42 -04:00
Adrien Grand
1e4d51f408
LUCENE-7352: Fix CheckHits for DisjunctionMax queries that generate negative scores.
2016-06-23 19:03:39 +02:00
Karl Wright
b11e48c755
LUCENE-7194: Ban Math.toRadians and Math.toDegrees
2016-06-22 03:53:47 -04:00
Mike McCandless
b338a0c0f0
LUCENE-7345: MDW now disables VirusCheckingFS while trying to corrupt files
2016-06-21 04:50:34 -04:00
Mike McCandless
c311eb1b7e
LUCENE-7345: RAMDirectory now enforces write-once as well
2016-06-20 10:03:19 -04:00
Mike McCandless
4c630b2216
LUCENE-7345: make sure MDW's efforts to corrumpt un-sync'd files is successful
2016-06-20 09:17:44 -04:00
Mike McCandless
87016b5f0c
LUCENE-7318: graduate StandardAnalyzer and make it the default for IndexWriterConfig
2016-06-14 16:38:04 -04:00
Mike McCandless
a893c64595
LUCENE-6171: pass the StandardOpenOption.CREATE_NEW so the filesystem ensures Lucene really is write-once
2016-06-14 15:15:57 -04:00
Christine Poerschke
95c7e6d716
LUCENE-7319: remove one more unused import
2016-06-13 11:11:02 +01:00
Christine Poerschke
52f5c50246
LUCENE-7319: remove unused imports
2016-06-13 09:59:14 +01:00
Uwe Schindler
cc7cede7a5
LUCENE-7333: Fix test bug where randomSimpleString() generated a filename that is a reserved device name on Windows
2016-06-11 11:31:12 +02:00
Mike McCandless
816fae9622
LUCENE-7326: don't use postings format by name in this test
2016-06-10 11:57:41 -04:00
Mike McCandless
067fb25e43
LUCENE-7323: compound file writing now verifies checksum and segment ID for the incoming sub-files, to catch hardware issues or filesystem bugs earlier
2016-06-09 14:48:58 -04:00
Mike McCandless
c8570ed821
LUCENE-7132: BooleanQuery sometimes assigned the wrong score when ranges of documents had only one clause matching while other ranges had more than one clause matchng
2016-06-06 10:35:16 -04:00
Mike McCandless
817c7f069a
sequence numbers: merge master
2016-05-26 15:30:12 -04:00