Commit Graph

1617 Commits

Author SHA1 Message Date
Jim Ferenczi 49e3cca77f LUCENE-8204: Boolean queries with a mix of required and optional clauses are now faster if the total hit count is not required 2018-08-08 15:49:58 +02:00
Adrien Grand e56c8722ce Revert "Make the nightly test smaller so that it does not fail with GC overhead exceeded (OOM). Clean up random number fetching to make it shorter."
This reverts commit 3203e99d8f.
2018-08-01 15:44:57 +02:00
Adrien Grand 86a39fa29f Revert "Fix AAIOOBE in GeoTestUtil."
This reverts commit c3e813188e.
2018-08-01 15:44:47 +02:00
Adrien Grand c3e813188e Fix AAIOOBE in GeoTestUtil. 2018-08-01 15:17:53 +02:00
Dawid Weiss 3203e99d8f Make the nightly test smaller so that it does not fail with GC overhead exceeded (OOM). Clean up random number fetching to make it shorter. 2018-08-01 14:05:02 +02:00
Adrien Grand 99dbe93681 LUCENE-8060: IndexSearcher's search and searchAfter methods now only compute total hit counts accurately up to 1,000. 2018-08-01 09:01:21 +02:00
Steve Rowe a08eadb480 Fix InfixSuggestersTest.testShutdownDuringBuild() failures 2018-07-30 22:49:49 -04:00
Adrien Grand 61e89e3ca0 LUCENE-8431: Top-docs collectors now collect lower bounds of the hit count. 2018-07-30 16:38:05 +02:00
Adrien Grand 9ca053712a LUCENE-8430: TopDocs.totalHits may now be a lower bound of the hit count. 2018-07-30 16:38:05 +02:00
Dawid Weiss d25f62634b LUCENE-8415: test quirk follow up. MockDirectoryWriter uses AccessDeniedException (a subclass of IOException) to signal files still open for writing when read access is requested. 2018-07-25 11:34:31 +02:00
Dawid Weiss 8892c0d9af LUCENE-8415: Clean up Directory contracts (write-once, no reads-before-write-completed). Minor test improvements and cleanups. 2018-07-24 08:47:50 +02:00
Jason Gerlowski 6ed9607f74 SOLR-12555: Add add'l expectThrows() test helper 2018-07-23 20:37:04 -04:00
Alan Woodward 028c86b1fa LUCENE-8306: Allow iteration over submatches
Also includes LUCENE-8404, adding match iteration to SpanQuery
2018-07-23 10:02:01 +01:00
Alan Woodward 6e3f61f6f9 Revert "LUCENE-8306: Allow iteration over submatches"
Incorrect patch committed in error

This reverts commit a8839b7eab.
2018-07-22 22:36:46 +01:00
Alan Woodward a8839b7eab LUCENE-8306: Allow iteration over submatches 2018-07-22 21:42:46 +01:00
Adrien Grand 331ccf3910 LUCENE-8405: Remove TopDocs.maxScore. 2018-07-18 08:38:57 +02:00
Adrien Grand 8093c450c1 LUCENE-8263: Replace TieredMergePolicy's reclaimDeletesWeight with deletesPctAllowed. 2018-07-17 18:31:06 +02:00
Adrien Grand d730c8b214 LUCENE-8060: Remove usage of TopDocs#totalHits that should really be IndexSearcher#count.
Many tests were written before we introduced IndexSearcher#count and used
`searcher.search(query, 1).totalHits` to get the number of matches of a query
rather than `searcher.count(query)`.
2018-07-17 14:32:02 +02:00
Michael Braun f0e1864ceb Merge remote-tracking branch 'source/master' into remove-constructor-wrapper-classes 2018-07-14 13:39:37 -04:00
Nicholas Knize b5ef13330f LUCENE-8396: Add Points Based Shape Indexing and Search that decomposes shapes into a triangular mesh and indexes individual triangles as a 6 dimension point 2018-07-14 11:28:37 -05:00
Adrien Grand b1bb11b79d LUCENE-8391: More tests for merge policies. 2018-07-10 09:17:34 +02:00
Adrien Grand 41ddac5b44 LUCENE-8385: Fix computation of the allowed segment count in TieredMergePolicy. 2018-07-09 15:21:10 +02:00
Erick Erickson c303c5f126 LUCENE-8370: Reproducing TestLucene{54,70}DocValuesFormat.testSortedSetVariableLengthBigVsStoredFields() failures 2018-06-28 18:28:37 -07:00
Alan Woodward ab2fec1642 LUCENE-8237: Correct handling of position increments in sub-tokenstreams 2018-06-18 09:57:38 +01:00
Nhat Nguyen 8a6f1bf5ad LUCENE-8165: Ban copyOf and copyOfRange.
These methods are lenient with out-of-bounds indices.

Signed-off-by: Adrien Grand <jpountz@gmail.com>
2018-06-07 10:08:21 +02:00
Michael Braun 78079fc552 Merge remote-tracking branch 'source/master' into remove-constructor-wrapper-classes 2018-06-05 18:48:55 -04:00
Simon Willnauer 59087d148a [TEST] Ensure MDW.assertNoUnreferencedFilesOnClose is threadsafe 2018-06-04 17:33:18 +02:00
Simon Willnauer fe83838ec3 LUCENE-8341: Record soft deletes in SegmentCommitInfo
This change add the number of documents that are soft deletes but
not hard deleted to the segment commit info. This is the last step
towards making soft deletes as powerful as hard deltes since now the
number of document can be read from commit points without opening a
full blown reader. This also allows merge posliies to make decisions
without requiring an NRT reader to get the relevant statistics. This
change doesn't enforce any field to be used as soft deletes and the statistic
is maintained per segment.
2018-06-04 15:05:12 +02:00
Simon Willnauer e7a0a12926 LUCENE-8335: Enforce soft-deletes field up-front
Soft deletes field must be marked as such once it's introduced
and can't be changed after the fact.

Co-authored-by: Nhat Nguyen <nhat.nguyen@elastic.co>
2018-06-04 08:28:38 +02:00
Michael Braun fb6574100e LUCENE-8345 - add wrapper class constructors to forbiddenapis 2018-06-03 15:40:50 -04:00
Simon Willnauer 3dc4fa199c Revert "LUCENE-8335: Enforce soft-deletes field up-front."
This reverts commit a2d9276674.
2018-06-02 13:47:24 +02:00
Simon Willnauer a2d9276674 LUCENE-8335: Enforce soft-deletes field up-front.
Soft deletes field must be marked as such once it's introduced
and can't be changed after the fact.
2018-06-02 13:14:53 +02:00
Simon Willnauer 34741a863a LUCENE-8330: Exclude MockRandomMP from basic tests 2018-05-29 16:58:03 +02:00
Simon Willnauer c93f628317 LUCENE-8330: Detach IndexWriter from MergePolicy
This change introduces a new MergePolicy.MergeContext interface
that is easy to mock and cuts over all instances of IW to MergeContext.
Since IW now implements MergeContext the cut over is straight forward.
This reduces the exposed API available in MP dramatically and allows
efficient testing without relying on IW to improve the coverage and
testability of our MP implementations.
2018-05-25 07:37:09 +02:00
Simon Willnauer 70cfe46689 LUCENE-8320: Fix NPE in WindowsFS if target file exists but isn't open 2018-05-18 19:38:11 +02:00
Alan Woodward b1ee23c525 LUCENE-8273: Fix end() and posInc handling 2018-05-18 13:11:39 +01:00
Simon Willnauer 42a79970d5 LUCENE-8320: Fix WindowsFS#rename with hardlinks 2018-05-18 09:33:50 +02:00
Simon Willnauer 3fe612bed2 LUCENE-8318: Ensure pending delete is not brought back on a try delete attempt
When renaming a file, `FSDirectory#rename` tries to delete the dest file
if it's in the pending deletes list. If that delete fails, it adds the
dest to the pending deletes list again. This causes the dest file to be
deleted later by `deletePendingFiles`.
2018-05-17 11:02:35 +02:00
Adrien Grand 6d69824a6b LUCENE-8314: More checks on AssertingScorer. 2018-05-16 17:54:19 +02:00
Adrien Grand 9b9776a714 LUCENE-8313: Simplify SimScorer. 2018-05-16 17:53:56 +02:00
Simon Willnauer 585952797c LUCENE-8310: Ensure IndexFileDeleter accounts for pending deletes
Today we fail creating the IndexWriter when the directory has a
pending delete. Yet, this is mainly done to prevent writing still
existing files more than once. IndexFileDeleter already accounts for
that for existing files which we can now use to also take pending
deletes into account which ensures that all file generations per segment
always go forward.
2018-05-16 11:17:43 +02:00
Adrien Grand d764156f91 LUCENE-8303: Make the overflow test a Monster rather than Nightly. 2018-05-11 14:36:42 +02:00
Simon Willnauer a3c86373e4 LUCENE-8298: Allow DocValues updates to reset a value
Today once a document has a value in a certain DV field this values
can only be changed but not removed. While resetting / removing a value
from a field is certainly a corner case it can be used to undelete a
soft-deleted document unless it's merged away.
This allows to rollback changes without rolling back to another commitpoint
or trashing all uncommitted changes. In certain cenarios it can be used to
"repair" history of documents in distributed systems.
2018-05-09 18:57:57 +02:00
Adrien Grand 8dc69428e3 LUCENE-8303: Make LiveDocsFormat only responsible for serialization/deserialization of live docs. 2018-05-09 15:40:14 +02:00
Dawid Weiss 85c00e77ef LUCENE-8267: removed references to memory codecs. 2018-05-08 10:32:11 +02:00
Adrien Grand 67c13bbe2e LUCENE-8142: Fix QueryUtils to only call getMaxScore when it is legal to do so. 2018-05-02 17:42:18 +02:00
Adrien Grand 46ecb73976 LUCENE-8142: Fix AssertingImpactsEnum and add missing javadoc. 2018-05-02 17:20:42 +02:00
Adrien Grand af680af77f LUCENE-8142: Make postings APIs expose raw impacts rather than scores. 2018-05-02 14:49:32 +02:00
Simon Willnauer 933d8a6995 LUCENE-8275: Fix BaseLockFactoryTestCase to step out on Windowns if pending files are found
The particular test here is #testStressLocks that has several protectesion against
WindowsFS and special logic in the catch clause that steps out on fatal exceptions with
pending deletes. Since we now check this consistently in the IW ctor we need to also
skip this entire test if we are on windows and have pending deletes.
2018-04-26 12:10:10 +02:00
Alan Woodward e167e91247 LUCENE-8270: Remove MatchesIterator.term() 2018-04-23 16:51:17 +01:00
Simon Willnauer 6f0a884582
LUCENE-8269: Detach downstream classes from IndexWriter
IndexWriter today is shared with many classes like BufferedUpdateStream,
DocumentsWriter and DocumentsWriterPerThread. Some of them even acquire locks
on the writer instance or assert that the current thread doesn't hold a lock.
This makes it very difficult to have a manageable threading model.

This change separates out the IndexWriter from those classes and makes them all
independent of IW. IW now implements a new interface for DocumentsWriter to communicate
on failed or successful flushes and tragic events. This allows IW to make it's critical
methods private and execute all lock critical actions on it's private queue that ensures
that the IW lock is not held. Follow-up changes will try to detach more code like
publishing flushed segments to ensure we never call back into IW in an uncontrolled way.
2018-04-23 17:17:40 +02:00
Simon Willnauer c70cceaee5 LUCENE-8253: Account for soft-deletes before they are flushed to disk
Inside the IndexWriter buffers are only written to disk if it's needed
or "worth it" which doesn't guarantee soft deletes to be accounted
in time. This is not necessarily a problem since they are eventually
collected and segments that have soft-deletes will me merged eventually
but for tests and on par behavior compared to hard deletes this behavior
is tricky.
This change cuts over to accounting in-place just like hard-deletes. This
results in accurate delete numbers for soft deletes at any give point in time
once the reader is loaded or a pending soft delete occurs.

This change also fixes an issue where all updates to a DV field are allowed
event if the field is unknown. Now this only works if the field is equal
to the soft deletes field. This behavior was never released.
2018-04-16 16:17:06 +02:00
Mike McCandless 7c0387ad3f LUCENE-8248: MergePolicyWrapper is renamed to FilterMergePolicy and now also overrides getMaxCFSSegmentSizeMB 2018-04-13 15:45:19 -04:00
Alan Woodward 040a9601b1 LUCENE-8229: Add Weight.matches() to iterate over match positions 2018-04-11 09:43:27 +01:00
Alan Woodward 798d351034 LUCENE-8242: Deprecate createNormalizedWeight 2018-04-09 15:07:04 +01:00
Simon Willnauer ed62b990d8
LUCENE-8237: Add a SoftDeletesDirectoryReaderWrapper
This adds support for soft deletes if the reader is opened form a directory.
Today we only support soft deletes for NRT readers, this change allows to wrap
existing DirectoryReader with a SoftDeletesDirectoryReaderWrapper to also filter
out soft deletes in the case of a non-NRT reader.
2018-04-09 11:50:38 +02:00
Simon Willnauer ecc17f9023 LUCENE-8233: Add support for soft deletes to IndexWriter
This change adds support for soft deletes as a fully supported feature
by the index writer. Soft deletes are accounted for inside the index
writer and therefor also by merge policies.

This change also adds a SoftDeletesRetentionMergePolicy that allows
users to selectively carry over soft_deleted document across merges
for renention policies. The merge policy selects documents that should
be kept around in the merged segment based on a user provided query.
2018-04-04 13:45:14 +02:00
Robert Muir e595541ef3 LUCENE-8192: always enforce index-time offsets are correct with BaseTokenStreamTestCase 2018-03-26 22:02:34 -04:00
Alan Woodward fac84c01c8 LUCENE-8202: Add FixedShingleFilter 2018-03-21 13:45:03 +00:00
Simon Willnauer 2e35ef2b3d LUCENE-8215: Fix several fragile exception handling places in o.a.l.index
Several places in the index package don't handle exceptions well or ignores them.
This change adds some utility methods and cuts over to make use of try/with blocks
to simplify exception handling.
2018-03-20 10:50:12 +01:00
Adrien Grand 3048e5da22 LUCENE-8008: Remove unintended changes. 2018-03-20 09:52:24 +01:00
Robert Muir 97299ed006 LUCENE-8191: if a tokenstream has broken offsets, its broken. IndexWriter always checks, so a separate whitelist can't work 2018-03-04 11:23:45 -05:00
Erick ad7e94afb2 SOLR-12028: BadApple and AwaitsFix annotations usage 2018-03-03 21:42:14 -08:00
Uwe Schindler 7dba350c7a SOLR-12028: Make initialization of constants dynamic (by reading the annotation), also add missing reproduce info 2018-02-28 00:47:00 +01:00
Erick Erickson 1fe45606b9 SOLR-12028: BadApple and AwaitsFix annotations usage 2018-02-26 20:35:12 -08:00
Adrien Grand 317a2e0c3d LUCENE-8153: Make impacts checks lighter by default.
The new `-slow` switch makes checks more complete but also more heavy. This
option also cross-checks term vectors.
2018-02-20 17:14:11 +01:00
Adrien Grand 4fb7e3d02c LUCENE-8135: Implement block-max WAND. 2018-02-15 15:13:58 +01:00
Alan Woodward 342e38217a LUCENE-8163: BaseDirectoryTestCase produces random filenames that fail on Windows 2018-02-09 09:14:02 +00:00
Adrien Grand f410df8113 LUCENE-4198: Give codecs the opportunity to index impacts. 2018-01-31 14:54:52 +01:00
Adrien Grand 75d50b4492 LUCENE-8116: Remove unnecessary IOException. 2018-01-11 11:49:36 +01:00
Adrien Grand 838c604b76 LUCENE-8119: Remove SimScorer.maxScore(float maxFreq). 2018-01-09 14:42:16 +01:00
Alan Woodward d250a1463d LUCENE-8133: Rename TermContext to TermStates, and load TermState lazily if term stats are not required 2018-01-05 14:17:15 +00:00
Adrien Grand 8fd7ead940 LUCENE-8116: SimScorer now only takes a frequency and a norm as per-document scoring factors. 2018-01-04 15:13:36 +01:00
Alan Woodward c1030eeb74 LUCENE-8012: Explanation takes Number rather than float 2018-01-02 11:06:59 +00:00
Adrien Grand b2f248164c LUCENE-8010: Fix similarities so that they pass tests. 2017-12-29 10:06:00 +01:00
Steve Rowe 3e2f9e62d7 LUCENE-2899: Add OpenNLP Analysis capabilities as a module 2017-12-15 11:24:18 -05:00
Adrien Grand d5c72eb588 LUCENE-8081: Remove unused import. 2017-12-08 08:45:18 +01:00
Simon Willnauer ede46fe6e9 LUCENE-8081: Allow IndexWriter to opt out of flushing on indexing threads
Index/Update Threads try to help out flushing pending document buffers to
disk. This change adds an expert setting to opt ouf of this behavior unless
flusing is falling behind.
2017-12-07 16:22:52 +01:00
Adrien Grand 4fc5a872de LUCENE-4100: Faster disjunctions when the hit count is not needed. 2017-12-07 10:49:39 +01:00
Adrien Grand 63b63c5734 LUCENE-8015: Fixed DFR similarities' scores to not decrease when tfn increases. 2017-12-06 18:19:57 +01:00
Adrien Grand a8a63464e7 LUCENE-7996: Queries are now required to produce positive scores. 2017-12-06 14:06:03 +01:00
Simon Willnauer 01d12777c4 LUCENE-8068: Allow IndexWriter to write a single DWPT to disk
Adds a `flushNextBuffer` method to IndexWriter that allows the caller to
synchronously move the next pending or the biggest non-pending index buffer to
disk. This enables flushing selected buffer to disk without highjacking an
indexing thread. This is for instance useful if more than one IW (shards) must
be maintained in a single JVM / system.
2017-11-30 18:57:27 +01:00
Adrien Grand d27ddcb409 LUCENE-8008: Reduce leniency in CheckHits. 2017-11-29 18:09:38 +01:00
David Smiley 64d95e6a6d LUCENE-8049: IndexWriter.getMergingSegments() signature changed to return Set instead of Collection 2017-11-26 23:25:06 -05:00
Alan Woodward 183571c085 LUCENE-6278: Remove Scorer.freq() 2017-11-15 11:14:16 +00:00
Alan Woodward 276e317e94 LUCENE-8042: Add SegmentCachable interface 2017-11-10 12:17:50 +00:00
Alan Woodward 1aa049bb27 LUCENE-8014: Remove deprecated SimScorer methods 2017-11-10 09:43:18 +00:00
Alan Woodward 764abcb31a Revert "LUCENE-8014: Remove deprecated SimScorer methods"
Reverting to fix test failures

This reverts commit 946ec9d5b9.
2017-11-10 09:02:03 +00:00
Alan Woodward 946ec9d5b9 LUCENE-8014: Remove deprecated SimScorer methods 2017-11-09 14:05:34 +00:00
Alan Woodward a886a001a4 LUCENE-8017: Add Weight.getCacheHelper() 2017-11-03 10:40:14 +00:00
Robert Muir ca5f9b3457 LUCENE-8007: Make scoring statistics mandatory 2017-11-02 23:02:21 -04:00
Robert Muir 875d45ff14 LUCENE-8030: fix buggy assert 2017-10-31 22:30:33 -04:00
Robert Muir e0bde57981 LUCENE-8020: don't force sim to score bogus terms (e.g. docfreq=0) 2017-10-30 20:32:12 -04:00
Robert Muir 489ca238c4 LUCENE-8021: Add AssertingSimilarity 2017-10-30 18:38:26 -04:00
Robert Muir 42717d5f4b LUCENE-7997: More sanity testing of similarities 2017-10-24 22:48:04 -04:00
Mike McCandless ea36f5040c LUCENE-7999: upgrade int to long for tracking the counter for the next segment name to prevent overflow 2017-10-24 13:13:41 -04:00
Dawid Weiss 46cd679e91 LUCENE-7983: IndexWriter.IndexReaderWarmer is now a functional interface instead of an abstract class with a single method. 2017-10-04 10:59:16 +02:00
Nicholas Knize bf71650ad7 LUCENE-7392: Add point based LatLonBoundingBox as new RangeField Type. 2017-09-19 14:45:04 -05:00
yonik a4374e840d SOLR-11173: implement Points support in TermsComponent via PointMerger 2017-08-19 18:02:11 -04:00
Adrien Grand 9c83d025e4 LUCENE-7897: IndexOrDocValuesQuery now requires the range cost to be more than 8x greater than the cost of the lead iterator in order to use doc values. 2017-08-10 12:10:44 +02:00
Adrien Grand bc9c4144df Remove unnecessary backward compatibility. 2017-07-03 14:17:09 +02:00
Alan Woodward af1ee47f2b LUCENE-7867: Remove deprecated Token class 2017-06-23 09:51:05 +01:00
Mike McCandless 58105a203a LUCENE-7868: use multiple threads to concurrently resolve deletes and DV udpates 2017-06-21 13:47:15 -04:00
Adrien Grand 44d1f1fe3f LUCENE-7872: TopDocs.totalHits is now a long. 2017-06-21 09:55:02 +02:00
David Smiley abc393dbfd LUCENE-7500: Remove LeafReader.fields in lieu of LeafReader.terms.
Optimized MultiFields.getTerms.
2017-06-15 00:59:01 -04:00
David Smiley f470bbcbdc LUCENE-7876 avoid leafReader.fields 2017-06-13 22:44:04 -04:00
Mike McCandless d276acfbbc LUCENE-7854: enable indexing custom term frequencies 2017-06-06 13:37:31 -04:00
Adrien Grand 14320a584c LUCENE-7847: Fix the all-docs-match optimization of range queries on range fields. 2017-05-23 18:46:50 +02:00
Adrien Grand 06a6034d9b LUCENE-7730: Better accuracy for the length normalization factor. 2017-05-18 16:27:31 +02:00
Dawid Weiss e52d860994 LUCENE-7796: Make IOUtils.reThrow idiom declare Error return type so
callers may use it in a way that compiler knows subsequent code is
unreachable. reThrow is now deprecated in favor of IOUtils.rethrowAlways.
2017-04-28 12:17:21 +02:00
Adrien Grand d3494c2cd6 LUCENE-7781: Call ensureOpen when registering closed listeners. 2017-04-13 08:41:47 +02:00
Adrien Grand 15a1561d43 LUCENE-7767: SortedDocValues.ordValue() now throws an IOException. 2017-04-11 10:02:51 +02:00
Adrien Grand 23b002a0fd LUCENE-7756: Only record the major Lucene version that created the index, and record the minimum Lucene version that contributed to segments. 2017-04-04 09:57:16 +02:00
Adrien Grand e80643e5a7 LUCENE-7753: Make fields static when possible. 2017-03-31 17:28:21 +02:00
Adrien Grand 53064e46dd LUCENE-7754: Inner classes should be static whenever possible. 2017-03-28 15:35:04 +02:00
Nicholas Knize d34d81f9af LUCENE-7740: Refactor Range Fields to remove Field suffix (e.g., DoubleRange),
move InetAddressRange and InetAddressPoint from sandbox to misc module, and
refactor all other range fields from sandbox to core.
2017-03-13 02:22:29 -05:00
Adrien Grand 8ed2b764ed LUCENE-6819: Remove index-time boosts. 2017-03-02 19:18:16 +01:00
Adrien Grand 540a237231 LUCENE-7410: Make TestReaderClosed pass if the IndexSearcher wraps a threadpool. 2017-03-01 19:12:14 +01:00
Adrien Grand df6f830723 LUCENE-7410: Make cache keys and close listeners less trappy. 2017-02-28 14:46:45 +01:00
Ishan Chattopadhyaya 9a1702a8f5 SOLR-8029: Reverting the previous commit and the merge 2017-02-15 07:13:28 +05:30
Noble Paul 325cbf0018 Revert "Merge remote-tracking branch 'origin/master'"
This reverts commit 26298f35df, reversing
changes made to 65c6c576b7.

undoing the merge commit
2017-02-14 17:32:14 -08:00
Christine Poerschke 738da0998a LUCENE-7693: Replace "org.apache." logic in GetMavenDependenciesTask. (Daniel Collins, Christine Poerschke) 2017-02-14 14:27:44 +00:00
Mike McCandless f1c5cd5784 LUCENE-7690: also handle expected CorruptIndexException in this test 2017-02-13 10:29:56 -05:00
Mike McCandless a3bfc3e67c LUCENE-7690: also handle expected CorruptIndexException in this test 2017-02-11 13:55:09 -05:00
Mike McCandless be007a6c11 LUCENE-7662: throw CorruptIndexException if index files are missing 2017-02-10 20:20:53 -05:00
Steve Rowe 6c1a4b673a SOLR-6246: SolrSuggester.build() now throws SolrCoreState.CoreIsClosedException when a core reload/shutdown happens; add a random test lookup dictionary with configurable size; add {Analyzing,Blended}InfixSuggester reload/build tests; add a wrapped-exception expectThrows() variant to LuceneTestCase 2017-02-03 18:25:38 -05:00
Mike McCandless 72eaeab715 LUCENE-7668: add new test case; remove dead code; improve CannedTokenStream to copy all Token attributes 2017-01-31 11:56:07 -05:00
Adrien Grand cf943c5454 LUCENE-7656: Implement geo box/distance queries using doc values. 2017-01-26 13:59:35 +01:00
Adrien Grand 94530940e4 LUCENE-7647: CompressingStoredFieldsFormat should reclaim memory more aggressively. 2017-01-25 16:15:04 +01:00
Alan Woodward 94e3460305 Revert "LUCENE-7628: Scorer.getChildren() returns only matching Scorers"
This reverts commit 5bdc492c9c.
2017-01-23 11:30:03 +00:00
Alan Woodward 5bdc492c9c LUCENE-7628: Scorer.getChildren() returns only matching Scorers 2017-01-23 09:35:16 +00:00
Mike McCandless 637915b890 LUCENE-7619: add WordDelimiterGraphFilter (replacing WordDelimiterFilter) to produce a correct token stream graph when splitting words 2017-01-17 10:38:07 -05:00
Jim Ferenczi 1acd2ee2bb Various fixes and updates for index sorting on flush
* IndexWriter.validateIndexSort now throws a CorruptIndexException if a segment created by version >= 6.5.0 is not sorted (already applied in branch_6x)
* Removes unneeded check in AssertingLiveDocsFormat (already applied in branch_6x)
* Removes try/finally block when stored fields consumer finishes (already applied in branch_6x).
2017-01-17 14:22:47 +01:00
Adrien Grand 86233cb95d LUCENE-7055: Add ScorerProvider to get an estimation of the cost of scorers before building them. 2017-01-17 08:51:58 +01:00
Alan Woodward 8fa0a8dd1e LUCENE-7627: Add #intersect(CompiledAutomaton) to Sorted*DocValues 2017-01-14 09:40:19 +00:00
Mike McCandless 64b86331c2 LUCENE-7626: IndexWriter no longer accepts broken offsets 2017-01-13 17:46:02 -05:00
Mike McCandless f985fcaa23 promote this test case to core 2017-01-08 06:38:37 -05:00
Mike McCandless e64111c654 TokenStreamToAutomaton failed to handle certain holes correctly 2017-01-08 06:26:08 -05:00
Uwe Schindler db9190db93 LUCENE-7595: Fix bug with RamUsageTester incorrectly handling Iterables outside Java Runtime 2016-12-29 20:31:47 +01:00
Uwe Schindler f29d2b5668 LUCENE-7595: Improve RAMUsageTester in test-framework to estimate memory usage of runtime classes and work with Java 9 EA (b148+). Disable static field heap usage checker in LuceneTestCase 2016-12-29 01:56:23 +01:00
Adrien Grand ba47f530d1 LUCENE-7401: Make sure BKD trees index all dimensions. 2016-12-26 10:37:16 +01:00
Mike McCandless c0467bb929 LUCENE-6664: add SynonymGraphFilter for correct multi-token synonym handling 2016-12-22 15:39:17 -05:00
Uwe Schindler d5e87898b1 LUCENE-6989: Fix some tests that hardcode MMapDirectory (and also the FSDirectory randomizer), to only use MMapDirectory on Windows, if it supports unmapping. Otherwise tests will fail. 2016-12-21 23:07:50 +01:00
Mike McCandless 4ccb9fbd2b LUCENE-7579: sort segments at flush too 2016-12-20 06:45:06 -05:00
Mike McCandless 268d4ace36 remove bad assertion 2016-12-15 09:30:25 -05:00
Adrien Grand be74f4f7a7 LUCENE-7569: Fix index sorting test failures. 2016-11-24 14:10:19 +01:00
Mike McCandless 24b5a624bb LUCENE-7652: improve BasePostingsFormatTestCase with a level 2 ghost test 2016-11-16 05:19:05 -05:00
Mike McCandless 6c3c6bc379 LUCENE-7537: Index time sorting now supports multi-valued sorts using selectors (MIN, MAX, etc.) 2016-11-15 16:22:51 -05:00
Mike McCandless c0f9bdf3c3 don't use BooleanSimilarity in RandomSimilarity 2016-11-13 18:41:18 -05:00
Adrien Grand c4c5e868d2 LUCENE-7531: Removed packing support from FST. 2016-11-10 15:01:49 +01:00
Adrien Grand 6b9f11311a LUCENE-7524: More detailed explanation of idf. 2016-11-10 15:01:49 +01:00
Adrien Grand 3e15233b23 LUCENE-5867: Add a BooleanSimilarity. 2016-11-10 15:01:49 +01:00
Adrien Grand af60048097 LUCENE-7429: AnalyzerWrapper can now wrap the normalization analysis chain too. 2016-10-27 16:27:45 +02:00
Adrien Grand 9aca4c9d56 LUCENE-7462: Give doc values APIs an `advanceExact` method. 2016-10-24 10:51:23 +02:00
Dawid Weiss bc0116af69 LUCENE-7515: RunListenerPrintReproduceInfo may try to access static rule fields without
the rule being called. This flag is needed to ensure this isn't the case.
2016-10-21 11:19:12 +02:00
Dawid Weiss a19ec194d2 LUCENE-7513: Update to randomizedtesting 2.4.0. 2016-10-21 11:14:37 +02:00
Adrien Grand c22725f0b5 LUCENE-7494: Give points a per-field API. 2016-10-18 10:14:27 +02:00
Adrien Grand ea1212232d LUCENE-7495: Fix doc values iterators' assertions in advance(). 2016-10-14 14:38:32 +02:00
Mike McCandless 1b7a88f61e LUCENE-7491: fix merge exception if the same field has points in some segments but not in others 2016-10-12 09:00:26 -04:00
yonik 98191225eb SOLR-9579: SchemaField should implement lucene.IndexableFieldType to avoid repeated creation 2016-10-11 11:17:55 -04:00
Adrien Grand 63ef459022 LUCENE-7487: Remove unnecessary synchronization from Lucene70NormsProducer. 2016-10-11 08:31:29 +02:00
Mike McCandless 4fe3110e49 automaton initial state is always 0 2016-10-10 13:43:46 -04:00
Adrien Grand fb088817eb LUCENE-7479: SimpleText should directly implement the new doc values APIs. 2016-10-07 10:59:11 +02:00
Adrien Grand 9128bdbaf5 LUCENE-7475: Make norms sparse. 2016-10-06 14:07:27 +02:00
Mike McCandless 001a3ca55b LUCENE-7407: speed up iterating norms a bit by having default codec implement the iterator directly 2016-10-05 14:18:55 -04:00
Adrien Grand 32446e9205 LUCENE-7453: Create a Lucene70Codec. 2016-10-04 10:50:34 +02:00
Noble Paul e8767daa7b SOLR-8995: use lambdas for CoreCloseListener 2016-09-29 11:33:22 +05:30
Mike McCandless 24ec3bd085 fix test bug 2016-09-23 06:13:26 -04:00
Mike McCandless f7aa200d40 LUCENE-7407: switch doc values usage to an iterator API, based on DocIdSetIterator, instead of random acces, freeing codecs for future improvements 2016-09-21 09:41:41 -04:00
Mike McCandless c7fb49d7b5 LUCENE-7439: FuzzyQuery now matches all terms within the specified edit distance, even if they are short 2016-09-15 14:44:26 -04:00
Robert Muir f6253d5ed1 LUCENE-7426: remove deprecated DataInput/Output map/string methods 2016-08-25 12:02:52 -04:00
Mike McCandless f0963b3b4f add test case to ensure SearchManager can handle concurrent close, refresh, searching and IndexWriter closing 2016-08-19 09:28:17 -04:00
Robert Muir 18373e1129 LUCENE-7414: TestUtil.checkIterator can make tests very slow 2016-08-16 16:12:55 -04:00
Mike McCandless 36d67aaba9 use method reference instead of lambda in these tests 2016-08-09 08:07:07 -04:00
Mike McCandless 04086fbfc4 LUCENE-7409: improve MockDirectoryWrapper's IndexInput to detect if a clone is being used after its parent was closed 2016-08-09 05:03:29 -04:00
Mike McCandless 1aecdd28d1 LUCENE-7390: revert this change, since it's obsoleted by the much better LUCENE-7396 2016-07-29 14:02:00 -04:00
Adrien Grand bc29a3d7ed Add missing update of `lastDocId` in AssertingPointsFormat. 2016-07-28 15:27:14 +02:00
Mike McCandless 3a0a9fd2c2 LUCENE-7390: improve points indexing performance by letting the codec use transient heap in proportion to IndexWriter's indexing buffer, by default 2016-07-25 11:33:34 -04:00
Steve Rowe 51d4af6859 LUCENE-7013: add licence header position checker to -validate-source-patterns, and fix the violations it found 2016-07-13 13:21:36 -04:00
Christine Poerschke 15e8719b8a LUCENE-7372: Factor out an org.apache.lucene.search.FilterWeight class. 2016-07-13 16:04:24 +01:00
Adrien Grand 866398bea6 LUCENE-7371: Better compression of values in Lucene60PointsFormat. 2016-07-12 17:57:56 +02:00
Adrien Grand e92a38af90 LUCENE-7355: Add Analyzer#normalize() and use it in query parsers. 2016-07-12 16:33:07 +02:00
Adrien Grand 5def78ba10 LUCENE-7368: Remove queryNorm. 2016-07-12 14:38:36 +02:00
Mike McCandless df9efb8b6d LUCENE-7373: deprecate Directory.renameFile, which both renamed and fsync'd the directory, replacing it with separate rename and syncMetaData methods 2016-07-11 11:15:46 -04:00
Adrien Grand f1528bf338 LUCENE-7369: Similarity.coord and BooleanQuery.disableCoord are removed. 2016-07-07 10:02:29 +02:00
Steve Rowe 17d113dac1 LUCENE-2605: Add classic QueryParser option setSplitOnWhitespace() to control whether to split on whitespace prior to text analysis. Default behavior remains unchanged: split-on-whitespace=true. 2016-07-05 18:03:42 -04:00
Adrien Grand 1e4d51f408 LUCENE-7352: Fix CheckHits for DisjunctionMax queries that generate negative scores. 2016-06-23 19:03:39 +02:00
Karl Wright b11e48c755 LUCENE-7194: Ban Math.toRadians and Math.toDegrees 2016-06-22 03:53:47 -04:00
Mike McCandless b338a0c0f0 LUCENE-7345: MDW now disables VirusCheckingFS while trying to corrupt files 2016-06-21 04:50:34 -04:00
Mike McCandless c311eb1b7e LUCENE-7345: RAMDirectory now enforces write-once as well 2016-06-20 10:03:19 -04:00
Mike McCandless 4c630b2216 LUCENE-7345: make sure MDW's efforts to corrumpt un-sync'd files is successful 2016-06-20 09:17:44 -04:00
Mike McCandless 87016b5f0c LUCENE-7318: graduate StandardAnalyzer and make it the default for IndexWriterConfig 2016-06-14 16:38:04 -04:00
Mike McCandless a893c64595 LUCENE-6171: pass the StandardOpenOption.CREATE_NEW so the filesystem ensures Lucene really is write-once 2016-06-14 15:15:57 -04:00
Christine Poerschke 95c7e6d716 LUCENE-7319: remove one more unused import 2016-06-13 11:11:02 +01:00
Christine Poerschke 52f5c50246 LUCENE-7319: remove unused imports 2016-06-13 09:59:14 +01:00
Uwe Schindler cc7cede7a5 LUCENE-7333: Fix test bug where randomSimpleString() generated a filename that is a reserved device name on Windows 2016-06-11 11:31:12 +02:00
Mike McCandless 816fae9622 LUCENE-7326: don't use postings format by name in this test 2016-06-10 11:57:41 -04:00
Mike McCandless 067fb25e43 LUCENE-7323: compound file writing now verifies checksum and segment ID for the incoming sub-files, to catch hardware issues or filesystem bugs earlier 2016-06-09 14:48:58 -04:00
Mike McCandless c8570ed821 LUCENE-7132: BooleanQuery sometimes assigned the wrong score when ranges of documents had only one clause matching while other ranges had more than one clause matchng 2016-06-06 10:35:16 -04:00
Mike McCandless 817c7f069a sequence numbers: merge master 2016-05-26 15:30:12 -04:00