Commit Graph

31087 Commits

Author SHA1 Message Date
Jim Ferenczi 9f29ed0757 LUCENE-8592: Fix index sorting corruption due to numeric overflow
The merge sort of sorted segments can produce an invalid
sort if the sort field is an Integer/Long that uses reverse order and contains values equal to
Integer/Long#MIN_VALUE. These values are always sorted first during a merge
(instead of last because of the reverse order) due to this bug.
Indices affected by the bug can be detected by running the CheckIndex command on a
distribution that contains the fix (7.6+).
2018-12-07 19:16:42 +01:00
Jim Ferenczi da7919f7a4 Revert "Add the ability to set the number of hits to track accurately"
This reverts commit 84d5015eee.
2018-12-07 19:15:44 +01:00
Jim Ferenczi 84d5015eee Add the ability to set the number of hits to track accurately
In Lucene 8 searches can skip non-competitive hits if the total hit count is not requested.
It is also possible to track the number of hits up to a certain threshold. This is a trade off to speed up searches while still being able to know a lower bound of the total hit count. This change adds the ability to set this threshold directly in the track_total_hits search option. A boolean value (true, false) indicates whether the total hit count should be tracked in the response. When set as an integer this option allows to compute a lower bound of the total hits while preserving the ability to skip non-competitive hits when enough matches have been collected.

Relates #33028
2018-12-07 19:11:59 +01:00
Joel Bernstein 17fca051c5 SOLR-13046: Suppress SSL for StreamingTest 2018-12-07 09:19:40 -05:00
Luca Cavanna b24af10d59 LUCENE-8591: add LegacyBM25Similarity#getDiscountOverlaps
Signed-off-by: Adrien Grand <jpountz@gmail.com>
2018-12-07 12:31:22 +01:00
Simon Willnauer f0db05b37e LUCENE-8595: Fix interleaved DV update and reset
This change fixes a bug where interleaved update and reset value
to the same doc in the same updates package looses an update
if the reset comes before the update as well as loosing the reset
if the update comes frist.

Co-authored-by: Adrien Grand <jpountz@gmail.com>
2018-12-07 08:24:52 +01:00
Simon Willnauer aaa64d7015 LUCENE-8594: DV update are broken for updates on new field
A segmemnt written with Lucene70Codec failes if it ties to update
a DV field that didn't exist in the index before it was upgraded to
Lucene80Codec. We bake the DV format into the FieldInfo when it's used
the first time and therefor never go to the codec if we need to update.
yet on a field that didn't exist before and was added during an indexing
operation we have to consult the coded and get an exception.
This change fixes this issue and adds the relevant bwc tests.
2018-12-06 20:16:03 +01:00
Diego Ceccarelli 2e586a7e7c LUCENE-8539: fix some typos and improve style in TestStopFilter
Split the test in two tests, use random
2018-12-06 10:48:50 -05:00
markrmiller 46d7994045 SOLR-13028: @AwaitsFix AutoAddReplicasPlanActionTest#testSimple. 2018-12-06 08:56:03 -06:00
markrmiller b9a966e5f7 SOLR-12833: Add configurable timeout to VersionBucket lock. 2018-12-06 08:36:56 -06:00
Simon Willnauer b4e1fe4393 LUCENE-8593: Specialize single value numeric DV updates
The case when all values are the the same on a numeric field update
is common for soft_deletes. With the new infrastucture for buffering
DV updates we can gain an easy win by specializing the applied updates
if all values are the same.
2018-12-06 09:40:59 +01:00
markrmiller 38cfd0e259 SOLR-13040: Add AwaitsFix annotation to TestSQLHandler and improve exception information related to that test. 2018-12-05 19:34:13 -06:00
Simon Willnauer b0e6a92ede LUCENE-8590: Optimize DocValues update datastructures
Today we are using a LinkedHashMap to buffer doc-values updates in
BufferedUpdates. This on the one hand uses an Object based datastructure
and on the other requires re-encoding the data into a more compact representation
once the BufferedUpdates are frozen. This change uses a more compact represenation
for the updates already in the BufferedUpdates in a parallel-array like datastructure
that can be reused in FrozenBufferedDeletes. It also adds an much simpler to use
API to consume the updates and allows for internal memory optimization for common
case updates.
2018-12-05 17:22:40 +01:00
Cassandra Targett c81822e157 SOLR-12422: Add section on performance issues with Solr Cell; Overhaul and re-organize entire page 2018-12-04 15:08:22 -06:00
Cassandra Targett ad373ff36e Ref Guide: typo in "parameter" 2018-12-04 15:08:22 -06:00
Jason Gerlowski a5fb6f8adc Add 'prelim_sort' helper to SolrJ's TermsFacetMap 2018-12-04 14:29:29 -05:00
Chris Hostetter 868e005760 SOLR-13032: harden MaxSizeAutoCommitTest to eliminate race conditions and eliminate use of arbitrary sleep 2018-12-04 10:44:05 -07:00
Shai Erera ca4555ed93 close DirectoryReader in test 2018-12-04 16:37:35 +02:00
Shai Erera f637c50c11 fix typo 2018-12-04 16:36:14 +02:00
Jason Gerlowski f7faae075b SOLR-13019: Fix typo in MailEntityProcessor
Path provided by Tommy Marshment-Howell.

Closes #509
2018-12-04 08:49:58 -05:00
Alan Woodward c78429a554 LUCENE-8586: Fix infinite loop in Intervals.or() 2018-12-04 09:47:42 +00:00
Alan Woodward 75a053dd69 LUCENE-8509: WordDelimiterGraphFilter no longer adjusts offsets by default 2018-12-04 09:47:42 +00:00
Alan Woodward f5867a1413 LUCENE-8564: Add GraphTokenFilter 2018-12-04 09:47:42 +00:00
Toke Eskildsen 6c11161111 Pre-commit fixes for LUCENE-8374 (JavaDoc + arguments) 2018-12-04 10:05:09 +01:00
Gus Heck 59919b4ac0 SOLR-13036 Fix retry logic in JettySolrRunner 2018-12-03 21:54:34 -05:00
markrmiller e14432efd8 SOLR-12801: Wait for thread on close and add AwaitsFix annotation - most of these sim tests can fail to elect shard leaders. 2018-12-03 17:40:51 -06:00
markrmiller e3cbdd0505 SOLR-13032: Add @AwaitsFix annotation to test. 2018-12-03 11:07:40 -06:00
markrmiller 1408f5255f SOLR-12801: Disable TimeRoutedAliasUpdateProcessorTest because the feature leaks threads. 2018-12-03 09:07:27 -06:00
Toke Eskildsen e356d793ca LUCENE-8374 part 4/4: Reduce reads for sparse DocValues
Offset and index jump-table for norm blocks.
2018-12-03 14:30:30 +01:00
Toke Eskildsen 7949b98f80 LUCENE-8374 part 3/4: Reduce reads for sparse DocValues
Offset jump-table for variable bits per value blocks.
2018-12-03 14:29:54 +01:00
Toke Eskildsen 7ad027627a LUCENE-8374 part 2/4: Reduce reads for sparse DocValues
Index jump-table for DENSE indexedDISI blocks.
2018-12-03 14:24:40 +01:00
Toke Eskildsen 58a7a8ada5 LUCENE-8374 part 1/4: Reduce reads for sparse DocValues
Offset and index jump-table for IndexedDISI blocks.
2018-12-03 14:23:11 +01:00
Christophe Bismuth 643ffc6f9f LUCENE-8548: The KoreanTokenizer no longer splits unknown words on combining diacritics and
detects script boundaries more accurately with Character#UnicodeScript#of.

Signed-off-by: Jim Ferenczi <jimczi@apache.org>
2018-12-03 10:57:22 +01:00
Gus Heck b6b9f9554e SOLR-12801 fix cut/paste oops in comment 2018-12-02 16:41:55 -05:00
Gus Heck cf4d749410 SOLR-12801 Revert breaking commit a3ec5b5fdf and improve test
to be more complete. Also, add comments to make it easier for
others to understand TimeRoutedAliasUpdateProcessorTest
2018-12-02 16:34:40 -05:00
Jason Gerlowski c951578fca SOLR-6117: Unify ReplicationHandler error handling
Prior to this commit, ReplicationHandler had a few inconsistencies
in how it reported errors:
  - Sometimes the 'status' field was used as an enum (e.g. 'success'
    v. 'failure'.  Elsewhere it is used to hold error messages.
  - Sometimes an explicit 'message' field was used, but often not.
  - Sometimes a stack trace was provided in place of an error message.

This commit tweaks the various error cases in ReplicationHandler to
report errors consistently.  'status' is always an enum-type value.  A
'message' field is provided for all errors, with an optional 'exception'
field.
2018-12-02 16:23:04 -05:00
markrmiller 16241f4484 LUCENE-8546: Fix ant beast to fail and succeed based on whether beasting actually fails or succeeds.
LUCENE-8541: Fix ant beast to not overwrite junit xml results for each beast.iters iteration.
2018-12-02 10:10:10 -06:00
Jason Gerlowski 6e4c9b3b58 SOLR-12555: Use `expectThrows` for expected exceptions
This commit replaces the `try { doX(); fail(); }` pattern with the
`expectThrows` test helper, which was created for this purpose.

This commit makes these changes in the core package:
`o.a.solr.core`.
2018-12-01 17:19:34 -05:00
markrmiller e5281ef3d5 SOLR-12801: Don't possibly block forever in this test in remaining spots. 2018-12-01 13:02:58 -06:00
markrmiller b294d8460f SOLR-12801: Don't possibly block forever in this test in another spot. 2018-12-01 12:55:05 -06:00
markrmiller b5a9cca168 SOLR-12801: Raise THREAD_POOL_MAX_IDLE_TIME_MS. 2018-12-01 12:42:28 -06:00
markrmiller cbe9f82202 SOLR-12801: Don't possibly block forever in this test. 2018-12-01 08:58:13 -06:00
markrmiller 755044b58f SOLR-12801: Wait for collection creation in test. 2018-12-01 08:09:03 -06:00
markrmiller 9b0b9032e2 SOLR-12801: Fix thread leak in test. 2018-12-01 00:53:12 -06:00
markrmiller 33c40a8da4 SOLR-13027: Change retries to work across JVM impls properly by looking for an IOException. 2018-12-01 00:26:15 -06:00
markrmiller 7f88bfa112 SOLR-12801: Wait for collections properly. 2018-12-01 00:26:03 -06:00
markrmiller a3ec5b5fdf SOLR-12801: Wait for executor to finish shutdown. 2018-12-01 00:26:03 -06:00
markrmiller d8f482f5fb SOLR-13030: Update executor usage to work correctly with Java 11 and update Mockito & HttpComponents to work with Java 11 and fix get remote info retry to work across jvms better. 2018-12-01 00:26:02 -06:00
markrmiller a01d0d9ef9 SOLR-13030: Close MetricsHistoryHandler inline. 2018-11-30 18:13:26 -06:00
Chris Hostetter 5dc988f5ee SOLR-12839: JSON 'terms' Faceting now supports a 'prelim_sort' option to use when initially selecting the top ranking buckets, prior to the final 'sort' option used after refinement. 2018-11-30 15:49:06 -07:00