Commit Graph

32852 Commits

Author SHA1 Message Date
Michael Froh a5475de57f LUCENE-8962: Fix intermittent test failures
1. TestIndexWriterMergePolicy.testMergeOnCommit will fail if the last
   commit (the one that should trigger the full merge) doesn't have any
   pending changes (which could occur if the last indexing thread
   commits at the end). We can fix that by adding one more document
   before that commit.
2. The previous implementation was throwing IOException if the commit
   thread gets interrupted while waiting for merges to complete. This
   violates IndexWriter's documented behavior of throwing
   ThreadInterruptedException.
2020-03-02 18:09:19 -05:00
msfroh a1791e7714 LUCENE-8962: Add ability to selectively merge on commit (#1155)
* LUCENE-8962: Add ability to selectively merge on commit

This adds a new "findCommitMerges" method to MergePolicy, which can
specify merges to be executed before the
IndexWriter.prepareCommitInternal method returns.

If we have many index writer threads, they will flush their DWPT buffers
on commit, resulting in many small segments, which can be merged before
the commit returns.

* Add missing Javadoc

* Fix incorrect comment

* Refactoring and fix intermittent test failure

1. Made some changes to the callback to update toCommit, leveraging
SegmentInfos.applyMergeChanges.
2. I realized that we'll never end up with 0 registered merges, because
we throw an exception if we fail to register a merge.
3. Moved the IndexWriterEvents.beginMergeOnCommit notification to before
we call MergeScheduler.merge, since we may not be merging on another
thread.
4. There was an intermittent test failure due to randomness in the time
it takes for merges to complete. Before doing the final commit, we wait
for pending merges to finish. We may still end up abandoning the final
merge, but we can detect that and assert that either the merge was
abandoned (and we have > 1 segment) or we did merge down to 1 segment.

* Fix typo

* Fix/improve comments based on PR feedback

* More comment improvements from PR feedback

* Rename method and add new MergeTrigger

1. Renamed findCommitMerges -> findFullFlushMerges.
2. Added MergeTrigger.COMMIT, passed to findFullFlushMerges and to
   MergeScheduler when merging on commit.

* Update renamed method name in strings and comments
2020-03-02 18:09:03 -05:00
Cassandra Targett f2a6ff1494 SOLR-14241: fix typos & incorrect example param 2020-03-02 16:53:46 -06:00
Cassandra Targett 1f549dc474 SOLR-14270: Move .gz example to CLI page; Remove bin/solr export from command-line-utilities.adoc 2020-03-02 16:53:36 -06:00
Cassandra Targett 3b595ab0bf Ref Guide: standardize i.e., e.g., spellings; fix typos 2020-03-02 15:46:12 -06:00
Cassandra Targett 902c6016d0 Ref Guide: Add license header, typos in Package Manager docs 2020-03-02 15:45:52 -06:00
Namgyu Kim 0de87de900 LUCENE-9253: Support custom dictionaries in KoreanTokenizer
Signed-off-by: Namgyu Kim <namgyu@apache.org>
2020-03-03 02:17:03 +09:00
Adrien Grand 6f3780e8d5 LUCENE-9247: Fix compile error. 2020-03-02 09:47:33 +01:00
Adrien Grand 3ab8bcf0b3 LUCENE-9247: Fix test failures with ExtraFS. 2020-03-02 08:59:14 +01:00
Ignacio Vera d9787406f8 LUCENE-9243: Add fudge factor when creating a bounding box of a xycircle (#1278) 2020-03-02 06:58:26 +01:00
Shalin Shekhar Mangar e7a9fd0a37 SOLR-13996: Rename LegacyReplicaSource to StandaloneReplicaSource
(cherry picked from commit 4897a64713)
2020-03-01 07:01:47 +05:30
David Smiley 3728178320 SOLR-14256, SOLR-14258: Deprecations 2020-02-29 16:16:25 -05:00
Tomoko Uchida 983b5e9784 Merge branch 'branch_8x' of github.com:apache/lucene-solr into branch_8x 2020-02-29 18:15:11 +09:00
Tomoko Uchida 3ab908afc0 SOLR-14295: Add the parameter description about 'discardCompoundToken' for JapaneseTokenizer in RefGuide 2020-02-29 17:45:22 +09:00
Adrien Grand b0c911ebca LUCENE-9247: Exclude `write.lock` from files whose integrity is expected to be verified. 2020-02-29 08:45:39 +01:00
Gus Heck f083067328 SOLR-14283 ensure that the test only runs on JVM's where it can succeeed 2020-02-28 14:20:35 -05:00
Adrien Grand a703c6e334 LUCENE-9247: Add tests for `checkIntegrity`. (#1284)
This adds a test to `BaseIndexFileFormatTestCase` that the combination
of opening a reader and calling `checkIntegrity` on it reads all bytes
of all files (including index headers and footers). This would help
detect most cases when `checkIntegrity` is not implemented correctly.
2020-02-28 15:27:01 +01:00
Adrien Grand 05830b463b LUCENE-9246: Remove `dOff` argument from `LZ4#decompress`. (#1283)
It is always set to 0 at call sites.
2020-02-28 15:27:01 +01:00
Bruno Roustant 7302effd9c
LUCENE-9237: Faster UniformSplit intersect TermsEnum. 2020-02-28 15:15:53 +01:00
Bruno Roustant c4475920b0
LUCENE-9245: Reduce AutomatonTermsEnum memory usage. 2020-02-28 11:02:28 +01:00
Jan Høydahl 3c8ab725c5 SOLR-14287: Admin UI Properties screen does not show colons (#1300)
(cherry picked from commit 198d9db674)
2020-02-28 10:37:33 +01:00
Ignacio Vera 16663db099 LUCENE-9250: Add support for Circle2d#intersectsLine around the dateline. (#1289) 2020-02-28 10:23:35 +01:00
Gus Heck 9c9a69c643 SOLR-14283 - fix NPE in SolrTestCaseJ4 2020-02-27 21:39:50 -05:00
Cao Manh Dat f2ac34373f SOLR-14286: Try and fix sha1 file. 2020-02-28 08:36:00 +07:00
Mikhail Khludnev 64193f052a SOLR-13411: reject incremental update for route.field, uniqueKey and
_version_.
2020-02-27 21:56:11 +03:00
Jan Høydahl e8922a2299 SOLR-14281: Make sharedLib configurable through SysProp (#1288)
(cherry picked from commit 62f5bd50cd)
2020-02-27 14:23:12 +01:00
Andrzej Bialecki ad9b2d2e19 SOLR-14252: NullPointerException in AggregateMetric. 2020-02-27 13:39:12 +01:00
Cao Manh Dat 0e0aa6e207 SOLR-14286: Update sha files 2020-02-27 15:11:52 +07:00
Cao Manh Dat 5b4f07ee75 SOLR-14286: Upgrade Jaegar to 1.1.0 2020-02-27 15:11:40 +07:00
noble 4e70711c95 SOLR-14270: fix incorrect date format 2020-02-27 17:30:58 +11:00
Christine Poerschke 449a7ef7b5 SOLR-14267: complete <query><HashDocSet> solrconfig.xml removal 2020-02-26 17:59:07 +00:00
Eric Pugh a5e372369c SOLR-13965: In GraphHandler, support <expressible> configuration and deprecate <streamFunctions> configuration.
(Eric Pugh via Christine Poerschke)

Closes #1033 pull request.
2020-02-26 17:59:07 +00:00
Mike McCandless 570109f73d LUCENE-9252: fix javac linter warnings in spatial-extras (thanks Andras Salamon) 2020-02-26 09:35:29 -05:00
Chris Hostetter d91f7856ba Some additional testing to prove 'defered function computation' in a syntax variant i was recently asked about
(cherry picked from commit 852f02b4b7)
2020-02-25 13:21:36 -07:00
Namgyu Kim b2796bf0c8
LUCENE-8954: refactor Nori analyzer 2020-02-25 22:46:26 +09:00
Namgyu Kim 8037234142
Revert "LUCENE-8954: refactor Nori analyzer"
This reverts commit 29b7e1a95c.
2020-02-25 02:44:23 +09:00
Namgyu Kim 29b7e1a95c
LUCENE-8954: refactor Nori analyzer
Signed-off-by: Namgyu Kim <namgyu@apache.org>
2020-02-25 00:08:46 +09:00
Alan Woodward 90028a7b93 LUCENE-9212: Intervals.multiterm() should take CompiledAutomaton 2020-02-24 11:11:19 +00:00
Alessandro Benedetti 2752d50dd1 SOLR-12238: Handle boosts in QueryBuilder
QueryBuilder now detects per-term boosts supplied by a BoostAttribute when
building queries using a TokenStream.  This commit also adds a DelimitedBoostTokenFilter
that parses boosts from tokens using a delimiter token, and exposes this in Solr
2020-02-24 10:33:51 +00:00
Jan Høydahl e49dd55c47 SOLR-14114: Add WARN to Solr log that embedded ZK is not supported in production (#1273)
(cherry picked from commit 57c7139ea3)
2020-02-24 10:00:15 +01:00
Jan Høydahl 365728f7f4 SOLR-10306: Document in Reference Guide how to disable or reduce swapping (#1256)
(cherry picked from commit 8b98befe93)
2020-02-24 09:55:01 +01:00
iverase 799ab23bc5 LUCENE-9238: Do not use Math#toRadians() in branch_8x 2020-02-21 17:40:03 +01:00
Erick Erickson a79a90dcd3 SOLR-11035: (at least) 2 distinct failures possible when clients attempt searches during SolrCore reload
(cherry picked from commit 292bed0c42)
2020-02-21 10:58:37 -05:00
Ignacio Vera 53cb6acf91 LUCENE-9238: Add new XYPointField, queries and sorting capabilities (#1272)
New XYPointField field and Queries for indexing, searching and sorting cartesian points.
2020-02-21 11:33:42 +01:00
Jan Høydahl 8ea947eeb2 SOLR-14250: Do not log error when trying to consume non-existing input stream due to Expect: 100-continue (#1250)
(cherry picked from commit 89b13377a1)
2020-02-21 10:31:38 +01:00
Noble Paul fb9024f0e5 SOLR-14270 export command to have an option to write to a zip file (#1266) 2020-02-21 13:42:57 +11:00
Nhat Nguyen 83ccb8d2a2 LUCENE-9228: Sort dvUpdates by terms before apply
With this change, we sort dvUpdates in the term order before applying if
they all update a single field to the same value. This optimization can
reduce the flush time by around 20% for the docValues update user cases.
2020-02-20 13:27:39 -05:00
Shalin Shekhar Mangar d5e51bf994 SOLR-12550: Adding entry to CHANGES.txt
(cherry picked from commit 2fdd3b02bb)
2020-02-20 04:35:58 -08:00
Marc A. Morissette 8eb6b03d51 SOLR-12550: ConcurrentUpdateSolrClient doesn't respect timeouts for commits and optimize (#417)
ConcurrentUpdateSolrClient now propagates its connection and read timeouts to the private HttpSolrClient used to commit and optimize.

(cherry picked from commit 051133c13f)
2020-02-20 04:35:50 -08:00
iverase ac46213e83 LUCENE-8707: fix test bug. when bounding box if a triangle
is within a circle, the triangle is within the circle as well.
2020-02-19 18:22:12 +01:00