Commit Graph

32787 Commits

Author SHA1 Message Date
Ignacio Vera ca3319cdbc LUCENE-9218: XYGeometries should expose values as floats (#1252) 2020-02-14 11:40:04 +01:00
Chris Hostetter 3dd484ba29 SOLR-14245: Fix ReplicaListTransformerTest
Previous changes to this issue 'fixed' the way the test was creating mock Replica instances,
to ensure all properties were specified -- but these changes tickled a bug in the existing test
scaffolding that caused it's "expecations" to be based on a regex check against only the base "url"
even though the test logic itself looked at the entire "core url"

The result is that there were reproducible failures if/when the randomly generated regex matched
".*1.*" because the existing test logic did not expect that to match the url or a Replica with
a core name of "core1" because it only considered the base url

(cherry picked from commit 49e20dbee4)
2020-02-12 11:13:55 -07:00
Erick Erickson 974c9ac8d9 Code comment only change
(cherry picked from commit 0767a9d4d7)
2020-02-11 19:33:34 -05:00
yonik c2125f331b SOLR-14058: fix peersync bounds check iterating over versions 2020-02-11 10:44:22 -08:00
David Smiley 8d95df364f
SOLR-14194: Highlighters now supports docValues for the uniqueKey
and the original highlighter can highlight docValues.

(cherry picked from commit 9a4f7661e9)
2020-02-11 02:21:24 -05:00
Shalin Shekhar Mangar 78e567c57e SOLR-13996: Refactor HttpShardHandler.prepDistributed method (#1220)
SOLR-13996: Refactor HttpShardHandler.prepDistributed method into smaller pieces

This commit introduces an interface named ReplicaSource which is marked as experimental. It has two sub-classes named CloudReplicaSource (for solr cloud) and LegacyReplicaSource for non-cloud clusters. The prepDistributed method now calls out to these sub-classes depending on whether the cluster is running on cloud mode or not.

(cherry picked from commit c65b97665c)
2020-02-10 20:02:08 +05:30
Ignacio Vera 49a37708a0 LUCENE-9216: Make sure we index LEAST_DOUBLE_VALUE (#1246) 2020-02-10 11:50:55 +01:00
Mikhail Khludnev 07de70ba62 SOLR-14209: specify charset via String for Java 8. 2020-02-09 16:38:48 +03:00
David Smiley e6701680f4 SOLR-14149: CHANGES.txt Remove off-topic stuff
* No Introduction (to Solr) header.  Point at solr-upgrade-notes.adoc instead
* No Getting Started header
* No Versions of Major Components header
* No "Upgrade Notes" for subsequent releases.  See solr-upgrade-notes.adoc
Closes #1202

(cherry picked from commit 46c0945614)
2020-02-08 23:04:41 -05:00
Kevin Risden 056e2cc5da
SOLR-14038: Admin UI display for "state.json" should be in a scollable region
Signed-off-by: Kevin Risden <krisden@apache.org>
2020-02-08 12:03:19 -06:00
Kevin Risden 8df7f379a4
SOLR-14209: Upgrade JQuery to 3.4.1
* JQuery 2.1.3 to 3.4.1
* jstree 1.0-rc1 to v3.3.8

Closes #1209

Signed-off-by: Kevin Risden <krisden@apache.org>
2020-02-08 11:59:25 -06:00
Shalin Shekhar Mangar e623eb5320 SOLR-14248: Improve ClusterStateMockUtil and make its methods public
(cherry picked from commit f5c132be6d)
2020-02-08 12:01:11 +05:30
Nicholas Knize 0bd2496205 LUCENE-9149: Increase data dimension limit in BKD 2020-02-07 16:57:32 -06:00
Ignacio Vera 5c1f2ca22a UCENE-9194: Simplify XYShapeXQuery API by adding a new abstract class called XYGeometry 2020-02-07 19:30:53 +01:00
Andrzej Bialecki f8163439ff SOLR-14245: Validate Replica / ReplicaInfo on creation. 2020-02-07 17:57:31 +01:00
Adrien Grand d62f630765 SOLR-12930: Exclude dev-docs from binary archive. 2020-02-07 10:37:56 +01:00
Adrien Grand 6a380798a2 LUCENE-9147: Make sure temporary files get deleted on all code paths. 2020-02-06 17:13:22 +01:00
Alan Woodward aa916bac3c LUCENE-9099: Correctly handle repeats in ORDERED and UNORDERED intervals (#1097)
If you have repeating intervals in an ordered or unordered interval source, you currently 
get somewhat confusing behaviour:

* `ORDERED(a, a, b)` will return an extra interval over just a b if it first matches a a b, meaning
that you can get incorrect results if used in a `CONTAINING` filter - 
`CONTAINING(ORDERED(x, y), ORDERED(a, a, b))` will match on the document `a x a b y`
* `UNORDERED(a, a)` will match on documents that just containg a single a.

This commit adds a RepeatingIntervalsSource that correctly handles repeats within 
ordered and unordered sources. It also changes the way that gaps are calculated within 
ordered and unordered sources, by using a new width() method on IntervalIterator. The 
default implementation just returns end() - start() + 1, but RepeatingIntervalsSource 
instead returns the sum of the widths of its child iterators. This preserves maxgaps filtering 
on ordered and unordered sources that contain repeats.

In order to correctly handle matches in this scenario, IntervalsSource#matches now always 
returns an explicit IntervalsMatchesIterator rather than a plain MatchesIterator, which adds 
gaps() and width() methods so that submatches can be combined in the same way that 
subiterators are. Extra checks have been added to checkIntervals() to ensure that the same 
intervals are returned by both iterator and matches, and a fix to 
DisjunctionIntervalIterator#matches() is also included - DisjunctionIntervalIterator minimizes 
its intervals, while MatchesUtils.disjunction does not, so there was a discrepancy between 
the two methods.
2020-02-06 14:45:27 +00:00
Adrien Grand 3246b26058 LUCENE-9147: Fix codec excludes. 2020-02-06 10:34:35 +01:00
Houston Putman e0d35f9641 SOLR-13887: Use the default idleTimeout instead of 0 for HTTP2 (#991) 2020-02-05 12:45:14 -08:00
Chris Hostetter bbdfce944b SOLR-14241: New delete() Stream Decorator
(cherry picked from commit c5d0391df9)
2020-02-05 13:31:55 -07:00
Adrien Grand 597141df6b LUCENE-9147: Move the stored fields index off-heap. (#1179)
This replaces the index of stored fields and term vectors with two
`DirectMonotonic` arrays. `DirectMonotonicWriter` requires to know the number
of values to write up-front, so incoming doc IDs and file pointers are buffered
on disk using temporary files that never get fsynced, but have index headers
and footers to make sure any corruption in these files wouldn't propagate to the
index.

`DirectMonotonicReader` gets a specialized `binarySearch` implementation that
leverages the metadata in order to avoid going to the IndexInput as often as
possible. Actually in the common case, it would only go to a single
sub `DirectReader` which, combined with the size of blocks of 1k values, helps
bound the number of page faults to 2.
2020-02-05 19:19:32 +01:00
Adrien Grand d007470bda SOLR-14242: HdfsDirectory#createTempOutput. (#1240) 2020-02-05 16:39:30 +01:00
Mike McCandless 3e63cd38ef LUCENE-9200: consistently use double (not float) math for TieredMergePolicy's decisions, to fix a corner-case bug uncovered by randomized tests 2020-02-05 09:52:19 -05:00
Tomas Fernandez Lobbe 37d4121770 SOLR-14219: Revert changes in OverseerSolrRespose and move serialization (#1227)
SOLR-14095 Introduced an issue for rolling restarts (Incompatible Java serialization). This change fixes the compatibility issue while keeping the functionality in SOLR-14095
2020-02-04 11:07:38 -08:00
Adrien Grand d7859097ee SOLR-14238: Fix HdfsDirectory to no longer overwrite existing files. (#1237) 2020-02-04 19:35:52 +01:00
Munendra S N 358043d1f3 SOLR-14090: fix delete-copy-field when source is dynamic field 2020-02-04 21:48:56 +05:30
Munendra S N 5a3a05d953 SOLR-10567: add support for DateRangeField in JSON facet range 2020-02-04 21:47:54 +05:30
Andrzej Bialecki 4a002411fc SOLR-14239: Fix the behavior of CaffeineCache.computeIfAbsent on branch_8x. 2020-02-04 17:02:05 +01:00
Ignacio Vera 996945fff7 LUCENE-9197: fix wrong implementation on Point2D#withinTriangle (#1228) 2020-02-04 07:11:07 +01:00
Anshum Gupta 02f9b276b0
SOLR-14206: Annotate HttpSolrCall as thread-safe (#1205)
* SOLR-14206: Annotate HttpSolrCall and V2HttpCall as thread-safe
2020-02-03 10:00:43 -08:00
Mikhail Khludnev 34d299018e SOLR-12325: uniqueBlock(\{!v=foo:bar}) 2020-02-02 15:20:16 +03:00
Jan Høydahl e4721d9a2d SOLR-14221: Upgrade restlet to version 2.4.0 (#1211)
(cherry picked from commit 16b8d50284)
2020-02-02 11:45:01 +01:00
Kazuaki Hiraga 12242b52e6 LUCENE-9123: Add new JapaneseTokenizer constructors with discardCompoundToken option to control whether the tokenizer emits original tokens when the mode is not NORMAL. 2020-02-01 15:20:02 +09:00
Munendra S N 43d07db523 fix typo in schema-api documentation 2020-02-01 10:39:33 +05:30
Robert Muir 507ef67d5f
support ECJ linting on newer JDK versions
The entire precommit task will still fail with unsupported java version
(subsequent checks do not support the newer javadocs format).

But this allows the ECJ linter to run, which checks for things such as
unused imports.
2020-01-31 14:07:03 -05:00
Jason Gerlowski 68cfe27b68 SOLR-13892: Add 'top-level' docValues Join implementation (#1171) 2020-01-31 13:11:28 -05:00
Joel Bernstein d4a4b4413d SOLR-14139: Support backtick phrase queries in Streaming Expressions 2020-01-31 12:14:43 -05:00
Christine Poerschke fc3497d24c LUCENE-9195: precommit fix (remove unused import) 2020-01-31 16:53:12 +00:00
Christine Poerschke 53d8b5b1b8 LUCENE-8530: fix some 'rawtypes' javac warnings 2020-01-31 16:42:25 +00:00
Robert Muir 30b2cc0163
LUCENE-9195: more slow tests fixes 2020-01-31 09:27:01 -05:00
Chris Hostetter b2d8b784a3 New /stream test cases showing authn+authz edge cases in cloud mode
This triggers various places in the Streaming Expressions code that use background threads
to confirm that the expected credentails (or lack of) are propogarded along.

Test currently has comments + workarounds for 2 known client issues:
 - SOLR-14226: SolrStream reports AuthN/AuthZ failures (401|403) as IOException w/o details
 - SOLR-14222: CloudSolrClient converts (update) 403 error to 500 error

(cherry picked from commit 517438e356)
2020-01-30 10:04:09 -07:00
Adrien Grand 744dec7275 LUCENE-4702: Improve performance for fuzzy queries.
Fuzzy queries with an edit distance of 1 or 2 must visit all blocks whose prefix
length is 1 or 2. By not compressing those, we can trade very little space (a
couple MBs in the case of the wikibigall index) for better query efficiency.
2020-01-30 10:40:44 +01:00
Ignacio Vera 46fa876c35 LUCENE-9141: Simplify LatLonShapeXQuery API by adding a new abstract class called LatLonGeometry. (#1170) 2020-01-30 08:04:09 +01:00
Robert Muir e258ab32f0
LUCENE-9192: speed up more slow tests 2020-01-29 14:33:05 -05:00
Robert Muir 16f240e740
LUCENE-9160: add params/docs to override jvm params in gradle build, default C2 off in tests.
Adds some build parameters to tune how tests run. There is an example
shown by "gradle helpLocalSettings"

Default C2 off in tests as it is wasteful locally and causes slowdown of
tests runs. You can override this by setting tests.jvmargs for gradle,
or args for ant.

Some crazy lucene stress tests may need to be toned down after the
change, as they may have been doing too many iterations by default...
but this is not a new problem.
2020-01-29 13:59:07 -05:00
Robert Muir 5ee2a6fcae
fix merging difficulty while trying to give branch_8x some love 2020-01-29 13:57:48 -05:00
Robert Muir e1cc7eb4b7
LUCENE-9189: TestIndexWriterDelete.testDeletesOnDiskFull can run for minutes
The issue is that MockDirectoryWrapper's disk full check is horribly
inefficient. On every writeByte/etc, it totally recomputes disk space
across all files. This means it calls listAll() on the underlying
Directory (which sorts all the underlying files), then sums up fileLength()
for each of those files.

This leads to many pathological cases in the disk full tests... but the
number of tests impacted by this is minimal, and the logic is scary.
2020-01-29 13:47:05 -05:00
Robert Muir 3dd47cf9c7
LUCENE-9186: remove linefiledocs usage from BaseTokenStreamTestCase 2020-01-29 13:46:41 -05:00
Robert Muir 037cc5b1de
LUCENE-9172: nuke some compiler warnings 2020-01-29 13:45:59 -05:00