Commit Graph

32806 Commits

Author SHA1 Message Date
Nhat Nguyen 83ccb8d2a2 LUCENE-9228: Sort dvUpdates by terms before apply
With this change, we sort dvUpdates in the term order before applying if
they all update a single field to the same value. This optimization can
reduce the flush time by around 20% for the docValues update user cases.
2020-02-20 13:27:39 -05:00
Shalin Shekhar Mangar d5e51bf994 SOLR-12550: Adding entry to CHANGES.txt
(cherry picked from commit 2fdd3b02bb)
2020-02-20 04:35:58 -08:00
Marc A. Morissette 8eb6b03d51 SOLR-12550: ConcurrentUpdateSolrClient doesn't respect timeouts for commits and optimize (#417)
ConcurrentUpdateSolrClient now propagates its connection and read timeouts to the private HttpSolrClient used to commit and optimize.

(cherry picked from commit 051133c13f)
2020-02-20 04:35:50 -08:00
iverase ac46213e83 LUCENE-8707: fix test bug. when bounding box if a triangle
is within a circle, the triangle is within the circle as well.
2020-02-19 18:22:12 +01:00
Ignacio Vera 50168ab5bc LUCENE-8707: Add LatLonShape and XYShape distance query (#587) 2020-02-19 16:05:31 +01:00
Mikhail Khludnev 9f1aa427c0 SOLR-14263: stripping .adoc to fix build. 2020-02-19 14:31:48 +03:00
Jan Høydahl 42b725462f LUCENE-9229: Fix some broken links
Change some wiki -> cwiki links

Signed-off-by: Jan Høydahl <janhoy@apache.org>
(cherry picked from commit 8389b87e39)
2020-02-19 10:37:17 +01:00
markharwood 57b361bd6a Test fix - new binary doc values test could use invalid values. 2020-02-19 09:20:21 +00:00
Robert Muir b01d7cb79a
LUCENE-9230: explicitly call python version we want from builds
On newer linux distros, at least, 'python' now means python3. So
we can't rely on what version of python it will invoke (at least for a
few years).

For example in Fedora Linux:

https://fedoraproject.org/wiki/Changes/Python_means_Python3

For python2.x code, explicitly call 'python2.7' and for python3.x code,
explicitly call 'python3'.

Ant variable names are cleaned up, e.g. 'python.exe' is renamed to
'python2.exe' and 'python32.exe' is renamed to 'python3.exe'. This also
makes it easy to identify remaining python 2.x code that should be
migrated to python 3.x
2020-02-18 19:04:37 -05:00
Erick Erickson 0461999c92 SOLR-14263: Update jvm-settings.adoc
(cherry picked from commit aa130c4259)
2020-02-18 16:45:17 -05:00
Christine Poerschke 01c9c68cc8 SOLR-13041: Add hashCode for autoscaling.Condition to accompany the already present equals.
(Zsolt Gyulavari via Christine Poerschke)
2020-02-18 16:14:57 +00:00
Eric Pugh f8e50a8fc2 SOLR-13965: s/StreamHandler/GraphHandler fix GraphHandler.getDescription()
(Eric Pugh via Christine Poerschke)
2020-02-18 16:14:47 +00:00
Eric Pugh d52e70a731 SOLR-13965: StreamHandler class-level javadoc edits
(Eric Pugh via Christine Poerschke)
2020-02-18 16:14:36 +00:00
markharwood bcdc21a001
LUCENE-9211 Add compression for Binary doc value fields (#1234)
Stores groups of 32 binary doc values in LZ4-compressed blocks.
(cherry picked from commit f549ee3535)
2020-02-18 14:40:44 +00:00
Claire Pollard 8fcf500d30 Update README.txt (#1090)
Update the analysis-extras README to include reference to including solr-analysis-extras jar.

(cherry picked from commit 188f620208)
2020-02-15 22:59:52 +01:00
Chris Hostetter ea20c9a001 SOLR-13794: Replace redundent test only copy of '_default' configset with SolrTestCase logic to correctly set 'solr.default.confdir' system property
This change allows us to remove kludgy test only code from ZkController

(cherry picked from commit f549ee3535)
2020-02-14 12:00:13 -07:00
Ignacio Vera f3c81d76b0 Return CELL_CROSSES_QUERY when point inside the triangle (#1259) 2020-02-14 17:08:00 +01:00
Adrien Grand 210f2f83f7 Add back assertions removed by LUCENE-9187. (#1236)
This time they would only apply to TestFastLZ4/TestHighLZ4 and avoid slowing
down all tests.
2020-02-14 13:41:30 +01:00
Robert Muir da33e4aa6f LUCENE-9187: remove too-expensive assert from LZ4 HighCompressionHashTable 2020-02-14 13:41:30 +01:00
Ignacio Vera ca3319cdbc LUCENE-9218: XYGeometries should expose values as floats (#1252) 2020-02-14 11:40:04 +01:00
Chris Hostetter 3dd484ba29 SOLR-14245: Fix ReplicaListTransformerTest
Previous changes to this issue 'fixed' the way the test was creating mock Replica instances,
to ensure all properties were specified -- but these changes tickled a bug in the existing test
scaffolding that caused it's "expecations" to be based on a regex check against only the base "url"
even though the test logic itself looked at the entire "core url"

The result is that there were reproducible failures if/when the randomly generated regex matched
".*1.*" because the existing test logic did not expect that to match the url or a Replica with
a core name of "core1" because it only considered the base url

(cherry picked from commit 49e20dbee4)
2020-02-12 11:13:55 -07:00
Erick Erickson 974c9ac8d9 Code comment only change
(cherry picked from commit 0767a9d4d7)
2020-02-11 19:33:34 -05:00
yonik c2125f331b SOLR-14058: fix peersync bounds check iterating over versions 2020-02-11 10:44:22 -08:00
David Smiley 8d95df364f
SOLR-14194: Highlighters now supports docValues for the uniqueKey
and the original highlighter can highlight docValues.

(cherry picked from commit 9a4f7661e9)
2020-02-11 02:21:24 -05:00
Shalin Shekhar Mangar 78e567c57e SOLR-13996: Refactor HttpShardHandler.prepDistributed method (#1220)
SOLR-13996: Refactor HttpShardHandler.prepDistributed method into smaller pieces

This commit introduces an interface named ReplicaSource which is marked as experimental. It has two sub-classes named CloudReplicaSource (for solr cloud) and LegacyReplicaSource for non-cloud clusters. The prepDistributed method now calls out to these sub-classes depending on whether the cluster is running on cloud mode or not.

(cherry picked from commit c65b97665c)
2020-02-10 20:02:08 +05:30
Ignacio Vera 49a37708a0 LUCENE-9216: Make sure we index LEAST_DOUBLE_VALUE (#1246) 2020-02-10 11:50:55 +01:00
Mikhail Khludnev 07de70ba62 SOLR-14209: specify charset via String for Java 8. 2020-02-09 16:38:48 +03:00
David Smiley e6701680f4 SOLR-14149: CHANGES.txt Remove off-topic stuff
* No Introduction (to Solr) header.  Point at solr-upgrade-notes.adoc instead
* No Getting Started header
* No Versions of Major Components header
* No "Upgrade Notes" for subsequent releases.  See solr-upgrade-notes.adoc
Closes #1202

(cherry picked from commit 46c0945614)
2020-02-08 23:04:41 -05:00
Kevin Risden 056e2cc5da
SOLR-14038: Admin UI display for "state.json" should be in a scollable region
Signed-off-by: Kevin Risden <krisden@apache.org>
2020-02-08 12:03:19 -06:00
Kevin Risden 8df7f379a4
SOLR-14209: Upgrade JQuery to 3.4.1
* JQuery 2.1.3 to 3.4.1
* jstree 1.0-rc1 to v3.3.8

Closes #1209

Signed-off-by: Kevin Risden <krisden@apache.org>
2020-02-08 11:59:25 -06:00
Shalin Shekhar Mangar e623eb5320 SOLR-14248: Improve ClusterStateMockUtil and make its methods public
(cherry picked from commit f5c132be6d)
2020-02-08 12:01:11 +05:30
Nicholas Knize 0bd2496205 LUCENE-9149: Increase data dimension limit in BKD 2020-02-07 16:57:32 -06:00
Ignacio Vera 5c1f2ca22a UCENE-9194: Simplify XYShapeXQuery API by adding a new abstract class called XYGeometry 2020-02-07 19:30:53 +01:00
Andrzej Bialecki f8163439ff SOLR-14245: Validate Replica / ReplicaInfo on creation. 2020-02-07 17:57:31 +01:00
Adrien Grand d62f630765 SOLR-12930: Exclude dev-docs from binary archive. 2020-02-07 10:37:56 +01:00
Adrien Grand 6a380798a2 LUCENE-9147: Make sure temporary files get deleted on all code paths. 2020-02-06 17:13:22 +01:00
Alan Woodward aa916bac3c LUCENE-9099: Correctly handle repeats in ORDERED and UNORDERED intervals (#1097)
If you have repeating intervals in an ordered or unordered interval source, you currently 
get somewhat confusing behaviour:

* `ORDERED(a, a, b)` will return an extra interval over just a b if it first matches a a b, meaning
that you can get incorrect results if used in a `CONTAINING` filter - 
`CONTAINING(ORDERED(x, y), ORDERED(a, a, b))` will match on the document `a x a b y`
* `UNORDERED(a, a)` will match on documents that just containg a single a.

This commit adds a RepeatingIntervalsSource that correctly handles repeats within 
ordered and unordered sources. It also changes the way that gaps are calculated within 
ordered and unordered sources, by using a new width() method on IntervalIterator. The 
default implementation just returns end() - start() + 1, but RepeatingIntervalsSource 
instead returns the sum of the widths of its child iterators. This preserves maxgaps filtering 
on ordered and unordered sources that contain repeats.

In order to correctly handle matches in this scenario, IntervalsSource#matches now always 
returns an explicit IntervalsMatchesIterator rather than a plain MatchesIterator, which adds 
gaps() and width() methods so that submatches can be combined in the same way that 
subiterators are. Extra checks have been added to checkIntervals() to ensure that the same 
intervals are returned by both iterator and matches, and a fix to 
DisjunctionIntervalIterator#matches() is also included - DisjunctionIntervalIterator minimizes 
its intervals, while MatchesUtils.disjunction does not, so there was a discrepancy between 
the two methods.
2020-02-06 14:45:27 +00:00
Adrien Grand 3246b26058 LUCENE-9147: Fix codec excludes. 2020-02-06 10:34:35 +01:00
Houston Putman e0d35f9641 SOLR-13887: Use the default idleTimeout instead of 0 for HTTP2 (#991) 2020-02-05 12:45:14 -08:00
Chris Hostetter bbdfce944b SOLR-14241: New delete() Stream Decorator
(cherry picked from commit c5d0391df9)
2020-02-05 13:31:55 -07:00
Adrien Grand 597141df6b LUCENE-9147: Move the stored fields index off-heap. (#1179)
This replaces the index of stored fields and term vectors with two
`DirectMonotonic` arrays. `DirectMonotonicWriter` requires to know the number
of values to write up-front, so incoming doc IDs and file pointers are buffered
on disk using temporary files that never get fsynced, but have index headers
and footers to make sure any corruption in these files wouldn't propagate to the
index.

`DirectMonotonicReader` gets a specialized `binarySearch` implementation that
leverages the metadata in order to avoid going to the IndexInput as often as
possible. Actually in the common case, it would only go to a single
sub `DirectReader` which, combined with the size of blocks of 1k values, helps
bound the number of page faults to 2.
2020-02-05 19:19:32 +01:00
Adrien Grand d007470bda SOLR-14242: HdfsDirectory#createTempOutput. (#1240) 2020-02-05 16:39:30 +01:00
Mike McCandless 3e63cd38ef LUCENE-9200: consistently use double (not float) math for TieredMergePolicy's decisions, to fix a corner-case bug uncovered by randomized tests 2020-02-05 09:52:19 -05:00
Tomas Fernandez Lobbe 37d4121770 SOLR-14219: Revert changes in OverseerSolrRespose and move serialization (#1227)
SOLR-14095 Introduced an issue for rolling restarts (Incompatible Java serialization). This change fixes the compatibility issue while keeping the functionality in SOLR-14095
2020-02-04 11:07:38 -08:00
Adrien Grand d7859097ee SOLR-14238: Fix HdfsDirectory to no longer overwrite existing files. (#1237) 2020-02-04 19:35:52 +01:00
Munendra S N 358043d1f3 SOLR-14090: fix delete-copy-field when source is dynamic field 2020-02-04 21:48:56 +05:30
Munendra S N 5a3a05d953 SOLR-10567: add support for DateRangeField in JSON facet range 2020-02-04 21:47:54 +05:30
Andrzej Bialecki 4a002411fc SOLR-14239: Fix the behavior of CaffeineCache.computeIfAbsent on branch_8x. 2020-02-04 17:02:05 +01:00
Ignacio Vera 996945fff7 LUCENE-9197: fix wrong implementation on Point2D#withinTriangle (#1228) 2020-02-04 07:11:07 +01:00
Anshum Gupta 02f9b276b0
SOLR-14206: Annotate HttpSolrCall as thread-safe (#1205)
* SOLR-14206: Annotate HttpSolrCall and V2HttpCall as thread-safe
2020-02-03 10:00:43 -08:00