Commit Graph

32819 Commits

Author SHA1 Message Date
Chris Hostetter d91f7856ba Some additional testing to prove 'defered function computation' in a syntax variant i was recently asked about
(cherry picked from commit 852f02b4b7)
2020-02-25 13:21:36 -07:00
Namgyu Kim b2796bf0c8
LUCENE-8954: refactor Nori analyzer 2020-02-25 22:46:26 +09:00
Namgyu Kim 8037234142
Revert "LUCENE-8954: refactor Nori analyzer"
This reverts commit 29b7e1a95c.
2020-02-25 02:44:23 +09:00
Namgyu Kim 29b7e1a95c
LUCENE-8954: refactor Nori analyzer
Signed-off-by: Namgyu Kim <namgyu@apache.org>
2020-02-25 00:08:46 +09:00
Alan Woodward 90028a7b93 LUCENE-9212: Intervals.multiterm() should take CompiledAutomaton 2020-02-24 11:11:19 +00:00
Alessandro Benedetti 2752d50dd1 SOLR-12238: Handle boosts in QueryBuilder
QueryBuilder now detects per-term boosts supplied by a BoostAttribute when
building queries using a TokenStream.  This commit also adds a DelimitedBoostTokenFilter
that parses boosts from tokens using a delimiter token, and exposes this in Solr
2020-02-24 10:33:51 +00:00
Jan Høydahl e49dd55c47 SOLR-14114: Add WARN to Solr log that embedded ZK is not supported in production (#1273)
(cherry picked from commit 57c7139ea3)
2020-02-24 10:00:15 +01:00
Jan Høydahl 365728f7f4 SOLR-10306: Document in Reference Guide how to disable or reduce swapping (#1256)
(cherry picked from commit 8b98befe93)
2020-02-24 09:55:01 +01:00
iverase 799ab23bc5 LUCENE-9238: Do not use Math#toRadians() in branch_8x 2020-02-21 17:40:03 +01:00
Erick Erickson a79a90dcd3 SOLR-11035: (at least) 2 distinct failures possible when clients attempt searches during SolrCore reload
(cherry picked from commit 292bed0c42)
2020-02-21 10:58:37 -05:00
Ignacio Vera 53cb6acf91 LUCENE-9238: Add new XYPointField, queries and sorting capabilities (#1272)
New XYPointField field and Queries for indexing, searching and sorting cartesian points.
2020-02-21 11:33:42 +01:00
Jan Høydahl 8ea947eeb2 SOLR-14250: Do not log error when trying to consume non-existing input stream due to Expect: 100-continue (#1250)
(cherry picked from commit 89b13377a1)
2020-02-21 10:31:38 +01:00
Noble Paul fb9024f0e5 SOLR-14270 export command to have an option to write to a zip file (#1266) 2020-02-21 13:42:57 +11:00
Nhat Nguyen 83ccb8d2a2 LUCENE-9228: Sort dvUpdates by terms before apply
With this change, we sort dvUpdates in the term order before applying if
they all update a single field to the same value. This optimization can
reduce the flush time by around 20% for the docValues update user cases.
2020-02-20 13:27:39 -05:00
Shalin Shekhar Mangar d5e51bf994 SOLR-12550: Adding entry to CHANGES.txt
(cherry picked from commit 2fdd3b02bb)
2020-02-20 04:35:58 -08:00
Marc A. Morissette 8eb6b03d51 SOLR-12550: ConcurrentUpdateSolrClient doesn't respect timeouts for commits and optimize (#417)
ConcurrentUpdateSolrClient now propagates its connection and read timeouts to the private HttpSolrClient used to commit and optimize.

(cherry picked from commit 051133c13f)
2020-02-20 04:35:50 -08:00
iverase ac46213e83 LUCENE-8707: fix test bug. when bounding box if a triangle
is within a circle, the triangle is within the circle as well.
2020-02-19 18:22:12 +01:00
Ignacio Vera 50168ab5bc LUCENE-8707: Add LatLonShape and XYShape distance query (#587) 2020-02-19 16:05:31 +01:00
Mikhail Khludnev 9f1aa427c0 SOLR-14263: stripping .adoc to fix build. 2020-02-19 14:31:48 +03:00
Jan Høydahl 42b725462f LUCENE-9229: Fix some broken links
Change some wiki -> cwiki links

Signed-off-by: Jan Høydahl <janhoy@apache.org>
(cherry picked from commit 8389b87e39)
2020-02-19 10:37:17 +01:00
markharwood 57b361bd6a Test fix - new binary doc values test could use invalid values. 2020-02-19 09:20:21 +00:00
Robert Muir b01d7cb79a
LUCENE-9230: explicitly call python version we want from builds
On newer linux distros, at least, 'python' now means python3. So
we can't rely on what version of python it will invoke (at least for a
few years).

For example in Fedora Linux:

https://fedoraproject.org/wiki/Changes/Python_means_Python3

For python2.x code, explicitly call 'python2.7' and for python3.x code,
explicitly call 'python3'.

Ant variable names are cleaned up, e.g. 'python.exe' is renamed to
'python2.exe' and 'python32.exe' is renamed to 'python3.exe'. This also
makes it easy to identify remaining python 2.x code that should be
migrated to python 3.x
2020-02-18 19:04:37 -05:00
Erick Erickson 0461999c92 SOLR-14263: Update jvm-settings.adoc
(cherry picked from commit aa130c4259)
2020-02-18 16:45:17 -05:00
Christine Poerschke 01c9c68cc8 SOLR-13041: Add hashCode for autoscaling.Condition to accompany the already present equals.
(Zsolt Gyulavari via Christine Poerschke)
2020-02-18 16:14:57 +00:00
Eric Pugh f8e50a8fc2 SOLR-13965: s/StreamHandler/GraphHandler fix GraphHandler.getDescription()
(Eric Pugh via Christine Poerschke)
2020-02-18 16:14:47 +00:00
Eric Pugh d52e70a731 SOLR-13965: StreamHandler class-level javadoc edits
(Eric Pugh via Christine Poerschke)
2020-02-18 16:14:36 +00:00
markharwood bcdc21a001
LUCENE-9211 Add compression for Binary doc value fields (#1234)
Stores groups of 32 binary doc values in LZ4-compressed blocks.
(cherry picked from commit f549ee3535)
2020-02-18 14:40:44 +00:00
Claire Pollard 8fcf500d30 Update README.txt (#1090)
Update the analysis-extras README to include reference to including solr-analysis-extras jar.

(cherry picked from commit 188f620208)
2020-02-15 22:59:52 +01:00
Chris Hostetter ea20c9a001 SOLR-13794: Replace redundent test only copy of '_default' configset with SolrTestCase logic to correctly set 'solr.default.confdir' system property
This change allows us to remove kludgy test only code from ZkController

(cherry picked from commit f549ee3535)
2020-02-14 12:00:13 -07:00
Ignacio Vera f3c81d76b0 Return CELL_CROSSES_QUERY when point inside the triangle (#1259) 2020-02-14 17:08:00 +01:00
Adrien Grand 210f2f83f7 Add back assertions removed by LUCENE-9187. (#1236)
This time they would only apply to TestFastLZ4/TestHighLZ4 and avoid slowing
down all tests.
2020-02-14 13:41:30 +01:00
Robert Muir da33e4aa6f LUCENE-9187: remove too-expensive assert from LZ4 HighCompressionHashTable 2020-02-14 13:41:30 +01:00
Ignacio Vera ca3319cdbc LUCENE-9218: XYGeometries should expose values as floats (#1252) 2020-02-14 11:40:04 +01:00
Chris Hostetter 3dd484ba29 SOLR-14245: Fix ReplicaListTransformerTest
Previous changes to this issue 'fixed' the way the test was creating mock Replica instances,
to ensure all properties were specified -- but these changes tickled a bug in the existing test
scaffolding that caused it's "expecations" to be based on a regex check against only the base "url"
even though the test logic itself looked at the entire "core url"

The result is that there were reproducible failures if/when the randomly generated regex matched
".*1.*" because the existing test logic did not expect that to match the url or a Replica with
a core name of "core1" because it only considered the base url

(cherry picked from commit 49e20dbee4)
2020-02-12 11:13:55 -07:00
Erick Erickson 974c9ac8d9 Code comment only change
(cherry picked from commit 0767a9d4d7)
2020-02-11 19:33:34 -05:00
yonik c2125f331b SOLR-14058: fix peersync bounds check iterating over versions 2020-02-11 10:44:22 -08:00
David Smiley 8d95df364f
SOLR-14194: Highlighters now supports docValues for the uniqueKey
and the original highlighter can highlight docValues.

(cherry picked from commit 9a4f7661e9)
2020-02-11 02:21:24 -05:00
Shalin Shekhar Mangar 78e567c57e SOLR-13996: Refactor HttpShardHandler.prepDistributed method (#1220)
SOLR-13996: Refactor HttpShardHandler.prepDistributed method into smaller pieces

This commit introduces an interface named ReplicaSource which is marked as experimental. It has two sub-classes named CloudReplicaSource (for solr cloud) and LegacyReplicaSource for non-cloud clusters. The prepDistributed method now calls out to these sub-classes depending on whether the cluster is running on cloud mode or not.

(cherry picked from commit c65b97665c)
2020-02-10 20:02:08 +05:30
Ignacio Vera 49a37708a0 LUCENE-9216: Make sure we index LEAST_DOUBLE_VALUE (#1246) 2020-02-10 11:50:55 +01:00
Mikhail Khludnev 07de70ba62 SOLR-14209: specify charset via String for Java 8. 2020-02-09 16:38:48 +03:00
David Smiley e6701680f4 SOLR-14149: CHANGES.txt Remove off-topic stuff
* No Introduction (to Solr) header.  Point at solr-upgrade-notes.adoc instead
* No Getting Started header
* No Versions of Major Components header
* No "Upgrade Notes" for subsequent releases.  See solr-upgrade-notes.adoc
Closes #1202

(cherry picked from commit 46c0945614)
2020-02-08 23:04:41 -05:00
Kevin Risden 056e2cc5da
SOLR-14038: Admin UI display for "state.json" should be in a scollable region
Signed-off-by: Kevin Risden <krisden@apache.org>
2020-02-08 12:03:19 -06:00
Kevin Risden 8df7f379a4
SOLR-14209: Upgrade JQuery to 3.4.1
* JQuery 2.1.3 to 3.4.1
* jstree 1.0-rc1 to v3.3.8

Closes #1209

Signed-off-by: Kevin Risden <krisden@apache.org>
2020-02-08 11:59:25 -06:00
Shalin Shekhar Mangar e623eb5320 SOLR-14248: Improve ClusterStateMockUtil and make its methods public
(cherry picked from commit f5c132be6d)
2020-02-08 12:01:11 +05:30
Nicholas Knize 0bd2496205 LUCENE-9149: Increase data dimension limit in BKD 2020-02-07 16:57:32 -06:00
Ignacio Vera 5c1f2ca22a UCENE-9194: Simplify XYShapeXQuery API by adding a new abstract class called XYGeometry 2020-02-07 19:30:53 +01:00
Andrzej Bialecki f8163439ff SOLR-14245: Validate Replica / ReplicaInfo on creation. 2020-02-07 17:57:31 +01:00
Adrien Grand d62f630765 SOLR-12930: Exclude dev-docs from binary archive. 2020-02-07 10:37:56 +01:00
Adrien Grand 6a380798a2 LUCENE-9147: Make sure temporary files get deleted on all code paths. 2020-02-06 17:13:22 +01:00
Alan Woodward aa916bac3c LUCENE-9099: Correctly handle repeats in ORDERED and UNORDERED intervals (#1097)
If you have repeating intervals in an ordered or unordered interval source, you currently 
get somewhat confusing behaviour:

* `ORDERED(a, a, b)` will return an extra interval over just a b if it first matches a a b, meaning
that you can get incorrect results if used in a `CONTAINING` filter - 
`CONTAINING(ORDERED(x, y), ORDERED(a, a, b))` will match on the document `a x a b y`
* `UNORDERED(a, a)` will match on documents that just containg a single a.

This commit adds a RepeatingIntervalsSource that correctly handles repeats within 
ordered and unordered sources. It also changes the way that gaps are calculated within 
ordered and unordered sources, by using a new width() method on IntervalIterator. The 
default implementation just returns end() - start() + 1, but RepeatingIntervalsSource 
instead returns the sum of the widths of its child iterators. This preserves maxgaps filtering 
on ordered and unordered sources that contain repeats.

In order to correctly handle matches in this scenario, IntervalsSource#matches now always 
returns an explicit IntervalsMatchesIterator rather than a plain MatchesIterator, which adds 
gaps() and width() methods so that submatches can be combined in the same way that 
subiterators are. Extra checks have been added to checkIntervals() to ensure that the same 
intervals are returned by both iterator and matches, and a fix to 
DisjunctionIntervalIterator#matches() is also included - DisjunctionIntervalIterator minimizes 
its intervals, while MatchesUtils.disjunction does not, so there was a discrepancy between 
the two methods.
2020-02-06 14:45:27 +00:00