32112 Commits

Author SHA1 Message Date
Adrien Grand
a80b5164d1 LUCENE-8311: Add CHANGES entry. 2019-07-09 17:05:47 +02:00
Adrien Grand
aad4b411cd LUCENE-8900: Simplify MultiSorter. 2019-07-09 16:20:29 +02:00
Adrien Grand
d271770ed1 LUCENE-8311: Phrase impacts (#760) 2019-07-09 16:10:34 +02:00
Christine Poerschke
439d43056c Fix two 'against' typos. 2019-07-09 11:14:59 +01:00
Christine Poerschke
198cf0861d LUCENE-8766: fix in[di]vidually typo in package-info.java 2019-07-09 11:14:58 +01:00
Nicholas Knize
81c88e2df3 LUCENE-8632: New XYShape Field and Queries for indexing and searching general cartesian geometries
The LatLonShape field and LatLonShape query classes added the ability to index and search geospatial
geometries in the WGS-84 latitude, longitude coordinate reference system. The foundation for this
capability is provided by the Tessellator that converts an array of vertices describing a Point Line
or Polygon into a stream of 3 vertex triangles that are encoded as a seven dimension point and
indexed using the BKD POINT structure. A nice property of the Tessellator is that lat, lon
restrictions are artificial and really only bound by the API.

This commit builds on top of / abstracts the Tessellator LatLonShape and LatLonShapeQuery classes to
provide the ability to index & search general cartesian (non WGS84 lat,lon restricted) geometry.
It does so by introducing two new base classes: ShapeField and ShapeQuery that provide the indexing
and search foundation for LatLonShape and the LatLonShape derived query classes
(LatLonShapeBoundingBoxQuery, LatLonShapeLineQuery, LatLonShapePolygonQuery) and introducing a new
XYShape factory class along with XYShape derived query classes (XYShapeBoundingBoxQuery,
XYShapeLineQuery, XYShapePolygonQuery). The heart of the cartesian indexing is achieved through
XYShapeEncodingUtils that converts the double precision vertices into an integer encoded seven
dimension point (similar to LatLonShape).

The test framework is also further abstracted and extended to provide a full test suite for the
new XYShape capability that works the same way as the LatLonShape test suite (but applied to non
GIS geometries).
2019-07-08 15:19:03 -05:00
Chris Hostetter
4fd1850d2e SOLR-13599: additional 'checkpoint' logging to try and help diagnose strange failures
(cherry picked from commit b4a602f6b24196273adbdb7d47bf42fa8d08d807)
2019-07-08 08:42:07 -07:00
Adrien Grand
a329953429 LUCENE-8803: Change the way that reverse ordering is implemented.
This addresses some test failures when IndexSearcher is created with an executor
and merges hits with TopDocs#merge.
2019-07-08 10:25:48 +02:00
Anshum Gupta
5d3a84fcd0
SOLR-13507: Remove support for addr parameter from the /solr/admin/zookeeper endpoint. (#759) (#766) 2019-07-05 10:15:13 -07:00
Andrzej Bialecki
e616ed49a6 SOLR-13583: Return 400 Bad Request instead of 500 Server Error when a complex
alias is found but a simple alias was expected.
2019-07-05 09:22:00 +02:00
Adrien Grand
a0a1604352 LUCENE-8803: Ensure doc ID order is preserved in tests. 2019-07-05 09:12:50 +02:00
Noble Paul
cb1b86b80a SOLR-13538: toNativeType () TrieDate & EnumField do not handle CharSequence properly 2019-07-05 14:50:13 +10:00
S N Munendra
7ce347c573 SOLR-13538: Fix classcastEx in TrieDateField for atomic updates (#764)
SOLR-13538: toNativeType () TrieDate & EnumField do not handle CharSequence properly
2019-07-05 14:49:59 +10:00
Jan Høydahl
07cb7cf593 LUCENE-8852: Fix wording and git commit msg
(cherry picked from commit 36af839eb110bcf9174dbd2f6de5f20a9705318e)
2019-07-04 19:05:10 +02:00
Ignacio Vera
ef64f7af3e LUCENE-8888: Improve distribution of points with data dimensions in BKD tree leaves (#747) 2019-07-04 11:17:14 +02:00
Michael Sokolov
792871c480 LUCENE-8895: deprecate the other util.fst.Util.getByOutput override, too 2019-07-03 21:34:44 -04:00
Andrzej Bialecki
bf1d14539c SOLR-13003: Fix WrapperModel method delegation. 2019-07-03 20:18:27 +02:00
Andrzej Bialecki
9d52e5be3b SOLR-13003: Query Result Cache does not honour maxRamBytes parameter. 2019-07-03 18:27:21 +02:00
Tomoko Uchida
a471ece05b SOLR-13602: Update _default schema for unittests. 2019-07-04 00:10:30 +09:00
Munendra S N
fd93b43cf9 SOLR-13603: remove deprecated groupSpec methods usage
This is precursor to removing deprecated methods
2019-07-03 19:34:11 +05:30
Tomoko Uchida
d586c44a33 SOLR-13588: Document Estonian analyzer in Solr Ref Guide 2019-07-03 22:04:47 +09:00
Tomoko Uchida
bb074f6a8a SOLR-13602: Add a field type for Estonian language to default managed_schema 2019-07-03 21:00:43 +09:00
Andrzej Bialecki
30a5b2cd87 SOLR-13583: Impossible to delete a collection with the same name as an existing alias. 2019-07-03 13:36:01 +02:00
David Smiley
1c1d77a791 SOLR-13158: DIH: Add System property toggle for use of dataConfig param
(cherry picked from commit 325824cd391c8e71f36f17d687f52344e50e9715)
2019-07-03 00:03:29 -04:00
Michael Sokolov
ba8366facf LUCENE-8895: switch all FST usage to enable array-with-gaps encoding
also, deprecate unused Util.getByOutput
2019-07-02 18:26:20 -04:00
Mayya Sharipova
5974586b87 Load freqs lazily in Postings (#595)
Ensure that freqs are loaded lazily,
and loaded only when they are needed and
accessed through `freq()` method.
2019-07-02 18:59:36 +02:00
Andrzej Bialecki
c645997eb3 LUCENE-8898: Adjust the assertions - when CompressedOops are in use the estimation is less
accurate because RamUsageEstimator doesn't know the exact layout of data structures used in the map.
2019-07-02 15:39:55 +02:00
Joel Bernstein
240a94a3e3 SOLR-13589: Update CHANGES.txt 2019-07-02 08:56:55 -04:00
Ignacio Vera
d771d9109e LUCENE-8896: Override default implementation of IntersectVisitor#visit(DocIDSetBuilder, byte[]) for several queries (#756) 2019-07-02 09:12:00 +02:00
Adrien Grand
b7cec0a6fd LUCENE-8898: Mute TestRamUsageEstimator#testMap. 2019-07-02 07:58:13 +02:00
Adrien Grand
8448a33ed8 LUCENE-8757: Revert on 8.x. 2019-07-02 07:57:22 +02:00
Munendra S N
00d931aaa7 SOLR-13404: Fix NPE when group=true and no group.field is present
* This was introduced in SOLR-12249
2019-07-02 10:31:32 +05:30
Munendra S N
d811f86342 SOLR-13404: support group.query in multishard env with group.main=true
group.query after execution forms QueryCommandResult. In case of
group.main=true or group.format=simple, QueryCommandResult was not
consumed in EndResultTransformer. Also, MainEndResultTransformer assumed
that always group.field would be specified. When group.field not specified
it failed with AIOOBE. After adding suppport for QueryCommandResult in
EndResultTransformers and handling AIOOBE, group.query started giving results

Working on tests exposed few other issues. Results differed b/w standalone
& distributed mode.
* One of the reason is that TopGroupShardResponseProcessor doesn't consider correct
  limit and offset when group format is simple. In case of simple, start and rows should be used
  as limit and offset instead of group.limit and group.offset.
* Secondly, In distributed second phase grouping, computing docsToCollect didn't consider
  group response format. This issue is again similar to above issue
* offset(group.offset or start) not being considered during TopDocs#merge caused
  different results. The fix was to use to offset in merge process
* group.offset doesn't support negative values but there is no checks on the value.
  In case of negative values AIOOBE. Now, checks are added for negative values and
  returns proper error message(this change is for both standalone and distrbuted).
  Validation is done only in case of group.format=grouped as that is only case when
  group.offset is consumed.

Fixing above issues resolved the differences b/w standalone and distributed mode.
2019-07-02 08:24:00 +05:30
Joel Bernstein
36ac878ea7 SOLR-13589: Fix precommit 2019-07-01 21:57:25 -04:00
Joel Bernstein
7e0df16220 SOLR-13589: Add zplot cluster test case 2019-07-01 21:57:09 -04:00
Joel Bernstein
c6cc2fd9fd SOLR-13589: Allow zplot to visualize clusters and convex hulls 2019-07-01 21:56:47 -04:00
Munendra S N
fc15cd79f7 SOLR-12554: Expose IndexWriterConfig's ramPerThreadHardLimitMB
* When ramPerThreadHardLimitMB is not specified, then Lucene's
  default value 1945 is used. The specified value should be
  greater than 0 and less than 2048MB
2019-07-01 23:47:26 +05:30
Munendra S N
c8190e9c3b LUCENE-8892: add missing closing parentheses in MultiBoolFunction's description() 2019-07-01 22:25:21 +05:30
Cassandra Targett
57430f6b9f Update Ruby version in Ref Guide build script 2019-07-01 11:37:11 -05:00
Jason Gerlowski
5abafaa4a3 SOLR-13539: Fix mv update of UUID, enum, bool and binary fields
Co-Authored-By: Thomas Wockinger
2019-07-01 08:54:45 -04:00
Atri Sharma
3a2f1fd19b LUCENE-8862: Introduce Collector Level Memory Accounting (#729)
This commit introduces a mechanism to track the dynamic memory
utilization of Collectors and allow setting collector level limits
for memory usage. A new bitset collecting Collector is introduced
which supports the memory tracker functionality
2019-07-01 13:47:34 +02:00
Alan Woodward
518b6b5aac LUCENE-8893: Intervals.prefix() and Intervals.wildcard() should take BytesRef 2019-07-01 09:06:20 +01:00
Ignacio Vera
f5991d96ba LUCENE-8885: Optimise BKD reader by exploiting cardinality information stored on leaves (#746)
The commit adds the method InstersectVisitor#visit(DocIdSetIterator, byte[]).
2019-07-01 06:16:06 +02:00
Jason Gerlowski
60ec073b27 Add missing predefined RBAP perms to ref-guide 2019-06-30 07:29:09 -04:00
Mikhail Khludnev
9b515deb64 SOLR-13577: wait jetty to stop as @hossman advised. 2019-06-30 09:46:46 +03:00
Tomoko Uchida
2804d00137 LUCENE-8874: Show SPI names instead of class names in Luke Analysis tab. 2019-06-30 13:40:12 +09:00
Tomoko Uchida
7e05bd7173 LUCENE-8894: Add APIs to tokenizer/charfilter/tokenfilter factories to get their SPI names from concrete classes 2019-06-30 12:24:11 +09:00
Gert Morten Paimla
2df6ea2305 LUCENE-8891: Add snowball stemmer and analyzer for Estonian language.
Signed-off-by: Tomoko Uchida <tomoko@apache.org>
2019-06-30 09:17:32 +09:00
Michael Sokolov
a9e37a6a7f LUCENE-8871: promote kuromoji tools to main jar 2019-06-29 16:20:33 -04:00
Munendra S N
9e40748486 SOLR-12364: add test cases for edismax boost
* This adds tests for bf and boosts
* Use expectThrows in edismax tests to verify exception
2019-06-29 22:34:03 +05:30