Commit Graph

32158 Commits

Author SHA1 Message Date
Christine Poerschke 198cf0861d LUCENE-8766: fix in[di]vidually typo in package-info.java 2019-07-09 11:14:58 +01:00
Nicholas Knize 81c88e2df3 LUCENE-8632: New XYShape Field and Queries for indexing and searching general cartesian geometries
The LatLonShape field and LatLonShape query classes added the ability to index and search geospatial
geometries in the WGS-84 latitude, longitude coordinate reference system. The foundation for this
capability is provided by the Tessellator that converts an array of vertices describing a Point Line
or Polygon into a stream of 3 vertex triangles that are encoded as a seven dimension point and
indexed using the BKD POINT structure. A nice property of the Tessellator is that lat, lon
restrictions are artificial and really only bound by the API.

This commit builds on top of / abstracts the Tessellator LatLonShape and LatLonShapeQuery classes to
provide the ability to index & search general cartesian (non WGS84 lat,lon restricted) geometry.
It does so by introducing two new base classes: ShapeField and ShapeQuery that provide the indexing
and search foundation for LatLonShape and the LatLonShape derived query classes
(LatLonShapeBoundingBoxQuery, LatLonShapeLineQuery, LatLonShapePolygonQuery) and introducing a new
XYShape factory class along with XYShape derived query classes (XYShapeBoundingBoxQuery,
XYShapeLineQuery, XYShapePolygonQuery). The heart of the cartesian indexing is achieved through
XYShapeEncodingUtils that converts the double precision vertices into an integer encoded seven
dimension point (similar to LatLonShape).

The test framework is also further abstracted and extended to provide a full test suite for the
new XYShape capability that works the same way as the LatLonShape test suite (but applied to non
GIS geometries).
2019-07-08 15:19:03 -05:00
Chris Hostetter 4fd1850d2e SOLR-13599: additional 'checkpoint' logging to try and help diagnose strange failures
(cherry picked from commit b4a602f6b2)
2019-07-08 08:42:07 -07:00
Adrien Grand a329953429 LUCENE-8803: Change the way that reverse ordering is implemented.
This addresses some test failures when IndexSearcher is created with an executor
and merges hits with TopDocs#merge.
2019-07-08 10:25:48 +02:00
Anshum Gupta 5d3a84fcd0
SOLR-13507: Remove support for addr parameter from the /solr/admin/zookeeper endpoint. (#759) (#766) 2019-07-05 10:15:13 -07:00
Andrzej Bialecki e616ed49a6 SOLR-13583: Return 400 Bad Request instead of 500 Server Error when a complex
alias is found but a simple alias was expected.
2019-07-05 09:22:00 +02:00
Adrien Grand a0a1604352 LUCENE-8803: Ensure doc ID order is preserved in tests. 2019-07-05 09:12:50 +02:00
Noble Paul cb1b86b80a SOLR-13538: toNativeType () TrieDate & EnumField do not handle CharSequence properly 2019-07-05 14:50:13 +10:00
S N Munendra 7ce347c573 SOLR-13538: Fix classcastEx in TrieDateField for atomic updates (#764)
SOLR-13538: toNativeType () TrieDate & EnumField do not handle CharSequence properly
2019-07-05 14:49:59 +10:00
Jan Høydahl 07cb7cf593 LUCENE-8852: Fix wording and git commit msg
(cherry picked from commit 36af839eb1)
2019-07-04 19:05:10 +02:00
Ignacio Vera ef64f7af3e LUCENE-8888: Improve distribution of points with data dimensions in BKD tree leaves (#747) 2019-07-04 11:17:14 +02:00
Michael Sokolov 792871c480 LUCENE-8895: deprecate the other util.fst.Util.getByOutput override, too 2019-07-03 21:34:44 -04:00
Andrzej Bialecki bf1d14539c SOLR-13003: Fix WrapperModel method delegation. 2019-07-03 20:18:27 +02:00
Andrzej Bialecki 9d52e5be3b SOLR-13003: Query Result Cache does not honour maxRamBytes parameter. 2019-07-03 18:27:21 +02:00
Tomoko Uchida a471ece05b SOLR-13602: Update _default schema for unittests. 2019-07-04 00:10:30 +09:00
Munendra S N fd93b43cf9 SOLR-13603: remove deprecated groupSpec methods usage
This is precursor to removing deprecated methods
2019-07-03 19:34:11 +05:30
Tomoko Uchida d586c44a33 SOLR-13588: Document Estonian analyzer in Solr Ref Guide 2019-07-03 22:04:47 +09:00
Tomoko Uchida bb074f6a8a SOLR-13602: Add a field type for Estonian language to default managed_schema 2019-07-03 21:00:43 +09:00
Andrzej Bialecki 30a5b2cd87 SOLR-13583: Impossible to delete a collection with the same name as an existing alias. 2019-07-03 13:36:01 +02:00
David Smiley 1c1d77a791 SOLR-13158: DIH: Add System property toggle for use of dataConfig param
(cherry picked from commit 325824cd39)
2019-07-03 00:03:29 -04:00
Michael Sokolov ba8366facf LUCENE-8895: switch all FST usage to enable array-with-gaps encoding
also, deprecate unused Util.getByOutput
2019-07-02 18:26:20 -04:00
Mayya Sharipova 5974586b87 Load freqs lazily in Postings (#595)
Ensure that freqs are loaded lazily,
and loaded only when they are needed and
accessed through `freq()` method.
2019-07-02 18:59:36 +02:00
Andrzej Bialecki c645997eb3 LUCENE-8898: Adjust the assertions - when CompressedOops are in use the estimation is less
accurate because RamUsageEstimator doesn't know the exact layout of data structures used in the map.
2019-07-02 15:39:55 +02:00
Joel Bernstein 240a94a3e3 SOLR-13589: Update CHANGES.txt 2019-07-02 08:56:55 -04:00
Ignacio Vera d771d9109e LUCENE-8896: Override default implementation of IntersectVisitor#visit(DocIDSetBuilder, byte[]) for several queries (#756) 2019-07-02 09:12:00 +02:00
Adrien Grand b7cec0a6fd LUCENE-8898: Mute TestRamUsageEstimator#testMap. 2019-07-02 07:58:13 +02:00
Adrien Grand 8448a33ed8 LUCENE-8757: Revert on 8.x. 2019-07-02 07:57:22 +02:00
Munendra S N 00d931aaa7 SOLR-13404: Fix NPE when group=true and no group.field is present
* This was introduced in SOLR-12249
2019-07-02 10:31:32 +05:30
Munendra S N d811f86342 SOLR-13404: support group.query in multishard env with group.main=true
group.query after execution forms QueryCommandResult. In case of
group.main=true or group.format=simple, QueryCommandResult was not
consumed in EndResultTransformer. Also, MainEndResultTransformer assumed
that always group.field would be specified. When group.field not specified
it failed with AIOOBE. After adding suppport for QueryCommandResult in
EndResultTransformers and handling AIOOBE, group.query started giving results

Working on tests exposed few other issues. Results differed b/w standalone
& distributed mode.
* One of the reason is that TopGroupShardResponseProcessor doesn't consider correct
  limit and offset when group format is simple. In case of simple, start and rows should be used
  as limit and offset instead of group.limit and group.offset.
* Secondly, In distributed second phase grouping, computing docsToCollect didn't consider
  group response format. This issue is again similar to above issue
* offset(group.offset or start) not being considered during TopDocs#merge caused
  different results. The fix was to use to offset in merge process
* group.offset doesn't support negative values but there is no checks on the value.
  In case of negative values AIOOBE. Now, checks are added for negative values and
  returns proper error message(this change is for both standalone and distrbuted).
  Validation is done only in case of group.format=grouped as that is only case when
  group.offset is consumed.

Fixing above issues resolved the differences b/w standalone and distributed mode.
2019-07-02 08:24:00 +05:30
Joel Bernstein 36ac878ea7 SOLR-13589: Fix precommit 2019-07-01 21:57:25 -04:00
Joel Bernstein 7e0df16220 SOLR-13589: Add zplot cluster test case 2019-07-01 21:57:09 -04:00
Joel Bernstein c6cc2fd9fd SOLR-13589: Allow zplot to visualize clusters and convex hulls 2019-07-01 21:56:47 -04:00
Munendra S N fc15cd79f7 SOLR-12554: Expose IndexWriterConfig's ramPerThreadHardLimitMB
* When ramPerThreadHardLimitMB is not specified, then Lucene's
  default value 1945 is used. The specified value should be
  greater than 0 and less than 2048MB
2019-07-01 23:47:26 +05:30
Munendra S N c8190e9c3b LUCENE-8892: add missing closing parentheses in MultiBoolFunction's description() 2019-07-01 22:25:21 +05:30
Cassandra Targett 57430f6b9f Update Ruby version in Ref Guide build script 2019-07-01 11:37:11 -05:00
Jason Gerlowski 5abafaa4a3 SOLR-13539: Fix mv update of UUID, enum, bool and binary fields
Co-Authored-By: Thomas Wockinger
2019-07-01 08:54:45 -04:00
Atri Sharma 3a2f1fd19b LUCENE-8862: Introduce Collector Level Memory Accounting (#729)
This commit introduces a mechanism to track the dynamic memory
utilization of Collectors and allow setting collector level limits
for memory usage. A new bitset collecting Collector is introduced
which supports the memory tracker functionality
2019-07-01 13:47:34 +02:00
Alan Woodward 518b6b5aac LUCENE-8893: Intervals.prefix() and Intervals.wildcard() should take BytesRef 2019-07-01 09:06:20 +01:00
Ignacio Vera f5991d96ba LUCENE-8885: Optimise BKD reader by exploiting cardinality information stored on leaves (#746)
The commit adds the method InstersectVisitor#visit(DocIdSetIterator, byte[]).
2019-07-01 06:16:06 +02:00
Jason Gerlowski 60ec073b27 Add missing predefined RBAP perms to ref-guide 2019-06-30 07:29:09 -04:00
Mikhail Khludnev 9b515deb64 SOLR-13577: wait jetty to stop as @hossman advised. 2019-06-30 09:46:46 +03:00
Tomoko Uchida 2804d00137 LUCENE-8874: Show SPI names instead of class names in Luke Analysis tab. 2019-06-30 13:40:12 +09:00
Tomoko Uchida 7e05bd7173 LUCENE-8894: Add APIs to tokenizer/charfilter/tokenfilter factories to get their SPI names from concrete classes 2019-06-30 12:24:11 +09:00
Gert Morten Paimla 2df6ea2305 LUCENE-8891: Add snowball stemmer and analyzer for Estonian language.
Signed-off-by: Tomoko Uchida <tomoko@apache.org>
2019-06-30 09:17:32 +09:00
Michael Sokolov a9e37a6a7f LUCENE-8871: promote kuromoji tools to main jar 2019-06-29 16:20:33 -04:00
Munendra S N 9e40748486 SOLR-12364: add test cases for edismax boost
* This adds tests for bf and boosts
* Use expectThrows in edismax tests to verify exception
2019-06-29 22:34:03 +05:30
Munendra S N 6c83e39dcc SOLR-9409: improve error message on unsupported types in collapsing
* Improve error message when collapsing is not supported on given
  fieldtype
* Return 400 error code when unsupported value are passed for max,min
  or in case of syntax error
2019-06-29 21:53:43 +05:30
Chris Hostetter 881aabe28a SOLR-13580: update test to account for different versions of java using different locale specific numeric formatting characters
(cherry picked from commit 8b72e91df7)
2019-06-28 12:11:22 -07:00
Christine Poerschke 328db38d71 SOLR-13576: Factor out a TopGroupsShardResponseProcessor.fillResultIds method. (Christine Poerschke, Diego Ceccarelli) 2019-06-28 19:00:08 +01:00
Christine Poerschke 07cf48816f SOLR-13280: Strengthen ScheduledTrigger's preferredOperation parameter validation. 2019-06-28 19:00:08 +01:00