Commit Graph

32553 Commits

Author SHA1 Message Date
Adrien Grand fb30ded643 Ensure tokens have at least 2 chars, otherwise norms are incorrect. 2019-07-10 09:09:54 +02:00
Munendra S N 8322ea55f4 SOLR-11263: add test for payload func with undefined field
Payload func with undefined used to throw NPE. In SOLR-11610, this
was fixed to return proper error but there are no tests to verify
changed the behavior.
This add simple test to verify error code and error message
2019-07-10 10:17:55 +05:30
Nicholas Knize 14b67012d0 LUCENE-8632: Fix EdgeTree.internalComponentRelateTriangle to pass correct line segment vertices 2019-07-09 13:49:54 -05:00
Adrien Grand 437090c302 LUCENE-8311: Add CHANGES entry. 2019-07-09 16:20:06 +02:00
Adrien Grand 17f2f96a5c LUCENE-8900: Simplify MultiSorter. 2019-07-09 16:18:47 +02:00
Adrien Grand cfac486afd
LUCENE-8311: Phrase impacts (#760) 2019-07-09 16:01:29 +02:00
Christine Poerschke bf9a7e2626 Fix two 'against' typos. 2019-07-09 10:56:49 +01:00
Christine Poerschke b6add7ac90 LUCENE-8766: fix in[di]vidually typo in package-info.java 2019-07-09 10:56:49 +01:00
Nicholas Knize 0c09481374 LUCENE-8632: New XYShape Field and Queries for indexing and searching general cartesian geometries
The LatLonShape field and LatLonShape query classes added the ability to index and search geospatial
geometries in the WGS-84 latitude, longitude coordinate reference system. The foundation for this
capability is provided by the Tessellator that converts an array of vertices describing a Point Line
or Polygon into a stream of 3 vertex triangles that are encoded as a seven dimension point and
indexed using the BKD POINT structure. A nice property of the Tessellator is that lat, lon
restrictions are artificial and really only bound by the API.

This commit builds on top of / abstracts the Tessellator LatLonShape and LatLonShapeQuery classes to
provide the ability to index & search general cartesian (non WGS84 lat,lon restricted) geometry.
It does so by introducing two new base classes: ShapeField and ShapeQuery that provide the indexing
and search foundation for LatLonShape and the LatLonShape derived query classes
(LatLonShapeBoundingBoxQuery, LatLonShapeLineQuery, LatLonShapePolygonQuery) and introducing a new
XYShape factory class along with XYShape derived query classes (XYShapeBoundingBoxQuery,
XYShapeLineQuery, XYShapePolygonQuery). The heart of the cartesian indexing is achieved through
XYShapeEncodingUtils that converts the double precision vertices into an integer encoded seven
dimension point (similar to LatLonShape).

The test framework is also further abstracted and extended to provide a full test suite for the
new XYShape capability that works the same way as the LatLonShape test suite (but applied to non
GIS geometries).
2019-07-08 14:59:37 -05:00
Adrien Grand ac209b637d LUCENE-8803: Change the way that reverse ordering is implemented.
This addresses some test failures when IndexSearcher is created with an executor
and merges hits with TopDocs#merge.
2019-07-08 10:25:55 +02:00
Andrzej Bialecki dd4813d5b8 SOLR-13583: Return 400 Bad Request instead of 500 Server Error when a complex
alias is found but a simple alias was expected.
2019-07-05 09:19:32 +02:00
Adrien Grand eff574f8b3 LUCENE-8803: Ensure doc ID order is preserved in tests. 2019-07-05 09:13:22 +02:00
Noble Paul 404cf4947d
SOLR-13538: toNativeType () TrieDate & EnumField do not handle CharSequence properly 2019-07-05 14:47:31 +10:00
S N Munendra 5f1d342a96 SOLR-13538: Fix classcastEx in TrieDateField for atomic updates (#764)
SOLR-13538: toNativeType () TrieDate & EnumField do not handle CharSequence properly
2019-07-05 14:43:12 +10:00
Jan Høydahl 36af839eb1 LUCENE-8852: Fix wording and git commit msg 2019-07-04 19:04:24 +02:00
Ignacio Vera 5bf6cf2edd
LUCENE-8888: Improve distribution of points with data dimensions in BKD tree leaves (#747) 2019-07-04 10:50:23 +02:00
Michael Sokolov 8bf5e8be64 LUCENE-8895: deprecate the other util.fst.Util.getByOutput override, too 2019-07-03 21:33:33 -04:00
Andrzej Bialecki 66812b2a67 SOLR-13003: Fix WrapperModel method delegation. 2019-07-03 20:17:46 +02:00
Anshum Gupta b7090d9c25
SOLR-13507: Remove support for addr parameter from the /solr/admin/zookeeper endpoint. (#759) 2019-07-03 10:50:01 -07:00
Munendra S N 96860eb181 SOLR-13596: remove deprecated groupSpec methods 2019-07-03 20:55:56 +05:30
Tomoko Uchida 2e3e6c5944 SOLR-13602: Update _default schema for unittests. 2019-07-04 00:08:13 +09:00
Andrzej Bialecki 5897787291 SOLR-13003: Query Result Cache does not honour maxRamBytes parameter. 2019-07-03 17:03:43 +02:00
Munendra S N e3d247f288 SOLR-13603: remove deprecated groupSpec methods usage
This is precursor to removing deprecated methods
2019-07-03 19:33:08 +05:30
Andrzej Bialecki 717afb11fc SOLR-13583: Impossible to delete a collection with the same name as an existing alias. 2019-07-03 15:52:36 +02:00
Tomoko Uchida 9d2f516357 SOLR-13588: Document Estonian analyzer in Solr Ref Guide 2019-07-03 22:02:47 +09:00
Tomoko Uchida 45ea46a425 SOLR-13602: Add a field type for Estonian language to default managed_schema 2019-07-03 20:57:38 +09:00
David Smiley 325824cd39 SOLR-13158: DIH: Add System property toggle for use of dataConfig param 2019-07-02 23:59:39 -04:00
Michael Sokolov 81e63e8fec LUCENE-8895: switch all FST usage to enable array-with-gaps encoding
also, deprecate unused Util.getByOutput
2019-07-02 18:21:13 -04:00
Chris Hostetter b4a602f6b2 SOLR-13599: additional 'checkpoint' logging to try and help diagnose strange failures 2019-07-02 14:52:13 -07:00
Atri Sharma f686167e71 LUCENE-8857: Introduce Custom Tiebreakers in TopDocs#merge (#757)
This commit introduces custom tiebreakers which allows users to
specify custom tiebreakers when ordering hits to return. A
default tiebreaker is introduced for tie breaking on shard index
first and then docID.
2019-07-02 20:35:18 +02:00
Mayya Sharipova 13ef5032fe Load freqs lazily in Postings (#595)
Ensure that freqs are loaded lazily,
and loaded only when they are needed and
accessed through `freq()` method.
2019-07-02 18:58:58 +02:00
Andrzej Bialecki 7ec41d13e6 LUCENE-8898: Adjust the assertions - when CompressedOops are in use the estimation is less
accurate because RamUsageEstimator doesn't know the exact layout of data structures used in the map.
2019-07-02 15:37:29 +02:00
Joel Bernstein c478193004 SOLR-13589: Update CHANGES.txt 2019-07-02 08:55:48 -04:00
Ignacio Vera 7fc9b4976e
LUCENE-8896: Override default implementation of IntersectVisitor#visit(DocIDSetBuilder, byte[]) for several queries (#756) 2019-07-02 08:51:41 +02:00
Adrien Grand 5e109fb0a7 LUCENE-8757: Move changes entry. 2019-07-02 07:59:01 +02:00
Adrien Grand 2ae69333bf LUCENE-8898: Mute TestRamUsageEstimator#testMap. 2019-07-02 07:57:38 +02:00
Munendra S N 48b026d5ee SOLR-13404: Fix NPE when group=true and no group.field is present
* This was introduced in SOLR-12249
2019-07-02 10:27:54 +05:30
Munendra S N cfd22cd493 SOLR-13404: support group.query in multishard env with group.main=true
group.query after execution forms QueryCommandResult. In case of
group.main=true or group.format=simple, QueryCommandResult was not
consumed in EndResultTransformer. Also, MainEndResultTransformer assumed
that always group.field would be specified. When group.field not specified
it failed with AIOOBE. After adding suppport for QueryCommandResult in
EndResultTransformers and handling AIOOBE, group.query started giving results

Working on tests exposed few other issues. Results differed b/w standalone
& distributed mode.
* One of the reason is that TopGroupShardResponseProcessor doesn't consider correct
  limit and offset when group format is simple. In case of simple, start and rows should be used
  as limit and offset instead of group.limit and group.offset.
* Secondly, In distributed second phase grouping, computing docsToCollect didn't consider
  group response format. This issue is again similar to above issue
* offset(group.offset or start) not being considered during TopDocs#merge caused
  different results. The fix was to use to offset in merge process
* group.offset doesn't support negative values but there is no checks on the value.
  In case of negative values AIOOBE. Now, checks are added for negative values and
  returns proper error message(this change is for both standalone and distrbuted).
  Validation is done only in case of group.format=grouped as that is only case when
  group.offset is consumed.

Fixing above issues resolved the differences b/w standalone and distributed mode.
2019-07-02 08:11:13 +05:30
Joel Bernstein 96d11063a7 SOLR-13589: Fix precommit 2019-07-01 21:54:25 -04:00
Joel Bernstein 6a99151eae SOLR-13589: Add zplot cluster test case 2019-07-01 21:39:45 -04:00
Joel Bernstein 2f6a681b39 SOLR-13589: Allow zplot to visualize clusters and convex hulls 2019-07-01 21:39:45 -04:00
Munendra S N 0e877aac34 SOLR-12554: Expose IndexWriterConfig's ramPerThreadHardLimitMB
* When ramPerThreadHardLimitMB is not specified, then Lucene's
  default value 1945 is used. The specified value should be
  greater than 0 and less than 2048MB
2019-07-01 22:42:19 +05:30
Munendra S N dc16e2707b LUCENE-8892: add missing closing parentheses in MultiBoolFunction's description() 2019-07-01 21:55:34 +05:30
Cassandra Targett 82bf95727e Update Ruby version in Ref Guide build script 2019-07-01 10:25:16 -05:00
Adrien Grand 3f0ecfa9c4 Revert "LUCENE-8857: Introduce Custom Tiebreakers in TopDocs#merge (#734)"
This reverts commit e70b43c39a.
2019-07-01 16:53:23 +02:00
Atri Sharma e70b43c39a LUCENE-8857: Introduce Custom Tiebreakers in TopDocs#merge (#734)
This commit introduces custom tiebreakers which allows users to
specify custom tiebreakers when ordering hits to return. A
default tiebreaker is introduced for tie breaking on shard index
first and then docID.
2019-07-01 15:56:52 +02:00
Jason Gerlowski 8242e6ce1d SOLR-13539: Fix mv update of UUID, enum, bool and binary fields
Co-Authored-By: Thomas Wockinger
2019-07-01 08:53:54 -04:00
Atri Sharma 6799ca6efd LUCENE-8862: Introduce Collector Level Memory Accounting (#729)
This commit introduces a mechanism to track the dynamic memory
utilization of Collectors and allow setting collector level limits
for memory usage. A new bitset collecting Collector is introduced
which supports the memory tracker functionality
2019-07-01 13:47:07 +02:00
Alan Woodward 6966b49a6b LUCENE-8893: Intervals.prefix() and Intervals.wildcard() should take BytesRef 2019-07-01 08:52:56 +01:00
Ignacio Vera db68634c67
LUCENE-8885: Optimise BKD reader by exploiting cardinality information stored on leaves (#746)
The commit adds the method InstersectVisitor#visit(DocIdSetIterator, byte[]).
2019-07-01 06:15:03 +02:00