Commit Graph

11342 Commits

Author SHA1 Message Date
Atri Sharma 42fadbff79 LUCENE-8915 : Improve Javadocs for RateLimiter and SimpleRateLimiter (#789) 2019-07-26 11:29:21 +02:00
iverase 846d18814a Add back-compat indices for 8.2.0 2019-07-26 10:23:21 +02:00
Ishan Chattopadhyaya 90dd3f768f LUCENE-8927: Fixing precommit / removing unused import 2019-07-25 19:02:03 -04:00
Atri Sharma 3e2ca05b8e LUCENE-8927: Set.copyOf and Set.of instead of Collections.unmodifiabl… (#796) 2019-07-25 20:15:29 +02:00
Atri Sharma 29f941baa2 LUCENE-8931: Remove Custom ScoreDoc Equality Method (#806) 2019-07-25 19:59:40 +02:00
Adrien Grand 1ea8419336
LUCENE-8922: Better impacts for DisjunctionMaxQuery. (#791) 2019-07-24 15:12:57 +02:00
Namgyu Kim b756cccf44 LUCENE-8912: remove nori/tools dependency on ICU 2019-07-23 23:59:30 +09:00
Namgyu Kim c5741c201f Revert "remove nori/tools dependency on ICU"
This reverts commit dd7b3d8d95.
2019-07-23 23:36:26 +09:00
Namgyu Kim dd7b3d8d95
remove nori/tools dependency on ICU 2019-07-23 00:08:01 +09:00
Alan Woodward 1ccef96767 LUCENE-8916: GraphTokenStreamFiniteStrings preserves all attributes 2019-07-19 11:46:25 +01:00
Alan Woodward 15b08f9a06 LUCENE-8918: Disallow null terms in PhraseQuery 2019-07-18 10:10:36 +01:00
iverase 0de627ee26 LUCENE-8913: Fix test bug in BaseLatLonShapeTestCase#testBoxQueryEqualsAndHashcode 2019-07-18 07:20:51 +02:00
Munendra S N 6104f55ac0 LUCENE-8909: remove deprecated IndexWriter#getFieldNames() 2019-07-18 10:03:17 +05:30
Ignacio Vera f026053d4d
LUCENE-8914: Move compare logic to IntersectVisitor in FloatPointNearestNeighbor (#783)
Move the logic for discarding inner modes to the IntersectVisitor so we take advantage of the change introduced in LUCENE-7862
2019-07-17 14:14:17 +02:00
iverase 41ae03a9a0 LUCENE-8923: Add Lucene-8.3 entry in CHANGES.txt 2019-07-17 13:53:21 +02:00
Adrien Grand 607c46c997 LUCENE-8811: Undo removal of deprecations. 2019-07-15 13:52:45 +02:00
Namgyu Kim 2ac5fb668a
LUCENE-8904: enhance Nori DictionaryBuilder tool 2019-07-11 23:59:34 +09:00
Tomoko Uchida 6d79cc9e44 LUCENE-8907: Move change logs for LUCENE-8778 and following issues to the 9.0.0 updates section. 2019-07-11 23:50:34 +09:00
Atri Sharma ee79a20174 LUCENE-8875: Introduce Optimized Collector For Large Number Of Hits (#754)
This commit introduces a new collector which is optimized for
cases when the number of hits is large and/or the actual hits
collected are sparse in comparison to the number of hits
requested.
2019-07-10 19:17:52 +02:00
Adrien Grand fb30ded643 Ensure tokens have at least 2 chars, otherwise norms are incorrect. 2019-07-10 09:09:54 +02:00
Nicholas Knize 14b67012d0 LUCENE-8632: Fix EdgeTree.internalComponentRelateTriangle to pass correct line segment vertices 2019-07-09 13:49:54 -05:00
Adrien Grand 437090c302 LUCENE-8311: Add CHANGES entry. 2019-07-09 16:20:06 +02:00
Adrien Grand 17f2f96a5c LUCENE-8900: Simplify MultiSorter. 2019-07-09 16:18:47 +02:00
Adrien Grand cfac486afd
LUCENE-8311: Phrase impacts (#760) 2019-07-09 16:01:29 +02:00
Christine Poerschke bf9a7e2626 Fix two 'against' typos. 2019-07-09 10:56:49 +01:00
Christine Poerschke b6add7ac90 LUCENE-8766: fix in[di]vidually typo in package-info.java 2019-07-09 10:56:49 +01:00
Nicholas Knize 0c09481374 LUCENE-8632: New XYShape Field and Queries for indexing and searching general cartesian geometries
The LatLonShape field and LatLonShape query classes added the ability to index and search geospatial
geometries in the WGS-84 latitude, longitude coordinate reference system. The foundation for this
capability is provided by the Tessellator that converts an array of vertices describing a Point Line
or Polygon into a stream of 3 vertex triangles that are encoded as a seven dimension point and
indexed using the BKD POINT structure. A nice property of the Tessellator is that lat, lon
restrictions are artificial and really only bound by the API.

This commit builds on top of / abstracts the Tessellator LatLonShape and LatLonShapeQuery classes to
provide the ability to index & search general cartesian (non WGS84 lat,lon restricted) geometry.
It does so by introducing two new base classes: ShapeField and ShapeQuery that provide the indexing
and search foundation for LatLonShape and the LatLonShape derived query classes
(LatLonShapeBoundingBoxQuery, LatLonShapeLineQuery, LatLonShapePolygonQuery) and introducing a new
XYShape factory class along with XYShape derived query classes (XYShapeBoundingBoxQuery,
XYShapeLineQuery, XYShapePolygonQuery). The heart of the cartesian indexing is achieved through
XYShapeEncodingUtils that converts the double precision vertices into an integer encoded seven
dimension point (similar to LatLonShape).

The test framework is also further abstracted and extended to provide a full test suite for the
new XYShape capability that works the same way as the LatLonShape test suite (but applied to non
GIS geometries).
2019-07-08 14:59:37 -05:00
Adrien Grand ac209b637d LUCENE-8803: Change the way that reverse ordering is implemented.
This addresses some test failures when IndexSearcher is created with an executor
and merges hits with TopDocs#merge.
2019-07-08 10:25:55 +02:00
Adrien Grand eff574f8b3 LUCENE-8803: Ensure doc ID order is preserved in tests. 2019-07-05 09:13:22 +02:00
Ignacio Vera 5bf6cf2edd
LUCENE-8888: Improve distribution of points with data dimensions in BKD tree leaves (#747) 2019-07-04 10:50:23 +02:00
Michael Sokolov 8bf5e8be64 LUCENE-8895: deprecate the other util.fst.Util.getByOutput override, too 2019-07-03 21:33:33 -04:00
Andrzej Bialecki 5897787291 SOLR-13003: Query Result Cache does not honour maxRamBytes parameter. 2019-07-03 17:03:43 +02:00
Michael Sokolov 81e63e8fec LUCENE-8895: switch all FST usage to enable array-with-gaps encoding
also, deprecate unused Util.getByOutput
2019-07-02 18:21:13 -04:00
Atri Sharma f686167e71 LUCENE-8857: Introduce Custom Tiebreakers in TopDocs#merge (#757)
This commit introduces custom tiebreakers which allows users to
specify custom tiebreakers when ordering hits to return. A
default tiebreaker is introduced for tie breaking on shard index
first and then docID.
2019-07-02 20:35:18 +02:00
Mayya Sharipova 13ef5032fe Load freqs lazily in Postings (#595)
Ensure that freqs are loaded lazily,
and loaded only when they are needed and
accessed through `freq()` method.
2019-07-02 18:58:58 +02:00
Andrzej Bialecki 7ec41d13e6 LUCENE-8898: Adjust the assertions - when CompressedOops are in use the estimation is less
accurate because RamUsageEstimator doesn't know the exact layout of data structures used in the map.
2019-07-02 15:37:29 +02:00
Ignacio Vera 7fc9b4976e
LUCENE-8896: Override default implementation of IntersectVisitor#visit(DocIDSetBuilder, byte[]) for several queries (#756) 2019-07-02 08:51:41 +02:00
Adrien Grand 5e109fb0a7 LUCENE-8757: Move changes entry. 2019-07-02 07:59:01 +02:00
Adrien Grand 2ae69333bf LUCENE-8898: Mute TestRamUsageEstimator#testMap. 2019-07-02 07:57:38 +02:00
Munendra S N dc16e2707b LUCENE-8892: add missing closing parentheses in MultiBoolFunction's description() 2019-07-01 21:55:34 +05:30
Adrien Grand 3f0ecfa9c4 Revert "LUCENE-8857: Introduce Custom Tiebreakers in TopDocs#merge (#734)"
This reverts commit e70b43c39a.
2019-07-01 16:53:23 +02:00
Atri Sharma e70b43c39a LUCENE-8857: Introduce Custom Tiebreakers in TopDocs#merge (#734)
This commit introduces custom tiebreakers which allows users to
specify custom tiebreakers when ordering hits to return. A
default tiebreaker is introduced for tie breaking on shard index
first and then docID.
2019-07-01 15:56:52 +02:00
Atri Sharma 6799ca6efd LUCENE-8862: Introduce Collector Level Memory Accounting (#729)
This commit introduces a mechanism to track the dynamic memory
utilization of Collectors and allow setting collector level limits
for memory usage. A new bitset collecting Collector is introduced
which supports the memory tracker functionality
2019-07-01 13:47:07 +02:00
Alan Woodward 6966b49a6b LUCENE-8893: Intervals.prefix() and Intervals.wildcard() should take BytesRef 2019-07-01 08:52:56 +01:00
Ignacio Vera db68634c67
LUCENE-8885: Optimise BKD reader by exploiting cardinality information stored on leaves (#746)
The commit adds the method InstersectVisitor#visit(DocIdSetIterator, byte[]).
2019-07-01 06:15:03 +02:00
Tomoko Uchida 299bfe55b3 LUCENE-8874: Show SPI names instead of class names in Luke Analysis tab. 2019-06-30 13:27:24 +09:00
Tomoko Uchida 7ac77ce441 LUCENE-8894: Add APIs to tokenizer/charfilter/tokenfilter factories to get their SPI names from concrete classes 2019-06-30 12:19:48 +09:00
Gert Morten Paimla 42a1eb0403 LUCENE-8891: Add snowball stemmer and analyzer for Estonian language.
Signed-off-by: Tomoko Uchida <tomoko@apache.org>
2019-06-30 09:12:55 +09:00
Alan Woodward 46cc24d77e LUCENE-8856: Promote intervals queries from sandbox to queries module 2019-06-28 12:28:53 +01:00
Andrzej Bialecki b48a02b1f1 LUCENE-8855: Move the CHANGES entry to the right version. 2019-06-28 10:59:50 +02:00