OpenSearch

History

Mayya Sharipova 7cf170830c Optimize sort on numeric long and date fields. (#49732 ) This rewrites long sort as a `DistanceFeatureQuery`, which can efficiently skip non-competitive blocks and segments of documents. Depending on the dataset, the speedups can be 2 - 10 times. The optimization can be disabled with setting the system property `es.search.rewrite_sort` to `false`. Optimization is skipped when an index has 50% or more data with the same value. Optimization is done through: 1. Rewriting sort as `DistanceFeatureQuery` which can efficiently skip non-competitive blocks and segments of documents. 2. Sorting segments according to the primary numeric sort field(#44021) This allows to skip non-competitive segments. 3. Using collector manager. When we optimize sort, we sort segments by their min/max value. As a collector expects to have segments in order, we can not use a single collector for sorted segments. We use collectorManager, where for every segment a dedicated collector will be created. 4. Using Lucene's shared TopFieldCollector manager This collector manager is able to exchange minimum competitive score between collectors, which allows us to efficiently skip the whole segments that don't contain competitive scores. 5. When index is force merged to a single segment, #48533 interleaving old and new segments allows for this optimization as well, as blocks with non-competitive docs can be skipped. Backport for #48804 Co-authored-by: Jim Ferenczi <jim.ferenczi@elastic.co>		2019-11-29 15:37:40 -05:00
..
main	Optimize sort on numeric long and date fields. (#49732 )	2019-11-29 15:37:40 -05:00
minimumRuntime/java/org/elasticsearch/gradle	Provision the correct JDK for test tasks (#48561 )	2019-11-18 10:28:02 +02:00
test	Fix testUnkownPlatform (#49235 )	2019-11-18 13:03:23 +01:00
testKit	Apply 2-space indent to all gradle scripts (#49071 )	2019-11-14 11:01:23 +00:00