OpenSearch

Commit Graph

Author	SHA1	Message	Date
Areek Zillur	f39d4e1f89	PhraseSuggester: Collate option should allow returning phrases with no matching docs A new option `prune` has been added to allow users to control phrase suggestion pruning when `collate` is set. If the new option is set, the phrase suggestion option will contain a boolean `collate_match` indicating whether the respective result had hits in collation. CLoses #6927	2014-07-22 17:17:15 -04:00
Adrien Grand	abeefbddea	Docs: Update documentation about execution hints for the terms aggregation.	2014-07-21 11:55:57 +02:00
Clinton Gormley	6a7a77eada	Docs: Add links to client helper classes for bulk/scroll/reindexing	2014-07-18 13:55:47 +02:00
Simon Willnauer	f9a9348508	[DOCS] Move benchmark API to 1.4	2014-07-16 15:02:20 +02:00
Brian Murphy	d6cd2c2b73	[DOCS][FIX] Fix reference check in indexed scripts/templates doc.	2014-07-16 11:24:18 +01:00
Brian Murphy	bc570919ee	[DOCS][FIX] Fix doc parsing, broken closing block	2014-07-16 11:18:21 +01:00
Brian Murphy	cbd2a97abd	[DOCS] : Indexed scripts/templates These are the docs for the indexed scripts/templates feature. Also moved the namespace for the REST endpoints. Closes #6851	2014-07-16 10:49:02 +01:00
Areek Zillur	76343899ea	Phrase Suggester: Add collate option to PhraseSuggester The newly added collate option will let the user provide a template query/filter which will be executed for every phrase suggestions generated to ensure that the suggestion matches at least one document for the filter/query. The user can also add routing preference `preference` to route the collate query/filter and additional `params` to inject into the collate template. Closes #3482	2014-07-14 16:07:52 -04:00
Britta Weber	74927adced	significant terms: infrastructure for changing easily the significance heuristic This commit adds the infrastructure to allow pluging in different measures for computing the significance of a term. Significance measures can be provided externally by overriding - SignificanceHeuristic - SignificanceHeuristicBuilder - SignificanceHeuristicParser closes #6561	2014-07-14 11:00:50 +02:00
Florian Hopf	3689f67a76	Docs: Fixed invalid word count in geodistance agg doc Closes #6838	2014-07-11 18:35:36 +02:00
Clinton Gormley	b6baa4be4a	Update preference.asciidoc Clarify that `preference` is a query string parameter only and provide an example.	2014-07-09 11:13:17 +02:00
Clinton Gormley	feb81e228b	Docs: Rewrote the scroll/scan docs Closes #6774	2014-07-08 11:54:53 +02:00
Andrii Gakhov	80321d89d9	Docs: Update histogram-aggregation.asciidoc filter in a filtered query should be under "filter" key Closes #6738	2014-07-07 10:44:11 +02:00
Carsten Brandt	bd4699da7e	Docs: fixed a typo in the docs Closes: #6718	2014-07-07 10:41:36 +02:00
Duncan Angus Wilkie	60a8515fb7	Update histogram-facet.asciidoc Spotted a typo, which I've fixed.	2014-07-01 10:49:43 +02:00
Clinton Gormley	64a4acc49b	Docs: Added IDs to the highlighters for linking	2014-06-22 16:46:42 +02:00
Chris	011e20678d	[DOCS] Fixed json example in nested-aggregation.asciidoc	2014-06-18 19:38:02 +02:00
Colin Goodheart-Smithe	7423ce0560	Aggregations: Added percentile rank aggregation Percentile Rank Aggregation is the reverse of the Percetiles aggregation. It determines the percentile rank (the proportion of values less than a given value) of the provided array of values. Closes #6386	2014-06-18 12:02:08 +01:00
stephlag	13d910f016	Added missing comma in suggester example	2014-06-13 16:01:04 +02:00
Adrien Grand	01327d7136	Facets: deprecation. Users are encouraged to move to the new aggregation framework that was introduced in Elasticsearch 1.0. Close #6485	2014-06-13 13:13:44 +02:00
Luke Fender	f9da5259bc	[DOCS] Fixed typo in post-filter.asciidoc Remove 'be' where it is not needed	2014-06-12 12:09:19 +02:00
Martijn van Groningen	5e408f3d40	Change the top_hits to be a metric aggregation instead of a bucket aggregation (which can't have an sub aggs) Closes #6395 Closes #6434	2014-06-10 09:09:50 +02:00
markharwood	724129e6ce	Aggregations optimisation for memory usage. Added changes to core Aggregator class to support a new mode of deferred collection. A new "breadth_first" results collection mode allows upper branches of aggregation tree to be calculated and then pruned to a smaller selection before advancing into executing collection on child branches. Closes #6128	2014-06-06 15:59:51 +01:00
fransflippo	cdbde4a578	[DOCS] Reworded note about shorthand suggest syntax The existing Note about the shorthand suggest syntax was poorly worded and confusing. Please check whether the way I've phrased it now is still correct as to what the shorthand form actually does and doesn't do: the original wording did not provide me enough information to be sure. Thanks!	2014-06-06 10:21:01 +02:00
Jad Naous	5aa84c9aab	[DOCS] Fixed typos in aggregations.asciidoc Fix plural/singular forms.	2014-06-05 19:47:01 +02:00
Colin Goodheart-Smithe	b9f4d44b14	Aggregations: Adds GeoBounds Aggregation The GeoBounds Aggregation is a new single bucket aggregation which outputs the coordinates of a bounding box containing all the points from all the documents passed to the aggregation as well as the doc count. Geobound Aggregation also use a wrap_logitude parameter which specifies whether the resulting bounding box is permitted to overlap the international date line. This option defaults to true. This aggregation introduces the idea of MetricsAggregation which do not return double values and cannot be used for sorting. The existing MetricsAggregation has been renamed to NumericMetricsAggregation and is a subclass of MetricsAggregation. MetricsAggregations do not store doc counts and do not support child aggregations. Closes #5634	2014-06-03 15:59:56 +01:00
javanna	5a1ad7b42e	[DOCS] fixed curl requests in benchmark docs	2014-06-03 11:47:13 +02:00
leonardo menezes	f3eca05c3b	[DOCS] removed slowest on single query benchmark requests Relates to #5904	2014-06-03 11:47:13 +02:00
Clinton Gormley	7fff6f1f43	Docs: Tidied percolate.asciidoc	2014-05-30 11:56:06 +02:00
Martijn van Groningen	aab38fb2e6	Aggregations: added pagination support to `top_hits` aggregation by adding `from` option. Closes #6299	2014-05-30 11:45:31 +02:00
Martijn van Groningen	5fafd2451a	Added `top_hits` aggregation that keeps track of the most relevant document being aggregated per bucket. Closes #6124	2014-05-23 16:01:18 +02:00
Nik Everett	3573822b7e	Highlight fields in request order Because json objects are unordered this also adds an explicit order syntax that looks like "highlight": { "fields": [ {"title":{ /params/ }}, {"text":{ /params/ }} ] } This is not useful for any of the builtin highlighters but will be useful in plugins. Closes #4649	2014-05-22 16:44:14 +02:00
Simon Willnauer	9d5507047f	Update Documentation Feature Flags [1.2.0]	2014-05-22 15:06:42 +02:00
Clinton Gormley	f950344546	[DOCS] Fixed title levels in context suggester	2014-05-21 20:47:25 +02:00
Simon Willnauer	ec3b1c57ac	Move Benchmark release to 1.3	2014-05-21 10:17:59 +02:00
Britta Weber	08e57890f8	use shard_min_doc_count also in TermsAggregation This was discussed in issue #6041 and #5998 . closes #6143	2014-05-14 14:10:04 +02:00
Clinton Gormley	ff12585fea	Improved wording in search-type.asciidoc Closes #5951	2014-05-14 12:15:48 +02:00
David Pilato	1cb2c3bdd3	[DOCS] reverse-nested aggs are added in 1.2.0	2014-05-13 20:00:42 +02:00
Tiago Alves Macambira	a8242e6c8c	Clarify `missing` behavior.	2014-05-13 15:49:46 +02:00
Adrien Grand	cc530b9037	Use t-digest as a dependency. Our improvements to t-digest have been pushed upstream and t-digest also got some additional nice improvements around memory usage and speedups of quantile estimation. So it makes sense to use it as a dependency now. This also allows to remove the test dependency on Apache Mahout. Close #6142	2014-05-13 10:38:08 +02:00
Clinton Gormley	3aac594503	[DOCS] Fix typos in context suggest	2014-05-13 10:34:16 +02:00
markharwood	1e560b0d92	Significant_terms agg: added option for a background_filter to define background context for analysis of term frequencies Closes #5944	2014-05-13 09:10:30 +01:00
Clinton Gormley	5b93255ec8	[DOCS] Added "Aggregation" to all aggs titles	2014-05-13 01:35:58 +02:00
Rashid Khan	233aaa63c9	Change key to keyed	2014-05-12 13:15:07 -07:00
Alex Ksikes	dae48d9fe8	Added the ability to include the queried document for More Like This API. By default More Like This API excludes the queried document from the response. However, when debugging or when comparing scores across different queries, it could be useful to have the best possible matched hit. So this option lets users explicitly specify the desired behavior. Closes #6067	2014-05-09 12:59:39 +02:00
Alex Ksikes	48b7172ee7	Provided some insights as to how More Like This works internally. In the Google Groups forum there appears to be some confusion as to what mlt does. This documentation update should hopefully help demystifying this feature, and provide some understanding as to how to use its parameters. Closes #6092	2014-05-09 12:13:29 +02:00
Andrew Selden	f23274523a	Integration tests for benchmark API. - Randomized integration tests for the benchmark API. - Negative tests for cases where the cluster cannot run benchmarks. - Return 404 on missing benchmark name. - Allow to specify 'types' as an array in the JSON syntax when describing a benchmark competition. - Don't record slowest for single-request competitions. Closes #6003, #5906, #5903, #5904	2014-05-07 14:14:54 -07:00
uboness	fc52db1209	Changed the respnose structure of the percentiles aggregation where now all the percentiles are placed under a `values` object (or `values` array in case the `keyed` flag is set to `false` Closes #5870	2014-05-07 18:35:24 +02:00
Britta Weber	7944369fd1	Add `shard_min_doc_count` parameter for significant terms similar to `shard_size` Significant terms internally maintain a priority queue per shard with a size potentially lower than the number of terms. This queue uses the score as criterion to determine if a bucket is kept or not. If many terms with low subsetDF score very high but the `min_doc_count` is set high, this might result in no terms being returned because the pq is filled with low frequent terms which are all sorted out in the end. This can be avoided by increasing the `shard_size` parameter to a higher value. However, it is not immediately clear to which value this parameter must be set because we can not know how many terms with low frequency are scored higher that the high frequent terms that we are actually interested in. On the other hand, if there is no routing of docs to shards involved, we can maybe assume that the documents of classes and also the terms therein are distributed evenly across shards. In that case it might be easier to not add documents to the pq that have subsetDF <= `shard_min_doc_count` which can be set to something like `min_doc_count`/number of shards because we would assume that even when summing up the subsetDF across shards `min_doc_count` will not be reached. closes #5998 closes #6041	2014-05-07 18:02:56 +02:00
gabriel-tessier	7b0efcbd96	fix typo	2014-05-06 15:54:36 +02:00

1 2 3 4

191 Commits