OpenSearch

Commit Graph

Author	SHA1	Message	Date
Britta Weber	a3cefd919e	significant terms: add google normalized distance, add chi square closes #6858	2014-08-04 08:15:26 +02:00
uboness	3c9c9f33e2	Aggregations Added Filters aggregation A multi-bucket aggregation where multiple filters can be defined (each filter defines a bucket). The buckets will collect all the documents that match their associated filter. This aggregation can be very useful when one wants to compare analytics between different criterias. It can also be accomplished using multiple definitions of the single filter aggregation, but here, the user will only need to define the sub-aggregations only once. Closes #6118	2014-08-01 16:01:08 +01:00
Adrien Grand	d9d5b35be9	Sort: Make `ignore_unmapped` work for cross-index queries. Close #2255	2014-08-01 15:30:17 +02:00
Stefan Antoni	8e862f15c1	[DOCS] fixed small typo in percolate.asciidoc	2014-08-01 12:38:35 +02:00
Britta Weber	d6a18ab2ba	Docs: add 1.4.0 label to many to many geo distance sort	2014-08-01 12:30:08 +02:00
Kurt Hurtado	66560acebb	Update fielddata-fields.asciidoc	2014-08-01 09:20:19 +02:00
Areek Zillur	1d581e6286	Search Exists API: Checks if any matching documents exist for a given query Implements a new Exists API allowing users to do fast exists check on any matched documents for a given query. This API should be faster then using the Count API as it will: - early terminate the search execution once any document is found to exist - return the response as soon as the first shard reports matched documents closes #6995	2014-07-31 15:42:30 -04:00
Britta Weber	fe86c8bc88	_geo_distance sort: allow many to many geo point distance Add computation of disyance to many geo points. Example request: ``` { "sort": [ { "_geo_distance": { "location": [ { "lat":1.2, "lon":3 }, { "lat":1.2, "lon":3 } ], "order": "desc", "unit": "km", "sort_mode": "max" } } ] } ``` closes #3926	2014-07-31 17:33:45 +02:00
Clinton Gormley	36e1c7928c	Rewrote post-filter.asciidoc Closes #5166	2014-07-31 12:56:11 +02:00
Adrien Grand	1fe76b891b	Docs: Add links to the equivalent aggs in facets documentation.	2014-07-28 15:22:49 +02:00
Clinton Gormley	be86556946	Update request-body.asciidoc Added link from `timeout` to time-units Closes #6361	2014-07-28 11:08:59 +02:00
Clinton Gormley	10b4177def	Docs: Fixed path to search-shards	2014-07-26 15:05:53 +02:00
Clinton Gormley	88c8754a3c	Docs: Removed search-shards from request-body	2014-07-26 14:52:50 +02:00
Colin Goodheart-Smithe	655157c83a	Aggregations: Added an option to show the upper bound of the error for the terms aggregation. This is only applicable when the order is set to _count. The upper bound of the error in the doc count is calculated by summing the doc count of the last term on each shard which did not return the term. The implementation calculates the error by summing the doc count for the last term on each shard for which the term IS returned and then subtracts this value from the sum of the doc counts for the last term from ALL shards. Closes #6696	2014-07-25 14:24:24 +01:00
Areek Zillur	5487c56c70	Search & Count: Add option to early terminate doc collection Allow users to control document collection termination, if a specified terminate_after number is set. Upon setting the newly added parameter, the response will include a boolean terminated_early flag, indicating if the document collection for any shard terminated early. closes #6876	2014-07-23 15:10:15 -04:00
Clinton Gormley	0f943850a0	Update named-queries-and-filters.asciidoc	2014-07-23 17:28:49 +02:00
Simon Willnauer	5bfea56457	[DOCS] move all coming tags to added in master	2014-07-23 16:37:19 +02:00
Areek Zillur	f39d4e1f89	PhraseSuggester: Collate option should allow returning phrases with no matching docs A new option `prune` has been added to allow users to control phrase suggestion pruning when `collate` is set. If the new option is set, the phrase suggestion option will contain a boolean `collate_match` indicating whether the respective result had hits in collation. CLoses #6927	2014-07-22 17:17:15 -04:00
Adrien Grand	abeefbddea	Docs: Update documentation about execution hints for the terms aggregation.	2014-07-21 11:55:57 +02:00
Clinton Gormley	6a7a77eada	Docs: Add links to client helper classes for bulk/scroll/reindexing	2014-07-18 13:55:47 +02:00
Simon Willnauer	f9a9348508	[DOCS] Move benchmark API to 1.4	2014-07-16 15:02:20 +02:00
Brian Murphy	d6cd2c2b73	[DOCS][FIX] Fix reference check in indexed scripts/templates doc.	2014-07-16 11:24:18 +01:00
Brian Murphy	bc570919ee	[DOCS][FIX] Fix doc parsing, broken closing block	2014-07-16 11:18:21 +01:00
Brian Murphy	cbd2a97abd	[DOCS] : Indexed scripts/templates These are the docs for the indexed scripts/templates feature. Also moved the namespace for the REST endpoints. Closes #6851	2014-07-16 10:49:02 +01:00
Areek Zillur	76343899ea	Phrase Suggester: Add collate option to PhraseSuggester The newly added collate option will let the user provide a template query/filter which will be executed for every phrase suggestions generated to ensure that the suggestion matches at least one document for the filter/query. The user can also add routing preference `preference` to route the collate query/filter and additional `params` to inject into the collate template. Closes #3482	2014-07-14 16:07:52 -04:00
Britta Weber	74927adced	significant terms: infrastructure for changing easily the significance heuristic This commit adds the infrastructure to allow pluging in different measures for computing the significance of a term. Significance measures can be provided externally by overriding - SignificanceHeuristic - SignificanceHeuristicBuilder - SignificanceHeuristicParser closes #6561	2014-07-14 11:00:50 +02:00
Florian Hopf	3689f67a76	Docs: Fixed invalid word count in geodistance agg doc Closes #6838	2014-07-11 18:35:36 +02:00
Clinton Gormley	b6baa4be4a	Update preference.asciidoc Clarify that `preference` is a query string parameter only and provide an example.	2014-07-09 11:13:17 +02:00
Clinton Gormley	feb81e228b	Docs: Rewrote the scroll/scan docs Closes #6774	2014-07-08 11:54:53 +02:00
Andrii Gakhov	80321d89d9	Docs: Update histogram-aggregation.asciidoc filter in a filtered query should be under "filter" key Closes #6738	2014-07-07 10:44:11 +02:00
Carsten Brandt	bd4699da7e	Docs: fixed a typo in the docs Closes: #6718	2014-07-07 10:41:36 +02:00
Duncan Angus Wilkie	60a8515fb7	Update histogram-facet.asciidoc Spotted a typo, which I've fixed.	2014-07-01 10:49:43 +02:00
Clinton Gormley	64a4acc49b	Docs: Added IDs to the highlighters for linking	2014-06-22 16:46:42 +02:00
Chris	011e20678d	[DOCS] Fixed json example in nested-aggregation.asciidoc	2014-06-18 19:38:02 +02:00
Colin Goodheart-Smithe	7423ce0560	Aggregations: Added percentile rank aggregation Percentile Rank Aggregation is the reverse of the Percetiles aggregation. It determines the percentile rank (the proportion of values less than a given value) of the provided array of values. Closes #6386	2014-06-18 12:02:08 +01:00
stephlag	13d910f016	Added missing comma in suggester example	2014-06-13 16:01:04 +02:00
Adrien Grand	01327d7136	Facets: deprecation. Users are encouraged to move to the new aggregation framework that was introduced in Elasticsearch 1.0. Close #6485	2014-06-13 13:13:44 +02:00
Luke Fender	f9da5259bc	[DOCS] Fixed typo in post-filter.asciidoc Remove 'be' where it is not needed	2014-06-12 12:09:19 +02:00
Martijn van Groningen	5e408f3d40	Change the top_hits to be a metric aggregation instead of a bucket aggregation (which can't have an sub aggs) Closes #6395 Closes #6434	2014-06-10 09:09:50 +02:00
markharwood	724129e6ce	Aggregations optimisation for memory usage. Added changes to core Aggregator class to support a new mode of deferred collection. A new "breadth_first" results collection mode allows upper branches of aggregation tree to be calculated and then pruned to a smaller selection before advancing into executing collection on child branches. Closes #6128	2014-06-06 15:59:51 +01:00
fransflippo	cdbde4a578	[DOCS] Reworded note about shorthand suggest syntax The existing Note about the shorthand suggest syntax was poorly worded and confusing. Please check whether the way I've phrased it now is still correct as to what the shorthand form actually does and doesn't do: the original wording did not provide me enough information to be sure. Thanks!	2014-06-06 10:21:01 +02:00
Jad Naous	5aa84c9aab	[DOCS] Fixed typos in aggregations.asciidoc Fix plural/singular forms.	2014-06-05 19:47:01 +02:00
Colin Goodheart-Smithe	b9f4d44b14	Aggregations: Adds GeoBounds Aggregation The GeoBounds Aggregation is a new single bucket aggregation which outputs the coordinates of a bounding box containing all the points from all the documents passed to the aggregation as well as the doc count. Geobound Aggregation also use a wrap_logitude parameter which specifies whether the resulting bounding box is permitted to overlap the international date line. This option defaults to true. This aggregation introduces the idea of MetricsAggregation which do not return double values and cannot be used for sorting. The existing MetricsAggregation has been renamed to NumericMetricsAggregation and is a subclass of MetricsAggregation. MetricsAggregations do not store doc counts and do not support child aggregations. Closes #5634	2014-06-03 15:59:56 +01:00
javanna	5a1ad7b42e	[DOCS] fixed curl requests in benchmark docs	2014-06-03 11:47:13 +02:00
leonardo menezes	f3eca05c3b	[DOCS] removed slowest on single query benchmark requests Relates to #5904	2014-06-03 11:47:13 +02:00
Clinton Gormley	7fff6f1f43	Docs: Tidied percolate.asciidoc	2014-05-30 11:56:06 +02:00
Martijn van Groningen	aab38fb2e6	Aggregations: added pagination support to `top_hits` aggregation by adding `from` option. Closes #6299	2014-05-30 11:45:31 +02:00
Martijn van Groningen	5fafd2451a	Added `top_hits` aggregation that keeps track of the most relevant document being aggregated per bucket. Closes #6124	2014-05-23 16:01:18 +02:00
Nik Everett	3573822b7e	Highlight fields in request order Because json objects are unordered this also adds an explicit order syntax that looks like "highlight": { "fields": [ {"title":{ /params/ }}, {"text":{ /params/ }} ] } This is not useful for any of the builtin highlighters but will be useful in plugins. Closes #4649	2014-05-22 16:44:14 +02:00
Simon Willnauer	9d5507047f	Update Documentation Feature Flags [1.2.0]	2014-05-22 15:06:42 +02:00
Clinton Gormley	f950344546	[DOCS] Fixed title levels in context suggester	2014-05-21 20:47:25 +02:00
Simon Willnauer	ec3b1c57ac	Move Benchmark release to 1.3	2014-05-21 10:17:59 +02:00
Britta Weber	08e57890f8	use shard_min_doc_count also in TermsAggregation This was discussed in issue #6041 and #5998 . closes #6143	2014-05-14 14:10:04 +02:00
Clinton Gormley	ff12585fea	Improved wording in search-type.asciidoc Closes #5951	2014-05-14 12:15:48 +02:00
David Pilato	1cb2c3bdd3	[DOCS] reverse-nested aggs are added in 1.2.0	2014-05-13 20:00:42 +02:00
Tiago Alves Macambira	a8242e6c8c	Clarify `missing` behavior.	2014-05-13 15:49:46 +02:00
Adrien Grand	cc530b9037	Use t-digest as a dependency. Our improvements to t-digest have been pushed upstream and t-digest also got some additional nice improvements around memory usage and speedups of quantile estimation. So it makes sense to use it as a dependency now. This also allows to remove the test dependency on Apache Mahout. Close #6142	2014-05-13 10:38:08 +02:00
Clinton Gormley	3aac594503	[DOCS] Fix typos in context suggest	2014-05-13 10:34:16 +02:00
markharwood	1e560b0d92	Significant_terms agg: added option for a background_filter to define background context for analysis of term frequencies Closes #5944	2014-05-13 09:10:30 +01:00
Clinton Gormley	5b93255ec8	[DOCS] Added "Aggregation" to all aggs titles	2014-05-13 01:35:58 +02:00
Rashid Khan	233aaa63c9	Change key to keyed	2014-05-12 13:15:07 -07:00
Alex Ksikes	dae48d9fe8	Added the ability to include the queried document for More Like This API. By default More Like This API excludes the queried document from the response. However, when debugging or when comparing scores across different queries, it could be useful to have the best possible matched hit. So this option lets users explicitly specify the desired behavior. Closes #6067	2014-05-09 12:59:39 +02:00
Alex Ksikes	48b7172ee7	Provided some insights as to how More Like This works internally. In the Google Groups forum there appears to be some confusion as to what mlt does. This documentation update should hopefully help demystifying this feature, and provide some understanding as to how to use its parameters. Closes #6092	2014-05-09 12:13:29 +02:00
Andrew Selden	f23274523a	Integration tests for benchmark API. - Randomized integration tests for the benchmark API. - Negative tests for cases where the cluster cannot run benchmarks. - Return 404 on missing benchmark name. - Allow to specify 'types' as an array in the JSON syntax when describing a benchmark competition. - Don't record slowest for single-request competitions. Closes #6003, #5906, #5903, #5904	2014-05-07 14:14:54 -07:00
uboness	fc52db1209	Changed the respnose structure of the percentiles aggregation where now all the percentiles are placed under a `values` object (or `values` array in case the `keyed` flag is set to `false` Closes #5870	2014-05-07 18:35:24 +02:00
Britta Weber	7944369fd1	Add `shard_min_doc_count` parameter for significant terms similar to `shard_size` Significant terms internally maintain a priority queue per shard with a size potentially lower than the number of terms. This queue uses the score as criterion to determine if a bucket is kept or not. If many terms with low subsetDF score very high but the `min_doc_count` is set high, this might result in no terms being returned because the pq is filled with low frequent terms which are all sorted out in the end. This can be avoided by increasing the `shard_size` parameter to a higher value. However, it is not immediately clear to which value this parameter must be set because we can not know how many terms with low frequency are scored higher that the high frequent terms that we are actually interested in. On the other hand, if there is no routing of docs to shards involved, we can maybe assume that the documents of classes and also the terms therein are distributed evenly across shards. In that case it might be easier to not add documents to the pq that have subsetDF <= `shard_min_doc_count` which can be set to something like `min_doc_count`/number of shards because we would assume that even when summing up the subsetDF across shards `min_doc_count` will not be reached. closes #5998 closes #6041	2014-05-07 18:02:56 +02:00
gabriel-tessier	7b0efcbd96	fix typo	2014-05-06 15:54:36 +02:00
Audrey	52d2f2d229	[DOCS] Update phrase-suggest.asciidoc Grammatical error Close #5993	2014-05-06 10:28:13 +02:00
Martijn van Groningen	013b319415	Added `reverse_nested` aggregation. The `reverse_nested` aggregation allows to aggregate on properties outside of the nested scope of a `nested` aggregation. Closes #5507	2014-05-01 00:23:05 +07:00
Lee Hinman	57bee03193	[DOCS] Add /_search_shards documentation	2014-04-22 08:54:32 -06:00
Clinton Gormley	3ba8fbbef8	Update benchmark.asciidoc Fixed incorrect parameter spec for benchmark nodes	2014-04-22 14:16:10 +02:00
Clinton Gormley	0e782331be	Update benchmark.asciidoc	2014-04-21 20:39:33 +02:00
David Pilato	f3fe50aac4	[DOCS] fix typo	2014-04-19 22:44:44 +02:00
Scott Wilkerson	9ea0e3a95b	Update percolate.asciidoc fix typo	2014-04-15 16:01:44 +02:00
Andrew Selden	2cf66c4115	Benchmark documentation Moving benchmark documentation under the search section. Closes #5786	2014-04-14 14:08:41 -07:00
Malte Schirnacher	8ce3bba010	Fix typos in percolate.asciidoc Close #5762 #5763 #5764	2014-04-11 18:09:16 +02:00
Andrew O'Brien	48031b6236	Fixes typo in "Scan" search type documention	2014-04-07 16:01:37 -06:00
gabriel-tessier	000c33aac3	fix typo	2014-04-07 09:23:46 +02:00
Martijn van Groningen	ade1d0ef57	Added global ordinals (unique incremental numbering for terms) to fielddata. Added a terms aggregation implementations that work on global ordinals, which is also the default. Closes #5672	2014-04-07 11:06:41 +07:00
Karl Meisterheim	6d993bc810	[DOCS] A few grammar and word use corrections	2014-04-04 19:26:38 +02:00
Alexander Reelsen	e547e113e1	Geo context suggester: Require precision in mapping The default precision was way too exact and could lead people to think that geo context suggestions are not working. This patch now requires you to set the precision in the mapping, as elasticsearch itself can never tell exactly, what the required precision for the users suggestions are. Closes #5621	2014-04-02 23:51:14 +02:00
Hannes Korte	c11293ad78	Fix some typos in documentation.	2014-03-31 13:48:17 +02:00
bleskes	5d832374dd	Update Documentation Feature Flags [1.1.0]	2014-03-25 17:51:30 +01:00
Boaz Leskes	fc8dc3f733	[Docs] updated the search template and query template docs	2014-03-25 15:25:02 +01:00
Alexander Reelsen	4fc461a97c	[DOCS] Moved the template query documentation into search section	2014-03-25 10:01:41 +01:00
Simon Willnauer	b4e504df99	[Docs] Add coming tag for context suggester docs	2014-03-25 09:46:49 +01:00
uboness	7d6ad8d91c	Added extended_bounds support for date_/histogram aggs By default the date_/histogram returns all the buckets within the range of the data itself, that is, the documents with the smallest values (on which with histogram) will determine the min bucket (the bucket with the smallest key) and the documents with the highest values will determine the max bucket (the bucket with the highest key). Often, when when requesting empty buckets (min_doc_count : 0), this causes a confusion, specifically, when the data is also filtered. To understand why, let's look at an example: Lets say the you're filtering your request to get all docs from the last month, and in the date_histogram aggs you'd like to slice the data per day. You also specify min_doc_count:0 so that you'd still get empty buckets for those days to which no document belongs. By default, if the first document that fall in this last month also happen to fall on the first day of the second week of the month, the date_histogram will not return empty buckets for all those days prior to that second week. The reason for that is that by default the histogram aggregations only start building buckets when they encounter documents (hence, missing on all the days of the first week in our example). With extended_bounds, you now can "force" the histogram aggregations to start building buckets on a specific min values and also keep on building buckets up to a max value (even if there are no documents anymore). Using extended_bounds only makes sense when min_doc_count is 0 (the empty buckets will never be returned if the min_doc_count is greater than 0). Note that (as the name suggest) extended_bounds is not filtering buckets. Meaning, if the min bounds is higher than the values extracted from the documents, the documents will still dictate what the min bucket will be (and the same goes to the extended_bounds.max and the max bucket). For filtering buckets, one should nest the histogram agg under a range filter agg with the appropriate min/max. Closes #5224	2014-03-20 14:48:27 +01:00
markharwood	5f1d9af9fe	Documentation fix for significant_terms heading levels	2014-03-17 12:17:54 +00:00
Randy Stauner	1486188a3b	[DOCS] Reword clear-scroll sentence	2014-03-17 12:08:49 +01:00
Boaz Leskes	ee8743f3f2	[Docs] added a missing reference to significantterms-aggergations Also fix header level mismatch issue reported by the build	2014-03-17 11:45:55 +01:00
rphadake	36a0cb99d7	[Doc] doc updates for date histogram interval Close #5308	2014-03-14 18:55:32 +01:00
Adrien Grand	eef71da650	[Doc] Add a chart about the relative error of the percentiles aggregation.	2014-03-14 12:23:23 +01:00
markharwood	767bef0596	Significant_terms aggregation identifies terms that are significant rather than merely popular in a set. Significance is related to the changes in document frequency observed between everyday use in the corpus and frequency observed in the result set. The asciidocs include extensive details on the applications of this feature. Closes #5146	2014-03-14 10:34:24 +00:00
Adrien Grand	5821fa042c	Cardinality aggregation. This aggregation computes unique term counts using the hyperloglog++ algorithm which uses linear counting to estimate low cardinalities and hyperloglog on higher cardinalities. Since this algorithm works on hashes, it is useful for high-cardinality fields to store the hash of values directly in the index, which is the purpose of the new `murmur3` field type. This is less necessary on low-cardinality string fields because the aggregator is smart enough to only compute the hash once per unique value per segment thanks to ordinals, or on numeric fields since hashing them is very fast. Close #5426	2014-03-13 19:19:56 +01:00
Florian Schilling	81e537bd5e	ContextSuggester ================ This commit extends the `CompletionSuggester` by context informations. In example such a context informations can be a simple string representing a category reducing the suggestions in order to this category. Three base implementations of these context informations have been setup in this commit. - a Category Context - a Geo Context All the mapping for these context informations are specified within a context field in the completion field that should use this kind of information.	2014-03-13 11:24:46 +01:00
Kurt Hurtado	ca6a2bb790	[DOCS] Various aggregation doc fixes	2014-03-13 09:05:25 +01:00
Boaz Leskes	b7a95d11a7	Introduced VersionType.FORCE & VersionType.EXTERNAL_GTE Also added "external_gt" as an alias name for VersionType.EXTERNAL , accessible for the rest layer. Closes #4213 , Closes #2946	2014-03-10 21:07:17 +01:00
Simon Willnauer	fbb8c0fafa	[DOCS] Add `coming` tag to multiple rescores Closes #5365	2014-03-10 09:27:44 +01:00
Benjamin Devèze	2affa5004f	Fix small typo in percentiles doc	2014-03-07 10:10:19 +01:00
Adrien Grand	f359b7f38b	[DOC] The percentiles aggregation is coming in 1.1.0.	2014-03-07 10:03:15 +01:00

1 2 3 4 5 ...

258 Commits