OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jim Ferenczi	63bdd01eb7	Expose WordDelimiterGraphTokenFilter (#23327 ) This change exposes the new Lucene graph based word delimiter token filter in the analysis filters. Unlike the `word_delimiter` this token filter named `word_delimiter_graph` correctly handles multi terms expansion at query time. Closes #23104	2017-02-24 00:53:38 +01:00
Shai Erera	eeac6d27f2	Add BreakIteratorBoundaryScanner support for FVH (#23248 ) This commit adds a boundary_scanner property to the search highlight request so the user can specify different boundary scanners: * `chars` (default, current behavior) * `word` Use a WordBreakIterator * `sentence` Use a SentenceBreakIterator This commit also adds "boundary_scanner_locale" to define which locale should be used when scanning the text.	2017-02-23 23:32:22 +01:00
Lee Hinman	6c9b89b882	[TEST] Fix incorrect test cluster name in cluster health doc tests	2017-02-22 17:18:11 -07:00
Christopher Best	eeaa0ccec2	Update getting-started.asciidoc (#23296 )	2017-02-22 11:06:27 +01:00
Alexander Reelsen	6781c4320c	Documentation: Consoleify cat shards/recovery API docs (#23116 ) Relates #23001	2017-02-22 09:18:10 +01:00
Glen Smith	a590a22ea3	Add note and link to 'tune for disk usage' (#23252 ) * Add note and link to 'tune for disk usage' * Changed formatting as suggested Thanks, @ clintongormley!	2017-02-20 20:31:19 +01:00
Jay Modi	b234644035	Enforce Content-Type requirement on the rest layer and remove deprecated methods (#23146 ) This commit enforces the requirement of Content-Type for the REST layer and removes the deprecated methods in transport requests and their usages. While doing this, it turns out that there are many places where *Entity classes are used from the apache http client libraries and many of these usages did not specify the content type. The methods that do not specify a content type explicitly have been added to forbidden apis to prevent more of these from entering our code base. Relates #19388	2017-02-17 14:45:41 -05:00
Jason Tedor	21181d7865	Fix more missing markup in configuration docs This commit fixes some missing markup in the configuration docs in the logging configuration section.	2017-02-16 19:00:49 -05:00
Andreas Roussos	788c64848b	[DOCS] Fixed various typos in the 'cat APIs' section (#23216 )	2017-02-16 20:41:42 +01:00
Jason Tedor	0e3e111921	Replace HTTP link with HTTPS link This commit replaces an HTTP link in the configuration docs with an HTTPS link.	2017-02-16 12:19:30 -05:00
Jason Tedor	69dc730341	Fix missing markup in configuration docs This commit adds a missing backtick in the configuration docs.	2017-02-16 12:19:30 -05:00
markwalkom	ced99dde50	Update stop-analyzer.asciidoc (#23195 ) Clarified where the stopwords file needs to live	2017-02-16 13:36:15 +01:00
Lee Hinman	5443f7d625	Console-ify curl statements for allocation explain API docs (#23190 ) * Console-ify curl statements for allocation explain API docs Relates to #23001 * Fix tests * Remove exclusion from build.gradle * Call out index creation in prose * Add console back and skip test	2017-02-15 17:18:07 -07:00
Nik Everett	a6cf170384	Add content-type header to curl in getting-started Relates to #23001	2017-02-15 18:14:25 -05:00
Ali Beyad	71739623d3	Consolify snapshot documentation (#23189 ) This commit brings the snapshot documentation in conformity with the CONSOLE format, and fixes the docs so that the documentation tests can be run against them.	2017-02-15 18:13:27 -05:00
Lee Hinman	ac3cc9641b	[DOCS] Console-ify shadow replica documentation Relates to #23001	2017-02-15 11:58:53 -07:00
Areek Zillur	2a38f155de	Docs: CONSOLEify multi-get.asciidoc (#23122 ) * Docs: CONSOLEify multi-get.asciidoc relates #23001 * incorporate feedback	2017-02-15 13:36:07 -05:00
sungjunyoung	ec686b52c1	Update getting-started.asciidoc (#23180 ) The "sort" and "_score" fields are contained within the "hits" field.	2017-02-15 10:55:09 +01:00
AlexNodex	139eb69fe4	Typo (#23179 ) autoGeneratePhraseQueries should be auto_generate_phrase_queries	2017-02-15 10:10:06 +01:00
Catherine Snow	51bad4300c	Fix typo (#23171 )	2017-02-15 09:38:10 +01:00
Adrien Grand	8d6a41f671	Nested queries should avoid adding unnecessary filters when possible. (#23079 ) When nested objects are present in the mappings, many queries get deoptimized due to the need to exclude documents that are not in the right space. For instance, a filter is applied to all queries that prevents them from matching non-root documents (`+: -_type:__`). Moreover, a filter is applied to all child queries of `nested` queries in order to make sure that the child query only matches child documents (`_type:__nested_path`), which is required by `ToParentBlockJoinQuery` (the Lucene query behing Elasticsearch's `nested` queries). These additional filters slow down `nested` queries. In 1.7-, the cost was somehow amortized by the fact that we cached filters very aggressively. However, this has proven to be a significant source of slow downs since 2.0 for users of `nested` mappings and queries, see #20797. This change makes the filtering a bit smarter. For instance if the query is a `match_all` query, then we need to exclude nested docs. However, if the query is `foo: bar` then it may only match root documents since `foo` is a top-level field, so no additional filtering is required. Another improvement is to use a `FILTER` clause on all types rather than a `MUST_NOT` clause on all nested paths when possible since `FILTER` clauses are more efficient. Here are some examples of queries and how they get rewritten: ``` "match_all": {} ``` This query gets rewritten to `ConstantScore(+:* -_type:__)` on master and `ConstantScore(_type:AutomatonQuery {\norg.apache.lucene.util.automaton.Automaton@4371da44})` with this change. The automaton is the complement of `_type:__` so it matches the same documents, but is faster since it is now a positive clause. Simplistic performance testing on a 10M index where each root document has 5 nested documents on average gave a latency of 420ms on master and 90ms with this change applied. ``` "term": { "foo": { "value": "0" } } ``` This query is rewritten to `+foo:0 #(ConstantScore(+: -_type:__))^0.0` on master and `foo:0` with this change: we do not need to filter nested docs out since the query cannot match nested docs. While doing performance testing in the same conditions as above, response times went from 250ms to 50ms. ``` "nested": { "path": "nested", "query": { "term": { "nested.foo": { "value": "0" } } } } ``` This query is rewritten to `+ToParentBlockJoinQuery (+nested.foo:0 #_type:__nested) #(ConstantScore(+:* -_type:__))^0.0` on master and `ToParentBlockJoinQuery (nested.foo:0)` with this change. The top-level filter (`-_type:__`) could be removed since `nested` queries only match documents of the parent space, as well as the child filter (`#_type:__nested`) since the child query may only match nested docs since the `nested` object has both `include_in_parent` and `include_in_root` set to `false`. While doing performance testing in the same conditions as above, response times went from 850ms to 270ms.	2017-02-14 16:05:19 +01:00
Boaz Leskes	70a3ac1767	Add a note about `cluster.routing.allocation.node_concurrent_recoveries` (#23160 ) Closes #23152	2017-02-14 14:14:41 +02:00
Loek van Gool	214a3536aa	Update redirects.asciidoc (#23148 )	2017-02-13 16:23:25 +01:00
Giuseppe	ecbeffcb1e	Add note about min_score filtering efficiency (#23109 ) * Add note about min_score filtering efficiency * Reword to mention 'HAVING' * Remove reference to HAVING	2017-02-13 12:15:01 +01:00
Adrien Grand	f3509b8003	Consolify docs/reference/analysis/tokenfilters/pattern-capture-tokenfilter.asciidoc. (#23050 )	2017-02-13 11:00:12 +01:00
Ryan Ernst	c91848e6a7	Docs: Consoleify cluster and indices settings docs (#23030 ) relates #23001	2017-02-10 14:57:43 -08:00
Tanguy Leroux	e2e5937455	Use `typed_keys` parameter to prefix suggester names by type in search responses (#23080 ) This pull request reuses the typed_keys parameter added in #22965, but this time it applies it to suggesters. When set to true, the suggester names in the search response will be prefixed with a prefix that reflects their type.	2017-02-10 10:53:38 +01:00
Tanguy Leroux	63ea6f7168	[Docs] Remove unnecessary // TEST[continued] in search-template doc It has been explained in `e39b96f257`	2017-02-10 10:08:24 +01:00
Clinton Gormley	d43417ef47	Docs: Deleted redundant word in scripting	2017-02-09 22:02:42 +01:00
Jonathan D Strootman	cb35b3785a	Adding `ansible-elasticsearch` to list of CM tools (#23058 )	2017-02-09 21:14:30 +01:00
Clinton Gormley	78d3028bb7	Fixed bad asciidoc in delete-by-query	2017-02-09 20:14:56 +01:00
Jim Ferenczi	94087b3274	Removes ExpandCollapseSearchResponseListener, search response listeners and blocking calls This changes removes the SearchResponseListener that was used by the ExpandCollapseSearchResponseListener to expand collapsed hits. The removal of SearchResponseListener is not a breaking change because it was never released. This change also replace the blocking call in ExpandCollapseSearchResponseListener by a single asynchronous multi search request. The parallelism of the expand request can be set via CollapseBuilder#max_concurrent_group_searches Closes #23048	2017-02-09 18:06:10 +01:00
Tanguy Leroux	3553522328	Add parameter to prefix aggs name with type in search responses (#22965 ) This pull request adds a new parameter to the REST Search API named `typed_keys`. When set to true, the aggregation names in the search response will be prefixed with a prefix that reflects the internal type of the aggregation. Here is a simple example: ``` GET /_search?typed_keys { "aggs": { "tweets_per_user": { "terms": { "field": "user" } } }, "size": 0 } ``` And the response: ``` { "aggs": { "sterms:tweets_per_user": { ... } } } ``` This parameter is intended to make life easier for REST clients that could parse back the prefix and could detect the type of the aggregation to parse. It could also be implemented for suggesters.	2017-02-09 11:19:04 +01:00
Tanguy Leroux	832952cb29	[Docs] Fix consoleify search-template.asciidoc It does not reproduce well, hopefully this will fix the failure on DELETE _search/template/<templatename>.	2017-02-08 21:23:38 +01:00
Igor Motov	1fc4fa5729	Docs: CONSOLEify native script docs Relates #23001	2017-02-08 13:30:39 -05:00
Jay Modi	7f3769c745	Remove ldjson support and document ndjson for bulk/msearch (#23049 ) This commit removes support for the `application/x-ldjson` Content-Type header as this was only used in the first draft of the spec and had very little uptake. Additionally, the docs for bulk and msearch have been updated to specifically call out ndjson and mention that the newline character may be preceded by a carriage return. Finally, the bulk request handling of the carriage return has been improved to remove this character from the source. Closes #23025	2017-02-08 11:55:50 -05:00
Clinton Gormley	40f40d7676	Docs: Fix termvectors by removing example blocks with embedded CONSOLE tests	2017-02-08 17:12:40 +01:00
Tanguy Leroux	477d1aa8bf	[Docs] Consoleify multi-search and search-template docs (#23047 ) Relates #23001	2017-02-08 17:05:22 +01:00
Christoph Büscher	e177d2ca40	Docs: CONSOLEify termvectors.asciidoc (#23046 )	2017-02-08 16:06:11 +01:00
Daniel Mitterdorfer	88c5627a1b	CONSOLEify multi-termvectors docs Relates #23001	2017-02-08 11:47:42 +01:00
Yannick Welsch	9154686623	Remove legacy primary shard allocation mode based on versions (#23016 ) Elasticsearch v5.0.0 uses allocation IDs to safely allocate primary shards whereas prior versions of ES used a version-based mode instead. Elasticsearch v5 still has support for version-based primary shard allocation as it needs to be able to load 2.x shards. ES v6 can drop the legacy support.	2017-02-08 10:00:55 +01:00
Lee Hinman	b3c27a7fdd	Disallow include_in_all for 6.0+ indices Since `_all` is now deprecated and cannot be set for new indices, we should also disallow any field that has the `include_in_all` parameter set. Resolves #22923	2017-02-07 19:31:51 -07:00
Tim Brooks	ad4bfa2307	Docs: CONSOLEify transport docs (#23027 ) This is related to #23001.	2017-02-07 20:06:28 -06:00
Jordan Robinson	693b0017af	Small typo fix in Windows service documentation This commit removes a duplicate definite article in the Windows service documentation. Relates #23028	2017-02-07 17:25:46 -05:00
Nik Everett	0e98c9107a	Docs: CONSOLEify some more docs These need to be CONSOLEified now because we're starting to require Content-Type headers and they didn't have any. * cluster/reroute: Marked as CONSOLE but skipped because the docs build runs with a single node. * docs/bulk: Marked as NOTCONSOLE because the snippets describe either examples or `curl` commands. Fixed the `curl` command to include the `Content-Type` header. * query-dsl/terms-query: Marked as CONSOLE. * search/request/rescore: Marked as CONSOLE. Fixed deprecated syntax. Relates #23001 Relates #18160	2017-02-07 16:49:01 -05:00
Nik Everett	0c011cb290	Docs: CONSOLEify histogram aggregation docs This adds the `COPY AS CURL` and `VIEW IN CONSOLE` links to the docs and causes the snippets to be tested during Elasticsearch's build. Relates to #18160	2017-02-07 16:09:32 -05:00
Nik Everett	245aa0404a	Docs: CONSOLEify sum aggregation docs This adds the `COPY AS CURL` and `VIEW IN CONSOLE` buttons to the docs and makes the build execute the snippets as part of `docs:check`. Relates to #18160	2017-02-07 14:18:54 -05:00
Jim Ferenczi	bbf62e3472	CONSOLify indices/analyze.asciidoc and search/field-stats.asciidoc Relates #23001	2017-02-07 20:15:09 +01:00
Nik Everett	274ee30d34	Docs: CONSOLEify the avg aggregation docs This creates the `COPY AS CURL` and `VIEW IN CONSOLE` buttons and makes the build test the examples. Relates to #18160	2017-02-07 13:48:27 -05:00
Nik Everett	a2ed676862	Docs: Explain painless's method dispatch (#23021 ) Painless uses Ruby-like method dispatch (reciever type, method name, and arity) rather than Java-like (reciever type, method name, and argument compile time types) or Groovy-like method dispatch (receiver type, method name, and argument run time types). We do this for mostly good reasons but we never documented it. Relates to #22720	2017-02-07 12:09:22 -05:00

1 2 3 4 5 ...

3480 Commits