OpenSearch

Commit Graph

Author	SHA1	Message	Date
Daniel Mitterdorfer	f174f72fee	Circuit-break based on real memory usage With this commit we introduce a new circuit-breaking strategy to the parent circuit breaker. Contrary to the current implementation which only accounts for memory reserved via child circuit breakers, the new strategy measures real heap memory usage at the time of reservation. This allows us to be much more aggressive with the circuit breaker limit so we bump it to 95% by default. The new strategy is turned on by default and can be controlled with the new cluster setting `indices.breaker.total.userealmemory`. Note that we turn it off for all integration tests with an internal test cluster because it leads to spurious test failures which are of no value (we cannot fully control heap memory usage in tests). All REST tests, however, will make use of the real memory circuit breaker. Relates #31767	2018-07-13 10:08:28 +02:00
Jim Ferenczi	584fa261cc	Remove the ability to index or query context suggestions without context (#31007 ) This is a follow up of #30712 that removes the ability to index or query and context enabled completion field without context. Relates #30712	2018-07-09 16:01:01 +02:00
Sohaib Iftikhar	40b822c878	Scripting: Remove support for deprecated StoredScript contexts (#31394 ) Removes support for storing scripts without the usual json around the script. So You can no longer do: ``` POST _scripts/<templatename> { "query": { "match": { "title": "{{query_string}}" } } } ``` and must instead do: ``` POST _scripts/<templatename> { "script": { "lang": "mustache", "source": { "query": { "match": { "title": "{{query_string}}" } } } } } ``` This improves error reporting when you attempt to store a script but don't quite get the syntax right. Before, there was a good chance that we'd think of it as a "raw" template and just store it. Now we won't do that. Nice.	2018-07-05 09:30:08 -04:00
Daniel Mitterdorfer	3d53daeb2f	Account for XContent overhead in in-flight breaker So far the in-flight request circuit breaker has only accounted for the on-the-wire representation of a request. However, we convert the raw request into XContent internally which increases the overhead. Therefore, we increase the value of the corresponding setting `network.breaker.inflight_requests.overhead` from one to two. While this value is still rather conservative (we assume that the representation as structured objects has no overhead compared to the byte[]), it is closer to reality than the current value. Relates #31613	2018-07-03 09:17:16 +02:00
Jonathan Little	8e4768890a	Migrate scripted metric aggregation scripts to ScriptContext design (#30111 ) * Migrate scripted metric aggregation scripts to ScriptContext design #29328 * Rename new script context container class and add clarifying comments to remaining references to params._agg(s) * Misc cleanup: make mock metric agg script inner classes static * Move _score to an accessor rather than an arg for scripted metric agg scripts This causes the score to be evaluated only when it's used. * Documentation changes for params._agg -> agg * Migration doc addition for scripted metric aggs _agg object change * Rename "agg" Scripted Metric Aggregation script context variable to "state" * Rename a private base class from ...Agg to ...State that I missed in my last commit * Clean up imports after merge	2018-06-25 12:01:33 +01:00
Ryan Ernst	c0961b79be	Docs: Add note about removing prepareExecute from the java client (#31401 ) relates #30966	2018-06-19 07:21:58 -07:00
Ryan Ernst	f3297ed23a	Packaging: Remove windows bin files from the tar distribution (#30596 ) This commit removes windows specific files from the tar distribution. Windows users use the zip, linux users use the tar.	2018-06-18 19:02:51 +02:00
Luca Cavanna	24163d10b7	REST hl client: cluster health to default to cluster level (#31268 ) With #29331 we added support for the cluster health API to the high-level REST client. The transport client does not support the level parameter, and it always returns all the info needed for shards level rendering. We have maintained that behaviour when adding support for cluster health to the high-level REST client, to ease migration, but the correct thing to do is to default the high-level REST client to `cluster` level, which is the same default as when going through the Elasticsearch REST layer.	2018-06-13 15:06:13 +02:00
Luca Cavanna	92eb324776	REST high-level Client: remove deprecated API methods (#31200 ) This commit removes all the API methods that accept a `Header` varargs argument, in favour of the newly introduced API methods that accept a `RequestOptions` argument. Relates to #31069	2018-06-12 21:00:06 +02:00
Simon Willnauer	f825a530b8	Limit the number of concurrent requests per node (#31206 ) With `max_concurrent_shard_requests` we used to throttle / limit the number of concurrent shard requests a high level search request can execute per node. This had several problems since it limited the number on a global level based on the number of nodes. This change now throttles the number of concurrent requests per node while still allowing concurrency across multiple nodes. Closes #31192	2018-06-11 08:49:18 +02:00
Alan Woodward	852df128a5	Match phrase queries against non-indexed fields should throw an exception (#31060 ) When `lenient=false`, attempts to create match phrase queries with custom analyzers against non-text fields will throw an IllegalArgumentException. Also changes `MatchQueryBuilderTests` so that it avoids this scenario Fixes #31061	2018-06-04 19:12:45 +01:00
Christoph Büscher	1ea9f11b03	Change ScriptException status to 400 (bad request) (#30861 ) Currently failures to compile a script usually lead to a ScriptException, which inherits the 500 INTERNAL_SERVER_ERROR from ElasticsearchException if it does not contain another root cause. Instead, this should be a 400 Bad Request error. This PR changes this more generally for script compilation errors by changing ScriptException to return 400 (bad request) as status code. Closes #12315	2018-05-30 14:00:07 +02:00
Jim Ferenczi	f582418ada	Fix missing option serialization after backport Relates #29465	2018-05-30 12:55:31 +02:00
Vladimir Dolzhenko	81eb8ba0f0	Include size of snapshot in snapshot metadata (#29602 ) Include size of snapshot in snapshot metadata Adds difference of number of files (and file sizes) between prev and current snapshot. Total number/size reflects total number/size of files in snapshot. Closes #18543	2018-05-25 21:04:50 +02:00
Igor Motov	cf0e0606af	Use geohash cell instead of just a corner in geo_bounding_box (#30698 ) Treats geohashes as grid cells instead of just points when the geohashes are used to specify the edges in the geo_bounding_box query. For example, if a geohash is used to specify the top_left corner, the top left corner of the geohash cell will be used as the corner of the bounding box. Closes #25154	2018-05-24 14:46:15 -04:00
Tim Brooks	d7040ad7b4	Reintroduce mandatory http pipelining support (#30820 ) This commit reintroduces `31251c9` and `63a5799`. These commits introduced a memory leak and were reverted. This commit brings those commits back and fixes the memory leak by removing unnecessary retain method calls.	2018-05-23 14:38:52 -06:00
Colin Goodheart-Smithe	4fd0a3e492	Revert "Make http pipelining support mandatory (#30695 )" (#30813 ) This reverts commit `31251c9` introduced in #30695. We suspect this commit is causing the OOME's reported in #30811 and we will use this PR to test this assertion.	2018-05-23 10:54:46 -06:00
Tim Brooks	31251c9a6d	Make http pipelining support mandatory (#30695 ) This is related to #29500 and #28898. This commit removes the abilitiy to disable http pipelining. After this commit, any elasticsearch node will support pipelined requests from a client. Additionally, it extracts some of the http pipelining work to the server module. This extracted work is used to implement pipelining for the nio plugin.	2018-05-22 09:29:31 -06:00
Tanguy Leroux	74474e99d6	[Docs] Fix broken cross link in documentation	2018-05-22 16:03:33 +02:00
Ryan Ernst	34180f2285	Scripting: Remove getDate methods from ScriptDocValues (#30690 ) The getDate() and getDates() existed prior to 5.x on long fields in scripting. In 5.x, a new Date type for ScriptDocValues was added. The getDate() and getDates() methods were left on long fields and added to date fields to ease the transition. This commit removes those methods for 7.0.	2018-05-18 21:26:26 -07:00
Jason Tedor	d68c44b76c	Default copy settings to true and deprecate on the REST layer (#30598 ) This commit defaults the copy_settings REST parameter to the shrink and split APIs to true, and deprecates the parameter.	2018-05-18 10:12:08 -04:00
Ryan Ernst	fb0aa562a5	Network: Remove http.enabled setting (#29601 ) This commit removes the http.enabled setting. While all real nodes (started with bin/elasticsearch) will always have an http binding, there are many tests that rely on the quickness of not actually needing to bind to 2 ports. For this case, the MockHttpTransport.TestPlugin provides a dummy http transport implementation which is used by default in ESIntegTestCase. closes #12792	2018-05-02 11:42:05 -07:00
Ryan Ernst	fba2f00a73	Packaging: Unmark systemd service file as a config file (#29004 ) Systemd overrides should happen through /etc/systemd/system, not directly editing the service file. This commit removes marking the service file as configuration for rpm and deb packages.	2018-05-02 09:48:49 -07:00
Jason Tedor	5de6f4ff7b	Adjust copy settings on resize BWC version This commit adjusts the BWC version for copy settings on resize operations after the behavior was backported to 6.x.	2018-05-01 08:49:16 -04:00
Jason Tedor	50535423ff	Allow copying source settings on resize operation (#30255 ) Today when an index is created from shrinking or splitting an existing index, the target index inherits almost none of the source index settings. This is surprising and a hassle for operators managing such indices. Given this is the default behavior, we can not simply change it. Instead, we start by introducing the ability to copy settings. This flag can be set on the REST API or on the transport layer and it has the behavior that it copies all settings from the source except non-copyable settings (a property of a setting introduced in this change). Additionally, settings on the request will always override. This change is the first step in our adventure: - this flag is added here in 7.0.0 and immediately deprecated - this flag will be backported to 6.4.0 and remain deprecated - then, we will remove the ability to set this flag to false in 7.0.0 - finally, in 8.0.0 we will remove this flag and the only behavior will be for settings to be copied	2018-05-01 08:48:19 -04:00
Jason Tedor	f381e2a00c	Add migration note on thread pool API changes (#29192 ) A previous change modified the output of the thread pool info contained in the nodes info API. This commit adds a note to the migration docs for this change.	2018-04-28 00:11:17 -04:00
Julie Tibshirani	f5978d6d33	In the field capabilities API, remove support for providing fields in the request body. (#30185 )	2018-04-27 16:14:11 -07:00
Jason Tedor	2c3e71f116	Remove the suggest metric from stats APIs (#29635 ) This metric previously existed for backwards compatibility reasons although the suggest stats were folded into search stats. This metric was deprecated in 6.3.0 and this commit removes them for 7.0.0.	2018-04-24 19:03:48 -04:00
Jason Tedor	5d767e449a	Remove bulk fallback for write thread pool (#29609 ) The name of the bulk thread pool was renamed to "write" with "bulk" as a fallback name. This change was made in 6.x for BWC reasons yet in 7.0.0 we are removing this fallback. This commit removes this fallback for the write thread pool.	2018-04-19 16:59:58 -04:00
Jason Tedor	2b47d67d95	Remove the index thread pool (#29556 ) Now that single-document indexing requests are executed on the bulk thread pool the index thread pool is no longer needed. This commit removes this thread pool from Elasticsearch.	2018-04-18 09:18:08 -04:00
Ke Li	0bfb59dcf2	Using ObjectParser in UpdateRequest (#29293 ) CRUD: Parsing changes for UpdateRequest (#29293) Use `ObjectParser` to parse `UpdateRequest` so we reject unknown fields and drop support for the `_fields` parameter because it was deprecated in 5.x.	2018-04-16 08:39:35 -04:00
Jim Ferenczi	1b6d5e531b	Fail _search request with trailing tokens (#29428 ) This change validates that the `_search` request does not have trailing tokens after the main object and fails the request with a parsing exception otherwise. Closes #28995	2018-04-11 13:10:22 +02:00
Adrien Grand	4918924fae	Remove legacy mapping code. (#29224 ) Some features have been deprecated since `6.0` like the `_parent` field or the ability to have multiple types per index. This allows to remove quite some code, which in-turn will hopefully make it easier to proceed with the removal of types.	2018-04-11 09:41:37 +02:00
Adrien Grand	aeac682869	Make purely negative queries return scores of 0. (#26015 ) It would make them consistent with queries that are only made of filters. Closes #23449	2018-04-10 14:31:06 +02:00
Christoph Büscher	231fd4eb18	Remove `delimited_payload_filter` (#27705 ) From 7.0 on, using `delimited_payload_filter` should throw an error. It was deprecated in 6.2 in favour of `delimited_payload` (#26625). Relates to #27704	2018-04-05 18:41:04 +02:00
Adrien Grand	569d0c0e89	Improve similarity integration. (#29187 ) This improves the way similarities are plugged in in order to: - reject the classic similarity on 7.x indices and emit a deprecation warning otherwise - reject unkwown parameters on 7.x indices and emit a deprecation warning otherwise Even though this breaks the plugin API, I'd like to backport to 7.x so that users can get deprecation warnings when they are doing something that will become unsupported in the future. Closes #23208 Closes #29035	2018-04-03 16:45:25 +02:00
Jim Ferenczi	c93c7f3121	Remove deprecated options for query_string (#29203 ) This commit removes some parameters deprecated in 6.x (or 5.x): `use_dismax`, `split_on_whitespace`, `all_fields` and `lowercase_expanded_terms`. Closes #25551	2018-03-22 18:37:08 +01:00
Adrien Grand	8f9d2ee4e2	Reject updates to the `_default_` mapping. (#29165 ) This will reject mapping updates to the `_default_` mapping with 7.x indices and still emit a deprecation warning with 6.x indices. Relates #15613 Supersedes #28248	2018-03-21 10:44:11 +01:00
joadha	08c530907a	[Docs] Update api.asciidoc (#29166 ) The parent page has the same title, and the URL path indicates this is about API changes, so include "API" in the title.	2018-03-21 10:14:26 +01:00
olcbean	3d81497f25	REST: Clear Indices Cache API remove deprecated url params (#29068 ) By the time the master branch is released the deprecated url parameters in the `/_cache/clear` API will have been deprecated for a couple of minor releases. Since master will be the next major release we are fine with removing these parameters.	2018-03-14 16:37:50 -04:00
Mayya Sharipova	f53d159aa1	Limit analyzed text for highlighting (improvements) (#28808 ) Increase the default limit of `index.highlight.max_analyzed_offset` to 1M instead of previous 10K. Enhance an error message when offset increased to include field name, index name and doc_id. Relates to https://github.com/elastic/kibana/issues/16764	2018-03-02 08:09:05 -08:00
Ke Li	a77273fc01	Reject regex search if regex string is too long (#28542 ) * Reject regex search if regex string is too long (#28344) * Add docs * Introduce index level setting `index.max_regex_length` to control the maximum length of the regular expression Closes #28344	2018-02-23 10:41:24 -08:00
Tanguy Leroux	a6a138905d	Use client settings in repository-gcs (#28575 ) Similarly to what has been done for s3 and azure, this commit removes the repository settings `application_name` and `connect/read_timeout` in favor of client settings. It introduce a GoogleCloudStorageClientSettings class (similar to S3ClientSettings) and a bunch of unit tests for that, it aligns the documentation to be more coherent with the S3 one, it documents the connect/read timeouts that were not documented at all and also adds a new client setting that allows to define a custom endpoint.	2018-02-22 15:40:20 +01:00
Martijn van Groningen	ecb1d07d00	percolator: remove deprecated map_unmapped_fields_as_string setting	2018-02-01 11:11:22 +01:00
olcbean	0c83240b5f	Java Api clean up: remove deprecated `isShardsAcked` (#28311 ) This PR removes previously deprecated `isShardsAcked()` method in favour of `isShardsAcknowledged()` on `CreateIndexResponse`, `CreateIndexClusterStateUpdateResponse` and `RolloverResponse` Related to #27784 Follow-up of #27819	2018-01-25 14:13:20 +01:00
Adrien Grand	700d9ecc95	Remove the `update_all_types` option. (#28288 ) This option is not useful in 7.x since no indices may have more than one type anymore.	2018-01-22 12:03:07 +01:00
Mayya Sharipova	dcde895f49	Introduce limit to the number of terms in Terms Query (#27968 ) - Introduce index level settings to control the maximum number of terms that can be used in a Terms Query - Throw an error if a request exceeds this max number Closes #18829	2017-12-28 17:36:29 -05:00
Mayya Sharipova	cbd271e497	Limit the analyzed text for highlighting (#27934 ) * Limit the analyzed text for highlighting - Introduce index level settings to control the max number of character to be analyzed for highlighting - Throw an error if analysis is required on a larger text Closes #27517	2017-12-21 10:19:58 -05:00
olcbean	25c606cf09	Remove deprecated names for string distance algorithms (#27640 ) #27409 deprecated the incorrectly-spelled `levenstein` in favour of `levenshtein`. #27526 deprecated the inconsistent `jarowinkler` in favour of `jaro_winkler`. These changes were merged into 6.2, and this change removes them entirely in 7.0.	2017-12-11 12:16:04 +00:00
Jim Ferenczi	caea6b70fa	Add a new cluster setting to limit the total number of buckets returned by a request (#27581 ) This commit adds a new dynamic cluster setting named `search.max_buckets` that can be used to limit the number of buckets created per shard or by the reduce phase. Each multi bucket aggregator can consume buckets during the final build of the aggregation at the shard level or during the reduce phase (final or not) in the coordinating node. When an aggregator consumes a bucket, a global count for the request is incremented and if this number is greater than the limit an exception is thrown (TooManyBuckets exception). This change adds the ability for multi bucket aggregator to "consume" buckets in the global limit, the default is 10,000. It's an opt-in consumer so each multi-bucket aggregator must explicitly call the consumer when a bucket is added in the response. Closes #27452 #26012	2017-12-06 09:15:28 +01:00
Mayya Sharipova	c6b73239ae	Limit the number of tokens produced by _analyze (#27529 ) Add an index level setting `index.analyze.max_token_count` to control the number of generated tokens in the _analyze endpoint. Defaults to 10000. Throw an error if the number of generated tokens exceeds this limit. Closes #27038	2017-11-30 11:54:39 -05:00
Simon Willnauer	f23ed6188d	Skip shard refreshes if shard is `search idle` (#27500 ) Today we refresh automatically in the background by default very second. This default behavior has a significant impact on indexing performance if the refreshes are not needed. This change introduces a notion of a shard being `search idle` which a shard transitions to after (default) `30s` without any access to an external searcher. Once a shard is search idle all scheduled refreshes will be skipped unless there are any refresh listeners registered. If a search happens on a `serach idle` shard the search request _park_ on a refresh listener and will be executed once the next scheduled refresh occurs. This will also turn the shard into the `non-idle` state immediately. This behavior is only applied if there is no explicit refresh interval set.	2017-11-27 18:16:10 +01:00
kel	4885acb048	Replace `delimited_payload_filter` by `delimited_payload` (#26625 ) The `delimited_payload_filter` is renamed to `delimited_payload`, the old name is deprecated and should be replaced by `delimited_payload`. Closes #21978	2017-11-24 13:03:19 +01:00
Simon Willnauer	fadbe0de08	Automatically prepare indices for splitting (#27451 ) Today we require users to prepare their indices for split operations. Yet, we can do this automatically when an index is created which would make the split feature a much more appealing option since it doesn't have any 3rd party prerequisites anymore. This change automatically sets the number of routinng shards such that an index is guaranteed to be able to split once into twice as many shards. The number of routing shards is scaled towards the default shard limit per index such that indices with a smaller amount of shards can be split more often than larger ones. For instance an index with 1 or 2 shards can be split 10x (until it approaches 1024 shards) while an index created with 128 shards can only be split 3x by a factor of 2. Please note this is just a default value and users can still prepare their indices with `index.number_of_routing_shards` for custom splitting. NOTE: this change has an impact on the document distribution since we are changing the hash space. Documents are still uniformly distributed across all shards but since we are artificually changing the number of buckets in the consistent hashign space document might be hashed into different shards compared to previous versions. This is a 7.0 only change.	2017-11-23 09:48:54 +01:00
Mayya Sharipova	57e4d10007	Limit the number of nested documents (#27405 ) Add an index level setting `index.mapping.nested_objects.limit` to control the number of nested json objects that can be in a single document across all fields. Defaults to 10000. Throw an error if the number of created nested documents exceed this limit during the parsing of a document. Closes #26962	2017-11-22 10:16:28 -05:00
Mayya Sharipova	858b2c7cb8	Standardize underscore requirements in parameters (#27414 ) Stardardize underscore requirements in parameters across different type of requests: _index, _type, _source, _id keep their underscores params like version and retry_on_conflict will be without underscores Throw an error if older versions of parameters are used BulkRequest, MultiGetRequest, TermVectorcRequest, MoreLikeThisQuery were changed Closes #26886	2017-11-17 15:31:52 -05:00
Jim Ferenczi	29331f1127	Fail queries with scroll that explicitely set request_cache (#27342 ) Queries that create a scroll context cannot use the cache. They modify the search context during their execution so using the cache can lead to duplicate result for the next scroll query. This change fails the entire request if the request_cache option is explictely set on a query that creates a scroll context (`scroll=1m`) and make sure internally that we never use the cache for these queries when the option is not explicitely used. For 6.x a deprecation log will be printed instead of failing the entire request and the request_cache hint will be ignored (forced to false).	2017-11-10 16:02:06 +01:00
Mayya Sharipova	abbe853f1e	Add limits for ngram and shingle settings (#27211 ) (#27318 ) Relates to #25887	2017-11-08 10:12:57 -05:00
javanna	34666844b3	[DOCS] Clarify migrate guide and search request validation Relates to #26811	2017-10-31 12:36:00 +01:00
kel	c3e2bdf20c	Raise IllegalArgumentException if query validation failed (#26811 ) Closes #26799	2017-10-31 12:17:27 +01:00
Simon Willnauer	8dda827ff4	Don't refresh on `_flush` `_force_merge` and `_upgrade` (#27000 ) Today all these API calls have a sideeffect of making documents visible to search requests. While this is sometimes desired it's an unnecessary sideeffect and now that we have an internal (engine-private) index reader (#26972) we artificially add a refresh call for bwc. This change removes this sideeffect in 7.0.	2017-10-16 10:16:35 +02:00
Alexander Kazakov	592ab043dd	Change default value to true for transpositions parameter of fuzzy query (#26901 )	2017-10-11 15:31:48 +02:00
Nhat	bf4c3642b2	remove _primary and _replica shard preferences (#26791 ) The shard preference _primary, _replica and its variants were useful for the asynchronous replication. However, with the current impl, they are no longer useful and should be removed. Closes #26335	2017-10-08 11:03:06 -04:00
David Turner	8fe9a20982	Forbid negative values for index.unassigned.node_left.delayed_timeout (#26828 ) Change delayed_timeout to be a positiveTimeSetting, and add note that this is a breaking change	2017-09-29 14:44:43 +01:00
Christoph Büscher	6189c54c84	Reject the `index_options` parameter for numeric fields (#26668 ) Numeric fields no longer support the index_options parameter. This changes the parameter to be rejected in numeric field types after it was deprecated in 6.0. Closes #21475	2017-09-25 23:43:14 +02:00
Yannick Welsch	df5c450e89	Add v6.1 BWC layer for adding wait_for_active_shards to index open command This commit disables BWC tests while adding a v6.1 BWC layer for the PR #26682	2017-09-22 16:30:07 +02:00
David Pilato	b01b1c2a58	Remove azure deprecated settings (#26099 ) Follow up for #23405. We remove azure deprecated settings in 7.0: * The legacy azure settings which where starting with `cloud.azure.storage.` prefix have been removed. This includes `account`, `key`, `default` and `timeout`. You need to use settings which are starting with `azure.client.` prefix instead. * Global timeout setting `cloud.azure.storage.timeout` has been removed. You must set it per azure client instead. Like `azure.client.default.timeout: 10s` for example.	2017-09-12 16:51:44 +02:00
Lee Hinman	cff904bf97	Enable adaptive replica selection by default (#26522 ) Relates to #24915	2017-09-07 09:25:05 -06:00
Jim Ferenczi	86d97971a4	Remove the _all metadata field (#26356 ) * Remove the _all metadata field This change removes the `_all` metadata field. This field is deprecated in 6 and cannot be activated for indices created in 6 so it can be safely removed in the next major version (e.g. 7).	2017-08-28 17:43:59 +02:00
Jim Ferenczi	a48616272f	#26173 : Removed global_ordinals_hash and global_ordinals_low_cardinality exeuction hint deprecated in 6.1	2017-08-21 20:44:34 +02:00
Lee Hinman	f18ec511ca	Disallow : in cluster and index/alias names (#26247 ) We use `:` for cross-cluster search (eg `cluster:index`), therefore, we should not allow the ambiguity when allowing cluster or index names. Relates to #23892	2017-08-17 14:57:26 -06:00

1 2 3 4

171 Commits