OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-03-09 14:34:43 +00:00

Author	SHA1	Message	Date
Jason Tedor	7c8860b7e6	Update number of replicas when removing setting (#56723 ) We previously rejected removing the number of replicas setting, which prevents users from reverting this setting to its default the natural way. To fix this, we put back the setting with the default value in the cases that the user is trying to remove it. Yet, we also need to do the work of updating the routing table and so on appropriately. This case was missed because when the setting is being removed, we were defaulting to -1 in this code path, which is treated as not being updated. Instead, we must treat the case when we are removing this setting as if the setting is being updated, too. This commit does that.	2020-05-13 20:13:25 -04:00
debadair	60f8a32dba	[DOCS] Add info about ILM and unallocated shards. (#56655 ) (#56724 ) * [DOCS] Add info about ILM and unallocated shards. * Incorporated review feedback. * Update docs/reference/ilm/actions/ilm-allocate.asciidoc Co-authored-by: James Rodewig <james.rodewig@elastic.co> * Apply suggestions from code review Co-authored-by: James Rodewig <james.rodewig@elastic.co> * Fix xref Co-authored-by: James Rodewig <james.rodewig@elastic.co> Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-05-13 16:12:37 -07:00
David Roberts	ab40466bfb	Prevent unexpected native controller output hanging the process (#56685 ) In normal operation native controllers are not expected to write anything to stdout or stderr. However, if due to an error or something unexpected with the environment a native controller does write something to stdout or stderr then it will block if nothing is reading that output. This change makes the stdout and stderr of native controllers reuse the same stdout and stderr as the Elasticsearch JVM (which are by default redirected to es.stdout.log and es.stderr.log) so that if something unexpected is written to native controller output then: 1. The native controller process does not block, waiting for something to read the output 2. We can see what the output was, making it easier to debug obscure environmental problems Backport of #56491 Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-05-13 22:57:00 +01:00
Nik Everett	b98b260048	Merge significant_terms into the terms package (backport of #56699 ) (#56715 ) This merges the code for the `significant_terms` agg into the package for the code for the `terms` agg. They are super entangled already, this mostly just admits that to ourselves. Precondition for the terms work in #56487	2020-05-13 17:36:21 -04:00
Luca Cavanna	34410814b9	Don't omit empty arrays when filtering _source (#56527 ) When using source filtering exclusions, empty arrays are not preserved in documents, and no empty arrays are returned if arrays are empty after applying exclusions. We have special treatment to make sure that we preserve empty objects, but the behaviour for arrays is different. It looks like this regression was introduced by #22593, shortly after we refactored source filtering to use automata (#20736). Note that this change affects what the search API returns when using source exclusions, as well as what gets indexed when using source exclusions for the _source field. Closes #23796	2020-05-13 23:24:21 +02:00
Nik Everett	126619ae3c	Add list of defered aggregations to the profiler (backport of #56208 ) (#56682 ) This adds a few things to the `breakdown` of the profiler: * `histogram` aggregations now contain `total_buckets` which is the count of buckets that they collected. This could be useful when debugging a histogram inside of another bucketing agg that is fairly selective. * All bucketing aggs that can delay their sub-aggregations will now add a list of delayed sub-aggregations. This is useful because we sometimes have fairly involved logic around which sub-aggregations get delayed and this will save you from having to guess. * Aggregtations wrapped in the `MultiBucketAggregatorWrapper` can't accurately add anything to the breakdown. Instead they the wrapper adds a marker entry `"multi_bucket_aggregator_wrapper": true` so we can be quickly pick out such aggregations when debugging. It also fixes a bug where `_count` breakdown entries were contributing to the overall `time_in_nanos`. They didn't add a large amount of time so it is unlikely that this caused a big problem, but I was there. To support the arbitrary breakdown data this reworks the profiler so that the `breakdown` can contain any data that is supported by `StreamOutput#writeGenericValue(Object)` and `XContentBuilder#value(Object)`.	2020-05-13 16:33:22 -04:00
Julie Tibshirani	1ad83c37c4	Use index sort range query when possible. (#56710 ) This PR proposes to use `IndexSortSortedNumericDocValuesRangeQuery` when possible to speed up certain range queries. Points-based queries are already very efficient, the only time this query makes a difference is when the range matches a large number of documents. Relates to #48665.	2020-05-13 13:24:45 -07:00
Ross Wolf	61e2cf89b5	EQL: Add number function (#55084 ) * EQL: Add number function * EQL: Fix the locale used for number for deterministic functionality * EQL: Add more ToNumber tests * EQL: Add more number ToNumberProcessor unit tests * EQL: Remove unnecessary overrides, fix processor methods * EQL: Remove additional unnecessary overrides * EQL: Lint fixes for ToNumber * EQL: ToNumber renames from PR feedback * EQL: Remove NumberFormat locale handling * EQL: Removed NumberFormat from ToNumber * EQL: Add number function tests * EQL: ToNumberProcessorTests formatting * EQL: Remove newline in ToNumberProcessorTests * EQL: Add number(..., null) test * EQL: Create expression.function.scalar.math package * EQL: Remove painless whitespace for ToNumber.asScript * EQL: Add Long support	2020-05-13 14:09:06 -06:00
Jason Tedor	5ca2ea2dde	Allow removing replicas setting on closed indices (#56680 ) This is similar to a previous change that allowed removing the number of replicas settings (so setting it to its default) on open indices. This commit allows the same for closed indices. It is unfortunate that we have separate branches for handling open and closed indices here, but I do not see a clean way to merge these two together without making a rather unnatural method (note that they invoke different methods for doing the settings updates). For now, we leave this as-is even though it led to the miss here.	2020-05-13 15:56:58 -04:00
Mark Vieira	e3be18a443	Add version 6.8.10	2020-05-13 11:27:40 -07:00
Bogdan Pintea	ee437bef27	Docs: forward port release docs of 7.7.0 (#56706 ) Forward port the release docs of 7.7.0: breaking changes, release notes, release highlights.	2020-05-13 20:08:14 +02:00
Julie Tibshirani	a92d138c77	Correct the type of the 'analyzer' parameter in the _analyze docs. (#56650 ) This optional parameter can only be a string. To test out a transient custom analysis chain, users are expected to use the 'tokenizer', 'filter', and 'char_filter' parameters.	2020-05-13 11:05:06 -07:00
Bogdan Pintea	2f0663c490	Add the 7.7.1 Version Add the bumped 7.7 branch new version, 7.7.1	2020-05-13 18:46:07 +02:00
David Turner	26382dff19	Clarify doc count stats (#56665 ) Today we report some statistics in terms of Lucene-level documents, which differ from Elasticsearch-level documents in a number of ways and include things like document tombstones which users cannot directly observe. This commit clarifies the internal nature of these statistics. Closes #56497	2020-05-13 15:07:44 +01:00
Costin Leau	9f1ecd52eb	EQL: Introduce support for sequences (#56300 ) Initial support for EQL sequences The current algorithm is focused on correctness and does not contain any optimization which is left for the future. The current implementation uses a state machine approach which moves ascending and runs each query one after the other working on computing sequences as the data comes in. For each result, the key and its timestamp are being extracted which are then used for matching/building a sequence. (cherry picked from commit 4f3e18c894a1841d333022361ad9d1fdf1477dc3)	2020-05-13 15:42:31 +03:00
James Rodewig	c859fafcbd	[DOCS] Correct `query` datatype in enrich policy definition (#56224 ) Corrects the datatype for the `query` property of an enrich policy object. The `query` property is a query object, not a string.	2020-05-13 08:35:17 -04:00
Ignacio Vera	b4521d5183	upgrade to Lucene 8.6.0 snapshot (#56661 )	2020-05-13 14:25:16 +02:00
Marios Trivyzas	cbbbd499bf	SQL/EQL: Add support for scalars within LIKE/RLIKE (#56495 ) (#56674 ) - Add support for scalar functions on the field of SQL's LIKE/RLIKE - Add support for scalar functions on the field of EQL's match/matchLite Closes: #55058 (cherry picked from commit 51c14e2dbb7fb29004a23369c449d425b3ac8fe2)	2020-05-13 13:40:24 +02:00
Luca Cavanna	30e9a1b8c7	Improve error handling when decoding async execution ids (#56285 ) When decoding async execution ids, exceptions thrown from the decode method itself were not caught, leading to cryptic errors like "Input byte array has incorrect ending byte at 68" being returned. With this commit we return "invalid id: [abcdef]". Added tests coverage for a couple of these scenarios and also added tests for equals/hashcode methods.	2020-05-13 12:26:17 +02:00
Jason Tedor	4394235c63	Allow removing index.number_of_replicas setting (#56656 ) Today a user can create an index without setting the index.number_of_replicas setting even though the index metadata requires that the setting has a value. We do this when creating an index by explicitly settings index.number_of_replicas to a default value if one is not provided. However, if a user updates the number of replicas, and then let wants to return to the default value, they are naturally inclined to try setting this setting to null, as the agreed upon way to return a setting to its default. Since the index metadata requires that this setting has a non-null value, we blow up when a user attempts to make this change. This is because we are not taking the same action when updating a setting on an index that we take when create an index. Namely, we are not explicitly setting index.number_of_replicas if the request does not carry a value for this setting. This would happen when nulling the setting, which we want to support. This commit addresses this by setting index.number_of_replicas to the default if the value for this setting is null when updating the settings for an index.	2020-05-13 06:25:43 -04:00
Marios Trivyzas	e781193cf9	SQL: Fix JDBC url pattern in docs and error message (#56612 ) The docs pattern url was using `*` which means zero or many instead of `?` which means zero or one. The pattern url returned in error messages was not in sync with the one in the docs. Fixes: #56476 (cherry picked from commit 1a5945c3962cdda21482f4b0b3e0ca508534c2c4)	2020-05-13 12:13:58 +02:00
David Turner	c10b4ae15a	Support cloning of searchable snapshot indices (#56595 ) Today you can convert a searchable snapshot index back into a regular index by restoring the underlying snapshot, but this is somewhat wasteful if the shards are already in cache since it copies the whole index from the repository again. Instead, we can make use of the locally-cached data by using the clone API to copy the contents of the cache into the layout expected by a regular shard. This commit marks the searchable snapshot's private index settings as `NotCopyableOnResize` so that they are removed by resize operations such as cloning. Cloning a regular index typically hard-links the underlying files rather than copying them, but this is tricky to support in the case of a searchable snapshot so this commit takes the simpler approach of always copying the underlying files.	2020-05-13 11:05:14 +01:00
Gabriel Petrovay	ca586f2a8d	[Docs] Correct formatting in datehistogram-aggregation.asciidoc (#56664 )	2020-05-13 12:01:42 +02:00
Christoph Büscher	73b64908b2	Fix `time_zone` on `query_string` and date fields (#55881 ) (#56668 ) Currently the `time_zone` parameter in `query_string` queries gets applied correctly only when using the range syntax, e.g "date:[2020-01-02 TO 2020-01-05]. When a date field gets searched without explicit range syntax, e.g. "date:"2020-01-01" we internally create a range query than uses the specified date as start date and rounds up to the next underspecified units for the end date (e.g. here 2020-01-01T23:59:59) without considering the `time_zone` settings. This change adds a check in QueryStringQueryParser to detect this scenario early where we have access to the time zone information and directly create a range query using it. Closes #55813	2020-05-13 11:20:25 +02:00
Ioannis Kakavas	cc119c3853	Expose idp.metadata.http.refresh for SAML realm (#56354 ) (#56593 ) This setting was not returned in the SamlRealmSettings#getSettings so it was not possible for users to set this in the realm config in our configuration.	2020-05-13 11:51:18 +03:00
Martijn van Groningen	d3dace903b	Fix allowed warning in data stream rest test. (#56630 ) (#56634 )	2020-05-13 09:44:19 +02:00
Henning Andersen	48a8c7eb88	Ensure search contexts are removed on index delete (#56335 ) (#56617 ) In a race condition, a search context could remain enlisted in SearchService when an index is deleted, potentially causing the index folder to not be cleaned up (for either lengthy searches or scrolls with timeouts > 30 minutes or if the scroll is kept active).	2020-05-13 09:41:02 +02:00
debadair	6de6ec68f2	[DOCS] Extract the cron docs from Watcher docs and add to the API conventions. (#56313 ) (#56651 ) * [DOCS] Promote cron expressions info from Watcher to a separate topic. * Fix table error * Fixed xref * Apply suggestions from code review Co-authored-by: James Rodewig <james.rodewig@elastic.co> * Incorporated review feedback Co-authored-by: James Rodewig <james.rodewig@elastic.co> Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-05-12 16:36:18 -07:00
CJ Cenizal	2704428a9c	[Docs] Clarify that _ccr/info omits parameters from the response when the follower index is paused. (#55961 )	2020-05-12 15:25:08 -07:00
debadair	cabc963135	[DOCS] Add request body param descriptions for move to step (#56560 ) (#56644 ) * [DOCS] Add request body param descriptions for move to step (#56560) * [DOCS] Clarify definition of max_size (#56561)	2020-05-12 15:00:48 -07:00
Jake Landis	a010f4f624	[7.x] Watcher dont add watches post index if stopped (#56556 ) (#56629 ) Watcher adds watches to the trigger service on the postIndex action for the .watches index. This has the (intentional) side effect of also adding the watches to the stats. The tests rely on these stats for their assertions. The tests also start and stop Watcher between each test for a clean slate. When Watcher executes it updates the .watches index and upon this update it will go through the postIndex method and end up added that watch to the trigger service (and stats). Functionally this is not a problem, if Watcher is stopping or stopped since Watcher is also paused and will not execute the watch. However, with specific timing and expectations of a clean slate can cause issues the test assertions against the stats. This commit ensures that the postIndex action only adds to the trigger service if the Watcher state is not stopping or stopped. When started back up it will re-read index .watches. This commit also un-mutes the tests related to #53177 and #56534	2020-05-12 16:30:27 -05:00
Jake Landis	a56fb6192e	[7.x] Fix ingest simulate verbose on failure with conditional (#56478 ) (#56635 ) If a conditional is added to a processor, and that processor fails, and that processor has an on_failure handler, the full trace of all of the executed processors may not be displayed in simulate verbose. The information is correct, but misses displaying some of the steps used to get there. This happens because a processor that is conditional processor is a wrapper around the real processor and a processor with an on_failure handler is also a wrapper around the processor(s). When decorating for simulation we treat compound processor specially, but if a compound processor is wrapped by a conditional processor that compound processor's processors can be missed for decoration resulting in the missing displayed steps. The fix to this is to treat the conditional processor specially and explicitly seperate it from the processor it is wrapping. This requires us to keep track of 2 processors a possible conditional processor and the actual processor it may be wrapping. related: #56004	2020-05-12 15:41:05 -05:00
James Rodewig	cf76a932fb	[DOCS] Correct watcher event data example (#56469 ) * Swaps outdated index patterns for the default `logstash` index alias. Adds some related information about Logstash ILM defaults to the callout. * Swaps `.raw` fields for `.keyword` fields. The Logstash template uses `keyword` fields by default since 6.x. * Swaps instances of `ctx.payload.hits.total.value` with `ctx.payload.hits.total`	2020-05-12 16:33:33 -04:00
James Rodewig	a5154cc190	[DOCS] Correct setting type for `indices.query.bool.max_clause_count` (#56640 ) #56449 incorrectly labelled this as a dynamic setting. This corrects that error.	2020-05-12 16:26:18 -04:00
Jake Landis	9c76ee47c4	[7.x] json spec: allow null for documentation url (#55749 ) (#56625 ) This commit allows the JSON schema's documentation.url property to have a null value. This can useful for cases where a feature is under development, and does not have documentation published yet. This commit also adds a documentation.url for two ml resources.	2020-05-12 14:49:02 -05:00
Jake Landis	5f5a648b9a	[7.x] remove the term 'system' from indicies doc (#56367 ) (#56375 ) 'system' indices will carry special meaning in the future this commit removes the system from the name to avoid confusion. (technically these indices will be hidden not system)	2020-05-12 14:44:50 -05:00
James Rodewig	54088a21d7	[DOCS] Collapse 7.8 breaking changes for security (#56622 )	2020-05-12 15:30:28 -04:00
Armin Braun	0a879b95d1	Save Bounds Checks in BytesReference (#56577 ) (#56621 ) Two spots that allow for some optimization: * We are often creating a composite reference of just a single item in the transport layer => special cased via static constructor to make sure we never do that * Also removed the pointless case of an empty composite bytes ref * `ByteBufferReference` is practically always created from a heap buffer these days so there is no point of dealing with all the bounds checks and extra references to sliced buffers from that and we can just use the underlying array directly	2020-05-12 20:33:45 +02:00
Jason Tedor	f7b8f0b2f4	Adjust warning for heap size bootstrap check (#56565 ) Today the heap size check warns the user about two issues why they might care about the heap size check: resize pauses, and if memory locking is enabled. Yet, we unconditionally make mention of the memory locking reason, even if memory locking is not enabled. This can confuse some users, so we adjust the warning about memory locking to only display if memory locking is enabled.	2020-05-12 14:31:21 -04:00
James Rodewig	d247e8f7a6	[DOCS] Sort EQL search API params alphabetically	2020-05-12 13:52:18 -04:00
Armin Braun	c104c9a11b	Fix Missing IgnoredUnavailable Flag in 7.x SLM Retention Task (#56616 ) Without the flag we run into the situation where a broken repository (broken by some old 6.x version of ES that is missing some snap-${uuid}.dat blobs fails to run the SLM retention task since it always errors out).	2020-05-12 18:07:58 +02:00
Marios Trivyzas	4240b97d0e	SQL: [Test] Fix JdbcPreparedStatement date test Use `ORDER BY` to ensure order of the rows since more than are returned in the testDate(). Follows: #56492 (cherry picked from commit 0053a1cb515b4db160d7b0bed5cf3f13c1050687)	2020-05-12 17:08:16 +02:00
Martijn van Groningen	0c61bc63e4	Backport: auto create data streams using index templates v2 (#56596 ) Backport: #55377 This commit adds the ability to auto create data streams using index templates v2. Index templates (v2) now have a data_steam field that includes a timestamp field, if provided and index name matches with that template then a data stream (plus first backing index) is auto created. Relates to #53100	2020-05-12 17:01:15 +02:00
Armin Braun	b449661b8f	Remove Unused ByteBufStreamInput (#56567 ) (#56601 ) We're not using this one any more.	2020-05-12 16:04:58 +02:00
Andrei Stefan	f0074e93a0	QL: case sensitive support in EQL (#56404 ) (#56597 ) * QL: case sensitive support in EQL (#56404) * adds a generic startsWith function to QL * modifies the existent EQL startsWith function to be case sensitive aware * improves the existent EQL startsWith function to use a prefix query when the function is used in a case sensitive context. Same improvement is used in SQL's newly added STARTS_WITH function. * adds case sensitivity to EQL configuration through a case_sensitive parameter in the eql request, as established in #54411. The case_sensitive parameter can be specified when running queries (default is case insensitive) (cherry picked from commit ee5a09ea840167566e34c28c8225dc38bc6a7ae8)	2020-05-12 16:56:18 +03:00
James Rodewig	8c457c884a	[DOCS] Add clean up snapshot repository API docs (#56519 )	2020-05-12 09:54:49 -04:00
Rory Hunter	bf97679854	Push arch-specific logic from Gradle into Docker (#56503 ) Docker informed us that for official multi-arch Docker builds, there needs to be a single Dockerfile and build context that can be used for each supported architecture. Therefore, rework the build to move the relevant architecture logic into the Dockerfile, and merge the aarch64 / x64 docker context builds.	2020-05-12 14:31:45 +01:00
Dan Hermann	dfdd7e4fce	Report used memory as zero when total memory cannot be obtained (#56412 )	2020-05-12 07:43:51 -05:00
Hendrik Muhs	a9425a0240	[7.x][Transform] fix count when matching exact ids(#56544 ) (#56582 ) fix count in get and get stats if explicit ids are given and ids might be duplicated when configuration are stored in different index (versions). fixes #56196	2020-05-12 14:23:13 +02:00
Marios Trivyzas	575cafb8da	SQL: Fix serialization of JDBC prep statement date/time params (#56492 ) (#56579 ) The Date/Time related query params of a JDBC prepared statement serialized using java.util.Date. The rules for serializing `java.util.Date` objects though reside in `XContentElasticsearchExtension` which is not available in the jdbc jar as this class is in `server` module. Therefore, a custom extension of the `XContentBuilderExtension` iface has been added to the jdbc module/jar. Moreover the sql's `qa` project had as dependency the `sql-action` module which depends on `server` so the `XContentBuilderExtension` was available for the integ tests hiding the real problem. Previously, when a user was setting a `java.sql.Time` to the prepStmt, the DataType used was `DATETIME` instead of `TIME` and therefore prevented from filtering with a `TIME` casted field: ``` SELECT * FROM test WHERE date::TIME = ? ``` Fixes: #56084 (cherry picked from commit f8d8e971bd2c85fa4aea44b5b3ba0cdcc950a4ed)	2020-05-12 13:25:02 +02:00

1 2 3 4 5 ...

51642 Commits