OpenSearch

Commit Graph

Author	SHA1	Message	Date
Luca Cavanna	dbbf2772d8	Mute newly added ml data streams tests (#58492 ) Relates to #58491	2020-06-24 15:11:40 +02:00
Luca Cavanna	7e2bb8d6a2	Mute Netty4HttpServerTransportTests#testCorsRequest (#58480 ) Relates to #58433	2020-06-24 14:31:38 +02:00
Jim Ferenczi	f6d5f452cd	Fix MultiClusterSearchYamlTestSuiteIT test failures (#58359 ) Restore number of shards for the field_caps_empty_index	2020-06-24 13:39:30 +02:00
markharwood	d5ac3bb87f	Field capabilities - make `keyword` a family of field types (#58315 ) (#58483 ) Introduces a new method on `MappedFieldType` to return a family type name which defaults to the field type. Changes `wildcard` and `constant_keyword` field types to return `keyword` for field capabilities. Relates to #53175	2020-06-24 12:32:14 +01:00
Jim Ferenczi	ec8d5ec79c	Fix handling of terminate_after when size is 0 (#58212 ) `terminate_after` is ignored on search requests that don't return top hits (`size` set to 0) and do not tracked the number of hits accurately (`track_total_hits`). We use early termination when the number of hits to track is reached during collection but this breaks the hard termination of `terminate_after` if it happens before we reached the `terminate_after` value. This change ensures that we continue to check `terminate_after` even if the tracking of total hits has reached the provided value. Closes #57624	2020-06-24 13:16:11 +02:00
David Turner	796cb9e9ca	Reword INDEX_READ_ONLY_ALLOW_DELETE_BLOCK message (#58410 ) Users are perennially confused by the message they get when writing to an index is blocked due to excessive disk usage: TOO_MANY_REQUESTS/12/index read-only / allow delete (api) Of course this is technically accurate but it is hard to join the dots from this message to "your disk was too full" without some searching of forums and documentation. Additionally in #50166 we changed the status code to today's `429` from the previous `403` which changed the message from the one that's widely documented elsewhere: FORBIDDEN/12/index read-only / allow delete (api) Since #42559 we've considered this block to be under the sole control of the disk-based shard allocator, and we have seen no evidence to suggest that anyone is applying this block manually. Therefore this commit adjusts this block's message to indicate that it's caused by a lack of disk space.	2020-06-24 10:22:11 +01:00
Alan Woodward	d251a482e9	Move MappedFieldType.similarity() to TextSearchInfo (#58439 ) Similarities only apply to a few text-based field types, but are currently set directly on the base MappedFieldType class. This commit moves similarity information into TextSearchInfo, and removes any mentions of it from MappedFieldType or FieldMapper. It was previously possible to include a similarity parameter on a number of field types that would then ignore this information. To make it obvious that this has no effect, setting this parameter on non-text field types now issues a deprecation warning.	2020-06-24 10:00:32 +01:00
Jim Ferenczi	fcd8a432d9	Submit _async search task should cancel children on cancellation (#58332 ) This change allows the submit async search task to cancel children and removes the manual indirection that cancels the search task when the submit task is cancelled. This is now handled by the task cancellation, which can cancel grand-children since #54757.	2020-06-24 09:10:26 +02:00
Ryan Ernst	88f1dab8b5	Fix long/int precision for test baseport calculation	2020-06-23 16:02:13 -07:00
Ryan Ernst	6285b87b97	Adjust gradle base port by one (#58368 ) When assigning ports for internal cluster tests, we use the gradle worker id as an adjustment on the base port of 10300. In order to not go outside the max port range, we modulo the worker id by 223. Since gradle worker ids start at 1, we expect to never actually get the base port of 10300. However, as the gradle daemon lasts for longer, the module can result in a value of 0, which cases the test to fail. This commit adjusts the modulo to ensure the value is never 0. closes #58279	2020-06-23 15:42:26 -07:00
Ryan Ernst	89c03e593c	Create utility for custom config setup in packaging tests (#58352 ) This commit creates a shared withCustomConfig method that may be used by any packaging test. The method will copy the config directory and override the conf path appropriately depending on the distribution type.	2020-06-23 15:12:22 -07:00
Larry Gregory	2ca09cddaf	[DOCS] Rename kibana user to kibana_system (#58423 )	2020-06-23 14:25:09 -07:00
Przemysław Witek	4e4ca6ac25	Extract ClientHelper.filterSecurityHeaders method and use it in ML code (#58447 ) (#58459 )	2020-06-23 22:18:39 +02:00
Dan Hermann	b40c27698f	Fix incorrect stats warning when swap is disabled	2020-06-23 14:34:27 -05:00
Benjamin Trent	a9b868b7a9	[7.x] [ML] allow data streams to be expanded for analytics and transforms (#58280 ) (#58455 ) This commits allows data streams to be a valid source for analytics and transforms. Data streams are fairly transparent and our `_search` and `_reindex` actions work without error. For `_transforms` the check-pointing works as desired as well. Data streams are effectively treated as an `alias` and the backing index values are stored within checkpointing information.	2020-06-23 14:40:35 -04:00
Benjamin Trent	0cc84d3caf	[ML] wait for yellow state for stats index in tests (#58436 ) (#58456 ) GET inference stats now reads from the .ml-stats index. Our tests should wait for yellow state before attempting to query the index for stat information.	2020-06-23 13:32:24 -04:00
James Rodewig	affc3954e6	[DOCS] Fix typo in RoutingNode comment (#58079 ) (#58454 ) Co-authored-by: Howard <danielhuang@tencent.com>	2020-06-23 13:07:08 -04:00
Dimitris Athanasiou	f67fee387b	[7.x][ML] Make regression training set predictable in size (#58331 ) (#58453 ) Unlike `classification`, which is using a cross validation splitter that produces training sets whose size is predictable and equal to `training_percent * class_cardinality`, for regression we have been using a random splitter that takes an independent decision for each document. This means we cannot predict the exact size of the training set. This poses a problem as we move towards performing test inference on the java side as we need to be able to provide an accurate upper bound of the training set size to the c++ process. This commit replaces the random splitter we use for regression with the same streaming-reservoir approach we do for `classification`. Backport of #58331	2020-06-23 19:49:03 +03:00
Marios Trivyzas	e7c40d973e	SQL: Relax parsing of date/time escaped literals (#58336 ) (#58450 ) Improve the usability of the MS-SQL server/ODBC escaped date/time/timestamp literals, by allowing timezone/offset ids in the parsed string, e.g.: ``` {ts '2000-01-01T11:11:11Z'} ``` Closes: #58262 (cherry picked from commit 0af1f2fef805324e802d97d2fd9b4660abb403f0)	2020-06-23 18:05:54 +02:00
Christoph Büscher	642b05a511	Fix test failure in RangeQueryBuilderTests.testToQuery (#58449 ) Very rarely this test can fail if we draw a random TimeZone id that we cannot parse with the legacy joda DateMathParser and get an IllegalArgumentException. In addition to a "SystemV/*" time zone we also need an index "versionCreated" before V_7_0_0 and no "format" setting in the query builder. Given how unlikely this combination is, we should simply dissallow those time zone ids when generating the random query builder for RangeQueryBuilderTests. Closes #58431	2020-06-23 17:44:18 +02:00
David Roberts	0d6bfd0ac3	[7.x][ML] Fix wire serialization for flush acknowledgements (#58443 ) There was a discrepancy in the implementation of flush acknowledgements: most of the class was designed on the basis that the "last finalized bucket time" could be null but the wire serialization assumed that it was never null. This works because, the C++ sends zero "last finalized bucket time" when it is not known or not relevant. But then the Java code will print that to XContent as it is assuming null represents not known or not relevant. This change corrects the discrepancies. Internally within the class null represents not known or not relevant, but this is translated from/to 0 for communications from the C++ and old nodes that have the bug. Additionally I switched from Date to Instant for this class and made the member variables final to modernise it a bit. Backport of #58413	2020-06-23 16:42:06 +01:00
Mark Tozzi	52806a8f89	Small VS config cleanup (#58294 ) (#58442 )	2020-06-23 10:53:06 -04:00
Benjamin Trent	61142a3005	[ML] only log if forecasts are set to failed (#58421 ) (#58437 ) This adjusts the logging level for setting forecasts to failed to WARN. And it will only log if 1 or more forecasts were adjusted to failed.	2020-06-23 10:24:03 -04:00
James Rodewig	afbf3bd33b	[DOCS] Add data streams to bulk, delete, and index API docs (#58340 ) (#58434 ) Updates existing docs for the bulk, delete and index APIs to make them aware of data streams.	2020-06-23 09:40:25 -04:00
Alan Woodward	8ebd341710	Add text search information to MappedFieldType (#58230 ) (#58432 ) Now that MappedFieldType no longer extends lucene's FieldType, we need to have a way of getting the index information about a field necessary for building text queries, building term vectors, highlighting, etc. This commit introduces a new TextSearchInfo abstraction that holds this information, and a getTextSearchInfo() method to MappedFieldType to make it available. Field types that do not support text search can just return null here. This allows us to remove the MapperService.getLuceneFieldType() shim method.	2020-06-23 14:37:26 +01:00
Nik Everett	519f41950a	Save memory when significant_text is not on top (#58145 ) (#58364 ) This merges the aggregator for `significant_text` into `significant_terms`, applying the optimization built in #55873 to save memory when the aggregation is not on top. The `significant_text` aggregation is pretty memory intensive all on its own and this doesn't particularly help with that, but it'll help with the memory usage of any sub-aggregations.	2020-06-23 09:19:05 -04:00
James Rodewig	9d03204308	[DOCS] Prohibit deletion of composable template in use by data stream (#58347 ) (#58430 ) Notes that you cannot delete a composable template currently in use by a data stream. Relates to #57957.	2020-06-23 09:01:17 -04:00
James Rodewig	b213f0222c	[DOCS] Reword tip in data streams overview	2020-06-23 08:57:59 -04:00
Dan Hermann	41e8f584c1	[7.x] Minimum node version check before creating data stream (#58424 )	2020-06-23 07:45:27 -05:00
Armin Braun	943efb78fd	Save Shard ID Serializations in Bulk Requests (#56209 ) (#58414 ) Just like #56094 but for the request side. Removes a lot of redundant `ShardId` instances from bulk shard requests as well as stops serializing index names when they're not needed because they're not different from what is in the shard id. Even ignoring the index name serialization savings here, this change saves one `ShardId` instance per bulk shard request at least. This means it saves approximately: * 8 bytes for the `ShardId` object (itself + one field) * + another 4 bytes for the `int` in the `ShardId` * 16 bytes (two fields + the instance itself + the padding) for the `Index` object * + 30 bytes for the `Index` uuid string * + all the bytes in the index name string => 60+ bytes per bulk request item saved on heap and over the wire	2020-06-23 12:35:52 +02:00
David Turner	256b660f0a	Remove anonymous PublicationContext implementation (#58412 ) Today the `PublicationContext` interface has a single anonymous implementation, and `PublicationTransportHandler` has various methods that take the variables that this anonymous class captures. This commit refactors this into a proper class with proper fields and moves the relevant methods onto this class. Backport of #58405 to 7.x.	2020-06-23 11:13:23 +01:00
Alan Woodward	519d1278e2	Make FieldTypeLookup immutable (#58162 ) (#58411 ) FieldTypeLookup maps field names to their MappedFieldTypes. In the past, due to the presence of multiple mapping types within a single index, this had to be updated in-place because a mapping update might only affect one type. However, now that we only have a single type per index, we can completely rebuild the FieldTypeLookup on each update, removing lots of concurrency worries.	2020-06-23 10:51:32 +01:00
David Roberts	f97b37190b	[ML] Add a new annotation type for categorization status changes (#58394 ) Adds a new value to the "event" enum of ML annotations, namely "categorization_status_change". This will allow users to see when categorization was found to be performing poorly. Once per-partition categorization is available, it will allow users to see when categorization is performing poorly for a specific partition. It does not make sense to reuse the "model_change" event that annotations already have, because categorizer state is separate to model state ("model" state is really anomaly detector state), and is not reverted by the revert model snapshot API. Therefore annotations related to categorization need to be treated differently to annotations related to anomaly detection.	2020-06-23 09:16:27 +01:00
Rene Groeschke	fc60cf6179	Introduce EnforceDeprecationFailuresPlugin (#58263 ) (#58309 ) - extract fail on deprecated usage into its own plugin - apply on all projects - ensures we don't miss any project (missed xpack/plugin/eql/qa/security before)	2020-06-23 09:14:12 +02:00
Rene Groeschke	bd2dd81bc6	Fix deprecated property usage in archive tasks (#58269 ) (#58308 )	2020-06-23 09:11:46 +02:00
István Zoltán Szabó	3169e4c70e	[DOCS] Updates screenshots in ML population analysis (#58318 )	2020-06-23 09:05:08 +02:00
Martijn van Groningen	7dda9934f9	Keep track of timestamp_field mapping as part of a data stream (#58400 ) Backporting #58096 to 7.x branch. Relates to #53100 * use mapping source direcly instead of using mapper service to extract the relevant mapping details * moved assertion to TimestampField class and added helper method for tests * Improved logic that inserts timestamp field mapping into an mapping. If the timestamp field path consisted out of object fields and if the final mapping did not contain the parent field then an error occurred, because the prior logic assumed that the object field existed.	2020-06-22 17:46:38 +02:00
Costin Leau	765f1b5775	SQL: Fix bug in resolving aliases against filters (#58399 ) When doing aliasing with the same name over non existing fields, the analyzer gets stuck in a loop trying to resolve the alias over and over leading to SO. This PR breaks the cycle by checking the relationship between the alias and the child it tries to replace as an alias should never replace its child. Fix #57270 Close #57417 Co-authored-by: Hailei <zhh5919@163.com> (cherry picked from commit 46786ff2e1ed5951006ff4bdd2b6ac6a1ebcf17b)	2020-06-22 16:05:42 +03:00
Dan Hermann	c5f5cc4cf8	[DOCS] Prohibit cloning, splitting, and shrinking a data stream's write index (#58105 ) (#58401 )	2020-06-22 07:29:26 -05:00
Przemko Robakowski	a44dad9fbb	[7.x] Add support for snapshot and restore to data streams (#57675 ) (#58371 ) * Add support for snapshot and restore to data streams (#57675) This change adds support for including data streams in snapshots. Names are provided in indices field (the same way as in other APIs), wildcards are supported. If rename pattern is specified it renames both data streams and backing indices. It also adds test to make sure SLM works correctly. Closes #57127 Relates to #53100 * version fix * compilation fix * compilation fix * remove unused changes * compilation fix * test fix	2020-06-19 22:41:51 +02:00
Benjamin Trent	bf8641aa15	[7.x] [ML] calculate cache misses for inference and return in stats (#58252 ) (#58363 ) When a local model is constructed, the cache hit miss count is incremented. When a user calls _stats, we will include the sum cache hit miss count across ALL nodes. This statistic is important to in comparing against the inference_count. If the cache hit miss count is near the inference_count it indicates that the cache is overburdened, or inappropriately configured.	2020-06-19 09:46:51 -04:00
James Rodewig	d8dc638a67	[DOCS] Document get data stream API response body (#58344 ) (#58360 )	2020-06-18 16:42:05 -04:00
James Rodewig	b8fa90198b	[DOCS] Prohibit deletion of a data stream's write index (#58341 ) (#58358 )	2020-06-18 16:00:10 -04:00
Lisa Cawley	6680271691	[DOCS] Updates pull and issue release attributes (#58348 )	2020-06-18 12:55:02 -07:00
Nik Everett	49684463dd	Mute ESTestCaseTests#testBasePortGradle Tracked by #58279. Failed a few times a day since June 13th.	2020-06-18 15:41:25 -04:00
Tal Levy	11086d5c7d	add geo_shape documentation for supported aggregations (#58284 ) (#58354 ) This commit adds documentation for geo_shape fields in aggregations Closes #55495.	2020-06-18 12:36:24 -07:00
William Brafford	b3c99f06d6	Mute flaky test (#58356 )	2020-06-18 15:30:11 -04:00
Andrei Dan	30e777856f	[7.x] Validate alias operations don't target data streams (#58327 ) (#58337 ) This adds validation to make sure alias operations (add, remove, remove index) don't target data streams or the backing indices. (cherry picked from commit 816448990e464a02f3960f12f6f6644a8cce36a4) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-06-18 20:23:07 +01:00
William Brafford	4836236446	Mute flaky multicluster tests (#58350 )	2020-06-18 14:39:59 -04:00
Ryan Ernst	d702cb0ad9	Consolidate temp dir handling in packaging tests (#58292 ) The packaging tests currently have a couple different ways of deciding where temp files should be placed, and then sometimes used fixed file or directory names within that dir. This commit conslidates some of that temp dir handling by making it more compatible with the handling that exists within the bats tests, where /tmp is not always appropriate due to how systemd interacts with it. This commit also adds a utility methhod for creating temp dirs, so as to ensure the new directory is created as if a umask of 022 were used, which is not the case when using Files.createTempDirectory without a set of permissions (it assumes 077).	2020-06-18 11:35:11 -07:00

... 2 3 4 5 6 ...

52405 Commits All Branches Search

52405 Commits

All Branches