OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-09 22:45:04 +00:00

Author	SHA1	Message	Date
Hendrik Muhs	68f9102550	[ML-DataFrame] audit changes in the source index (#45282 ) add audits when the set of source indexes changes and in a special case runs empty	2019-08-08 23:31:55 +02:00
Andrei Stefan	740d58fd46	SQL: Uniquely named inner_hits sections for each nested field condition (#45341 ) * Name each inner_hits section of nested queries differently and extract and combine the multiple values it generates into a single list. This also introduces a limitation (its origin it's with Elasticsearch though) on the sorting capabilities when the sorting is based on the nested fields filtered: only one of the conditions applied to nested documents will be used in the nested sorting. (cherry picked from commit cfc5cf68f6e83b07bb9006986d0903d6be418ec6)	2019-08-09 00:22:49 +03:00
Tim Brooks	af908efa41	Disable netty direct buffer pooling by default (#44837 ) Elasticsearch does not grant Netty reflection access to get Unsafe. The only mechanism that currently exists to free direct buffers in a timely manner is to use Unsafe. This leads to the occasional scenario, under heavy network load, that direct byte buffers can slowly build up without being freed. This commit disables Netty direct buffer pooling and moves to a strategy of using a single thread-local direct buffer for interfacing with sockets. This will reduce the memory usage from networking. Elasticsearch currently derives very little value from direct buffer usage (TLS, compression, Lucene, Elasticsearch handling, etc all use heap bytes). So this seems like the correct trade-off until that changes.	2019-08-08 15:10:31 -06:00
Armin Braun	b19de55095	Add missing wait to testAutomaticReleaseOfIndexBlock (#45342 ) (#45351 ) Today the test waits for one of the shards to be blocked, but this does not mean that the block has been applied on all nodes, so a subsequent indexing operation may still go through. Fixes #45338	2019-08-08 22:39:22 +02:00
Henning Andersen	d139896b66	Reindex share retry between hit sources (#44203 ) (#45348 ) The client and remote hit sources had each their own retry mechanism, which would do the same. Supporting resiliency we would have to expand on the retry mechanisms and as a preparation for that, the retry mechanism is now shared such that each sub class is only responsible for sending requests and converting responses/failures to common format. Part of #42612	2019-08-08 22:01:29 +02:00
Mark Vieira	214cbb28df	Fix for build runtime classpath instability (#45347 ) (cherry picked from commit dee4ee2f0d4190ab54d0a4f0aa251d8c03e9db6d)	2019-08-08 12:41:17 -07:00
Christoph Büscher	a552b33276	Fix occasional SuggestSearchIT failure (#45330 ) Refreshes happening during indexing can result differen segment counts and slightly skewed term statistics, which in turn has the potential to change suggestion output slightly. In order to prevent this, disable refresh for the affected tests. Closes #43261	2019-08-08 21:06:32 +02:00
Mark Vieira	0ae103c40f	Avoid unnecessary eager creation of Gradle tasks (#45098 ) (#45310 )	2019-08-08 10:50:09 -07:00
Jack Conradson	b716b840d3	Remove loop counter from Reserved in Painless AST. (#45298 ) This change adds a compiler pass to give each node the chance to store settings necessary for analysis and writing. This removes the need to pass this in a somewhat convoluted way through an additional class called Reserved, and also removes the need to have the Walker set values for settings on reserved. This is next step in decoupling the Painless grammar from the Painless AST.	2019-08-08 09:34:51 -07:00
David Roberts	14545f8958	[ML-DataFrame] Combine task_state and indexer_state in _stats (#45324 ) This commit replaces task_state and indexer_state in the data frame _stats output with a single top level state that combines the two. It is defined as: - failed if what's currently reported as task_state is failed - stopped if there is no persistent task - Otherwise what's currently reported as indexer_state Backport of #45276	2019-08-08 16:24:26 +01:00
Dimitris Athanasiou	e53bb050db	Mute testAutomaticReleaseOfIndexBlock Relates #45338	2019-08-08 17:56:41 +03:00
Andrey Ershov	07c656fba9	Mute testCustomDataPaths on Windows See #45333 (cherry picked from commit 671e1ad1068aee4b593ad0c8ab13ff60b4f125b8)	2019-08-08 16:26:56 +02:00
Mayya Sharipova	f0f2294695	Add filters in examples of vector functions (#45327 )	2019-08-08 09:44:59 -04:00
Zachary Tong	86d6597890	Use newIndexSearcher() instead of newSearcher() (#45248 ) `newSearcher()` from lucene can randomly choose index readers which are not compatible with our tests, like ParallelCompositeReader. The `newIndexSearcher()` method on AggregatorTestCase is a wrapper similar to newSearcher but compatible with our tests	2019-08-08 09:34:38 -04:00
István Zoltán Szabó	4e32470827	[DOCS] Reformats cluster reroute API. (#45328 )	2019-08-08 15:27:54 +02:00
István Zoltán Szabó	4d96c83854	[DOCS] Reformats cluster pending tasks API (#45280 ) Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2019-08-08 14:49:32 +02:00
James Rodewig	e72bda1703	[DOCS] Reformats cat nodes API (#45285 )	2019-08-08 08:38:02 -04:00
James Rodewig	b806e6edde	[DOCS] Reformats cat pending tasks API (#45287 )	2019-08-08 08:32:04 -04:00
István Zoltán Szabó	276e9c6697	[DOCS] Adds supported time units ref to the ML and DF API params. (#45322 )	2019-08-08 14:26:19 +02:00
Martijn van Groningen	e066133016	Change the ingest simulate api to not include dropped documents (#44161 ) If documents are dropped by the `drop` processor then these documents are returned as a `null` value in the response. === Example Create pipeline: ``` PUT _ingest/pipeline/droppipeline { "processors": [ { "set": { "field": "bla", "value": "val" } }, { "drop": {} } ] } ``` Simulate request: POST _ingest/pipeline/droppipeline/_simulate { "docs": [ { "_source": { "message": "text" } } ] } Response: ``` { "docs": [ null ] } ``` Response if verbose is enabled: ``` { "docs": [ { "processor_results": [ { "doc": { "_index": "_index", "_type": "_doc", "_id": "_id", "_source": { "message": "text", "bla": "val" }, "_ingest": { "timestamp": "2019-07-10T11:07:10.758315Z" } } }, null ] } ] } ``` Closes #36150 * Abort pipeline simulation in verbose mode when document has been dropped by drop processor	2019-08-08 13:04:33 +02:00
Ioannis Kakavas	99ddb8b3d8	Allow empty token endpoint for implicit flow (#45038 ) When using the implicit flow in OpenID Connect, the op.token_endpoint_url should not be mandatory as there is no need to contact the token endpoint of the OP.	2019-08-08 12:50:53 +03:00
David Turner	ddcc38cf1c	More read-only-allow-delete docs (#45320 ) Adds to the `index.blocks.read_only_allow_delete` docs the information that this block may be added or removed automatically, and rewords the breaking-changes docs to mention the blocks explicitly and to recommend using a different block. Relates #42559	2019-08-08 09:58:23 +01:00
István Zoltán Szabó	9f62c04637	[DOCS] Reformats cluster health and cluster state APIs (#45206 ) Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2019-08-08 10:25:05 +02:00
Martijn van Groningen	fb959d188c	Backport: Add description to force-merge tasks (#41365 ) (#45191 ) * Add description to force-merge tasks (#41365) This is static information that is part of the force merge request. Relates to #15975	2019-08-08 08:15:09 +02:00
Mark Vieira	fca458f1c8	Use system properties for build cache configuration (#45295 )	2019-08-07 13:14:54 -07:00
Gordon Brown	e3599fded7	Add warning about versions to Deprecation API docs (#36624 ) Add a note that the Deprecation API may not be up to date with all breaking changes until the last minor version in a major version series.	2019-08-07 14:11:05 -06:00
Michael Basnight	89861d0884	Add ingest processor existence helper method (#45156 ) This commit adds a helper method to the ingest service allowing it to inspect a pipeline by id and verify the existence of a processor in the pipeline. This work exposed a potential bug in that some processors contain inner processors that are passed in at instantiation. These processors needed a common way to expose their inner processors, so the WrappingProcessor was created in order to expose the inner processor.	2019-08-07 11:19:04 -05:00
Mark Vieira	341ab48ec0	Improve SCM info in build scans (#45264 )	2019-08-07 09:06:11 -07:00
Benjamin Trent	5db9982f71	[7.x] [ML][Data Frame] Add update transform api endpoint (#45154 ) (#45279 ) * [ML][Data Frame] Add update transform api endpoint (#45154) This adds the ability to `_update` stored data frame transforms. All mutable fields are applied when the next checkpoint starts. The exception being `description`. This PR contains all that is necessary for this addition: * HLRC * Docs * Server side	2019-08-07 10:37:35 -05:00
Benjamin Trent	3a71b91dca	[ML][Data Frame] add support for geo_bounds aggregation (#44441 ) (#45281 ) This adds support for `geo_bounds` aggregation inside the `pivot.aggregations` configuration. The two points returned from the `geo_bounds` aggregation are transformed into `geo_shape` whose types are dynamic given the point's similarity. * `point` if the two points are identical * `linestring` if the two points share either a latitude or longitude * `polygon` if the two points are completely different The automatically deduced mapping for the resulting field is a `geo_shape`.	2019-08-07 10:37:09 -05:00
István Zoltán Szabó	95d3a8e8ad	[DOCS] Reformats cluster stats API and expands common params (#45270 ) Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2019-08-07 16:49:58 +02:00
Lee Hinman	c7ec0b8431	Include in-progress snapshot for a policy with get SLM policy… (#45245 ) This commit adds the "in_progress" key to the SLM get policy API, returning a policy that looks like: ```json { "daily-snapshots" : { "version" : 1, "modified_date" : "2019-08-05T18:41:48.778Z", "modified_date_millis" : 1565030508778, "policy" : { "name" : "<production-snap-{now/d}>", "schedule" : "0 30 1 * * ?", "repository" : "repo", "config" : { "indices" : [ "foo-*", "important" ], "ignore_unavailable" : true, "include_global_state" : false }, "retention" : { "expire_after" : "10m" } }, "last_success" : { "snapshot_name" : "production-snap-2019.08.05-oxctmnobqye3luim4uejhg", "time_string" : "2019-08-05T18:42:23.257Z", "time" : 1565030543257 }, "next_execution" : "2019-08-06T01:30:00.000Z", "next_execution_millis" : 1565055000000, "in_progress" : { "name" : "production-snap-2019.08.05-oxctmnobqye3luim4uejhg", "uuid" : "t8Idqt6JQxiZrzp0Vt7z6g", "state" : "STARTED", "start_time" : "2019-08-05T18:42:22.998Z", "start_time_millis" : 1565030542998 } } } ``` These are only visible while the snapshot is being taken (or failed), since it reads from the cluster state rather than from the repository itself.	2019-08-07 08:29:49 -06:00
István Zoltán Szabó	9384774b4c	[DOCS] Adds supported time units ref to the frequency and delay params. (#45283 )	2019-08-07 16:19:59 +02:00
Alpar Torok	0ea00e4861	Change how we pick bwc versions to check out (#45189 ) Prior to this PR we always checked out the latest bwc branches and had an external mechanism to store the bwc versions used for every CI run so we could both reproduce those builds and run additional tests using the same combination. This adds complexities in setting up and maintaining CI and makes it difficult to set up multi jobs. This change replaces that mechanism with a time based approach that looks at the commit date of the current revision and picks the newest on the bwc branch that's still older than that. It also makes sure there are no merge commits in this interval. This new behavior will is ment to be enabled in CI only, for everything except PR checks that will still use last available bwc revision.	2019-08-07 16:44:38 +03:00
James Rodewig	46fc989ca2	[DOCS] Reformats cat nodeattrs API (#45255 )	2019-08-07 09:31:37 -04:00
James Rodewig	5ade756275	[DOCS] Reformats cat indices API (#45239 )	2019-08-07 09:08:35 -04:00
James Rodewig	d8423f0a80	[DOCS] Reformat cat master API (#45240 )	2019-08-07 09:04:04 -04:00
Bas ten Berge	15d41a7508	[DOCS] Remove outdated 5.x community plugins (#40306 ) Our docs previously included several community plugins that are only supported for versions 5.x and earlier. This removes those plugins for our 6.6+ docs.	2019-08-07 08:45:05 -04:00
Benjamin Trent	be911e6a53	[ML][Data Frames] Fix null aggregation handling in indexer (#45061 ) (#45257 ) * [ML][Data Frames] Fix null aggregation handling in indexer * addressing PR comments * adjusting error messages	2019-08-07 07:01:13 -05:00
Bukhtawar	cd304c4def	Auto-release flood-stage write block (#42559 ) If a node exceeds the flood-stage disk watermark then we add a block to all of its indices to prevent further writes as a last-ditch attempt to prevent the node completely exhausting its disk space. However today this block remains in place until manually removed, and this block is a source of confusion for users who current have ample disk space and did not even realise they nearly ran out at some point in the past. This commit changes our behaviour to automatically remove this block when a node drops below the high watermark again. The expectation is that the high watermark is some distance below the flood-stage watermark and therefore the disk space problem is truly resolved. Fixes #39334	2019-08-07 11:03:53 +01:00
Tanguy Leroux	a869342910	Restore DefaultShardOperationFailedException's reason after deserialization (#45203 ) The reason field of DefaultShardOperationFailedException is lost during serialization. This is sad because this field is checked for nullity during xcontent generation and it means that the cause won't be included in the generated xcontent and won't be printed in two REST API responses (Close Index API and Indices Shard Stores API). This commit simply restores the reason from the cause during deserialization.	2019-08-07 10:37:15 +02:00
Jason Tedor	bd59ee6c72	Fix clock used in update requests (#45262 ) We accidentally switched to using the relative time provider here. This commit fixes this by switching to the appropriate absolute clock.	2019-08-06 21:15:21 -04:00
Julie Tibshirani	9318192578	Correct a code snippet in removal_of_types. (#45118 ) Previously, the reindex examples did not include `_doc` as the destination type. This would result in the reindex failing with the error "Rejecting mapping update to [users] as the final mapping would have more than 1 type: [_doc, user]". Relates to #43100.	2019-08-06 14:09:21 -07:00
Tom Callahan	a7a419bee8	Change Ldap SDK License to LGPL-2.1 (#45116 ) We currently use the unboundid ldap SDK, which is triply licensed under GPL-2.0, LGPL-2.1, and the "UnboundID LDAP SDK Free Use License". We currently identify the license as the latter, but LGPL-2.1 is the one we should be using per our policy.	2019-08-06 16:48:09 -04:00
Lisa Cawley	7c9c9a9cc4	[DOCS] Reformats ML update APIs (#45253 )	2019-08-06 11:16:29 -07:00
James Rodewig	51c1abc112	[DOCS] Reformat match phrase prefix query (#45209 )	2019-08-06 14:02:07 -04:00
Jack Conradson	fc8a6fc9d0	Decouple Painless AST Lambda generation from the grammar (#45111 ) This is the first step in decoupling the Painless AST from the grammar. The Painless AST should be able to generate classes independently of how the AST is generated from a grammar. (If I were to build a Painless AST by hand in code this should be all that's necessary.) This change removes Lambda name generation from the ANTLR grammar tree walker. It also removes unnecessary node generation of new array function references from the tree walker as well.	2019-08-06 10:08:19 -07:00
Jason Tedor	9a142ff25c	Introduce formal node ML role (#45174 ) This commit builds on the ability for plugins to introduce new roles to add a formal node ML role.	2019-08-06 13:00:05 -04:00
Lisa Cawley	bb22b2e45f	[DOCS] Reformats get rollup jobs API (#45114 )	2019-08-06 09:40:18 -07:00
David Turner	f5d1381e01	Remove always-true param from IndicesService#stats (#45231 ) Parameter `includePrevious` is always true, so this commit inlines it.	2019-08-06 17:22:11 +01:00

1 2 3 4 5 ...

47267 Commits