OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-18 19:05:06 +00:00

Author	SHA1	Message	Date
Alan Woodward	1a2e931d6e	Reduce the max depth of randomly generated interval queries (#50317 ) We randomly generate intervals sources to test serialization and query generation in IntervalQueryBuilderTests. However, rarely we can generate a query that has too many nested disjunctions, resulting in query rewrites running afoul of the maximum boolean clause limit. This commit reduces the maximum depth of the randomly generated intervals source to make running into this limit much more unlikely.	2019-12-19 15:12:12 +00:00
Andrei Dan	1e11d23051	Extract a create index method that only manipulates the ClusterState (#50240 ) (#50328 ) * Extract IndexCreationTask execute into applyCreateIndexRequest This is the first step in preparation for separating the index creation into a few steps that only deal with the cluster state mutation and removing the IndexCreationTask altogether. * Split applyCreateIndexRequest This breaks down the logic in applyCreateIndexRequest into multiple steps that will hopefully make the service more readable and unit testable. The service creation process now goes through a few well defined steps, namely find the templates that possibly match the new index, parse the requested and template matching mappings, process the index and template matching settings, validate the wait for active shards request and create the `IndexService`, update the mappings in the `MapperService` (which is grouped together with creating the sort order for validation purposes), validate the requested and templated matching aliases and finally update the `ClusterState` to reflect the requested changes. This also removes the `IndexCreationTask` as it was a shallow indirection and migrates the tests from `IndexCreationTaskTests` to `MetaDataCreateIndexServiceTests` (making them "real" unit tests operating on the `ClusterState` rather than mocks). * Add more unit tests. * Add IT to verify we cleanup in case of failure (cherry picked from commit 57e6269f750471f05a1a79539ca45361b9e3c2b5) Signed-off-by: Andrei Dan <andrei.dan@elastic.co> # Conflicts: # server/src/main/java/org/elasticsearch/cluster/metadata/MetaDataCreateIndexService.java # server/src/test/java/org/elasticsearch/action/admin/indices/create/CreateIndexIT.java # server/src/test/java/org/elasticsearch/cluster/metadata/IndexCreationTaskTests.java	2019-12-19 12:37:07 +00:00
Igor Motov	c77ca98928	Geo: Switch generated WKT to upper case (#50285 ) Switches generated WKT to upper case to conform to the standard recommendation. Relates #49568	2019-12-18 17:29:08 -05:00
Stuart Tettemer	9cdbcbd121	[TEST] Exclude name on ScriptContextInfo mutate (#50332 ) (#50337 ) ScriptContextInfoSerializingTests:testEqualsAndHashcode was failing because the mutation was generating the same name. Backport Fixes: #50331	2019-12-18 14:23:21 -07:00
Stuart Tettemer	06a24f09cf	Scripting: Cache script results if deterministic (#50106 ) (#50329 ) Cache results from queries that use scripts if they use only deterministic API calls. Nondeterministic API calls are marked in the whitelist with the `@nondeterministic` annotation. Examples are `Math.random()` and `new Date()`. Refs: #49466	2019-12-18 13:00:42 -07:00
Adrien Grand	35a88a5dbb	Add 7.5.2 version.	2019-12-18 19:50:00 +01:00
Ryan Ernst	8439b2779b	Add version 6.8.7 constant	2019-12-18 09:38:07 -08:00
Nikita Glashenko	ef54a9c23c	Add tests for IntervalsSourceProvider.Wildcard and Prefix (#50306 ) This PR adds unit tests for wire and xContent serialization of `IntervalsSourceProvider.Wildcard` and `IntervalsSourceProvider.Prefix`. Relates #50150	2019-12-18 17:41:48 +01:00
Yannick Welsch	37b8c139b3	Omit loading IndexMetaData when inspecting shards (#50214 ) Loading shard state information during shard allocation sometimes runs into a situation where a data node does not know yet how to look up the shard on disk if custom data paths are used. The current implementation loads the index metadata from disk to determine what the custom data path looks like. This PR removes this dependency, simplifying the lookup. Relates #48701	2019-12-17 14:33:02 +01:00
Martijn van Groningen	2079f1cbeb	Backport: Fix ingest simulate response document order if processor executes async (#50269 ) Backport #50244 to 7.x branch. If a processor executes asynchronously and the ingest simulate api simulates with multiple documents then the order of the documents in the response may not match the order of the documents in the request. Alexander Reelsen discovered this issue with the enrich processor with the following reproduction: ``` PUT cities/_doc/munich {"zip":"80331","city":"Munich"} PUT cities/_doc/berlin {"zip":"10965","city":"Berlin"} PUT /_enrich/policy/zip-policy { "match": { "indices": "cities", "match_field": "zip", "enrich_fields": [ "city" ] } } POST /_enrich/policy/zip-policy/_execute GET _cat/indices/.enrich-* POST /_ingest/pipeline/_simulate { "pipeline": { "processors" : [ { "enrich" : { "policy_name": "zip-policy", "field" : "zip", "target_field": "city", "max_matches": "1" } } ] }, "docs": [ { "_id": "first", "_source" : { "zip" : "80331" } } , { "_id": "second", "_source" : { "zip" : "50667" } } ] } ``` * fixed test compile error	2019-12-17 12:27:07 +01:00
Armin Braun	4f24739fbe	Fix Index Deletion During Partial Snapshot Create (#50234 ) (#50266 ) We can simply filter out shard generation updates for indices that were removed from the cluster state concurrently to fix index deletes during partial snapshots as that completely removes any reference to those shards from the snapshot. Follow up to #50202 Closes #50200	2019-12-17 10:58:15 +01:00
Armin Braun	2e7b1ab375	Use ClusterState as Consistency Source for Snapshot Repositories (#49060 ) (#50267 ) Follow up to #49729 This change removes falling back to listing out the repository contents to find the latest `index-N` in write-mounted blob store repositories. This saves 2-3 list operations on each snapshot create and delete operation. Also it makes all the snapshot status APIs cheaper (and faster) by saving one list operation there as well in many cases. This removes the resiliency to concurrent modifications of the repository as a result and puts a repository in a `corrupted` state in case loading `RepositoryData` failed from the assumed generation.	2019-12-17 10:55:13 +01:00
Henning Andersen	8391b974c5	Recovery buffer size 16B smaller (#50100 ) G1GC will use humongous allocations when an allocation exceeds half the chosen region size, which is minimum 1MB. By reducing the recovery buffer size by 16 bytes we ensure that the recovery buffer is never allocated as a humongous allocation.	2019-12-16 22:00:22 +01:00
Nhat Nguyen	731bfa6614	Account trimAboveSeqNo in committed translog generation (#50205 ) Today we do not consider trimAboveSeqNo when calculating the translog generation of an index commit. If there is no new indexing after the primary promotion, then we won't be able to clean up the translog.	2019-12-16 11:40:16 -05:00
Zachary Tong	be78d5cc74	Migrate MinAggregator integration tests to AggregatorTestCase (#50053 ) Also renames MinTests to MinAggregationBuilderTests	2019-12-16 11:15:50 -05:00
Rory Hunter	2bd3a05892	Refactor environment variable processing for Docker (#50221 ) Backport of #49612. The current Docker entrypoint script picks up environment variables and translates them into -E command line arguments. However, since any tool executes via `docker exec` doesn't run the entrypoint, it results in a poorer user experience. Therefore, refactor the env var handling so that the -E options are generated in `elasticsearch-env`. These have to be appended to any existing command arguments, since some CLI tools have subcommands and -E arguments must come after the subcommand. Also extract the support for `_FILE` env vars into a separate script, so that it can be called from more than once place (the behaviour is idempotent). Finally, add noop -E handling to CronEvalTool for parity, and support `-E` in MultiCommand before subcommands.	2019-12-16 15:39:28 +00:00
Armin Braun	afcdc27c02	Fix Index Deletion during Snapshot Finalization (#50202 ) (#50227 ) With #45689 making it so that index metadata is written after all shards have been snapshotted we can't delete indices that are part of the upcoming snapshot finalization any longer and it is not sufficient to check if all shards of an index have been snapshotted before deciding that it is safe to delete it. This change forbids deleting any index that is in the process of being snapshot to avoid issues during snapshot finalization. Relates #50200 (doesn't fully fix yet because we're not fixing the `partial=true` snapshot case here	2019-12-16 13:30:05 +01:00
Henning Andersen	4ced237a7f	Disk threshold decider is enabled by default (#50222 ) An old comment had survived after the default was flipped. Relates #6204	2019-12-16 12:43:34 +01:00
Armin Braun	761d6e8e4b	Remove BlobContainer Tests against Mocks (#50194 ) (#50220 ) * Remove BlobContainer Tests against Mocks Removing all these weird mocks as asked for by #30424. All these tests are now part of real repository ITs and otherwise left unchanged if they had independent tests that didn't call the `createBlobStore` method previously. The HDFS tests also get added coverage as a side-effect because they did not have an implementation of the abstract repository ITs. Closes #30424	2019-12-16 11:37:09 +01:00
Ignacio Vera	3717c733ff	"CONTAINS" support for BKD-backed geo_shape and shape fields (#50141 ) (#50213 ) Lucene 8.4 added support for "CONTAINS", therefore in this commit those changes are integrated in Elasticsearch. This commit contains as well a bug fix when querying with a geometry collection with "DISJOINT" relation.	2019-12-16 09:17:51 +01:00
Nhat Nguyen	6f1098cceb	Fix version in testTurnOffTranslogRetentionAfterAllShardStarted Soft-deletes requires 6.5 or later.	2019-12-15 12:58:28 -05:00
Nhat Nguyen	df46848fb0	Migrate peer recovery from translog to retention lease (#49448 ) Since 7.4, we switch from translog to Lucene as the source of history for peer recoveries. However, we reduce the likelihood of operation-based recoveries when performing a full cluster restart from pre-7.4 because existing copies do not have PPRL. To remedy this issue, we fallback using translog in peer recoveries if the recovering replica does not have a peer recovery retention lease, and the replication group hasn't fully migrated to PRRL. Relates #45136	2019-12-15 10:24:39 -05:00
Nhat Nguyen	c151a75dfe	Use retention lease in peer recovery of closed indices (#48430 ) Today we do not use retention leases in peer recovery for closed indices because we can't sync retention leases on closed indices. This change allows that ability and adjusts peer recovery to use retention leases for all indices with soft-deletes enabled. Relates #45136 Co-authored-by: David Turner <david.turner@elastic.co>	2019-12-15 10:24:34 -05:00
Christoph Büscher	c0216f9a06	Improve DateFieldMapper `ignore_malformed` handling (#50090 ) A recent change around date parsing (#46675) made it stricter, so we should now also catch DateTimeExceptions in DateFieldMapper and ignore those when the `ignore_malformed` option is set. Closes #50081	2019-12-13 10:00:10 +01:00
Zachary Tong	521933aa11	SingleBucket aggs need to reduce their bucket's pipelines first (#50103 ) When decoupling the pipeline reduction from regular agg reduction, MultiBucket aggs were modified to reduce their bucket's pipeline aggs first before reducing the sibling aggs. This modification was missed on SingleBucket aggs, meaning any SingleBucket would fail to reduce any pipeline sub-aggs	2019-12-12 09:07:33 -05:00
Ignacio Vera	b5ec227de8	upgrade to lucene 8.4.0-snapshot-08b8d116f8f (#50129 ) (#50132 )	2019-12-12 13:13:37 +01:00
Adrien Grand	0bba7ccedd	Remove information about the latest PostingsFormat/DocValuesFormat. (#50118 ) (#50127 ) This information is outdated and unused.	2019-12-12 11:46:37 +01:00
Armin Braun	6eee41e253	Remove Unused Single Delete in BlobStoreRepository (#50024 ) (#50123 ) * Remove Unused Single Delete in BlobStoreRepository There are no more production uses of the non-bulk delete or the delete that throws on missing so this commit removes both these methods. Only the bulk delete logic remains. Where the bulk delete was derived from single deletes, the single delete code was inlined into the bulk delete method. Where single delete was used in tests it was replaced by bulk deleting.	2019-12-12 11:17:46 +01:00
Armin Braun	0fae4065ef	Better Logging GCS Blobstore Mock (#50102 ) (#50124 ) * Better Logging GCS Blobstore Mock Two things: 1. We should just throw a descriptive assertion error and figure out why we're not reading a multi-part instead of returning a `400` and failing the tests that way here since we can't reproduce these 400s locally. 2. We were missing logging the exception on a cleanup delete failure that coincides with the `400` issue in tests. Relates #49429	2019-12-12 11:17:22 +01:00
Ryan Ernst	cbff63685a	Ensure meta and document field maps are never null in GetResult (#50112 ) This commit ensures deseriable a GetResult from StreamInput does not leave metaFields and documentFields null. This could cause an NPE in situations where upsert response for a document that did not exist is passed back to a node that forwarded the upsert request. closes #48215	2019-12-11 22:21:55 -08:00
Tim Brooks	38b67f719e	Add int indicating size of transport header (#50085 ) Currently we do not know the size of the transport header (map of request response headers, features array, and action name). This means that we must read the entire transport message to dependably act on the headers. This commit adds an int indicating the size of the transport headers. With this addition we can act upon the headers prior to reading the entire message.	2019-12-11 16:24:19 -07:00
Adrien Grand	adf5c92f8c	Address UUIDTests#testCompression failures. (#50101 ) Those were due to codec randomization. Closes #50048	2019-12-11 22:13:58 +01:00
David Turner	285eacd267	Use more specific loggers in subclasses of TMNA (#50076 ) Adjusts the subclasses of `TransportMasterNodeAction` to use their own loggers instead of the one for the base class. Relates #50056. Partial backport of #46431 to 7.x.	2019-12-11 15:07:47 +00:00
Adrien Grand	87e72156ce	Upgrade to lucene 8.4.0-snapshot-662c455. (#50016 ) (#50039 ) Lucene 8.4 is about to be released so we should check it doesn't cause problems with Elasticsearch.	2019-12-10 18:04:58 +01:00
James Rodewig	3f5678ca79	[DOCS] Remove shadow replica reference (#50029 ) Removes a reference to shadow replicas from the cat shards API docs and a comment in cluster/routing/UnassignedInfo.java. Shadow replicas were removed with #23906.	2019-12-10 09:30:51 -05:00
Armin Braun	ee4a8a08dd	Improve Snapshot Finalization Ex. Handling (#49995 ) (#50017 ) * Improve Snapshot Finalization Ex. Handling Like in #49989 we can get into a situation where the setting of the repository generation (during snapshot finalization) in the cluster state fails due to master failing over. In this case we should not try to execute the next cluster state update that will remove the snapshot from the cluster state. Closes #49989	2019-12-10 13:01:51 +01:00
Yannick Welsch	a16abf921f	Make elasticsearch-node tools custom metadata-aware (#48390 ) The elasticsearch-node tools allow manipulating the on-disk cluster state. The tool is currently unaware of plugins and will therefore drop custom metadata from the cluster state once the state is written out again (as it skips over the custom metadata that it can't read). This commit preserves unknown customs when editing on-disk metadata through the elasticsearch-node command-line tools.	2019-12-10 09:58:11 +01:00
shiwenjie12	dd441962bb	Modify notes (#48331 ) Modify notes	2019-12-09 13:03:40 -05:00
Jason Tedor	bfb2dc1353	Enable dependent settings values to be validated (#49942 ) Today settings can declare dependencies on another setting. This declaration is implemented so that if the declared setting is not set when the declaring setting is, settings validation fails. Yet, in some cases we want not only that the setting is set, but that it also has a specific value. For example, with the monitoring exporter settings, if xpack.monitoring.exporters.my_exporter.host is set, we not only want that xpack.monitoring.exporters.my_exporter.type is set, but that it is also set to local. This commit extends the settings infrastructure so that this declaration is possible. The use of this in the monitoring exporter settings will be implemented in a follow-up.	2019-12-09 12:45:50 -05:00
Vishnu Chilamakuru	056c698540	Add Validation for maxQueryTerms to be greater than 0 for MoreLikeThisQuery (#49966 ) Adds validation for maxQueryTerms to be greater than 0 for MoreLikeThisQuery and MoreLikeThisQueryBuilder. Closes #49927	2019-12-09 15:01:10 +01:00
Armin Braun	62e128f02d	Cleanup Old index-N Blobs in Repository Cleanup (#49862 ) (#49902 ) * Cleanup Old index-N Blobs in Repository Cleanup Repository cleanup didn't deal with old index-N, this change adds cleaning up all old index-N found in the repository.	2019-12-09 12:05:55 +01:00
Armin Braun	ac2774c9fa	Use Cluster State to Track Repository Generation (#49729 ) (#49976 ) Step on the road to #49060. This commit adds the logic to keep track of a repository's generation across repository operations. See changes to package level Javadoc for the concrete changes in the distributed state machine. It updates the write side of new repository generations to be fully consistent via the cluster state. With this change, no `index-N` will be overwritten for the same repository ever. So eventual consistency issues around conflicting updates to the same `index-N` are not a possibility any longer. With this change the read side will still use listing of repository contents instead of relying solely on the cluster state contents. The logic for that will be introduced in #49060. This retains the ability to externally delete the contents of a repository and continue using it afterwards for the time being. In #49060 the use of listing to determine the repository generation will be removed in all cases (except for full-cluster restart) as the last step in this effort.	2019-12-09 09:02:57 +01:00
Yannick Welsch	7a2e35caa0	Properly fake corrupted translog (#49918 ) The fake translog corruption in the test sometimes generates invalid translog files where some assertions do not hold (e.g. minSeqNo <= maxSeqNo or minTranslogGen <= translogGen) Closes #49909	2019-12-09 08:33:40 +01:00
Yannick Welsch	01d36afa4b	Randomly run CCR tests with _source disabled (#49922 ) Makes sure that CCR also properly works with _source disabled. Changes one exception in LuceneChangesSnapshot as the case of missing _recovery_source because of a missing lease was not properly properly bubbled up to CCR (testIndexFallBehind was failing).	2019-12-09 08:33:40 +01:00
Armin Braun	f768f8ddab	Fix TimedRunnable Executing onAfter Twice (#49910 ) (#49930 ) If we have a nested `AbstractRunnable` inside of `TimedRunnable` it's executed twice on `run` (once when its own `run` method is invoked and once when the `onAfter` in the `TimedRunnable` is executed). Simply removing the `onAfter` override in `TimedRunnable` makes sure that the `onAfter` is only called once by the `run` on the nested `AbstractRunnable` itself. Same was done for `onFailure` as it was double-triggering as well on exceptions in the inner `onFailure`.	2019-12-08 17:36:05 +01:00
Armin Braun	8ae11e176a	Cleanup some in o.e.transport (#49901 ) (#49971 ) Cleaning up some obvious compile warnings and dead code.	2019-12-08 16:14:20 +01:00
Stuart Tettemer	17cda5b2c0	Scripting: Groundwork for caching script results (#49895 ) (#49944 ) In order to cache script results in the query shard cache, we need to check if scripts are deterministic. This change adds a default method to the script factories, `isResultDeterministic() -> false` which is used by the `QueryShardContext`. Script results were never cached and that does not change here. Future changes will implement this method based on whether the results of the scripts are deterministic or not and therefore cacheable. Refs: #49466 Backport	2019-12-06 15:08:05 -07:00
David Roberts	17fa9d5844	[TEST] Mute ConnectionManagerTests.testConcurrentConnectsAndDisconnects Due to https://github.com/elastic/elasticsearch/issues/49903	2019-12-06 17:06:34 +00:00
Alexander Reelsen	d299bf5760	Add tests for ingesting CBOR data attachments (#49715 ) Our docs specifically mention that CBOR is supported when ingesting attachments. However this is not tested anywhere. This adds a test, that uses specifically CBOR format in its IndexRequest and another one that behaves like CBOR in the ingest attachment unit tests.	2019-12-06 14:33:39 +01:00
Orhan Toy	0f02e02d77	Consistent case in CLI option descriptions (#49635 ) This commit improves the casing of messages in the CLI help descriptions.	2019-12-05 13:36:11 -08:00

... 2 3 4 5 6 ...

4138 Commits