OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-08 05:58:44 +00:00

Author	SHA1	Message	Date
Lee Hinman	b4cc3cd35d	Remove FORCE version_type This was an error-prone version type that allowed overriding previous version semantics. It could cause primaries and replicas to be out of sync however, so it has been removed. Resolves #19769	2016-09-07 13:05:18 -06:00
Chris Earle	6a7309c09a	Add "version" field to Pipelines This adds a version field to Pipelines, which is itself is unused by Elasticsearch, but exists for users to better manage their own pipelines.	2016-09-07 10:27:40 -04:00
Luca Cavanna	faa03ad9fa	Merge pull request #20255 from javanna/enhancement/cluster_stats_available_memory Add mem section back to cluster stats	2016-09-02 10:19:51 +02:00
Adrien Grand	5bfab76c96	Source filtering should keep working when the source contains numbers greater than `Long.MAX_VALUE`. #20278 Currently it does not because our parsers do not support big integers/decimals (on purpose) but we do not have to ask our parser for the number type, we can just ask the jackson parser for a number representation of the value with the right type. Note that I did not add similar tests for big decimals because Jackson seems to never return big decimals, even for decimal values that are out of the range of values that can be represented by doubles. Closes #11508	2016-09-02 08:56:04 +02:00
javanna	5f299ff46f	add mem section back to cluster stats The mem section was buggy in cluster stats and removed. It is now added back with the same structure as in node stats, containing total memory, available memory, used memory and percentages. All the values are the sum of all the nodes across the cluster (or at least the ones that we were able to get the values from).	2016-09-01 11:26:03 +02:00
Simon Willnauer	a0becd26b1	Optimize indexing for the autogenerated ID append-only case (#20211 ) If elasticsearch controls the ID values as well as the documents version we can optimize the code that adds / appends the documents to the index. Essentially we an skip the version lookup for all documents unless the same document is delivered more than once. On the lucene level we can simply call IndexWriter#addDocument instead of #updateDocument but on the Engine level we need to ensure that we deoptimize the case once we see the same document more than once. This is done as follows: 1. Mark every request with a timestamp. This is done once on the first node that receives a request and is fixed for this request. This can be even the machine local time (see why later). The important part is that retry requests will have the same value as the original one. 2. In the engine we make sure we keep the highest seen time stamp of "retry" requests. This is updated while the retry request has its doc id lock. Call this `maxUnsafeAutoIdTimestamp` 3. When the engine runs an "optimized" request comes, it compares it's timestamp with the current `maxUnsafeAutoIdTimestamp` (but doesn't update it). If the the request timestamp is higher it is safe to execute it as optimized (no retry request with the same timestamp has been run before). If not we fall back to "non-optimzed" mode and run the request as a retry one and update the `maxUnsafeAutoIdTimestamp` unless it's been updated already to a higher value Relates to #19813	2016-09-01 10:39:40 +02:00
Ali Beyad	4641254ea6	Parameter improvements to Cluster Health API wait for shards (#20223 ) * Params improvements to Cluster Health API wait for shards Previously, the cluster health API used a strictly numeric value for `wait_for_active_shards`. However, with the introduction of ActiveShardCount and the removal of write consistency level for replication operations, `wait_for_active_shards` is used for write operations to represent values for ActiveShardCount. This commit moves the cluster health API's usage of `wait_for_active_shards` to be consistent with its usage in the write operation APIs. This commit also changes `wait_for_relocating_shards` from a numeric value to a simple boolean value `wait_for_no_relocating_shards` to set whether the cluster health operation should wait for all relocating shards to complete relocation. * Addresses code review comments * Don't be lenient if `wait_for_relocating_shards` is set	2016-08-31 11:58:19 -04:00
Nik Everett	df73292256	Add an alias action to delete an index While removing an index isn't actually an alias action, if we add an alias action that deletes an index then we can delete and index and add an alias with the same name as the index atomically, in the same cluster state update. Closes #20064	2016-08-30 10:15:21 -04:00
Tanguy Leroux	b4245c7ad9	Add exclusion filters support to filter_path This commit adds the support for exclusion filter to the response filtering (filter_path) feature. It changes the XContentBuilder APIs so that it now accepts two types of filters: inclusive and exclusive. Filters are no more String arrays but sets of String instead.	2016-08-30 09:08:30 +02:00
Greg Marzouka	2363c7dcdd	Merge pull request #20186 from gmarz/spec/wait_for_active_shards [SPEC] Fix type for wait_for_active_shards (string => number)	2016-08-29 10:19:33 -04:00
Greg Marzouka	84f05cd7d5	[SPEC] Change type of wait_for_active_shards from number to string	2016-08-29 09:31:01 -04:00
Jun Ohtani	2a00c9dc46	Merge pull request #19860 from johtani/fix/validate_empty_field_name Validate blank field name	2016-08-29 11:52:18 +09:00
Yannick Welsch	1b75cb63a2	Add recovery source to ShardRouting (#19516 ) Adds an explicit recoverySource field to ShardRouting that characterizes the type of recovery to perform: - fresh empty shard copy - existing local shard copy - recover from peer (primary) - recover from snapshot - recover from other local shards on same node (shrink index action)	2016-08-27 16:11:10 +02:00
Jun Ohtani	450f47d5b5	Validate blank field name add validation and validate only 5.0+ Add tests before 5.0 Closes #19251	2016-08-26 20:10:33 +09:00
Adrien Grand	3ed0da5a58	GET operations should not extract fields from `_source`. #20158 This makes GET operations more consistent with `_search` operations which expect `(stored_)fields` to work on stored fields and source filtering to work on the `_source` field. This is now possible thanks to the fact that GET operations do not read from the translog anymore (#20102) and also allows to get rid of `FieldMapper#isGenerated`. The `_termvectors` API (and thus more_like_this too) was relying on the fact that GET operations would extract fields from either stored fields or the source so the logic to do this that used to exist in `ShardGetService` has been moved to `TermVectorsService`. It would be nice that term vectors do not rely on this, but this does not seem to be a low hanging fruit.	2016-08-26 10:35:23 +02:00
Jason Tedor	bc136a90d5	Add network types to cluster stats The network types in use on a cluster can be useful information to have, so this commit adds aggregate metrics for the network types in use in a cluster to the cluster stats. Relates #20144	2016-08-25 21:08:05 -04:00
Jim Ferenczi	4682fc34ae	Add the ability to disable the retrieval of the stored fields entirely This change adds a special field named _none_ that allows to disable the retrieval of the stored fields in a search request or in a TopHitsAggregation. To completely disable stored fields retrieval (including disabling metadata fields retrieval such as _id or _type) use _none_ like this: ```` POST _search { "stored_fields": "_none_" } ````	2016-08-24 16:40:08 +02:00
Simon Willnauer	c499427166	Use _refresh instead of reading from Translog in the RT GET case (#20102 ) Today we do a lot of accounting inside the engine to maintain locations of documents inside the transaction log. This is only needed to ensure we can return the documents source from the engine if it hasn't been refreshed. Aside of the added complexity to be able to read from the currently writing translog, maintainance of pointers into the translog this also caused inconsistencies like different values of the `_ttl` field if it was read from the tlog or not. TermVectors are totally different if the document is fetched from the tranlog since copy fields are ignored etc. This chance will simply call `refresh` if the documents latest version is not in the index. This streamlines the semantics of the `_get` API and allows for more optimizations inside the engine and on the transaction log. Note: `_refresh` is only called iff the requested document is not refreshed yet but has recently been updated or added. #Relates to #19787	2016-08-24 15:30:08 +02:00
Clinton Gormley	336ec0ac9a	Renamed REST spec reindex.rethrottle to reindex_rethrottle There is already a reindex method, so reindex can't also be used as a namespace.	2016-08-24 13:30:48 +02:00
Ali Beyad	1c9b64e09a	Adds ignoreUnavailable option to the snapshot status API (#20066 ) Adds ignoreUnavailable to the snapshot status API to be consistent with the get snapshots API which has a similar parameter. If ignoreUnavailable is set to true, then the snapshot status request will ignore any snapshots that were not found in the repository, instead of throwing a SnapshotMissingException. Closes #18522	2016-08-19 16:19:56 -04:00
Adrien Grand	a4ea7e7223	Switch indices.exists_type from `{index}/{type}` to `{index}/_mapping/{type}`. #20055 This will help remove types as we will need `{index}/{id}` to tell whether a document exists. Relates #15613	2016-08-19 09:18:24 +02:00
Clinton Gormley	7da9d826ff	Update ingest.get_pipeline.json The `id` parameter is not required Closes #20010	2016-08-17 14:40:59 +02:00
Adrien Grand	d894db1590	Only use `PUT` for index creation, not POST. #20001 Currently both `PUT` and `POST` can be used to create indices. This commit removes support for `POST index_name` so that we can use it to index documents with auto-generated ids once types are removed. Relates #15613	2016-08-17 10:15:42 +02:00
Ali Beyad	5ba06b6487	Removes support for adding aliases to analyzers. Indices created pre 5.x (#19994 ) that have analyzer aliases in their analysis settings will still work, but any attempts to create an alias for analyzers in newly created indices will result in an IllegalArgumentException. As a result, the setting `index.analysis.analyzer.{analyzerName}.alias` is no longer supported. Closes #18244	2016-08-15 16:17:58 -04:00
Ryan Ernst	d89540be9c	Add rest put mapping test with dot in fieldname	2016-08-10 15:35:53 -07:00
Areek Zillur	d107141bf6	Remove payload option from completion suggester The payload option was introduced with the new completion suggester implementation in v5, as a stop gap solution to return additional metadata with suggestions. Now we can return associated documents with suggestions (#19536) through fetch phase using stored field (_source). The additional fetch phase ensures that we only fetch the _source for the global top-N suggestions instead of fetching _source of top results for each shard.	2016-08-08 16:04:06 -04:00
Nik Everett	1e587406d8	Fail yaml tests and docs snippets that get unexpected warnings Adds `warnings` syntax to the yaml test that allows you to expect a `Warning` header that looks like: ``` - do: warnings: - '[index] is deprecated' - quotes are not required because yaml - but this argument is always a list, never a single string - no matter how many warnings you expect get: index: test type: test id: 1 ``` These are accessible from the docs with: ``` // TEST[warning:some warning] ``` This should help to force you to update the docs if you deprecate something. You must add the warnings marker to the docs or the build will fail. While you are there you should update the docs to add deprecation warnings visible in the rendered results.	2016-08-04 15:23:05 -04:00
Jason Tedor	533412e36f	Improve cat thread pool API Today, when listing thread pools via the cat thread pool API, thread pools are listed in a column-delimited format. This is unfriendly to command-line tools, and inconsistent with other cat APIs. Instead, thread pools should be listed in a row-delimited format. Additionally, the cat thread pool API is limited to a fixed list of thread pools that excludes certain built-in thread pools as well as all custom thread pools. These thread pools should be available via the cat thread pool API. This commit improves the cat thread pool API by listing all thread pools (built-in or custom), and by listing them in a row-delimited format. Finally, for each node, the output thread pools are sorted by thread pool name. Relates #19721	2016-08-03 23:02:13 -04:00
Igor Motov	22e63b4783	Fixes cat tasks operation in detailed mode Currently the cat tasks operation fails in the detailed mode. Closes #19755	2016-08-02 15:21:31 -04:00
Ali Beyad	d93f7d6085	Refactors ActiveShardCount	2016-08-01 13:35:29 -04:00
Ali Beyad	25d8eca62d	Removes the notion of write consistency level across all APIs in favor of waiting for active shard copy count (wait_for_active_shards).	2016-08-01 13:35:29 -04:00
Alexander Lin	9ac6389e43	Rename operation to result and reworking responses * Rename operation to result and reworking responses * Rename DocWriteResponse.Operation enum to DocWriteResponse.Result These are just easier to interpret names. Closes #19664	2016-08-01 10:42:58 -04:00
Tanguy Leroux	64ca834722	Remove unnecessary indices refresh and wait for green status in REST tests	2016-08-01 11:09:56 +02:00
Tanguy Leroux	737db98bd7	/_cat/shards should support wilcards for indices closes #19634	2016-08-01 11:09:48 +02:00
Tanguy Leroux	e642187642	Remove unnecessary indices refresh from cluster.state/20_filtering.yaml file	2016-08-01 09:22:12 +02:00
Tanguy Leroux	7d4f557aa3	Allow routing table to be filtered by index pattern Before this commit when an index pattern is used to filter the cluster state, only indices metadata are populated and routing table is just empty. This commit aligns the behavior of the filtering of cluster state's routing table with the filtering of cluster state's metadata so that coherent data are returned for both routing table & metadata when index pattern is requested.	2016-08-01 09:22:12 +02:00
Martijn van Groningen	a91bb29585	ingest: Made the response format of the get pipeline api match with the response format of the index template api Closes #19585	2016-07-29 17:58:30 +02:00
Areek Zillur	4e3602a790	Add zero-padding to auto-generated rollover index name increment closes #19484	2016-07-27 10:50:47 -04:00
Chris Earle	0553ba9151	[Ingest] Add REST _ingest/pipeline to get all pipelines This adds an extra REST handler for "_ingest/pipeline" so that users do not need to supply "_ingest/pipeline/*" to get all of them. - Also adds a teardown section to related REST-tests for ingest.	2016-07-26 13:48:15 -04:00
Christoph Büscher	e1415d6519	Merge pull request #19595 from cbuescher/fix-19422 Allow empty json object in request body in `_count` API.	2016-07-26 18:17:52 +02:00
Alexander Lin	8f2882a442	Add _operation field to index, update, delete responses Performing the bulk request shown in #19267 now results in the following: ``` {"_index":"test","_type":"test","_id":"1","_version":1,"_operation":"create","forced_refresh":false,"_shards":{"total":2,"successful":1,"failed":0},"status":201} {"_index":"test","_type":"test","_id":"1","_version":1,"_operation":"noop","forced_refresh":false,"_shards":{"total":2,"successful":1,"failed":0},"status":200} ```	2016-07-26 11:16:19 -04:00
Christoph Büscher	b861ec1cc0	Allow empty json object in request body in `_count` API When the request body is missing, all documents in the target index are counted. As mentioned in #19422, the same should happen when the request body is an empty json object. This is also the behaviour for the `_search` endpoint and the two APIs should behave in the same way.	2016-07-26 09:54:05 +02:00
Boaz Leskes	03fbc91816	allow for a `-` in a node name	2016-07-24 09:02:30 +02:00
Boaz Leskes	cd596772ee	Persistent Node Names (#19456 ) With #19140 we started persisting the node ID across node restarts. Now that we have a "stable" anchor, we can use it to generate a stable default node name and make it easier to track nodes over a restarts. Sadly, this means we will not have those random fun Marvel characters but we feel this is the right tradeoff. On the implementation side, this requires a bit of juggling because we now need to read the node id from disk before we can log as the node node is part of each log message. The PR move the initialization of NodeEnvironment as high up in the starting sequence as possible, with only one logging message before it to indicate we are initializing. Things look now like this: ``` [2016-07-15 19:38:39,742][INFO ][node ] [_unset_] initializing ... [2016-07-15 19:38:39,826][INFO ][node ] [aAmiW40] node name set to [aAmiW40] by default. set the [node.name] settings to change it [2016-07-15 19:38:39,829][INFO ][env ] [aAmiW40] using [1] data paths, mounts [[ /(/dev/disk1)]], net usable_space [5.5gb], net total_space [232.6gb], spins? [unknown], types [hfs] [2016-07-15 19:38:39,830][INFO ][env ] [aAmiW40] heap size [1.9gb], compressed ordinary object pointers [true] [2016-07-15 19:38:39,837][INFO ][node ] [aAmiW40] version[5.0.0-alpha5-SNAPSHOT], pid[46048], build[473d3c0/2016-07-15T17:38:06.771Z], OS[Mac OS X/10.11.5/x86_64], JVM[Oracle Corporation/Java HotSpot(TM) 64-Bit Server VM/1.8.0_51/25.51-b03] [2016-07-15 19:38:40,980][INFO ][plugins ] [aAmiW40] modules [percolator, lang-mustache, lang-painless, reindex, aggs-matrix-stats, lang-expression, ingest-common, lang-groovy, transport-netty], plugins [] [2016-07-15 19:38:43,218][INFO ][node ] [aAmiW40] initialized ``` Needless to say, settings `node.name` explicitly still works as before. The commit also contains some clean ups to the relationship between Environment, Settings and Plugins. The previous code suggested the path related settings could be changed after the initial Environment was changed. This did not have any effect as the security manager already locked things down.	2016-07-23 22:46:48 +02:00
Ali Beyad	83a137b25c	Fixes REST test that is designed to timeout on index creation by making the test wait until all urgent requests are completed before finishing, so that tear down can properly delete the created index and cleanup. Without this wait, it was possible that the test would finish and cleanup the deleted indices would happen before the index creation even processed, causing the test to leave a created index behind.	2016-07-21 09:14:41 -04:00
Karel Minarik	8c721b10af	Test: Fixed incorrect YAML indentation in the `indices.put_template/10_basic.yaml` test The Ruby YAML parser ignores the `do` actions when they are not indented, making the test suite fail. Related: #19506 Closes #19529	2016-07-21 14:17:17 +02:00
Simon Willnauer	302c7a521a	Fix analyzer alias processing (#19506 ) In the lack of tests the analyzer.alias feature was pretty much not working at all on current master. Issues like #19163 showed some serious problems for users using this feature upgrading to an alpha version. This change fixes the processing order and allows aliases to be set for existing analyzers like `default`. This change also ensures that if `default` is aliased the correct analyzer is used for `default_search` etc. Closes #19163	2016-07-21 09:32:47 +02:00
Jun Ohtani	cebad703fe	Analyze: Specify anonymous char_filters/tokenizer/token_filters in the analyze API Add parser for anonymous char_filters/tokenizer/token_filters Using Settings in AnalyzeRequest for anonymous definition Add breaking changes document Closed #8878	2016-07-21 11:06:36 +09:00
Karel Minarik	5bab65d886	Fixed incorrect YAML indentation in the "Rollover" tests again As part of changes in d78f40fb1e88c78ce4466fe145d365c205441e43, a fix to the YAML indentation has been reverted, see location: `d78f40fb1e (diff-eaf129528b571da2cafdfd5490c12453)` This patch fixes the YAML notation back. /cc @abeyad Closes #19482	2016-07-18 19:33:28 +02:00
Nik Everett	d573541f66	Support requests_per_second=-1 to mean no throttling in reindex This is entirely on the REST level, Float.POSITIVE_INFINITY is still how you get no throttling over the transport api. Closes #19089	2016-07-18 13:05:06 -04:00

1 2 3 4 5 ...

1119 Commits