OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-28 16:09:10 +00:00

Author	SHA1	Message	Date
Alex Ksikes	3f6dae1a73	More Like This: renamed `ignore_like` to `unlike` This changes the parameter name `ignore_like` to the more user friendly name `unlike`. This later feature generates a query from the terms in `A` but not from the terms in `B`. This translates to a result set which is like `A` but unlike `B`. We could have further negatively boosted any documents that have some `B`, but these documents already do not receive any contribution from having `B`, and would therefore negatively compete with documents having `A`. Closes #11117	2015-06-17 17:18:50 -05:00
Shay Banon	4e0d65453f	Add Unassigned meta data Unassigned meta includes additional information as to why a shard is unassigned, this is especially handy when a shard moves to unassigned due to node leaving or shard failure. The additional data is provided as part of the cluster state, and as part of `_cat/shards` API. The additional meta includes the timestamp that the shard has moved to unassigned, allowing us in the future to build functionality such as delay allocation due to node leaving until a copy of the shard is found. closes #11653	2015-06-16 17:01:48 +02:00
Boaz Leskes	e559eafa19	Revert "Test: fix removal of e validate_query rewrite YAML test" This reverts commit e461852548b6ee1cbb18031989af4790f4922f48.	2015-06-15 21:57:05 +02:00
Boaz Leskes	e461852548	Test: fix removal of e validate_query rewrite YAML test	2015-06-15 20:29:06 +02:00
gmarz	3b93653863	[TEST] Remove validate_query rewrite YAML test	2015-06-15 12:52:06 -04:00
gmarz	8dd7dae5b9	[SPEC] Add `rewrite` query parameter to the `indices.validate_query` API	2015-06-15 11:13:46 -04:00
Simon Willnauer	43ce27d3bd	[TEST] Shards can also be RELOCATING	2015-06-15 10:07:40 +02:00
Honza Král	09b5f90779	[API] correct link for flush_synced docs	2015-06-09 21:29:02 +02:00
Nirmal Chidambaram	931b9f9c74	Filtered out non data-nodes in relevant cat api Closes #9214 Closes #9287	2015-06-08 16:05:42 +02:00
Boaz Leskes	e359698605	test: reduce replicas for indices synced flush test We have to make sure all shards are started to know the synced flush will hit them all. Shards that are still initializing during the sync flush may be missed and confuse the stats call	2015-06-06 21:10:25 +02:00
Adrien Grand	7c698146f5	Rest: Add all meta fields to the top level json document. Some of our meta fields (such as _id, _version, ...) are returned as top-level properties of the json document, while other properties (_timestamp, _routing, ...) are returned under `fields`. This commit makes all meta fields returned as top-level properties. So eg. `GET test/test/1?fields=_timestamp,foo` would now return ```json { "_index": "test", "_type": "test", "_id": "1", "_version": 1, "_timestamp": 10000000, "found": true, "fields": { "foo": [ "bar" ] } } ``` while it used to return ```json { "_index": "test", "_type": "test", "_id": "1", "_version": 1, "found": true, "fields": { "_timestamp": 10000000, "foo": [ "bar" ] } } ```	2015-06-04 23:42:17 +02:00
Michael McCandless	e1197dfea9	Merge branch 'master' into require_units Conflicts: src/main/java/org/elasticsearch/action/bulk/BulkRequest.java src/main/java/org/elasticsearch/cluster/metadata/MetaDataIndexUpgradeService.java src/main/java/org/elasticsearch/node/internal/InternalSettingsPreparer.java src/test/java/org/elasticsearch/snapshots/DedicatedClusterSnapshotRestoreTests.java	2015-06-04 13:59:10 -04:00
Britta Weber	d23449ee85	[test] fix name in synced flush test	2015-05-29 13:48:37 +02:00
Clinton Gormley	18c8394fbd	REST spec: Renamed indices.flush.synced to indices.flush_synced	2015-05-29 12:30:43 +02:00
Britta Weber	87a0c76e9c	Merge remote-tracking branch 'boaz/index_seal_to_flush_sync'	2015-05-29 10:31:03 +02:00
Igor Motov	c695f35bca	Tests: make REST upgrade test more reliable Make sure that all shards are started to avoid flush conflicts.	2015-05-28 12:37:52 -10:00
Igor Motov	6980286ba4	Core: fix upgrade response serialization	2015-05-28 09:46:46 -10:00
Alexander Reelsen	38639074b4	Testing: Ensure cat API REST tests are unaffected by randomization The wildcard cat API REST tests relied on bulk.max and bulk.min in the thread_pool response. However due to the thread pool types being randomized in InternalTestCluster, the min/max values were not guaranteed to exist (the cached thread pool type is unbounded and thus does not have a max value). In order to prevent this, the test has been removed and now the cat nodes test is used for wildcard testing, which always returns stats about the heap.	2015-05-28 16:24:41 +02:00
Clinton Gormley	cbb7b633f6	REST tests: Fixed bad YAML in search/10_source_filtering	2015-05-27 19:35:00 +02:00
Alexander Reelsen	9d5e789508	Cat API: Do not rely on hashmap for sorted entries The tests for the recently added added wildcard feature were relying on order of the hashmap being used, which could be different. The implementation now ensures, that the header fields are parsed in the order they have been added.	2015-05-27 17:46:22 +02:00
markharwood	57653df713	Search fix: fielddata_fields query string parameter was ignored. The RestSearchAction did not parse the fielddata_fields parameter. Added test case and missing parser code. Closes #11025	2015-05-27 15:30:10 +01:00
Alexander Reelsen	fc224a0de8	Cat API: Add wildcard support for header names This adds wildcard support (simple regexes) for specifying header names. Aliases are supported as well. Closes #10811	2015-05-27 16:09:31 +02:00
javanna	44fe99a3a8	[TEST] make filter_path a default parameter in java rest runner Closes #11351	2015-05-26 15:34:45 +02:00
Tanguy Leroux	ce63590bd6	API: Add response filtering with filter_path parameter This change adds a new "filter_path" parameter that can be used to filter and reduce the responses returned by the REST API of elasticsearch. For example, returning only the shards that failed to be optimized: ``` curl -XPOST 'localhost:9200/beer/_optimize?filter_path=_shards.failed' {"_shards":{"failed":0}}% ``` It supports multiple filters (separated by a comma): ``` curl -XGET 'localhost:9200/_mapping?pretty&filter_path=.mappings..properties.name,.mappings..properties.title' ``` It also supports the YAML response format. Here it returns only the `_id` field of a newly indexed document: ``` curl -XPOST 'localhost:9200/library/book?filter_path=_id' -d '---hello:\n world: 1\n' --- _id: "AU0j64-b-stVfkvus5-A" ``` It also supports wildcards. Here it returns only the host name of every nodes in the cluster: ``` curl -XGET 'http://localhost:9200/_nodes/stats?filter_path=nodes..host' {"nodes":{"lvJHed8uQQu4brS-SXKsNA":{"host":"portable"}}} ``` And "" can be used to include sub fields without knowing the exact path. Here it returns only the Lucene version of every segment: ``` curl 'http://localhost:9200/_segments?pretty&filter_path=indices..version' { "indices" : { "beer" : { "shards" : { "0" : [ { "segments" : { "_0" : { "version" : "5.2.0" }, "_1" : { "version" : "5.2.0" } } } ] } } } } ``` Note that elasticsearch sometimes returns directly the raw value of a field, like the _source field. If you want to filter _source fields, you should consider combining the already existing _source parameter (see Get API for more details) with the filter_path parameter like this: ``` curl -XGET 'localhost:9200/_search?pretty&filter_path=hits.hits._source&_source=title' { "hits" : { "hits" : [ { "_source":{"title":"Book #2"} }, { "_source":{"title":"Book #1"} }, { "_source":{"title":"Book #3"} } ] } } ```	2015-05-26 13:51:04 +02:00
Boaz Leskes	b376a3fbfb	Move index sealing terminology to synced flush #10032 introduced the notion of sealing an index by marking it with a special read only marker, allowing for a couple of optimization to happen. The most important one was to speed up recoveries of shards where we know nothing has changed since they were online by skipping the file based sync phase. During the implementation we came up with a light notion which achieves the same recovery benefits but without the read only aspects which we dubbed synced flush. The fact that it was light weight and didn't put the index in read only mode, allowed us to do it automatically in the background which has great advantage. However we also felt the need to allow users to manually trigger this operation. The implementation at #11179 added the sync flush internal logic and the manual (rest) rest API. The name of the API was modeled after the sealing terminology which may end up being confusing. This commit changes the API name to match the internal synced flush naming, namely `{index}/_flush/synced'. On top of that it contains a couple other changes: - Remove all java client API. This feature is not supposed to be called programtically by applications but rather by admins. - Improve rest responses making structure similar to other (flush) API - Change IndexShard#getOperationsCount to exclude the internal +1 on open shard . it's confusing to get 1 while there are actually no ongoing operations - Some minor other clean ups	2015-05-25 22:32:32 +03:00
Michael McCandless	5aaebb6d83	add missing units to tests	2015-05-25 06:48:34 -04:00
Clinton Gormley	409e4e5f73	REST test: Fixed index-seal test Fixed bad YAML, and changed to wait for yellow instead of green, because REST tests usually run on a single node	2015-05-20 13:46:43 +02:00
Simon Willnauer	9d2852f0ab	Merge branch 'master' into feature/synced_flush Conflicts: src/main/java/org/elasticsearch/index/engine/InternalEngine.java src/main/java/org/elasticsearch/index/shard/IndexShard.java src/main/java/org/elasticsearch/indices/recovery/RecoverySourceHandler.java src/test/java/org/elasticsearch/index/engine/InternalEngineTests.java	2015-05-19 12:16:22 +02:00
Adrien Grand	2c241e8a36	Mappings: Remove the `ignore_conflicts` option. Mappings conflicts should not be ignored. If I read the history correctly, this option was added when a mapping update to an existing field was considered a conflict, even if the new mapping was exactly the same. Now that mapping updates are smart enough to detect conflicting options, we don't need an option to ignore conflicts.	2015-05-18 15:28:23 +02:00
Martijn van Groningen	719252a138	Merge pull request #11183 from martijnvg/parent-child/remove_id_cache_from_stats_and_clear_cache_apis Removed `id_cache` from stats and cat apis.	2015-05-15 14:39:35 +02:00
Martijn van Groningen	ece18f162e	Removed `id_cache` from stats and cat apis. Also removed the `id_cache` option from the clear cache api. Closes #5269	2015-05-15 14:06:18 +02:00
Jun Ohtani	3a1a4d3e89	Analysis: Add multi-valued text support Add support array text as a multi-valued for AnalyzeRequestBuilder Add support array text as a multi-valued for Analyze REST API Add docs Closes #3023	2015-05-15 20:01:10 +09:00
Britta Weber	c628d67f9e	api for synced flush closes #11098	2015-05-13 14:45:24 +02:00
javanna	d7e585ca4f	REST: Unify query_string parameters parsing There currently are small differences between search api and count, exists, validate query, explain api when it comes to reading query_string parameters. `analyze_wildcard`, `lowercase_expanded_terms` and `lenient` are only read by the search api and ignored by all other mentioned apis. Unified code to fix this and make sure it doesn't happen again. Also shared some code when it comes to printing out the query as part of SearchSourceBuilder conversion to ToXContent. Extended REST spec to include all the supported params (some that were already supported weren't listed), and added REST tests (also some basic tests for count and search_exists which weren't tested at all). Closes #11057	2015-05-11 11:32:59 +02:00
Andrew Selden	c953e99324	Merge pull request #10864 from aleph-zero/issues/9606 Remove (dfs_)query_and_fetch from the REST API	2015-05-07 12:51:28 -07:00
Alex Ksikes	ec4f12f9ef	More Like This: removal of the MLT API Removes the More Like This API, users should now use the More Like This query. The MLT API tests were converted to their query equivalent. Also some clean ups in MLT tests. Closes #10736 Closes #11003	2015-05-06 18:11:11 +02:00
Adrien Grand	b72f27a410	Core: Cut over to the Lucene filter cache. This removes Elasticsearch's filter cache and uses Lucene's instead. It has some implications: - custom cache keys (`_cache_key`) are unsupported - decisions are made internally and can't be overridden by users ('_cache`) - not only filters can be cached but also all queries that do not need scores - parent/child queries can now be cached, however cached entries are only valid for the current top-level reader so in practice it will likely only be used on read-only indices - the cache deduplicates filters, which plays nicer with large keys (eg. `terms`) - better stats: we already had ram usage and evictions, but now also hit count, miss count, lookup count, number of cached doc id sets and current number of doc id sets in the cache - dynamically changing the filter cache size is not supported anymore Internally, an important change is that it removes the NoCacheFilter infrastructure in favour of making Query.rewrite specializing the query for the current reader so that it will only be cached on this reader (look for IndexCacheableQuery). Note that consuming filters with the query API (createWeight/scorer) instead of the filter API (getDocIdSet) is important for parent/child queries because otherwise a QueryWrapperFilter(ParentQuery) would run the wrapped query per segment while relations might be cross segments.	2015-05-04 09:02:15 +02:00
Honza Král	28e5a649b4	[API] remove global parameter source from individual APIs Same way we don't define pretty anywhere, we shouldn't have source	2015-04-29 14:01:47 +02:00
aleph-zero	89542facb3	Remove (dfs_)query_and_fetch from the REST API Remove the ability to specify search type ‘query_and_fetch’ and ‘df_query_and_fetch’ from the REST API. - Adds REST tests - Updates REST API spec to remove ‘query_and_fetch’ and ‘df_query_and_fetch’ as options - Removes documentation for these options Closes #9606	2015-04-28 15:27:59 -07:00
Michael McCandless	cf2fb4ed0f	Remove core delete-by-query implementation, to be replaced with a plugin The current implementation is dangerous: it unexpectedly refreshes, which can quickly cause an unhealthy index (segment explosion). It can also delete different documents on primary vs replicas, causing inconsistent replicas. For 2.0 we will replace this with an optional plugin that does a scan/scroll search and then issues bulk delete requests. Closes #10859	2015-04-28 16:09:04 -04:00
Simon Willnauer	d164526d27	Remove `_shutdown` API Thsi commit removes the `_shutdown` API entirely without any replacement. Nodes should be managed from the operating system not via REST APIs	2015-04-27 17:19:36 +02:00
Simon Willnauer	15d58d91f1	[REST] Render REST errors in a structural way This commit adds support for structural errors / failures / exceptions on the elasticsearch REST layer. Exceptions are rendering with at least a `type` and a `reason` corresponding to the exception name and the message. Some expcetions like the ones associated with an index or a shard will have additional information about the index the exception was triggered on or the shard respectivly. Each rendered response will also contain a list of root causes which is a list of distinct shard level errors returned for the request. Root causes are the lowest level elasticsearch exception found per shard response and are intended to be displayed to the user to indicate the soruce of the exception. Shard level response are by-default grouped by their type and reason to reduce the amount of duplicates retunred. Yet, the same exception retunred from different indices will not be grouped. Closes #3303	2015-04-24 09:36:10 +02:00
tlrx	adc0807c68	Internal: Add METADATA_READ and METADATA_WRITE blocks This commit splits the current ClusterBlockLevel.METADATA into two disctins ClusterBlockLevel.METADATA_READ and ClusterBlockLevel.METADATA_WRITE blocks. It allows to make a distinction between an operation that modifies the index or cluster metadata and an operation that does not change any metadata. Before this commit, many operations where blocked when the cluster was read-only: Cluster Stats, Get Mappings, Get Snapshot, Get Index Settings, etc. Now those operations are allowed even when the cluster or the index is read-only. Related to #8102, #2833 Closes #3703 Closes #5855 Closes #10521 Closes #10522	2015-04-23 15:13:57 +02:00
Martijn van Groningen	5705537ecf	Added field stats api The field stats api returns field level statistics such as lowest, highest values and number of documents that have at least one value for a field. An api like this can be useful to explore a data set you don't know much about. For example you can figure at with the lowest and highest response times are, so that you can create a histogram or range aggregation with sane settings. This api doesn't run a search to figure this statistics out, but rather use the Lucene index look these statics up (using Terms class in Lucene). So finding out these stats for fields is cheap and quick. The min/max values are based on the type of the field. So for a numeric field min/max are numbers and date field the min/max date and other fields the min/max are term based. Closes #10523	2015-04-23 08:52:34 +02:00
Ryan Ernst	2d54738339	Tests: Add shortcut "all" to skip version ranges in rest tests This was suggested on #10656 as cleaner than " - " to indicate all versions should be skipped. closes #10702	2015-04-22 11:40:26 -07:00
Boaz Leskes	1118965aa5	Test: move to ensure yellow in indices.stats/12_level.yaml	2015-04-22 16:10:28 +02:00
Boaz Leskes	91ff3f6963	Test: add ensure green to indices.stats/12_level.yaml Also changed the stash logger to not log all stashed values under debug (it does trace now) but do dump the stash content upon failure (under info as a XContent)	2015-04-22 15:56:45 +02:00
Jun Ohtani	0955c127c0	Rest: Add json in request body to scroll, clear scroll, and analyze API Change analyze.asciidoc and scroll.asciidoc Add json support to Analyze and Scroll, and clear scrollAPI Add rest-api-spec/test Closes #5866	2015-04-22 17:53:20 +09:00
Boaz Leskes	a1ba339517	Stats: add CommitStats to supply information about the current commit point Extends ShardStats with commit specific information. We currently expose commit id, generation and the user data map. The information is also retrievable via the Rest API by using `GET _stats?level=shards` Closes #10687	2015-04-22 09:33:14 +02:00
Adrien Grand	d7abb12100	Replace deprecated filters with equivalent queries. In Lucene 5.1 lots of filters got deprecated in favour of equivalent queries. Additionally, random-access to filters is now replaced with approximations on scorers. This commit - replaces the deprecated NumericRangeFilter, PrefixFilter, TermFilter and TermsFilter with NumericRangeQuery, PrefixQuery, TermQuery and TermsQuery, wrapped in a QueryWrapperFilter - replaces XBooleanFilter, AndFilter and OrFilter with a BooleanQuery in a QueryWrapperFilter - removes DocIdSets.isBroken: the new two-phase iteration API will now help execute slow filters efficiently - replaces FilterCachingPolicy with QueryCachingPolicy Close #8960	2015-04-21 15:32:43 +02:00

1 2 3 4 5 ...

736 Commits