OpenSearch

Commit Graph

Author	SHA1	Message	Date
Robert Muir	4b3672b7df	Add migration note for hunspell dictionaries	2015-05-04 10:00:05 -04:00
Adrien Grand	b72f27a410	Core: Cut over to the Lucene filter cache. This removes Elasticsearch's filter cache and uses Lucene's instead. It has some implications: - custom cache keys (`_cache_key`) are unsupported - decisions are made internally and can't be overridden by users ('_cache`) - not only filters can be cached but also all queries that do not need scores - parent/child queries can now be cached, however cached entries are only valid for the current top-level reader so in practice it will likely only be used on read-only indices - the cache deduplicates filters, which plays nicer with large keys (eg. `terms`) - better stats: we already had ram usage and evictions, but now also hit count, miss count, lookup count, number of cached doc id sets and current number of doc id sets in the cache - dynamically changing the filter cache size is not supported anymore Internally, an important change is that it removes the NoCacheFilter infrastructure in favour of making Query.rewrite specializing the query for the current reader so that it will only be cached on this reader (look for IndexCacheableQuery). Note that consuming filters with the query API (createWeight/scorer) instead of the filter API (getDocIdSet) is important for parent/child queries because otherwise a QueryWrapperFilter(ParentQuery) would run the wrapped query per segment while relations might be cross segments.	2015-05-04 09:02:15 +02:00
Ryan Ernst	4ef9f3ca63	Mappings: Remove file based default mappings Using files that must be specified on each node is an anti-pattern from the API based goal of ES. This change removes the ability to specify the default mapping with a file on each node. closes #10620	2015-04-30 13:50:35 -07:00
Adrien Grand	e5be85d586	Aggs: Change the default `min_doc_count` to 0 on histograms. The assumption is that gaps in histogram are generally undesirable, for instance if you want to build a visualization from it. Additionally, we are building new aggregations that require that there are no gaps to work correctly (eg. derivatives).	2015-04-30 15:48:23 +02:00
Simon Willnauer	94d8b20611	Add multi data.path to migration guide this commit removes the obsolete settings for distributors and updates the documentation on multiple data.path. It also adds an explain to the migration guide. Relates to #9498 Closes #10770	2015-04-29 11:51:37 +02:00
Ryan Ernst	bf09e58cb3	Mappings: Remove includes and excludes from _source Regardless of the outcome of #8142, we should at least enforce that when _source is enabled, it is sufficient to reindex. This change removes the excludes and includes settings, since these modify the source, causing us to lose the ability to reindex some fields. closes #10814	2015-04-28 15:03:51 -07:00
javanna	c914134355	Scripting: remove groovy sandbox Groovy sandboxing was disabled by default from 1.4.3 on though since we found out that it could be worked around, so it makes little sense to keep it and maintain it. Closes #10156 Closes #10480	2015-04-28 11:27:50 +02:00
Jun Ohtani	933edf7bcc	Analysis: Fix wrong position number by analyze API Add breaking chages comment to migrate docs Fix the stopword included text using stopword filter	2015-04-28 17:44:41 +09:00
Simon Willnauer	d164526d27	Remove `_shutdown` API Thsi commit removes the `_shutdown` API entirely without any replacement. Nodes should be managed from the operating system not via REST APIs	2015-04-27 17:19:36 +02:00
markharwood	1b8b993912	Query enhancement: Enable Lucene ranking behaviour for queries on numeric fields. This changes the default ranking behaviour of single-term queries on numeric fields to use the usual Lucene TermQuery scoring logic rather than a constant-scoring wrapper. Closes #10628	2015-04-27 09:42:55 +01:00
Ryan Ernst	1f5bdca8cc	Mappings: Restrict murmur3 field type to sane options Disabling doc values or trying to index hash values are not correct uses of this the murmur3 field type, and just cause problems. This disallows changing doc values or index options for 2.0+. closes #10465	2015-04-23 21:48:42 -07:00
Igor Motov	60721b2a17	Snapshot/Restore: remove obsolete expand_wildcards_open and expand_wildcards_close options In #6097 we made snapshot/restore index option consistent with other API. Now we can remove old style options from master. Closes #10743	2015-04-23 13:29:24 -04:00
Adrien Grand	d7abb12100	Replace deprecated filters with equivalent queries. In Lucene 5.1 lots of filters got deprecated in favour of equivalent queries. Additionally, random-access to filters is now replaced with approximations on scorers. This commit - replaces the deprecated NumericRangeFilter, PrefixFilter, TermFilter and TermsFilter with NumericRangeQuery, PrefixQuery, TermQuery and TermsQuery, wrapped in a QueryWrapperFilter - replaces XBooleanFilter, AndFilter and OrFilter with a BooleanQuery in a QueryWrapperFilter - removes DocIdSets.isBroken: the new two-phase iteration API will now help execute slow filters efficiently - replaces FilterCachingPolicy with QueryCachingPolicy Close #8960	2015-04-21 15:32:43 +02:00
Clinton Gormley	ab3fa78ae0	Docs: Reverte migration docs mentioning parent removal from update request Relates to #9612	2015-04-13 16:35:21 +02:00
Adrien Grand	5b3cc2f07c	Search: deprecate the limit filter. This is really a Collector instead of a filter. This commit deprecates the `limit` filter, makes it a no-op and recommends to use the `terminate_after` parameter instead that we introduced in the meantime.	2015-04-10 17:18:50 +02:00
Adrien Grand	919589b908	Queries: Remove fuzzy-like-this support. The fuzzy-like-this query builds very expensive queries and only serves esoteric use-cases.	2015-04-10 17:16:02 +02:00
Adrien Grand	aecd9ac515	Aggregations: Speed up include/exclude in terms aggregations with regexps. Today we check every regular expression eagerly against every possible term. This can be very slow if you have lots of unique terms, and even the bottleneck if your query is selective. This commit switches to Lucene regular expressions instead of Java (not exactly the same syntax yet most existing regular expressions should keep working) and uses the same logic as RegExpQuery to intersect the regular expression with the terms dictionary. I wrote a quick benchmark (in the PR) to make sure it made things faster and the same request that took 750ms on master now takes 74ms with this change. Close #7526	2015-04-09 12:12:56 +02:00
javanna	acabf2d55a	Cluster state REST api: print routing_nodes out only when requested through specific flag For bacwards compatibility reasons routing_nodes were previously printed out when routing_table was requested, together with the actual routing_table. Now they are printed out only when requests through `routing_nodes` flag. Relates to #10412 Closes #10486	2015-04-08 16:10:36 +02:00
javanna	d9aebf4906	Scripting: remove deprecated methods from ScriptService Removed the following methods from `ScriptService`, which don't require the `ScriptContext` argument: ``` public CompiledScript compile(String lang, String script, ScriptType scriptType) public ExecutableScript executable(String lang, String script, ScriptType scriptType, Map<String, Object> vars) public SearchScript search(SearchLookup lookup, String lang, String script, ScriptType scriptType, @Nullable Map<String, Object> vars) ``` Also removed the ScriptContext.Standard.GENERIC_PLUGIN enum value, as it was used only for backwards compatibility. Plugins that make use of scripts should declare their own script contexts through `ScriptModule#registerScriptContext` and use them when compiling/executing scripts. Closes #10476	2015-04-08 12:20:03 +02:00
Adrien Grand	08f93cf33f	Add doc values support to boolean fields. This pull request makes boolean handled like dates and ipv4 addresses: things are stored as as numerics under the hood and aggregations add some special formatting logic in order to return true/false in addition to 1/0. For example, here is an output of a terms aggregation on a boolean field: ``` "aggregations": { "top_f": { "doc_count_error_upper_bound": 0, "buckets": [ { "key": 0, "key_as_string": "false", "doc_count": 2 }, { "key": 1, "key_as_string": "true", "doc_count": 1 } ] } } ``` Sorted numeric doc values are used under the hood. Close #4678 Close #7851	2015-04-02 15:40:46 +02:00
Reuben Sutton	85c221e9b1	Remove jsonp support and associated tests, closes #9108	2015-04-01 16:06:09 +01:00
javanna	83fb0a10e5	Scripting: remove support for script.disable_dynamic setting Now that fine-grained script settings are supported (#10116) we can remove support for the script.disable_dynamic setting. Same result as `script.disable_dynamic: false` can be obtained as follows: ``` script.inline: on script.indexed: on ``` An exception is thrown at startup when the old setting is set, so we make sure we tell users they have to change it rather than ignoring the setting. Closes #10286	2015-03-31 13:24:52 +02:00
Adrien Grand	a608db122d	Search: Remove the `count` search type. This commit brings the benefits of the `count` search type to search requests that have a `size` of 0: - a single round-trip to shards (no fetch phase) - ability to use the query cache Since `count` now provides no benefits over `query_then_fetch`, it has been deprecated. Close #7630	2015-03-31 11:31:49 +02:00
Martijn van Groningen	75713f4190	Reverted commit: `20f7be3`	2015-03-28 08:53:11 +01:00
Ryan Ernst	693d91e41c	Mappings: Remove delete mapping API Deleting a type from an index is inherently dangerous because the type can be recreated with new mappings which may conflict with existing segments still using the old mappings. This removes the ability to delete a type (similar to how deleting fields within a type is not allowed, for the same reason). closes #8877 closes #10231	2015-03-24 09:46:02 -07:00
javanna	4348959f9d	Delete api: remove broadcast delete if routing is missing when required This commit changes the behaviour of the delete api when processing a delete request that refers to a type that has routing set to required in the mapping, and the routing is missing in the request. Up until now the delete api sent a broadcast delete request to all of the shards that belong to the index, making sure that the document could be found although the routing value wasn't specified. This was probably not the best choice: if the routing is set to required, an error should be thrown instead. A `RoutingMissingException` gets now thrown instead, like it happens in the same situation with every other api (index, update, get etc.). Last but not least, this change allows to get rid of a couple of `TransportAction`s, `Request`s and `Response`s and simplify the codebase. Closes #9123 Closes #10136	2015-03-20 09:19:43 +01:00
Clinton Gormley	aa94ced0ae	Remove references to the thrift and memcached transport plugins as they are no longer supported Closes #10166	2015-03-19 20:49:58 +01:00
Clinton Gormley	25369f0727	Remove async replication from the docs and REST spec Relates to #10114	2015-03-19 15:34:12 +01:00
Ryan Ernst	9d708e20a0	Mappings: Lock down _size field This also changes the stored setting for _size to true (for indexes created in 2.x). see #8143 closes #9913	2015-02-27 11:09:52 -08:00
Ryan Ernst	3b7928d568	Mappings: Lock down _field_names field Now that we have an explicit `enabled` flag, we can lock down the field type so it is not mungeable. see #8143 closes #9912	2015-02-26 15:15:59 -08:00
Ryan Ernst	7181bbde26	Mappings: Remove _boost field This has been deprecated since 1.0.0.RC1. It is finally removed here. closes #8875	2015-02-26 15:07:07 -08:00
Ryan Ernst	78df69e6a0	Mappings: Lock down _routing field `required` is now the only changeable settings (on indexes created after 1.x). see #8143 closes #9895	2015-02-26 13:09:41 -08:00
Ryan Ernst	32e042f1c4	Mappings: Lock down _index field see #8143 closes #9870	2015-02-25 12:24:55 -08:00
Clinton Gormley	5a53ff6f1b	Update migrate_2_0.asciidoc More code formatting in breaking changes	2015-02-25 14:13:25 +01:00
Clinton Gormley	e805fe71cc	Update migrate_2_0.asciidoc Code formatting in breaking changes	2015-02-25 14:11:57 +01:00
Clinton Gormley	5146cf6256	Update migrate_2_0.asciidoc Fixed bad heading levels in breaking changes	2015-02-25 14:10:17 +01:00
Clinton Gormley	0c61ea803d	Update migrate_2_0.asciidoc Fixed bad asciidoc in breaking changes	2015-02-25 14:07:19 +01:00
Ryan Ernst	be0cef0c43	Mappings: Lock down _type field see #8143 closes #9869	2015-02-24 22:37:41 -08:00
Ryan Ernst	b96bd201c1	Mappings: Lock down _id field There are two implications to this change. First, percolator now uses _uid internally, extracting the id portion when needed. Second, sorting on _id is no longer possible, since you can no longer index _id. However, _uid can still be used to sort, and is better anyways as indexing _id just to make it available to fielddata for sorting is wasteful. see #8143 closes #9842	2015-02-24 14:26:22 -08:00
Colin Goodheart-Smithe	2753db4685	Scripting: Removed deprecated script parameter names This change removes the deprecated script parameter names ('file', 'id', and 'scriptField'). It also removes the ability to load file scripts using the 'script' parameter. File scripts should be loaded using the 'script_file' parameter only.	2015-02-23 13:49:21 +00:00
Adrien Grand	4708227ecf	Codecs: Remove the ability to have custom per-field postings and doc values formats. This commit makes the `postings_format` and `doc_values_format` options of mappings illegal on 2.0 and ignored on 1.x (meaning that the default postings and doc values formats from the codec will be used in such a case). This removes a fair amount of code. Close #8746 #9741	2015-02-19 15:47:25 +01:00
Christoph Büscher	30fd70f07b	Aggregations: Simplify time zone option in `date_histogram` Removed the existing `pre_zone` and `post_zone` option in `date_histogram` in favor of the simpler `time_zone` option. Previously, specifying different values for these could lead to confusing scenarios where ES would return bucket keys that are not UTC. Now `time_zone` is the only option setting, the calculation of date buckets to take place in the preferred time zone, but after rounding converting the bucket key values back to UTC. Closes #9062 Closes #9637	2015-02-16 16:54:06 +01:00
Ryan Ernst	533fdbdf75	Mappings: Remove support for field access by short name When multiple fields under object fields share the same name, accessing by short name is ambiguous. This removes support for short names, always requiring the full name when used in queries. closes #8872	2015-02-12 09:58:37 -08:00
Christoph Büscher	d2f852a274	Aggregations: Add 'offset' option to date_histogram, replacing 'pre_offset' and 'post_offset' Add offset option to 'date_histogram' replacing and simplifying the previous 'pre_offset' and 'post_offset' options. This change is part of a larger clean up task for `date_histogram` from issue #9062.	2015-02-09 14:03:28 +01:00
Ryan Ernst	6079d88d43	Mappings: Remove type prefix support from field names in queries This is the first part of #8872.	2015-02-02 13:10:56 -08:00
Colin Goodheart-Smithe	285ef0f06d	Aggregations: Clean up response API for Aggregations This change makes the response API object for Histogram Aggregations the same for all types of Histogram, and does the same for all types of Ranges. The change removes getBucketByKey() from all aggregations except filters and terms. It also reduces the methods on the Bucket class to just getKey() and getKeyAsString(). The getKey() method returns Object and the actual Type is returns will be appropriate for the type of aggregation being run. e.g. date_histogram will return a DateTime for this method and Histogram will return a Number.	2015-01-27 10:53:44 +00:00
David Pilato	052645903a	Rest: remove status code from main action Today we give the HTTP status back within the HTTP response itself and within the JSON response as well: ```sh curl localhost:9200/ ``` ```js { "status" : 200, "name" : "Red Wolf", "version" : { "number" : "2.0.0", "build_hash" : "6837a61d8a646a2ac7dc8da1ab3c4ab85d60882d", "build_timestamp" : "2014-08-19T13:55:56Z", "build_snapshot" : true, "lucene_version" : "4.9" }, "tagline" : "You Know, for Search" } ```	2015-01-12 12:37:46 +01:00
Ryan Ernst	060f963a8e	Mappings: Remove allow_type_wrapper setting Before Elasticsearch 1.0, the type was allowed to be passed as the root element when uploading a document. However, this was ambiguous if the mappings also contained a field with the same name as the type. The behavior was changed in 1.0 to not allow this, but a setting was added for backwards compatibility. This change removes the setting for 2.0.	2015-01-08 09:13:40 -08:00
Martijn van Groningen	ca4f27f40e	Core: Added `_shards` header to all write responses. The header indicates to how many shard copies (primary and replicas shards) a write was supposed to go to, to how many shard copies to write succeeded and potentially captures shard failures if writing into a replica shard fails. For async writes it also includes the number of shards a write is still pending. Closes #7994	2015-01-08 18:10:08 +01:00
Martijn van Groningen	20f7be378b	Removed parent parameter from update request, because it is just sets the routing. The routing option should be used instead. The parent a child document points to can't be updated. Closes #4538	2015-01-07 10:26:20 +01:00
Adrien Grand	bc86796592	Core: Remove terms filter cache. This is our only cache which is not 'exact' and might allow for stalled results. Additionally, a similar cache that we have and needs to perform lookups in other indices in order to run queries is the script index, and for this index we rely on the filesystem cache, so we should probably do the same with terms filters lookups. Close #9056	2015-01-06 17:21:20 +01:00
David Pilato	d2a2d1bb53	java: QueryBuilders cleanup: remove deprecated Related to #8667: Some QueryBuilders have been deprecated in 1.x branches. We removed them in 2.0. Removed ------- * `textPhrase(...)` * `textPhrasePrefix(...)` * `textPhrasePrefixQuery(...)` * `filtered(...)` * `inQuery(...)` * `commonTerms(...)` * `queryString(...)` * `simpleQueryString(...)` Closes #8721.	2014-12-03 16:07:34 +01:00
Adrien Grand	8346e92ebb	Core: Fix script fields to be returned as a multivalued field when they produce a list. This change is essentially the same as #3015 but on script fields. Close #8592	2014-11-24 09:41:16 +01:00
Alex Ksikes	1959275622	Term Vectors: More consistent naming for term vector[s] We speak of the term vectors of a document, where each field has an associated stored term vector. Since by default we are requesting all the term vectors of a document, the HTTP request endpoint should rather be called `_termvectors` instead of `_termvector`. The usage of `_termvector` is now deprecated, as well as the transport client call to termVector and prepareTermVector. Closes #8484	2014-11-21 14:06:44 +01:00
Simon Willnauer	0fcb466555	[STORE] Remove `memory`/ `ram` store The RAM store is discuraged for production usage anyway and we don't test it in our randomized infrastructure. This commit removes it for `2.0`	2014-11-20 14:47:19 +01:00
Colin Goodheart-Smithe	353574d6af	Indices API: Fix GET index API always running all features Previous to this change all features (_alias,_mapping,_settings,_warmer) are run regardless of which features are actually requested. This change fixes the request object to resolve this bug	2014-11-13 13:22:46 +00:00
Adrien Grand	9ea25df649	Switch to murmurhash3 to route documents to shards. We currently use the djb2 hash function in order to compute the shard a document should go to. Unfortunately this hash function is not very sophisticated and you can sometimes hit adversarial cases, such as numeric ids on 33 shards. Murmur3 generates hashes with a better distribution, which should avoid the adversarial cases. Here are some examples of how 100000 incremental ids are distributed to shards using either djb2 or murmur3. 5 shards: Murmur3: [19933, 19964, 19940, 20030, 20133] DJB: [20000, 20000, 20000, 20000, 20000] 3 shards: Murmur3: [33185, 33347, 33468] DJB: [30100, 30000, 39900] 33 shards: Murmur3: [2999, 3096, 2930, 2986, 3070, 3093, 3023, 3052, 3112, 2940, 3036, 2985, 3031, 3048, 3127, 2961, 2901, 3105, 3041, 3130, 3013, 3035, 3031, 3019, 3008, 3022, 3111, 3086, 3016, 2996, 3075, 2945, 2977] DJB: [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 900, 900, 900, 900, 1000, 1000, 10000, 10000, 10000, 10000, 9100, 9100, 9100, 9100, 9000, 9000, 0, 0, 0, 0, 0, 0] Even if djb2 looks ideal in some cases (5 shards), the fact that the distribution of its hashes has some patterns can raise issues with some shard counts (eg. 3, or even worse 33). Some tests have been modified because they relied on implementation details of the routing hash function. Close #7954	2014-11-04 16:32:42 +01:00
Alex Ksikes	35f55608cc	MLT Field Query: remove it from master The MLT field query is simply replaced by a MLT query set to specififc field. To simplify code maintenance we should deprecate it in 1.4 and remove it in 2.0. Closes #8238	2014-10-29 10:19:00 +01:00
Adrien Grand	230c6684a9	Search: Remove partial fields. Partial fields have been deprecated since 1.0.0Beta1 in favor of _source filtering. They will be removed in 2.0.	2014-10-20 12:29:30 +02:00
Colin Goodheart-Smithe	9ce7ca21e4	Docs: Added breaking changes docs for Indices APIs Adds the breaking changes defaults for the change of default indices options for the GET Aliases API	2014-10-08 15:08:25 +01:00

1 2 3

110 Commits