OpenSearch

Commit Graph

Author	SHA1	Message	Date
Igor Motov	dd41c68741	Snapshot/Restore: fix FSRepository location configuration Closes #11068	2015-05-20 22:14:31 -04:00
Adrien Grand	2c241e8a36	Mappings: Remove the `ignore_conflicts` option. Mappings conflicts should not be ignored. If I read the history correctly, this option was added when a mapping update to an existing field was considered a conflict, even if the new mapping was exactly the same. Now that mapping updates are smart enough to detect conflicting options, we don't need an option to ignore conflicts.	2015-05-18 15:28:23 +02:00
javanna	a843008b17	Highlighting: require_field_match set to true by default The default `false` for `require_field_match` is a bit odd and confusing for users, given that field names get ignored by default and every field gets highlighted if it contains terms extracted out of the query, regardless of which fields were queries. Changed the default to `true`, it can always be changed per request. Closes #10627 Closes #11067	2015-05-15 21:38:45 +02:00
javanna	46c521f7ec	Highlighting: nuke XPostingsHighlighter Our own fork of the lucene PostingsHighlighter is not easy to maintain and doesn't give us any added value at this point. In particular, it was introduced to support the require_field_match option and discrete per value highlighting, used in case one wants to highlight the whole content of a field, but get back one snippet per value. These two features won't make it into lucene as they slow things down and shouldn't have been supported from day one on our end probably. One other customization we had was support for a wider range of queries via custom rewrite etc. (yet another way to slow things down), which got added to lucene and works much much better than what we used to do (instead of or rewrite, term s are pulled out of the automata for multi term queries). Removing our fork means the following in terms of features: - dropped support for require_field_match: the postings highlighter will only highlight fields that were queried - some custom es queries won't be supported anymore, meaning they won't be highlighted. The only one I found up until now is the phrase_prefix. Postings highlighter rewrites against an empty reader to avoid slow operations (like the ones that we were performing with the fork that we are removing here), thus the prefix will not be expanded to any term. What the postings highlighter does instead is pulling the automata out of multi term queries, but this is not supported at the moment with our MultiPhrasePrefixQuery. Closes #10625 Closes #11077	2015-05-15 20:41:33 +02:00
Jun Ohtani	597c53a0bb	Add migrationi note for AnalyzeRequest	2015-05-16 00:25:53 +09:00
Martijn van Groningen	ece18f162e	Removed `id_cache` from stats and cat apis. Also removed the `id_cache` option from the clear cache api. Closes #5269	2015-05-15 14:06:18 +02:00
Adrien Grand	630757906a	Query DSL: Add `filter` clauses to `bool` queries. These clauses filter the document space without affecting scoring and map to Lucene's BooleanClause.Occur.FILTER. The `filtered` query is now deprecated and ```json { "filtered": { "query": { //query }, "filter": { //filter } } } ``` should be replaced with ```json { "bool": { "must": { //query }, "filter": { //filter } } } ```	2015-05-13 12:04:56 +02:00
Ryan Ernst	f766b260ba	Add tests for includeInObject backcompat	2015-05-12 23:11:15 -07:00
Ryan Ernst	565ffb16f1	Mappings: Remove ability to set meta fields inside documents A few meta fields can currently be set within a document's source. However, the recommended way to set meta fields like this is through the api, and setting within the document can be a performance trap (e.g. needing to find _id in order to route the document). This change removes the ability to set meta fields within a document source for 2.0+ indexes. closes #11051 closes #11074	2015-05-12 23:09:03 -07:00
Ryan Ernst	e7618b8528	Settings: Remove file based index templates As a follow up to #10870, this removes support for index templates on disk. It also removes a missed place still allowing disk based mappings. closes #11052	2015-05-11 12:51:22 -07:00
Martijn van Groningen	acdd9a5dd9	parent/child: Removed the `top_children` query.	2015-05-10 16:30:19 +02:00
Lee Hinman	c6747ded16	Truncate log messages at 10,000 characters	2015-05-08 10:10:44 -06:00
Adrien Grand	a0af88e996	Query DSL: Remove filter parsers. This commit makes queries and filters parsed the same way using the QueryParser abstraction. This allowed to remove duplicate code that we had for similar queries/filters such as `range`, `prefix` or `term`.	2015-05-07 20:14:34 +02:00
Alex Ksikes	4787cf701f	More Like This: remove percent_terms_to_match Users should use minimum_should_match instead. Closes #11030	2015-05-07 14:21:29 +02:00
Martijn van Groningen	f7c29457d0	parent/child: Deprecated the `top_children` in favour of the `has_child` query.	2015-05-07 09:27:54 +02:00
Alex Ksikes	ec4f12f9ef	More Like This: removal of the MLT API Removes the More Like This API, users should now use the More Like This query. The MLT API tests were converted to their query equivalent. Also some clean ups in MLT tests. Closes #10736 Closes #11003	2015-05-06 18:11:11 +02:00
Ryan Ernst	7a7bd6086a	Mappings: Remove ability to disable _source field Current features (eg. update API) and future features (eg. reindex API) depend on _source. This change locks down the field so that it can no longer be disabled. It also removes legacy settings compress/compress_threshold. closes #8142 closes #10915	2015-05-05 22:04:18 -07:00
Clinton Gormley	e28ad853c7	Docs: Fixed bad asciidoc in migrate_2_0	2015-05-05 11:17:21 +02:00
Pascal Borreli	af6d890ad5	Docs: Fixed typos Closes #10973	2015-05-05 10:38:05 +02:00
Shay Banon	187d79b6df	Centralize admin implementations and action execution This change removes the multiple implementations of different admin interfaces and centralizes it with AbstractClient. It also makes sure all executions of actions now go through a single AbstractClient#execute method, taking care of copying headers and wrapping listener. This also has the side benefit of removing all the code around differnet possible clients, and removes quite a bit of code (most of the + code is actually removal of generics and such). This change also changes how TransportClient is constructed, requiring a Builder to create it, its a breaking change and its noted in the migration guide. Yea another step towards simplifying the action infra and making it simpler...	2015-05-04 23:40:17 +02:00
Robert Muir	4b3672b7df	Add migration note for hunspell dictionaries	2015-05-04 10:00:05 -04:00
Adrien Grand	b72f27a410	Core: Cut over to the Lucene filter cache. This removes Elasticsearch's filter cache and uses Lucene's instead. It has some implications: - custom cache keys (`_cache_key`) are unsupported - decisions are made internally and can't be overridden by users ('_cache`) - not only filters can be cached but also all queries that do not need scores - parent/child queries can now be cached, however cached entries are only valid for the current top-level reader so in practice it will likely only be used on read-only indices - the cache deduplicates filters, which plays nicer with large keys (eg. `terms`) - better stats: we already had ram usage and evictions, but now also hit count, miss count, lookup count, number of cached doc id sets and current number of doc id sets in the cache - dynamically changing the filter cache size is not supported anymore Internally, an important change is that it removes the NoCacheFilter infrastructure in favour of making Query.rewrite specializing the query for the current reader so that it will only be cached on this reader (look for IndexCacheableQuery). Note that consuming filters with the query API (createWeight/scorer) instead of the filter API (getDocIdSet) is important for parent/child queries because otherwise a QueryWrapperFilter(ParentQuery) would run the wrapped query per segment while relations might be cross segments.	2015-05-04 09:02:15 +02:00
Clinton Gormley	c28bf3bb3f	Docs: Updated elasticsearch.org links to elastic.co	2015-05-01 20:46:12 +02:00
Ryan Ernst	4ef9f3ca63	Mappings: Remove file based default mappings Using files that must be specified on each node is an anti-pattern from the API based goal of ES. This change removes the ability to specify the default mapping with a file on each node. closes #10620	2015-04-30 13:50:35 -07:00
Adrien Grand	e5be85d586	Aggs: Change the default `min_doc_count` to 0 on histograms. The assumption is that gaps in histogram are generally undesirable, for instance if you want to build a visualization from it. Additionally, we are building new aggregations that require that there are no gaps to work correctly (eg. derivatives).	2015-04-30 15:48:23 +02:00
Simon Willnauer	94d8b20611	Add multi data.path to migration guide this commit removes the obsolete settings for distributors and updates the documentation on multiple data.path. It also adds an explain to the migration guide. Relates to #9498 Closes #10770	2015-04-29 11:51:37 +02:00
Ryan Ernst	bf09e58cb3	Mappings: Remove includes and excludes from _source Regardless of the outcome of #8142, we should at least enforce that when _source is enabled, it is sufficient to reindex. This change removes the excludes and includes settings, since these modify the source, causing us to lose the ability to reindex some fields. closes #10814	2015-04-28 15:03:51 -07:00
javanna	c914134355	Scripting: remove groovy sandbox Groovy sandboxing was disabled by default from 1.4.3 on though since we found out that it could be worked around, so it makes little sense to keep it and maintain it. Closes #10156 Closes #10480	2015-04-28 11:27:50 +02:00
Jun Ohtani	933edf7bcc	Analysis: Fix wrong position number by analyze API Add breaking chages comment to migrate docs Fix the stopword included text using stopword filter	2015-04-28 17:44:41 +09:00
Simon Willnauer	d164526d27	Remove `_shutdown` API Thsi commit removes the `_shutdown` API entirely without any replacement. Nodes should be managed from the operating system not via REST APIs	2015-04-27 17:19:36 +02:00
markharwood	1b8b993912	Query enhancement: Enable Lucene ranking behaviour for queries on numeric fields. This changes the default ranking behaviour of single-term queries on numeric fields to use the usual Lucene TermQuery scoring logic rather than a constant-scoring wrapper. Closes #10628	2015-04-27 09:42:55 +01:00
Ryan Ernst	1f5bdca8cc	Mappings: Restrict murmur3 field type to sane options Disabling doc values or trying to index hash values are not correct uses of this the murmur3 field type, and just cause problems. This disallows changing doc values or index options for 2.0+. closes #10465	2015-04-23 21:48:42 -07:00
Igor Motov	60721b2a17	Snapshot/Restore: remove obsolete expand_wildcards_open and expand_wildcards_close options In #6097 we made snapshot/restore index option consistent with other API. Now we can remove old style options from master. Closes #10743	2015-04-23 13:29:24 -04:00
Adrien Grand	d7abb12100	Replace deprecated filters with equivalent queries. In Lucene 5.1 lots of filters got deprecated in favour of equivalent queries. Additionally, random-access to filters is now replaced with approximations on scorers. This commit - replaces the deprecated NumericRangeFilter, PrefixFilter, TermFilter and TermsFilter with NumericRangeQuery, PrefixQuery, TermQuery and TermsQuery, wrapped in a QueryWrapperFilter - replaces XBooleanFilter, AndFilter and OrFilter with a BooleanQuery in a QueryWrapperFilter - removes DocIdSets.isBroken: the new two-phase iteration API will now help execute slow filters efficiently - replaces FilterCachingPolicy with QueryCachingPolicy Close #8960	2015-04-21 15:32:43 +02:00
Clinton Gormley	ab3fa78ae0	Docs: Reverte migration docs mentioning parent removal from update request Relates to #9612	2015-04-13 16:35:21 +02:00
Adrien Grand	5b3cc2f07c	Search: deprecate the limit filter. This is really a Collector instead of a filter. This commit deprecates the `limit` filter, makes it a no-op and recommends to use the `terminate_after` parameter instead that we introduced in the meantime.	2015-04-10 17:18:50 +02:00
Adrien Grand	919589b908	Queries: Remove fuzzy-like-this support. The fuzzy-like-this query builds very expensive queries and only serves esoteric use-cases.	2015-04-10 17:16:02 +02:00
Adrien Grand	aecd9ac515	Aggregations: Speed up include/exclude in terms aggregations with regexps. Today we check every regular expression eagerly against every possible term. This can be very slow if you have lots of unique terms, and even the bottleneck if your query is selective. This commit switches to Lucene regular expressions instead of Java (not exactly the same syntax yet most existing regular expressions should keep working) and uses the same logic as RegExpQuery to intersect the regular expression with the terms dictionary. I wrote a quick benchmark (in the PR) to make sure it made things faster and the same request that took 750ms on master now takes 74ms with this change. Close #7526	2015-04-09 12:12:56 +02:00
javanna	acabf2d55a	Cluster state REST api: print routing_nodes out only when requested through specific flag For bacwards compatibility reasons routing_nodes were previously printed out when routing_table was requested, together with the actual routing_table. Now they are printed out only when requests through `routing_nodes` flag. Relates to #10412 Closes #10486	2015-04-08 16:10:36 +02:00
javanna	d9aebf4906	Scripting: remove deprecated methods from ScriptService Removed the following methods from `ScriptService`, which don't require the `ScriptContext` argument: ``` public CompiledScript compile(String lang, String script, ScriptType scriptType) public ExecutableScript executable(String lang, String script, ScriptType scriptType, Map<String, Object> vars) public SearchScript search(SearchLookup lookup, String lang, String script, ScriptType scriptType, @Nullable Map<String, Object> vars) ``` Also removed the ScriptContext.Standard.GENERIC_PLUGIN enum value, as it was used only for backwards compatibility. Plugins that make use of scripts should declare their own script contexts through `ScriptModule#registerScriptContext` and use them when compiling/executing scripts. Closes #10476	2015-04-08 12:20:03 +02:00
Adrien Grand	08f93cf33f	Add doc values support to boolean fields. This pull request makes boolean handled like dates and ipv4 addresses: things are stored as as numerics under the hood and aggregations add some special formatting logic in order to return true/false in addition to 1/0. For example, here is an output of a terms aggregation on a boolean field: ``` "aggregations": { "top_f": { "doc_count_error_upper_bound": 0, "buckets": [ { "key": 0, "key_as_string": "false", "doc_count": 2 }, { "key": 1, "key_as_string": "true", "doc_count": 1 } ] } } ``` Sorted numeric doc values are used under the hood. Close #4678 Close #7851	2015-04-02 15:40:46 +02:00
Reuben Sutton	85c221e9b1	Remove jsonp support and associated tests, closes #9108	2015-04-01 16:06:09 +01:00
javanna	83fb0a10e5	Scripting: remove support for script.disable_dynamic setting Now that fine-grained script settings are supported (#10116) we can remove support for the script.disable_dynamic setting. Same result as `script.disable_dynamic: false` can be obtained as follows: ``` script.inline: on script.indexed: on ``` An exception is thrown at startup when the old setting is set, so we make sure we tell users they have to change it rather than ignoring the setting. Closes #10286	2015-03-31 13:24:52 +02:00
Adrien Grand	a608db122d	Search: Remove the `count` search type. This commit brings the benefits of the `count` search type to search requests that have a `size` of 0: - a single round-trip to shards (no fetch phase) - ability to use the query cache Since `count` now provides no benefits over `query_then_fetch`, it has been deprecated. Close #7630	2015-03-31 11:31:49 +02:00
Martijn van Groningen	75713f4190	Reverted commit: `20f7be3`	2015-03-28 08:53:11 +01:00
Ryan Ernst	90dfd78267	Remove missed references to delete mapping API See #10231	2015-03-24 10:13:19 -07:00
Ryan Ernst	693d91e41c	Mappings: Remove delete mapping API Deleting a type from an index is inherently dangerous because the type can be recreated with new mappings which may conflict with existing segments still using the old mappings. This removes the ability to delete a type (similar to how deleting fields within a type is not allowed, for the same reason). closes #8877 closes #10231	2015-03-24 09:46:02 -07:00
javanna	4348959f9d	Delete api: remove broadcast delete if routing is missing when required This commit changes the behaviour of the delete api when processing a delete request that refers to a type that has routing set to required in the mapping, and the routing is missing in the request. Up until now the delete api sent a broadcast delete request to all of the shards that belong to the index, making sure that the document could be found although the routing value wasn't specified. This was probably not the best choice: if the routing is set to required, an error should be thrown instead. A `RoutingMissingException` gets now thrown instead, like it happens in the same situation with every other api (index, update, get etc.). Last but not least, this change allows to get rid of a couple of `TransportAction`s, `Request`s and `Response`s and simplify the codebase. Closes #9123 Closes #10136	2015-03-20 09:19:43 +01:00
Clinton Gormley	aa94ced0ae	Remove references to the thrift and memcached transport plugins as they are no longer supported Closes #10166	2015-03-19 20:49:58 +01:00
Clinton Gormley	25369f0727	Remove async replication from the docs and REST spec Relates to #10114	2015-03-19 15:34:12 +01:00
Ryan Ernst	9d708e20a0	Mappings: Lock down _size field This also changes the stored setting for _size to true (for indexes created in 2.x). see #8143 closes #9913	2015-02-27 11:09:52 -08:00
Ryan Ernst	3b7928d568	Mappings: Lock down _field_names field Now that we have an explicit `enabled` flag, we can lock down the field type so it is not mungeable. see #8143 closes #9912	2015-02-26 15:15:59 -08:00
Ryan Ernst	7181bbde26	Mappings: Remove _boost field This has been deprecated since 1.0.0.RC1. It is finally removed here. closes #8875	2015-02-26 15:07:07 -08:00
Ryan Ernst	78df69e6a0	Mappings: Lock down _routing field `required` is now the only changeable settings (on indexes created after 1.x). see #8143 closes #9895	2015-02-26 13:09:41 -08:00
Ryan Ernst	32e042f1c4	Mappings: Lock down _index field see #8143 closes #9870	2015-02-25 12:24:55 -08:00
Clinton Gormley	5a53ff6f1b	Update migrate_2_0.asciidoc More code formatting in breaking changes	2015-02-25 14:13:25 +01:00
Clinton Gormley	e805fe71cc	Update migrate_2_0.asciidoc Code formatting in breaking changes	2015-02-25 14:11:57 +01:00
Clinton Gormley	5146cf6256	Update migrate_2_0.asciidoc Fixed bad heading levels in breaking changes	2015-02-25 14:10:17 +01:00
Clinton Gormley	0c61ea803d	Update migrate_2_0.asciidoc Fixed bad asciidoc in breaking changes	2015-02-25 14:07:19 +01:00
Ryan Ernst	be0cef0c43	Mappings: Lock down _type field see #8143 closes #9869	2015-02-24 22:37:41 -08:00
Ryan Ernst	b96bd201c1	Mappings: Lock down _id field There are two implications to this change. First, percolator now uses _uid internally, extracting the id portion when needed. Second, sorting on _id is no longer possible, since you can no longer index _id. However, _uid can still be used to sort, and is better anyways as indexing _id just to make it available to fielddata for sorting is wasteful. see #8143 closes #9842	2015-02-24 14:26:22 -08:00
Colin Goodheart-Smithe	2753db4685	Scripting: Removed deprecated script parameter names This change removes the deprecated script parameter names ('file', 'id', and 'scriptField'). It also removes the ability to load file scripts using the 'script' parameter. File scripts should be loaded using the 'script_file' parameter only.	2015-02-23 13:49:21 +00:00
Adrien Grand	4708227ecf	Codecs: Remove the ability to have custom per-field postings and doc values formats. This commit makes the `postings_format` and `doc_values_format` options of mappings illegal on 2.0 and ignored on 1.x (meaning that the default postings and doc values formats from the codec will be used in such a case). This removes a fair amount of code. Close #8746 #9741	2015-02-19 15:47:25 +01:00
Christoph Büscher	30fd70f07b	Aggregations: Simplify time zone option in `date_histogram` Removed the existing `pre_zone` and `post_zone` option in `date_histogram` in favor of the simpler `time_zone` option. Previously, specifying different values for these could lead to confusing scenarios where ES would return bucket keys that are not UTC. Now `time_zone` is the only option setting, the calculation of date buckets to take place in the preferred time zone, but after rounding converting the bucket key values back to UTC. Closes #9062 Closes #9637	2015-02-16 16:54:06 +01:00
Ryan Ernst	533fdbdf75	Mappings: Remove support for field access by short name When multiple fields under object fields share the same name, accessing by short name is ambiguous. This removes support for short names, always requiring the full name when used in queries. closes #8872	2015-02-12 09:58:37 -08:00
Christoph Büscher	d2f852a274	Aggregations: Add 'offset' option to date_histogram, replacing 'pre_offset' and 'post_offset' Add offset option to 'date_histogram' replacing and simplifying the previous 'pre_offset' and 'post_offset' options. This change is part of a larger clean up task for `date_histogram` from issue #9062.	2015-02-09 14:03:28 +01:00
Ryan Ernst	6079d88d43	Mappings: Remove type prefix support from field names in queries This is the first part of #8872.	2015-02-02 13:10:56 -08:00
Colin Goodheart-Smithe	285ef0f06d	Aggregations: Clean up response API for Aggregations This change makes the response API object for Histogram Aggregations the same for all types of Histogram, and does the same for all types of Ranges. The change removes getBucketByKey() from all aggregations except filters and terms. It also reduces the methods on the Bucket class to just getKey() and getKeyAsString(). The getKey() method returns Object and the actual Type is returns will be appropriate for the type of aggregation being run. e.g. date_histogram will return a DateTime for this method and Histogram will return a Number.	2015-01-27 10:53:44 +00:00
David Pilato	052645903a	Rest: remove status code from main action Today we give the HTTP status back within the HTTP response itself and within the JSON response as well: ```sh curl localhost:9200/ ``` ```js { "status" : 200, "name" : "Red Wolf", "version" : { "number" : "2.0.0", "build_hash" : "6837a61d8a646a2ac7dc8da1ab3c4ab85d60882d", "build_timestamp" : "2014-08-19T13:55:56Z", "build_snapshot" : true, "lucene_version" : "4.9" }, "tagline" : "You Know, for Search" } ```	2015-01-12 12:37:46 +01:00
Ryan Ernst	060f963a8e	Mappings: Remove allow_type_wrapper setting Before Elasticsearch 1.0, the type was allowed to be passed as the root element when uploading a document. However, this was ambiguous if the mappings also contained a field with the same name as the type. The behavior was changed in 1.0 to not allow this, but a setting was added for backwards compatibility. This change removes the setting for 2.0.	2015-01-08 09:13:40 -08:00
Martijn van Groningen	ca4f27f40e	Core: Added `_shards` header to all write responses. The header indicates to how many shard copies (primary and replicas shards) a write was supposed to go to, to how many shard copies to write succeeded and potentially captures shard failures if writing into a replica shard fails. For async writes it also includes the number of shards a write is still pending. Closes #7994	2015-01-08 18:10:08 +01:00
Martijn van Groningen	20f7be378b	Removed parent parameter from update request, because it is just sets the routing. The routing option should be used instead. The parent a child document points to can't be updated. Closes #4538	2015-01-07 10:26:20 +01:00
Adrien Grand	bc86796592	Core: Remove terms filter cache. This is our only cache which is not 'exact' and might allow for stalled results. Additionally, a similar cache that we have and needs to perform lookups in other indices in order to run queries is the script index, and for this index we rely on the filesystem cache, so we should probably do the same with terms filters lookups. Close #9056	2015-01-06 17:21:20 +01:00
David Pilato	d2a2d1bb53	java: QueryBuilders cleanup: remove deprecated Related to #8667: Some QueryBuilders have been deprecated in 1.x branches. We removed them in 2.0. Removed ------- * `textPhrase(...)` * `textPhrasePrefix(...)` * `textPhrasePrefixQuery(...)` * `filtered(...)` * `inQuery(...)` * `commonTerms(...)` * `queryString(...)` * `simpleQueryString(...)` Closes #8721.	2014-12-03 16:07:34 +01:00
Adrien Grand	8346e92ebb	Core: Fix script fields to be returned as a multivalued field when they produce a list. This change is essentially the same as #3015 but on script fields. Close #8592	2014-11-24 09:41:16 +01:00
Alex Ksikes	1959275622	Term Vectors: More consistent naming for term vector[s] We speak of the term vectors of a document, where each field has an associated stored term vector. Since by default we are requesting all the term vectors of a document, the HTTP request endpoint should rather be called `_termvectors` instead of `_termvector`. The usage of `_termvector` is now deprecated, as well as the transport client call to termVector and prepareTermVector. Closes #8484	2014-11-21 14:06:44 +01:00
Simon Willnauer	0fcb466555	[STORE] Remove `memory`/ `ram` store The RAM store is discuraged for production usage anyway and we don't test it in our randomized infrastructure. This commit removes it for `2.0`	2014-11-20 14:47:19 +01:00
Clinton Gormley	32fc657d71	Docs: Fixed a bad ref to docs-bulk-udp which no longer exists in master	2014-11-13 14:34:49 +01:00
Colin Goodheart-Smithe	353574d6af	Indices API: Fix GET index API always running all features Previous to this change all features (_alias,_mapping,_settings,_warmer) are run regardless of which features are actually requested. This change fixes the request object to resolve this bug	2014-11-13 13:22:46 +00:00
Clinton Gormley	6b05b229af	Docs: Changed breaking docs in master to correspond with 1.x for easier merging	2014-11-13 13:50:57 +01:00
Adrien Grand	9ea25df649	Switch to murmurhash3 to route documents to shards. We currently use the djb2 hash function in order to compute the shard a document should go to. Unfortunately this hash function is not very sophisticated and you can sometimes hit adversarial cases, such as numeric ids on 33 shards. Murmur3 generates hashes with a better distribution, which should avoid the adversarial cases. Here are some examples of how 100000 incremental ids are distributed to shards using either djb2 or murmur3. 5 shards: Murmur3: [19933, 19964, 19940, 20030, 20133] DJB: [20000, 20000, 20000, 20000, 20000] 3 shards: Murmur3: [33185, 33347, 33468] DJB: [30100, 30000, 39900] 33 shards: Murmur3: [2999, 3096, 2930, 2986, 3070, 3093, 3023, 3052, 3112, 2940, 3036, 2985, 3031, 3048, 3127, 2961, 2901, 3105, 3041, 3130, 3013, 3035, 3031, 3019, 3008, 3022, 3111, 3086, 3016, 2996, 3075, 2945, 2977] DJB: [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 900, 900, 900, 900, 1000, 1000, 10000, 10000, 10000, 10000, 9100, 9100, 9100, 9100, 9000, 9000, 0, 0, 0, 0, 0, 0] Even if djb2 looks ideal in some cases (5 shards), the fact that the distribution of its hashes has some patterns can raise issues with some shard counts (eg. 3, or even worse 33). Some tests have been modified because they relied on implementation details of the routing hash function. Close #7954	2014-11-04 16:32:42 +01:00
Alex Ksikes	35f55608cc	MLT Field Query: remove it from master The MLT field query is simply replaced by a MLT query set to specififc field. To simplify code maintenance we should deprecate it in 1.4 and remove it in 2.0. Closes #8238	2014-10-29 10:19:00 +01:00
Adrien Grand	230c6684a9	Search: Remove partial fields. Partial fields have been deprecated since 1.0.0Beta1 in favor of _source filtering. They will be removed in 2.0.	2014-10-20 12:29:30 +02:00
Adrien Grand	87497c92ca	Docs: Add missing link to the 2.0 migration guide.	2014-10-12 17:54:21 +02:00
Colin Goodheart-Smithe	9ce7ca21e4	Docs: Added breaking changes docs for Indices APIs Adds the breaking changes defaults for the change of default indices options for the GET Aliases API	2014-10-08 15:08:25 +01:00
Martijn van Groningen	fca406415d	Docs: Document the most important changes to zen discovery. Closes #7746	2014-09-30 13:05:45 +02:00
Clinton Gormley	de0cca4cef	Docs: Fixed some deleted refs	2014-09-26 21:35:57 +02:00
Colin Goodheart-Smithe	f0e9b7b8ef	[DOC] Add GET Alias API note to breaking changes Note explains that GET Alias API now supports IndicesOptions and will error if a index is missing	2014-09-18 15:09:01 +01:00
Colin Goodheart-Smithe	8238388bad	[DOCS] clarification of breaking changes to 1.4 due to GET index API	2014-09-11 15:04:04 +01:00
Colin Goodheart-Smithe	5fe782b784	Indices API: Added GET Index API Returns information about settings, aliases, warmers, and mappings. Basically returns the IndexMetadata. This new endpoint replaces the /{index}/_alias\|_aliases\|_mapping\|_mappings\|_settings\|_warmer\|_warmers and /_alias\|_aliases\|_mapping\|_mappings\|_settings\|_warmer\|_warmers endpoints whilst maintaining the same response formats. The only exception to this is on the /_alias\|_aliases\|_warmer\|_warmers endpoint which will now return a section for 'aliases' or 'warmers' even if no aliases or warmers exist. This backwards compatibility change is documented in the reference docs. Closes #4069	2014-09-11 11:19:21 +01:00
Martijn van Groningen	52f1ab6e16	Core: Added the `index.query.parse.allow_unmapped_fields` setting to fail queries if they refer to unmapped fields. The percolator and filters in aliases by default enforce strict query parsing. Closes #7335	2014-09-09 15:00:47 +02:00
Adrien Grand	ea96359d82	Facets: Removal from master. Close #7337	2014-08-21 10:34:39 +02:00
Martijn van Groningen	a81a4a5efe	[DOCS] Included the `_percolator` index breaking change to migration docs.	2014-02-20 16:43:06 +01:00
Simon Willnauer	990ce658a4	[Docs] Remove `custom_score` from documentation and add a migration section.	2014-02-11 14:59:15 +01:00
Clinton Gormley	17e2ca5259	[DOCS] Updated migration docs for multi_field to point to copy_to	2014-02-06 14:34:07 +01:00
Clinton Gormley	d9bdfe3fec	[DOCS] Deprecated the path setting in favour of copy_to Relates to #4729	2014-02-05 14:47:48 +01:00
Clinton Gormley	1aa1e83e03	[DOCS] Updated the breaking changes for the fields param Closes #4888	2014-01-25 12:34:15 +01:00
Karel Minarik	241bb09db1	[DOCS] More assertive statement about requiring `query` in _count, etc	2014-01-23 20:35:44 +01:00
Luca Cavanna	bdb1992e85	Fixed typo	2014-01-20 19:32:50 +01:00
Clinton Gormley	8cb091e55d	[DOCS] Tidied up asciidoc for migration page	2014-01-16 12:22:05 +01:00
Clinton Gormley	3d4891321b	[DOCS] Minor changes to the breaking changes doc	2014-01-15 18:23:03 +01:00
Clinton Gormley	9e3f527721	[DOCS] Fixed asciidoc issue	2014-01-15 18:00:13 +01:00
Clinton Gormley	faddd66e87	[DOCS] Added breaking changes in 1.0	2014-01-15 17:50:24 +01:00

... 13 14 15 16 17

803 Commits