OpenSearch

Commit Graph

Author	SHA1	Message	Date
Christoph Büscher	4406d236de	Merge branch 'master' into feature/query-refactoring Conflicts: core/src/main/java/org/elasticsearch/transport/netty/MessageChannelHandler.java	2015-06-30 10:52:34 +02:00
Martijn van Groningen	47a43e4063	nested query: Added `min` score mode. This score mode was added with the Lucene 5.2 release, but the `nested` query parser hasn't been changed to use it.	2015-06-29 12:26:30 +02:00
Boaz Leskes	41f8c96fed	Docs: clarification of allocation awareness w.r.t. rack failures Closes #11908	2015-06-29 11:57:32 +02:00
Christoph Büscher	53f6bf0625	Merge branch 'master' into feature/query-refactoring	2015-06-29 10:54:45 +02:00
Adrien Grand	d2f86933cc	Merge pull request #11893 from jpountz/fix/rename_cache Rename caches.	2015-06-29 10:21:18 +02:00
Adrien Grand	38f5cc236a	Rename caches. In order to be more consistent with what they do, the query cache has been renamed to request cache and the filter cache has been renamed to query cache. A known issue is that package/logger names do no longer match settings names, please speak up if you think this is an issue. Here are the settings for which I kept backward compatibility. Note that they are a bit different from what was discussed on #11569 but putting `cache` before the name of what is cached has the benefit of making these settings consistent with the fielddata cache whose size is configured by `indices.fielddata.cache.size`: * index.cache.query.enable -> index.requests.cache.enable * indices.cache.query.size -> indices.requests.cache.size * indices.cache.filter.size -> indices.queries.cache.size Close #11569	2015-06-29 10:15:27 +02:00
Clinton Gormley	f19a748d3c	Docs: Move field highlight order to the highlight page	2015-06-26 17:36:48 +02:00
Clinton Gormley	765ac45168	Docs: Tidied up function score query docs Closes #5991	2015-06-26 17:31:32 +02:00
jaymode	6b086dc7db	change CORS allow origin default to allow no origins Today, we disable CORS by default, but if a user simply enables CORS their instance of elasticsearch will allow cross origin requests from anywhere, as the default value for allowed origins is ``. This changes the default to be `null` so that no origins are allowed and the user must explicitly specify the origins they wish to allow requests from. The documentation also mentions that there is a security risk in using `` as the value. Closes #11169	2015-06-26 10:59:15 -04:00
Christoph Büscher	6678acfe23	Merge branch 'master' into feature/query-refactoring Conflicts: core/src/main/java/org/elasticsearch/index/query/RangeQueryBuilder.java	2015-06-26 14:48:20 +02:00
Christoph Büscher	f5f73259e4	Docs: Update Joda URLs in documentation.	2015-06-26 10:23:02 +02:00
Christoph Büscher	ba9bbf7e66	Docs: Update date-format.asciidoc Joda documentation moved from http://joda-time.sourceforge.net/ to http://www.joda.org/joda-time/. Updated the links in the documentation accordingly.	2015-06-26 09:49:29 +02:00
Alexander Reelsen	23cf9af495	Dates: Be backwards compatible with pre 2.x indices In order to be backwards compatible, indices created before 2.x must support indexing of a unix timestamp and its configured date format. Indices created with 2.x must configure the `epoch_millis` date formatter in order to support this. Relates #10971	2015-06-25 17:21:29 +02:00
javanna	556e43aa84	Merge branch 'master' into feature/query-refactoring	2015-06-25 16:57:42 +02:00
Colin Goodheart-Smithe	f21924ae0d	Aggregations: Adds cumulative sum aggregation This adds a new pipeline aggregation, the cumulative sum aggregation. This is a parent aggregation which must be specified as a sub-aggregation to a histogram or date_histogram aggregation. It will add a new aggregation to each bucket containing the sum of a specified metrics over this and all previous buckets.	2015-06-25 14:27:57 +01:00
Isabel Drost-Fromm	4f7ed2132e	Remove duplicate operator enums As we now have an enum Operator that comes with many useful helper methods switching to use that instead of the enums defined separately. Also switches to using the new enum's helper methods where applicable removing duplicate parsing logic. This breaks backwards compatibility. Documenting the break in migrate_query_refactoring.asciidoc Relates to #10217	2015-06-25 10:47:39 +02:00
Clinton Gormley	3105b4edbe	Update core-types.asciidoc Added an anchor for multi-fields in mappinggs	2015-06-24 21:36:37 +02:00
Simon Willnauer	fcdcce3bba	Consolidate shard level abstractions This commit consolidates several abstractions on the shard level in ordinary classes not managed by the shard level guice injector. Several classes have been collapsed into IndexShard and IndexShardGatewayService was cleaned up to be more lightweight and self-contained. It has also been moved into the index.shard package and it's operation is renamed from recovery from "gateway" to recovery from "store" or "shard_store". Closes #11847	2015-06-24 15:18:04 +02:00
Christoph Büscher	a2122fdc2b	Merge branch 'master' into feature/query-refactoring	2015-06-24 11:29:59 +02:00
David Pursehouse	b49e66c3a1	Replace references to ImmutableSettings with Settings ImmutableSettings was merged into Settings in commit 4873070. Change-Id: I06bd0150381d131593920c2328c46beacf49661f	2015-06-24 14:54:53 +09:00
Clinton Gormley	e1aef43ee3	Update plugins.asciidoc Moved community scripting plugins to their own section	2015-06-23 21:53:25 +02:00
Clinton Gormley	37eae789a0	Merge pull request #11801 from golubev/patch-6 fix json syntax in filters-aggregation.asciidoc	2015-06-23 20:02:04 +02:00
Carol Willing	65cf8d1c46	Docs: Add Oracle doc link to getting started page Since there is a recommended version of JDK, it would be helpful to provide a link to the Oracle documentation. Since there are many versions of Java, those that are new or infrequent users of Java would find the link helpful. Thanks! Closes #11792	2015-06-23 19:50:54 +02:00
Martijn van Groningen	fe330b868a	percolator: Fail nicely if `nested` query with `inner_hits` is used in a percolator query. Closes #11672	2015-06-23 15:03:31 +02:00
Colin Goodheart-Smithe	f26311e88b	Aggregations: Rename `series_arithmetic` agg to `bucket_script`	2015-06-23 14:00:17 +01:00
javanna	99147228d7	Merge branch 'master' into feature/query-refactoring Conflicts: core/src/main/java/org/elasticsearch/index/query/GeoShapeQueryBuilder.java core/src/main/java/org/elasticsearch/index/query/TermsQueryBuilder.java	2015-06-23 10:16:21 +02:00
Igor Motov	d32443bfb5	Docs: add description of the analyze_wildcard parameter to the simple query string query docs	2015-06-22 18:26:31 -04:00
Clinton Gormley	f123a53d72	Docs: Refactored modules and index modules sections	2015-06-22 23:49:45 +02:00
Boaz Leskes	1df2d3015e	Add OS name to _nodes and _cluster/nodes we currently don't expose this. This adds the following to the OS section of `_nodes`: ``` "os": { "name": "Mac OS X", ... } ``` and the following to the OS section of `_cluster/stats`: ``` "os": { ... "names": [ { "name": "Mac OS X", "count": 1 } ], ... }, ``` Closes #11807	2015-06-22 20:36:29 +02:00
Ryan Ernst	12e7cbe92b	Mappings: Lockdown _timestamp This is a follow up to #8143 and #6730 for _timestamp. It removes support for `path`, as well as any field type settings, and enables docvalues for _timestamp, for 2.0. Users who need to adjust these settings can use a date field.	2015-06-22 10:21:03 -07:00
Christoph Büscher	b6cdc46a61	Query refactoring: QueryFilterBuilder and Parser Moving the query building functionality from the parser to the builders new toQuery() method analogous to other recent query refactorings. In this case this also includes FQueryFilterParser, since both queries are closely related. Relates to #10217 Closes #11729	2015-06-22 18:17:01 +02:00
Alexander Reelsen	38ddc8159c	Dates: Allow for negative unix timestamps This fixes an issue to allow for negative unix timestamps. An own printer for epochs instead of just having a parser has been added. Added docs that only 10/13 length unix timestamps are supported Added docs in upgrade documentation Fixes #11478	2015-06-22 11:56:31 +02:00
Clinton Gormley	f67ae63d88	Docs: Added cluster naming advice to setup and getting started docs	2015-06-19 18:34:00 +02:00
Clinton Gormley	e8d5b8ce4b	Convert curl examples to Sense for snapshot restore Closes #11537 Conflicts: docs/reference/modules/snapshots.asciidoc	2015-06-19 18:08:04 +02:00
Clinton Gormley	64581d66c9	Tidied up the update docs Closes #9459	2015-06-19 17:29:11 +02:00
Clinton Gormley	d6ba3226d6	Docs: Add missing quotes in phrase suggest	2015-06-19 16:56:25 +02:00
Clinton Gormley	cda1f37ead	Merge pull request #11773 from elastic/robin13-patch-1 Update stats.asciidoc	2015-06-19 16:48:12 +02:00
Clinton Gormley	1bfaac7098	Fixed bad asciidoc	2015-06-19 16:33:14 +02:00
Clinton Gormley	dd680669f5	Docs: Rewrote the upgrade section	2015-06-19 16:28:07 +02:00
caldwecr	1ac728d22b	Docs: Update filter-aggregation.asciidoc Replace the previous example which leveraged a range filter, which causes unnecessary confusion about when to use a range filter to create a single bucket or a range aggregation with exactly one member in ranges. Closes #11704	2015-06-19 12:24:42 +02:00
Alex Ksikes	3f6dae1a73	More Like This: renamed `ignore_like` to `unlike` This changes the parameter name `ignore_like` to the more user friendly name `unlike`. This later feature generates a query from the terms in `A` but not from the terms in `B`. This translates to a result set which is like `A` but unlike `B`. We could have further negatively boosted any documents that have some `B`, but these documents already do not receive any contribution from having `B`, and would therefore negatively compete with documents having `A`. Closes #11117	2015-06-17 17:18:50 -05:00
Simon Willnauer	0434ecfb03	Merge pull request #11464 from nirmalc/nodes-preference Search `preference` based on node specification	2015-06-17 12:33:51 +02:00
Boaz Leskes	f4a143d138	Clarify refresh parameter in the `_bulk` API See #11690 Closes #11691	2015-06-17 08:47:40 +02:00
Adrien Grand	17fac6dad5	Merge pull request #11568 from jpountz/remove/rivers Rivers removal.	2015-06-17 08:20:48 +02:00
Nirmal Chidambaram	72a9d34eb8	5925 - Allow node specification in preference -Allow node selector api's with new preference ONLY_NODES ( selector apis like https://www.elastic.co/guide/en/elasticsearch/reference/current/cluster.html) -Update documentation	2015-06-16 11:49:12 -05:00
Adrien Grand	14c9c239bc	Remove non-default fielddata formats. Now that doc values are the default for fielddata, specialized in-memory formats are becoming an esoteric option. This commit removes such formats: - `fst` on string fields, - `compressed` on geo points. I also removed documentation and tests that the fielddata cache is shared if you change the format, since this is only true for in-memory fielddata formats (given that for doc values, the caching is done directly in Lucene).	2015-06-15 14:05:23 +02:00
Clinton Gormley	64ec18afa0	Merge pull request #11661 from pjcard/patch-1 Make explicit the requirement for intervals to be integers Conflicts: docs/reference/search/aggregations/bucket/histogram-aggregation.asciidoc	2015-06-15 11:42:12 +02:00
Mark Walkom	c8f635d429	Docs: Updated groovy docs link Closes #11656	2015-06-15 11:15:57 +02:00
Clinton Gormley	2376cc500d	Merge pull request #11649 from adjust/jdk_docs Clarify Java requirements	2015-06-14 22:15:23 +02:00
Clinton Gormley	e88535a67e	Merge pull request #11614 from oyiadom/patch-1 Fix typo in upgrade docs	2015-06-13 11:34:56 +02:00
Clinton Gormley	4e94d097e7	Merge pull request #11556 from robin13/master Docs: More information about 'Copy field to'	2015-06-12 15:51:49 +02:00
Colin Goodheart-Smithe	a216062d88	Aggregations: allow users to perform simple arithmetic operations on histogram aggregations Closes #11029	2015-06-12 09:25:52 +01:00
Igor Motov	93beea1f67	Snapshot/Restore: Move in-progress snapshot and restore information from custom metadata to custom cluster state part Information about in-progress snapshot and restore processes is not really metadata and should be represented as a part of the cluster state similar to discovery nodes, routing table, and cluster blocks. Since in-progress snapshot and restore information is no longer part of metadata, this refactoring also enables us to handle cluster blocks in more consistent manner and allow creation of snapshots of a read-only cluster. Closes #8102	2015-06-11 15:21:18 -04:00
Clinton Gormley	0216dfd3b6	Docs: Removed left over table header from merge.asciidoc	2015-06-11 13:26:34 +02:00
Simon Willnauer	f77804dad3	Bake in TieredMergePolicy Today we provide the ability to plug in MergePolicy and we provide the once lucene ships with. We do not recommend to change the default and even only a small number of expert users would ever touch this. This commit removes the ancient log byte size and log doc count merge policy providers, simplifies the MergePolicy wiring and makes the tiered MP the one and only default. All notions of a merge policy has been removed from the docs and should be deprecated in the previous version. Closes #11588	2015-06-11 11:58:30 +02:00
Clinton Gormley	6e71f60b82	Update bool-query.asciidoc Emphasise section about using bool query in filter context	2015-06-10 21:46:23 +02:00
Simon Willnauer	657d6dd9cf	Remove MergeScheduler pluggability Nobody should really plug in a different merge scheduler for elasticsearch. This is too expert and might cause catastrophic failures.	2015-06-10 20:28:30 +02:00
Adrien Grand	ac7ce2b899	Rivers removal. While we had initially planned to keep rivers around in 2.0 to ease migration, keeping support for rivers is challenging as it conflicts with other important changes that we want to bring to 2.0 like synchronous dynamic mappings updates. Nothing impossible to fix, but it would increase the complexity of how we deal with dynamic mappings updates and manage rivers, while handling dynamic mappings updates correctly is important for resiliency and rivers are on the go. So removing rivers in 2.0 may well be a better trade-off.	2015-06-10 09:22:09 +02:00
Robin Clarke	f13c216aa2	More information about 'Copy field to'	2015-06-09 16:35:49 +02:00
Alexander Reelsen	3bda78e43b	ResourceWatcher: Rename settings to prevent watcher clash The ResourceWatcher used settings prefixed `watcher.`, which potentially could clash with the watcher plugin. In order to prevent confusion, the settings have been renamed to `resource.reload` prefixes. This also uses the deprecation logging infrastructure introduced in #11033 to log deprecated settings and their alternative at startup. Closes #11175	2015-06-09 10:02:49 +02:00
Chelsea Lura	3ac19e8f7f	Doc: Typo 'good' vs 'well' typo Closes #11549	2015-06-09 09:25:23 +02:00
Andreas Kohn	1c0ad8c724	Fix a typo in the documentation: six_hun -> "narrower" This was introduced in https://github.com/elastic/elasticsearch.github.com/commit/defaf4f0, probably as a search-and-replace mistake.	2015-06-08 18:07:52 +02:00
Clinton Gormley	60c7e0eb91	Update merge.asciidoc Corrected typo in merge docs	2015-06-08 16:45:59 +02:00
Nirmal Chidambaram	931b9f9c74	Filtered out non data-nodes in relevant cat api Closes #9214 Closes #9287	2015-06-08 16:05:42 +02:00
javanna	2ef0fcfd6a	Plugins: one single (global) way to register custom query parsers There are different ways to register custom query parsers through plugins, a couple of them work per index via index settings, which is probably even too flexible. There also three different ways to add a global custom query parser through either IndicesQueriesModule or IndicesQueriesRegistry. This commit consolidates the registration of custom query parsers via IndicesQueriesModule#addQuery(Class<? extends QueryParser>). The complexity of supporting parsers per index is not needed hence it got removed. Also the other ways of registering global custom parsers are dropped in favour of the one mentioned above. Closes #11481	2015-06-08 12:19:53 +02:00
jaymode	78630e03a2	make prompt placeholders consistent with existing placeholders In #10918, we introduced the prompt placeholders. These were had a different format than our existing placeholders. This changes the prompt placeholders to follow the format of the existing placeholders. Relates to #11455	2015-06-06 10:41:07 -04:00
Clinton Gormley	ecf53b167e	Docs: Added explanation of when to use the upgrade API Closes #9779	2015-06-05 17:50:10 +02:00
gmarz	9b230db095	[DOCS] Updated memory settings for Windows	2015-06-05 08:58:55 -04:00
Adrien Grand	7c698146f5	Rest: Add all meta fields to the top level json document. Some of our meta fields (such as _id, _version, ...) are returned as top-level properties of the json document, while other properties (_timestamp, _routing, ...) are returned under `fields`. This commit makes all meta fields returned as top-level properties. So eg. `GET test/test/1?fields=_timestamp,foo` would now return ```json { "_index": "test", "_type": "test", "_id": "1", "_version": 1, "_timestamp": 10000000, "found": true, "fields": { "foo": [ "bar" ] } } ``` while it used to return ```json { "_index": "test", "_type": "test", "_id": "1", "_version": 1, "found": true, "fields": { "_timestamp": 10000000, "foo": [ "bar" ] } } ```	2015-06-04 23:42:17 +02:00
Clinton Gormley	a138f627be	Docs: removed the unused query_dsl/index.asciidoc	2015-06-04 19:31:28 +02:00
Lee Hinman	65f43970da	Default to binding to loopback address Binds to the address returned by `InetAddress.getLoopbackAddress()`. Closes #11300	2015-06-04 10:25:49 -06:00
Boaz Leskes	708320446e	Doc: Minor typo fix in query_filter_context.asciidoc	2015-06-04 15:42:55 +02:00
Clinton Gormley	f85a17ff1a	Docs: Fixed heading level for in query DSL docs	2015-06-04 13:16:32 +02:00
Clinton Gormley	171687d207	Docs: Reorganised the Query DSL docs into families and explaing query vs filter context	2015-06-04 01:59:37 +02:00
Boaz Leskes	26d71fe00e	Reduce shard inactivity timeout to 5m To better distribute the memory allocating to indexing, the IndexingMemoryController periodically checks the different shard for their last indexing activity. If no activity has happened for a while, the controller marks the shards as in active and allocated it's memory buffer budget (but a small minimal budget) to other active shards. The recently added synced flush feature (#11179, #11336) uses this inactivity trigger to attempt as a trigger to attempt adding a sync id marker (which will speed up future recoveries). We wait for 30m before declaring a shard inactive. However, these days the operation just requires a refresh and is light. We can be stricter (and 5m) increase the chance a synced flush will be triggered. Closes #11479	2015-06-04 00:23:14 +02:00
Alexander Reelsen	01e8eaf181	Date Parsing: Add parsing for epoch and epoch in milliseconds This commit changes the date handling. First and foremost Elasticsearch does not try to convert every date to a unix timestamp first and then uses the configured date. This now allows for dates like `2015121212` to be parsed correctly. Instead it is now explicit by adding a `epoch_second` and `epoch_millis` date format. This also means, that the default date format now is `epoch_millis\|\|dateOptionalTime` to remain backwards compatible. Closes #5328 Relates #10971	2015-06-03 18:07:47 +02:00
Lee Hinman	5fd96d9371	[DOCS] Document the `index.shared_filesystem.recover_on_any_node` setting Relates to #10960 Closes #11047	2015-06-03 12:35:25 +02:00
Timur	6812ed0bb6	Docs: fix typo Closes #112220	2015-06-02 19:42:45 +02:00
jaymode	f6191d05de	add ability to prompt for selected settings on startup Some settings may be considered sensitive, such as passwords, and storing them in the configuration file on disk is not good from a security perspective. This change allows settings to have a special value, `${prompt::text}` or `${prompt::secret}`, to indicate that elasticsearch should prompt the user for the actual value on startup. This only works when started in the foreground. In cases where elasticsearch is started as a service or in the background, an exception will be thrown. Closes #10838	2015-06-02 09:38:07 -04:00
Martijn van Groningen	359d9ac0d0	docs: added missing ids	2015-05-29 22:45:01 +02:00
Martijn van Groningen	1cfb6a79f1	Parent/child: refactored _parent field mapper and parent/child queries * Cut the `has_child` and `has_parent` queries over to use Lucene's query time global ordinal join. The main benefit of this change is that parent/child queries can now efficiently execute if parent/child queries are wrapped in a bigger boolean query. If the rest of the query only hit a few documents both has_child and has_parent queries don't need to evaluate all parent or child documents any more. * Cut the `_parent` field over to use doc values. This significantly reduces the on heap memory footprint of parent/child, because the parent id values are never loaded into memory. Breaking changes: * The `type` option on the `_parent` field can only point to a parent type that doesn't exist yet, so this means that an existing type/mapping can't become a parent type any longer. * The `has_child` and `has_parent` queries can no longer be use in alias filters. All these changes, improvements and breaks in compatibility only apply for indices created with ES version 2.0 or higher. For indices creates with ES <= 2.0 the older implementation is used. It is highly recommended to re-index all your indices with parent and child documents to benefit from all the improvements that come with this refactoring. The easiest way to achieve this is by using the scan and bulk apis using a simple script. Closes #6107 Closes #8134	2015-05-29 21:44:17 +02:00
Areek Zillur	fb8cd53582	This commit removes the ability to use `filter` for PhraseSuggester collate. Only `query` can be used for collation. Internally, a collate query is executed as an exists query. So specifying a filter does not have any benefits.	2015-05-29 12:26:08 -04:00
Colin Goodheart-Smithe	35a58d874e	Scripting: Unify script and template requests across codebase This change unifies the way scripts and templates are specified for all instances in the codebase. It builds on the Script class added previously and adds request building and parsing support as well as the ability to transfer script objects between nodes. It also adds a Template class which aims to provide the same functionality for template APIs Closes #11091	2015-05-29 16:52:04 +01:00
Britta Weber	a031232c48	[doc] remove reference to seal, was removed in #11336	2015-05-29 11:40:34 +02:00
Britta Weber	87a0c76e9c	Merge remote-tracking branch 'boaz/index_seal_to_flush_sync'	2015-05-29 10:31:03 +02:00
Igor Motov	55fc3a727b	Core: refactor upgrade API to use transport and write minimum compatible version that the index was upgraded to In #11072 we are adding a check that will prevent opening of old indices. However, this check doesn't take into consideration the fact that indices can be made compatible with the current version through upgrade API. In order to make compatibility check aware of the upgrade, the upgrade API should write a new setting `index.version.minimum_compatible` that will indicate the minimum compatible version of lucene this index is compatible with and `index.version.upgraded` that will indicate the version of elasticsearch that performed the upgrade. Closes #11095	2015-05-28 05:23:49 -10:00
Zachary Tong	d32a80f37b	Docs: Fix misplaced images in moving_avg docs	2015-05-27 16:13:36 -04:00
Zachary Tong	491afbe01c	Aggregations: Add Holt-Winters model to `moving_avg` pipeline aggregation Closes #11043	2015-05-27 14:45:45 -04:00
Alexander Reelsen	fc224a0de8	Cat API: Add wildcard support for header names This adds wildcard support (simple regexes) for specifying header names. Aliases are supported as well. Closes #10811	2015-05-27 16:09:31 +02:00
Boaz Leskes	37bdbe074a	doc feedback	2015-05-27 15:40:02 +03:00
Tanguy Leroux	340b7ef6ef	Add common SystemD file for RPM/DEB package	2015-05-27 11:51:58 +02:00
javanna	fc28bc73f8	[DOCS] add kopf to site plugins	2015-05-27 10:28:53 +02:00
Ryan Schneider	8ec6bf7340	[DOCS] Update get.asciidoc Updated to not mislead the reader that the data is actually gone when a document is updated. For example if you have 100GB of docs and update each one you'll only be able to access 100GB of the data, but there would theoretically be 200GB of doc data. Closes #10375	2015-05-27 10:17:10 +02:00
Boaz Leskes	6d269cbf4d	feedback	2015-05-27 10:29:37 +03:00
javanna	6c81a8daf3	Internal: count api to become a shortcut to the search api The count api used to have its own execution path, although it would do the same (up to bugs!) of the search api. This commit makes it a shortcut to the search api with size set to 0. The change is made in a backwards compatible manner, by leaving all of the java api code around too, given that you may not want to get back a whole SearchResponse when asking only for number of hits matching a query, also cause migrating from countResponse.getCount() to searchResponse.getHits().totalHits() doesn't look great from a user perspective. We can always decide to drop more code around the count api if we want to break backwards compatibility on the java api, making it a shortcut on the rest layer only. Closes #9117 Closes #11198	2015-05-26 19:12:11 +02:00
Alexander Reelsen	1fa21a76cf	Documentation: Fix elasticsearch documentation build The commit for closing #11033 was not building the asciidoc documentation.	2015-05-26 18:16:12 +02:00
Alexander Reelsen	045f01c085	Infra for deprecation logging Add support for a specific deprecation logging that can be used to turn on in order to notify users of a specific feature, flag, setting, parameter, ... being deprecated. The deprecation logger logs with a "deprecation." prefix logge (or "org.elasticsearch.deprecation." if full name is used), and outputs the logging to a dedicated deprecation log file. Deprecation logging are logged under the DEBUG category. The idea is not to enabled them by default (under WARN or ERROR) when running embedded in another application. By default they are turned off (INFO), in order to turn it on, the "deprecation" category need to be set to DEBUG. This can be set in the logging file or using the cluster update settings API, see the documentation Closes #11033	2015-05-26 17:44:52 +02:00
Tanguy Leroux	ce63590bd6	API: Add response filtering with filter_path parameter This change adds a new "filter_path" parameter that can be used to filter and reduce the responses returned by the REST API of elasticsearch. For example, returning only the shards that failed to be optimized: ``` curl -XPOST 'localhost:9200/beer/_optimize?filter_path=_shards.failed' {"_shards":{"failed":0}}% ``` It supports multiple filters (separated by a comma): ``` curl -XGET 'localhost:9200/_mapping?pretty&filter_path=.mappings..properties.name,.mappings..properties.title' ``` It also supports the YAML response format. Here it returns only the `_id` field of a newly indexed document: ``` curl -XPOST 'localhost:9200/library/book?filter_path=_id' -d '---hello:\n world: 1\n' --- _id: "AU0j64-b-stVfkvus5-A" ``` It also supports wildcards. Here it returns only the host name of every nodes in the cluster: ``` curl -XGET 'http://localhost:9200/_nodes/stats?filter_path=nodes..host' {"nodes":{"lvJHed8uQQu4brS-SXKsNA":{"host":"portable"}}} ``` And "" can be used to include sub fields without knowing the exact path. Here it returns only the Lucene version of every segment: ``` curl 'http://localhost:9200/_segments?pretty&filter_path=indices..version' { "indices" : { "beer" : { "shards" : { "0" : [ { "segments" : { "_0" : { "version" : "5.2.0" }, "_1" : { "version" : "5.2.0" } } } ] } } } } ``` Note that elasticsearch sometimes returns directly the raw value of a field, like the _source field. If you want to filter _source fields, you should consider combining the already existing _source parameter (see Get API for more details) with the filter_path parameter like this: ``` curl -XGET 'localhost:9200/_search?pretty&filter_path=hits.hits._source&_source=title' { "hits" : { "hits" : [ { "_source":{"title":"Book #2"} }, { "_source":{"title":"Book #1"} }, { "_source":{"title":"Book #3"} } ] } } ```	2015-05-26 13:51:04 +02:00
Britta Weber	eeeb29f900	spell correct and add single quotes	2015-05-26 11:41:19 +02:00
Britta Weber	37782c1745	analyzers: custom analyzers names and aliases must not start with _ closes #9596	2015-05-26 11:38:15 +02:00
Boaz Leskes	b376a3fbfb	Move index sealing terminology to synced flush #10032 introduced the notion of sealing an index by marking it with a special read only marker, allowing for a couple of optimization to happen. The most important one was to speed up recoveries of shards where we know nothing has changed since they were online by skipping the file based sync phase. During the implementation we came up with a light notion which achieves the same recovery benefits but without the read only aspects which we dubbed synced flush. The fact that it was light weight and didn't put the index in read only mode, allowed us to do it automatically in the background which has great advantage. However we also felt the need to allow users to manually trigger this operation. The implementation at #11179 added the sync flush internal logic and the manual (rest) rest API. The name of the API was modeled after the sealing terminology which may end up being confusing. This commit changes the API name to match the internal synced flush naming, namely `{index}/_flush/synced'. On top of that it contains a couple other changes: - Remove all java client API. This feature is not supposed to be called programtically by applications but rather by admins. - Improve rest responses making structure similar to other (flush) API - Change IndexShard#getOperationsCount to exclude the internal +1 on open shard . it's confusing to get 1 while there are actually no ongoing operations - Some minor other clean ups	2015-05-25 22:32:32 +03:00
Alex Chan	e31049988b	[Docs] Fix minor spelling errors Closes #11320	2015-05-25 19:56:43 +02:00
Eduardo Gurgel	0f3b3c0787	Docs: Fix typo on percolate_format description Closes #11215	2015-05-25 13:17:59 +02:00
Clinton Gormley	4d27d751fb	Docs: Move the page on facets into redirects.asciidoc	2015-05-24 23:34:23 +02:00
Clinton Gormley	6171ae6cc4	Docs: Added stub entries for pages deleted from 1.x	2015-05-24 17:57:34 +02:00
Clinton Gormley	4b854d10bd	Docs: Tidied up the field statistics docs	2015-05-24 15:12:44 +02:00
Britta Weber	4d0b40ca52	Merge pull request #11235 from nik9000/seal_docs Rewrote some _seal documentation	2015-05-22 18:24:23 +02:00
Clinton Gormley	cde2c91b5a	Docs: Example blocks can't contain warnings	2015-05-22 17:37:58 +02:00
Clinton Gormley	631e03c872	Docs: Tidied up term vectors docs Moved annotations out of titles Made the example titles into example blocks	2015-05-22 17:19:12 +02:00
Nik Everett	6da1e858dc	Rewrote some _seal documentation The first two paragraphs were confusing to me so I tried to rewrite them. I removed some passive voice because it irks me.	2015-05-22 10:51:21 -04:00
Clinton Gormley	20279a2556	Docs: Rename reference docs to Elasticsearch Reference	2015-05-22 14:49:11 +02:00
Adrien Grand	42f9053817	Merge pull request #11280 from jpountz/fix/remove_binary_compress Mappings: Remove the `compress`/`compress_threshold` options of the BinaryFieldMapper.	2015-05-22 14:21:13 +02:00
Adrien Grand	461683ac58	Mappings: Remove the `compress`/`compress_threshold` options of the BinaryFieldMapper. This option is broken currently since it potentially interprets an incoming binary value as compressed while it just happens that the first bytes are the same as the LZF header.	2015-05-22 14:20:42 +02:00
Colin Goodheart-Smithe	35deb7efea	Aggregations: Renaming reducers to Pipeline Aggregators	2015-05-21 14:57:23 +01:00
Igor Motov	dd41c68741	Snapshot/Restore: fix FSRepository location configuration Closes #11068	2015-05-20 22:14:31 -04:00
Lee Hinman	0a6f7ef379	[DOCS] Mention Integer.MAX_VALUE limit for http.max_content_length Fixes #11244	2015-05-20 13:08:59 -06:00
Clinton Gormley	5e4d5e1c64	Docs: Included the index-seal docs in the indices section	2015-05-20 11:20:12 +02:00
Simon Willnauer	488be75d19	Add some words about the purpose of a seal etc.	2015-05-19 12:26:08 +02:00
Simon Willnauer	9d2852f0ab	Merge branch 'master' into feature/synced_flush Conflicts: src/main/java/org/elasticsearch/index/engine/InternalEngine.java src/main/java/org/elasticsearch/index/shard/IndexShard.java src/main/java/org/elasticsearch/indices/recovery/RecoverySourceHandler.java src/test/java/org/elasticsearch/index/engine/InternalEngineTests.java	2015-05-19 12:16:22 +02:00
Adrien Grand	2c241e8a36	Mappings: Remove the `ignore_conflicts` option. Mappings conflicts should not be ignored. If I read the history correctly, this option was added when a mapping update to an existing field was considered a conflict, even if the new mapping was exactly the same. Now that mapping updates are smart enough to detect conflicting options, we don't need an option to ignore conflicts.	2015-05-18 15:28:23 +02:00
javanna	a843008b17	Highlighting: require_field_match set to true by default The default `false` for `require_field_match` is a bit odd and confusing for users, given that field names get ignored by default and every field gets highlighted if it contains terms extracted out of the query, regardless of which fields were queries. Changed the default to `true`, it can always be changed per request. Closes #10627 Closes #11067	2015-05-15 21:38:45 +02:00
Clinton Gormley	9d71816cd2	Docs: Fixed explanation of AUTO fuzziness Closes #11186	2015-05-15 21:25:11 +02:00
javanna	46c521f7ec	Highlighting: nuke XPostingsHighlighter Our own fork of the lucene PostingsHighlighter is not easy to maintain and doesn't give us any added value at this point. In particular, it was introduced to support the require_field_match option and discrete per value highlighting, used in case one wants to highlight the whole content of a field, but get back one snippet per value. These two features won't make it into lucene as they slow things down and shouldn't have been supported from day one on our end probably. One other customization we had was support for a wider range of queries via custom rewrite etc. (yet another way to slow things down), which got added to lucene and works much much better than what we used to do (instead of or rewrite, term s are pulled out of the automata for multi term queries). Removing our fork means the following in terms of features: - dropped support for require_field_match: the postings highlighter will only highlight fields that were queried - some custom es queries won't be supported anymore, meaning they won't be highlighted. The only one I found up until now is the phrase_prefix. Postings highlighter rewrites against an empty reader to avoid slow operations (like the ones that we were performing with the fork that we are removing here), thus the prefix will not be expanded to any term. What the postings highlighter does instead is pulling the automata out of multi term queries, but this is not supported at the moment with our MultiPhrasePrefixQuery. Closes #10625 Closes #11077	2015-05-15 20:41:33 +02:00
Clinton Gormley	3a69b65e88	Docs: Fixed the backslash escaping on the pattern analyzer docs Closes #11099	2015-05-15 18:40:16 +02:00
Jun Ohtani	597c53a0bb	Add migrationi note for AnalyzeRequest	2015-05-16 00:25:53 +09:00
Adrien Grand	bf599d68dd	Merge pull request #11042 from jpountz/feature/aggs_missing Aggs: Make it possible to configure missing values.	2015-05-15 16:33:29 +02:00
Adrien Grand	32e23b9100	Aggs: Make it possible to configure missing values. Most aggregations (terms, histogram, stats, percentiles, geohash-grid) now support a new `missing` option which defines the value to consider when a field does not have a value. This can be handy if you eg. want a terms aggregation to handle the same way documents that have "N/A" or no value for a `tag` field. This works in a very similar way to the `missing` option on the `sort` element. One known issue is that this option sometimes cannot make the right decision in the unmapped case: it needs to replace all values with the `missing` value but might not know what kind of values source should be produced (numerics, strings, geo points?). For this reason, we might want to add an `unmapped_type` option in the future like we did for sorting. Related to #5324	2015-05-15 16:26:58 +02:00
Martijn van Groningen	719252a138	Merge pull request #11183 from martijnvg/parent-child/remove_id_cache_from_stats_and_clear_cache_apis Removed `id_cache` from stats and cat apis.	2015-05-15 14:39:35 +02:00
Martijn van Groningen	ece18f162e	Removed `id_cache` from stats and cat apis. Also removed the `id_cache` option from the clear cache api. Closes #5269	2015-05-15 14:06:18 +02:00
Jun Ohtani	3a1a4d3e89	Analysis: Add multi-valued text support Add support array text as a multi-valued for AnalyzeRequestBuilder Add support array text as a multi-valued for Analyze REST API Add docs Closes #3023	2015-05-15 20:01:10 +09:00
Britta Weber	7a8d08a4a3	Merge remote-tracking branch 'origin/master' into feature/synced_flush	2015-05-15 10:35:36 +02:00
Lee Hinman	179dad69b6	[DOCS] Add DNS SRV discovery plugin	2015-05-14 16:02:59 -06:00
Areek Zillur	7efc43db25	Re-structure collate option in PhraseSuggester to only collate on local shard. Previously, collate feature would be executed on all shards of an index using the client, this leads to a deadlock when concurrent collate requests are run from the _search API, due to the fact that both the external request and internal collate requests use the same search threadpool. As phrase suggestions are generated from the terms of the local shard, in most cases the generated suggestion, which does not yield a hit for the collate query on the local shard would not yield a hit for collate query on non-local shards. Instead of using the client for collating suggestions, collate query is executed against the ContextIndexSearcher. This PR removes the ability to specify a preference for a collate query, as the collate query is only run on the local shard. closes #9377	2015-05-14 17:21:53 -04:00
Jack Conradson	a5c0ac0d67	Scripting: Add Multi-Valued Field Methods to Expressions Add methods to operate on multi-valued fields in the expressions language. Note that users will still not be able to access individual values within a multi-valued field. The following methods will be included: * min * max * avg * median * count * sum Additionally, changes have been made to MultiValueMode to support the new median method. closes #11105	2015-05-14 08:27:24 -07:00
Britta Weber	2b03a03c0c	Merge remote-tracking branch 'origin/master' into feature/synced_flush	2015-05-13 18:00:18 +02:00
Britta Weber	f1948cf95c	doc for seal api and doc for syned flush in general	2015-05-13 15:43:05 +02:00
Adrien Grand	630757906a	Query DSL: Add `filter` clauses to `bool` queries. These clauses filter the document space without affecting scoring and map to Lucene's BooleanClause.Occur.FILTER. The `filtered` query is now deprecated and ```json { "filtered": { "query": { //query }, "filter": { //filter } } } ``` should be replaced with ```json { "bool": { "must": { //query }, "filter": { //filter } } } ```	2015-05-13 12:04:56 +02:00
Ryan Ernst	f766b260ba	Add tests for includeInObject backcompat	2015-05-12 23:11:15 -07:00
Ryan Ernst	565ffb16f1	Mappings: Remove ability to set meta fields inside documents A few meta fields can currently be set within a document's source. However, the recommended way to set meta fields like this is through the api, and setting within the document can be a performance trap (e.g. needing to find _id in order to route the document). This change removes the ability to set meta fields within a document source for 2.0+ indexes. closes #11051 closes #11074	2015-05-12 23:09:03 -07:00
Igor Motov	d6efe1e508	Docs: Add information about restoring to a different cluster	2015-05-12 20:59:24 -04:00
Ryan Ernst	e7618b8528	Settings: Remove file based index templates As a follow up to #10870, this removes support for index templates on disk. It also removes a missed place still allowing disk based mappings. closes #11052	2015-05-11 12:51:22 -07:00
javanna	36c373e615	[DOCS] documented missing query_string parameters for count, exists, search & validate_query relates to #11057	2015-05-11 12:58:30 +02:00
Martijn van Groningen	acdd9a5dd9	parent/child: Removed the `top_children` query.	2015-05-10 16:30:19 +02:00
Lee Hinman	459a05168c	Merge remote-tracking branch 'refs/remotes/dakrone/truncate-loglines'	2015-05-08 10:11:26 -06:00
Lee Hinman	c6747ded16	Truncate log messages at 10,000 characters	2015-05-08 10:10:44 -06:00
Clinton Gormley	a536bd5f81	Docs: Rewrote the term query docs to explain analyzed vs not_analyzed	2015-05-08 08:32:13 +02:00
Andrew Selden	c953e99324	Merge pull request #10864 from aleph-zero/issues/9606 Remove (dfs_)query_and_fetch from the REST API	2015-05-07 12:51:28 -07:00
josephwolnskipn	7f064c592f	Docs: Fix grammar and typos in percolate Added commas, capitalized "JSON" and "API", capitalized titles, etc. Closes #11023	2015-05-07 21:50:48 +02:00
Ryan Ernst	e29492ce94	Docs: Cleanup meta field docs Meta fields were locked down to not allow exotic options to the underlying field types in #8143. This change fixes the docs to no longer refer to the old settings. closes #10879	2015-05-07 11:26:49 -07:00
Adrien Grand	a0af88e996	Query DSL: Remove filter parsers. This commit makes queries and filters parsed the same way using the QueryParser abstraction. This allowed to remove duplicate code that we had for similar queries/filters such as `range`, `prefix` or `term`.	2015-05-07 20:14:34 +02:00
Alex Ksikes	4787cf701f	More Like This: remove percent_terms_to_match Users should use minimum_should_match instead. Closes #11030	2015-05-07 14:21:29 +02:00
Martijn van Groningen	f7c29457d0	parent/child: Deprecated the `top_children` in favour of the `has_child` query.	2015-05-07 09:27:54 +02:00
Alexander Reelsen	82c21ff5b3	Documentation: Mention RPM repo does not work with older distributions Getting this to work would be a lot of work (creating two different repositories, having another GPG key, integrating this into our build). Closes #6498	2015-05-07 08:20:06 +02:00
Alex Ksikes	ec4f12f9ef	More Like This: removal of the MLT API Removes the More Like This API, users should now use the More Like This query. The MLT API tests were converted to their query equivalent. Also some clean ups in MLT tests. Closes #10736 Closes #11003	2015-05-06 18:11:11 +02:00
Colin Goodheart-Smithe	cf1251796f	Aggregations: Adding Sum Bucket Aggregation Closes #11007	2015-05-06 14:44:56 +01:00
Zachary Tong	e70a8d4ee9	Merge pull request #10964 from polyfractal/feature/aggs_movavg_rename Rename Moving Average models to their "common" names	2015-05-06 09:07:23 -04:00
Zachary Tong	3eb9cb913d	Rename Moving Average models to their "common" names Previously, we were using the "statistical", technically accurate name. Instead, we should probably use the name that people are familiar with, e.g. "Holt Winters" instead of "triple exponential". To that end: - `single_exp` becomes `ewma` (exponentially weighted moving average) - `double_exp` becomes `holt` When the `triple_exp` is added, it will be called `holt_winters`.	2015-05-06 09:04:44 -04:00
Colin Goodheart-Smithe	72d99773dc	Aggregations: Adding Average Bucket Aggregation Also includes changes to the other bucket metric aggregations to share code Closes #11006	2015-05-06 13:53:57 +01:00
Colin Goodheart-Smithe	644fd00714	Aggregations: x-axis units normalisation for derivative aggregation	2015-05-06 10:31:16 +01:00
Ryan Ernst	7a7bd6086a	Mappings: Remove ability to disable _source field Current features (eg. update API) and future features (eg. reindex API) depend on _source. This change locks down the field so that it can no longer be disabled. It also removes legacy settings compress/compress_threshold. closes #8142 closes #10915	2015-05-05 22:04:18 -07:00
Clinton Gormley	603a0c193b	Docs: More translog doc improvements	2015-05-05 22:01:58 +02:00
Clinton Gormley	a60251068c	Docs: Improved the translog docs	2015-05-05 21:32:52 +02:00
Simon Willnauer	fe5a35b68e	Merge branch 'master' into pr-10624 Conflicts: src/main/java/org/elasticsearch/index/shard/IndexShard.java	2015-05-05 11:46:02 +02:00
Clinton Gormley	e28ad853c7	Docs: Fixed bad asciidoc in migrate_2_0	2015-05-05 11:17:21 +02:00
Pascal Borreli	af6d890ad5	Docs: Fixed typos Closes #10973	2015-05-05 10:38:05 +02:00
aleph-zero	2b483cc806	Removed reference to search type 'count' Removed reference to search type 'count' as this is now a deprecated search type.	2015-05-04 14:48:40 -07:00
Shay Banon	187d79b6df	Centralize admin implementations and action execution This change removes the multiple implementations of different admin interfaces and centralizes it with AbstractClient. It also makes sure all executions of actions now go through a single AbstractClient#execute method, taking care of copying headers and wrapping listener. This also has the side benefit of removing all the code around differnet possible clients, and removes quite a bit of code (most of the + code is actually removal of generics and such). This change also changes how TransportClient is constructed, requiring a Builder to create it, its a breaking change and its noted in the migration guide. Yea another step towards simplifying the action infra and making it simpler...	2015-05-04 23:40:17 +02:00
Zachary Tong	f6d5167d41	Merge pull request #10929 from polyfractal/docs/aggs Restructure Aggregation documentation	2015-05-04 13:28:47 -04:00
Ryan Ernst	ba68d354c4	Merge pull request #10934 from mattweber/custom_analyzer_pos_offset_gap document and test custom analyzer position offset gap	2015-05-04 08:56:50 -07:00
Matt Weber	63c4a214db	document and test custom analyzer position offset gap	2015-05-04 08:53:45 -07:00
Clément Salaün	c0659ce4d4	Docs: Update geo-distance-range-filter.asciidoc missing comma Closes #10957	2015-05-04 17:17:48 +02:00
Simon Willnauer	930eacd457	Merge branch 'master' into pr-10624	2015-05-04 17:06:05 +02:00
Clinton Gormley	bffcf5af58	Docs: Update rolling upgrade Added note about why replica shards may remain unassigned while there is only one node of the higher version in the cluster. Closes #10951	2015-05-04 16:52:35 +02:00
Robert Muir	4b3672b7df	Add migration note for hunspell dictionaries	2015-05-04 10:00:05 -04:00
Zachary Tong	967e05ea76	[DOCS] Fix section levels for Sampler agg	2015-05-04 09:18:24 -04:00
Simon Willnauer	7e5f9d5628	Merge branch 'master' into pr-10624 Conflicts: src/main/java/org/elasticsearch/index/engine/EngineConfig.java src/main/java/org/elasticsearch/index/shard/IndexShard.java src/test/java/org/elasticsearch/index/engine/InternalEngineTests.java src/test/java/org/elasticsearch/index/engine/ShadowEngineTests.java	2015-05-04 11:37:54 +02:00
Adrien Grand	b72f27a410	Core: Cut over to the Lucene filter cache. This removes Elasticsearch's filter cache and uses Lucene's instead. It has some implications: - custom cache keys (`_cache_key`) are unsupported - decisions are made internally and can't be overridden by users ('_cache`) - not only filters can be cached but also all queries that do not need scores - parent/child queries can now be cached, however cached entries are only valid for the current top-level reader so in practice it will likely only be used on read-only indices - the cache deduplicates filters, which plays nicer with large keys (eg. `terms`) - better stats: we already had ram usage and evictions, but now also hit count, miss count, lookup count, number of cached doc id sets and current number of doc id sets in the cache - dynamically changing the filter cache size is not supported anymore Internally, an important change is that it removes the NoCacheFilter infrastructure in favour of making Query.rewrite specializing the query for the current reader so that it will only be cached on this reader (look for IndexCacheableQuery). Note that consuming filters with the query API (createWeight/scorer) instead of the filter API (getDocIdSet) is important for parent/child queries because otherwise a QueryWrapperFilter(ParentQuery) would run the wrapped query per segment while relations might be cross segments.	2015-05-04 09:02:15 +02:00
Zachary Tong	e3ae1df6f0	[DOCS] Restructure Aggs documentation	2015-05-01 16:04:55 -04:00
Clinton Gormley	c28bf3bb3f	Docs: Updated elasticsearch.org links to elastic.co	2015-05-01 20:46:12 +02:00
Robert Muir	dfe1d1463c	fix doc typo	2015-04-30 23:46:37 -04:00
Robert Muir	aade6194b7	Add span within/containing queries. Expose new span queries from https://issues.apache.org/jira/browse/LUCENE-6083 Within returns matches from 'little' that are enclosed inside of a match from 'big'. Containing returns matches from 'big' that enclose matches from 'little'.	2015-04-30 23:31:31 -04:00
Jack Conradson	aa968f6b65	Scripting: Add Field Methods Added infrastructure to allow basic member methods in the expressions language to be called. The methods must have a signature with no arguments. Also added the following member methods for date fields (and it should be easy to add more) * getYear * getMonth * getDayOfMonth * getHourOfDay * getMinutes * getSeconds Allow fields to be accessed without using the member variable [value]. (Note that both ways can be used to access fields for back-compat.) closes #10890	2015-04-30 15:36:46 -07:00
Ryan Ernst	d2b12e4fc2	Mappings: Remove docs for type level analyzer defaults These settings were removed in #9430.	2015-04-30 13:57:55 -07:00
Ryan Ernst	4ef9f3ca63	Mappings: Remove file based default mappings Using files that must be specified on each node is an anti-pattern from the API based goal of ES. This change removes the ability to specify the default mapping with a file on each node. closes #10620	2015-04-30 13:50:35 -07:00
Boaz Leskes	d596f5cc45	Decouple recoveries from engine flush In order to safely complete recoveries / relocations we have to keep all operation done since the recovery start at available for replay. At the moment we do so by preventing the engine from flushing and thus making sure that the operations are kept in the translog. A side effect of this is that the translog keeps on growing until the recovery is done. This is not a problem as we do need these operations but if the another recovery starts concurrently it may have an unneededly long translog to replay. Also, if we shutdown the engine for some reason at this point (like when a node is restarted) we have to recover a long translog when we come back. To void this, the translog is changed to be based on multiple files instead of a single one. This allows recoveries to keep hold to the files they need while allowing the engine to flush and do a lucene commit (which will create a new translog files bellow the hood). Change highlights: - Refactor Translog file management to allow for multiple files. - Translog maintains a list of referenced files, both by outstanding recoveries and files containing operations not yet committed to Lucene. - A new Translog.View concept is introduced, allowing recoveries to get a reference to all currently uncommitted translog files plus all future translog files created until the view is closed. They can use this view to iterate over operations. - Recovery phase3 is removed. That phase was replaying operations while preventing new writes to the engine. This is unneeded as standard indexing also send all operations from the start of the recovery to the recovering shard. Replay all ops in the view acquired in recovery start is enough to guarantee no operation is lost. - IndexShard now creates the translog together with the engine. The translog is closed by the engine on close. ShadowIndexShards do not open the translog. - Moved the ownership of translog fsyncing to the translog it self, changing the responsible setting to `index.translog.sync_interval` (was `index.gateway.local.sync`) Closes #10624	2015-04-30 23:42:50 +03:00
Adrien Grand	e5be85d586	Aggs: Change the default `min_doc_count` to 0 on histograms. The assumption is that gaps in histogram are generally undesirable, for instance if you want to build a visualization from it. Additionally, we are building new aggregations that require that there are no gaps to work correctly (eg. derivatives).	2015-04-30 15:48:23 +02:00
Colin Goodheart-Smithe	969f53e399	fix typo in Min bucket aggregation docs	2015-04-30 14:41:01 +01:00
Colin Goodheart-Smithe	d16bf992a9	Aggregations: min_bucket aggregation An aggregation to calculate the minimum value in a set of buckets. Closes #9999	2015-04-30 13:34:21 +01:00
Zachary Tong	351a4d3315	[DOCS] Fix movavg images and naming	2015-04-29 13:33:54 -04:00
Colin Goodheart-Smithe	57a8885964	Merge branch 'master' into feature/aggs_2_0 # Conflicts: # src/main/java/org/elasticsearch/index/query/CommonTermsQueryBuilder.java # src/main/java/org/elasticsearch/search/aggregations/AggregationModule.java # src/main/java/org/elasticsearch/search/aggregations/AggregatorFactories.java # src/main/java/org/elasticsearch/search/aggregations/AggregatorParsers.java # src/main/java/org/elasticsearch/search/aggregations/InternalMultiBucketAggregation.java # src/main/java/org/elasticsearch/search/aggregations/bucket/nested/NestedAggregator.java # src/main/java/org/elasticsearch/search/aggregations/metrics/InternalNumericMetricsAggregation.java # src/test/java/org/elasticsearch/search/aggregations/bucket/nested/NestedAggregatorTest.java	2015-04-29 15:49:41 +01:00
Adrien Grand	6e076efdb9	Docs: Add documentation for the `doc_values` setting on the `boolean` field type. Close #10431	2015-04-29 15:59:24 +02:00
Clinton Gormley	7aa4c7e256	Docs: Removed a reference to index_name from the array mapping page	2015-04-29 15:12:31 +02:00
Antonio Bonuccelli	ab83eb036b	Docs: adding missing single quote on PUT index request Closes #10876	2015-04-29 14:45:25 +02:00
Simon Willnauer	94d8b20611	Add multi data.path to migration guide this commit removes the obsolete settings for distributors and updates the documentation on multiple data.path. It also adds an explain to the migration guide. Relates to #9498 Closes #10770	2015-04-29 11:51:37 +02:00
aleph-zero	1d60f34944	Remove all doc references to (dfs_)query_and_fetch Removes references to (dfs_)query_and_fetch as possible ‘search_type’ parameters for the REST API.	2015-04-28 15:57:46 -07:00
aleph-zero	89542facb3	Remove (dfs_)query_and_fetch from the REST API Remove the ability to specify search type ‘query_and_fetch’ and ‘df_query_and_fetch’ from the REST API. - Adds REST tests - Updates REST API spec to remove ‘query_and_fetch’ and ‘df_query_and_fetch’ as options - Removes documentation for these options Closes #9606	2015-04-28 15:27:59 -07:00
Ryan Ernst	bf09e58cb3	Mappings: Remove includes and excludes from _source Regardless of the outcome of #8142, we should at least enforce that when _source is enabled, it is sufficient to reindex. This change removes the excludes and includes settings, since these modify the source, causing us to lose the ability to reindex some fields. closes #10814	2015-04-28 15:03:51 -07:00
Lee Hinman	04f6067c66	Merge branch 'pr/10845'	2015-04-28 09:13:26 -06:00
Nik Everett	cb89a14010	Add default to field_value_factor field_value_factor now takes a default that is used if the document doesn't have a value for that field. It looks like: "field_value_factor": { "field": "popularity", "missing": 1 } Closes #10841	2015-04-28 11:06:24 -04:00
minde-eagleeye	a1289b4ad5	Docs: Update cluster.asciidoc added a missing comma in one of examples Closes #10834	2015-04-28 11:48:08 +02:00
javanna	c914134355	Scripting: remove groovy sandbox Groovy sandboxing was disabled by default from 1.4.3 on though since we found out that it could be worked around, so it makes little sense to keep it and maintain it. Closes #10156 Closes #10480	2015-04-28 11:27:50 +02:00
Jun Ohtani	933edf7bcc	Analysis: Fix wrong position number by analyze API Add breaking chages comment to migrate docs Fix the stopword included text using stopword filter	2015-04-28 17:44:41 +09:00
Zachary Tong	bf9739d0f0	[DOCS] review comment fixes	2015-04-27 14:40:04 -04:00
Simon Willnauer	d164526d27	Remove `_shutdown` API Thsi commit removes the `_shutdown` API entirely without any replacement. Nodes should be managed from the operating system not via REST APIs	2015-04-27 17:19:36 +02:00
Clinton Gormley	089914dede	Docs: Document `http.max_header_size` Closes #10752	2015-04-27 15:59:27 +02:00
Clinton Gormley	ba4ec6bca5	Docs: Updated current version	2015-04-27 13:45:35 +02:00
markharwood	1b8b993912	Query enhancement: Enable Lucene ranking behaviour for queries on numeric fields. This changes the default ranking behaviour of single-term queries on numeric fields to use the usual Lucene TermQuery scoring logic rather than a constant-scoring wrapper. Closes #10628	2015-04-27 09:42:55 +01:00
navins	84636557e1	Docs: correct three mis-match of brackets Closes #10806	2015-04-26 19:43:14 +02:00
Christine	9e81e4c09b	Docs: Update bool-filter.asciidoc from, to deprecated in favour of gt, lt Closes #10682	2015-04-26 19:23:11 +02:00
Clinton Gormley	37ed61807f	Docs: Updated the experimental annotations in the docs as follows: * Removed the docs for `index.compound_format` and `index.compound_on_flush` - these are expert settings which should probably be removed (see https://github.com/elastic/elasticsearch/issues/10778) * Removed the docs for `index.index_concurrency` - another expert setting * Labelled the segments verbose output as experimental * Marked the `compression`, `precision_threshold` and `rehash` options as experimental in the cardinality and percentile aggs * Improved the experimental text on `significant_terms`, `execution_hint` in the terms agg, and `terminate_after` param on count and search * Removed the experimental flag on the `geobounds` agg * Marked the settings in the `merge` and `store` modules as experimental, rather than the modules themselves Closes #10782	2015-04-26 18:49:15 +02:00
Clinton Gormley	f1a0e2216a	Docs: Mentioned script_id and script_file parameters across all aggs Closes #10760	2015-04-26 17:30:38 +02:00
Mark Mulder	690c16e81a	Docs: Fix minor spelling mistakes in Match Query doc Closes #10751	2015-04-26 16:29:41 +02:00
Clinton Gormley	7de8b7008e	Docs: Tidied docs for field-stats	2015-04-26 15:52:02 +02:00
Mehdi Mollaverdi	dce920b75f	Docs: The name of scroll ID attribute in the response is "_scroll_id" rather than "scroll_id" Closes #10691	2015-04-25 19:32:32 +02:00
Clinton Gormley	cf177c32d4	Docs: Fixed pattern-capture token filter example Closes #10690	2015-04-25 19:27:55 +02:00
Clinton Gormley	2579cc31b1	Docs: Note that include_in_parent/root does not apply to geo-shape fields Closes #10653	2015-04-25 16:49:49 +02:00
Tanguy Leroux	f7d4baacfb	Remove working directory This commit removes the working directory and its associated environment variable "WORK_DIR"	2015-04-25 13:08:36 +02:00
Oliver Eilhard	95e9b86505	Mustache tags syntax Hi there. I've been experimenting with the search templates recently and I'm a bit confused. Shouldn't the Mustache tags be written like `{{tagname}}` instead of `{tagname}`? Your using `{{...}}` [here](http://www.elastic.co/guide/en/elasticsearch/reference/current/search-template.html) BTW. Using the first example in that page seems to indicate that something's wrong, or am I missing something? ``` $ curl 'localhost:9200/test/_search' -d '{"query":{"template":{"query":{"match":{"text":"{keywords}"}},"params":{"keywords":"value1_foo"}}}}' {"took":1,"timed_out":false,"_shards":{"total":1,"successful":1,"failed":0},"hits":{"total":0,"max_score":null,"hits":[]}} $ curl 'localhost:9200/test/_search' -d '{"query":{"template":{"query":{"match":{"text":"{{keywords}}"}},"params":{"keywords":"value1_foo"}}}}' {"took":1,"timed_out":false,"_shards":{"total":1,"successful":1,"failed":0},"hits":{"total":1,"max_score":1.0,"hits":[{"_index":"test","_type":"testtype","_id":"1","_score":1.0,"_source":{"text":"value1_foo"}}]}} ```	2015-04-24 21:23:58 +02:00
Ryan Ernst	1f5bdca8cc	Mappings: Restrict murmur3 field type to sane options Disabling doc values or trying to index hash values are not correct uses of this the murmur3 field type, and just cause problems. This disallows changing doc values or index options for 2.0+. closes #10465	2015-04-23 21:48:42 -07:00
Benoit Delbosc	4a94e1f14b	Docs: Warning about the conflict with the Standard Tokenizer The examples given requires a specific Tokenizer to work. Closes: 10645	2015-04-23 21:16:30 +02:00
Igor Motov	60721b2a17	Snapshot/Restore: remove obsolete expand_wildcards_open and expand_wildcards_close options In #6097 we made snapshot/restore index option consistent with other API. Now we can remove old style options from master. Closes #10743	2015-04-23 13:29:24 -04:00
Mal Curtis	9eabcd7c0f	Docs: Fix missing comma in context suggester docs Closes #10623	2015-04-23 14:04:46 +02:00
Alexander	dbbfe39415	[Docs] fix typo in scripting module Closes #10622	2015-04-23 14:00:44 +02:00
Martijn van Groningen	dbeb4aaacf	docs: make sure that the options are rendered correctly	2015-04-23 10:50:01 +02:00
Martijn van Groningen	6a2f9c2682	docs: fixed title out of sequence	2015-04-23 09:57:31 +02:00
Martijn van Groningen	5705537ecf	Added field stats api The field stats api returns field level statistics such as lowest, highest values and number of documents that have at least one value for a field. An api like this can be useful to explore a data set you don't know much about. For example you can figure at with the lowest and highest response times are, so that you can create a histogram or range aggregation with sane settings. This api doesn't run a search to figure this statistics out, but rather use the Lucene index look these statics up (using Terms class in Lucene). So finding out these stats for fields is cheap and quick. The min/max values are based on the type of the field. So for a numeric field min/max are numbers and date field the min/max date and other fields the min/max are term based. Closes #10523	2015-04-23 08:52:34 +02:00
Zachary Tong	e08e45cee8	[DOCS] Add link to movavg page	2015-04-22 18:59:39 -04:00
Zachary Tong	a03cefcece	[DOCS] Add documentation for moving average	2015-04-22 18:59:39 -04:00
Lee Hinman	a4f98e7400	[DOCS] Add example of setting disk threshold decider settings Fixes #10686	2015-04-22 11:53:19 -06:00
Clinton Gormley	a60571c597	Docs: Removed some unused callout from the scroll docs	2015-04-22 12:49:06 +02:00
Jun Ohtani	0955c127c0	Rest: Add json in request body to scroll, clear scroll, and analyze API Change analyze.asciidoc and scroll.asciidoc Add json support to Analyze and Scroll, and clear scrollAPI Add rest-api-spec/test Closes #5866	2015-04-22 17:53:20 +09:00
Nicholas Knize	453217fd7a	[GEO] Prioritize tree_level and precision parameters over default distance_error_pct If a user explicitly defined the tree_level or precision parameter in a geo_shape mapping their specification was always overridden by the default_error_pct parameter (even though our docs say this parameter is a 'hint'). This lead to unexpected accuracy problems in the results of a geo_shape filter. (example provided in issue #9691) This simple patch fixes the unexpected behavior by setting the default distance_error_pct parameter to zero when the tree_level or precision parameters are provided by the user. Under the covers the quadtree will now use the tree level defined by the user. The docs will be updated to alert the user to exercise caution with these parameters. Specifying a precision of "1m" for an index using large complex shapes can quickly lead to OOM issues. closes #9691	2015-04-21 14:42:10 -05:00
Colin Goodheart-Smithe	bd28c9c44e	Documentation for the max_bucket reducer	2015-04-21 15:06:20 +01:00
Colin Goodheart-Smithe	be647a89d3	Documentation for the derivative reducer	2015-04-21 15:06:20 +01:00
Colin Goodheart-Smithe	0f4b7f3b5c	Added section for reducer aggregations in the main aggregation docs page	2015-04-21 15:06:19 +01:00
Adrien Grand	d7abb12100	Replace deprecated filters with equivalent queries. In Lucene 5.1 lots of filters got deprecated in favour of equivalent queries. Additionally, random-access to filters is now replaced with approximations on scorers. This commit - replaces the deprecated NumericRangeFilter, PrefixFilter, TermFilter and TermsFilter with NumericRangeQuery, PrefixQuery, TermQuery and TermsQuery, wrapped in a QueryWrapperFilter - replaces XBooleanFilter, AndFilter and OrFilter with a BooleanQuery in a QueryWrapperFilter - removes DocIdSets.isBroken: the new two-phase iteration API will now help execute slow filters efficiently - replaces FilterCachingPolicy with QueryCachingPolicy Close #8960	2015-04-21 15:32:43 +02:00
markharwood	63db34f649	New feature - Sampler aggregation used to limit any nested aggregations' processing to a sample of the top-scoring documents. Optionally, a “diversify” setting can limit the number of collected matches that share a common value such as an "author". Closes #8108	2015-04-21 10:22:05 +01:00
Adrien Grand	f4d5914511	Docs: Warn about the fact that min_doc_count=0 might return terms that only belong to different types.	2015-04-21 00:57:57 +02:00
Honza Král	e929c1560d	[DOCS] Be explicit about scan doing no scoring	2015-04-20 18:05:45 +02:00
Tanguy Leroux	b3d91b1cbb	Doc: Change the wording a bit for the HOSTNAME environment variable I should have done this while merging #9474.	2015-04-17 10:24:50 +02:00
Tanguy Leroux	a806314e2c	Merge pull request #9474 from AndreKR/export-hostname-for-config Export the hostname as environment variable	2015-04-17 10:17:55 +02:00
André Hänsel	c107f0bcb9	Export the hostname as environment variable and mention it in the docs	2015-04-17 09:17:02 +02:00
Michael McCandless	399f0ccce9	Core: add only_ancient_segments to upgrade API, so only segments with an old Lucene version are upgraded This option defaults to false, because it is also important to upgrade the "merely old" segments since many Lucene improvements happen within minor releases. But you can pass true to do the minimal work necessary to upgrade to the next major Elasticsearch release. The HTTP GET upgrade request now also breaks out how many bytes of ancient segments need upgrading. Closes #10213 Closes #10540 Conflicts: dev-tools/create_bwc_index.py rest-api-spec/api/indices.upgrade.json src/main/java/org/elasticsearch/action/admin/indices/optimize/OptimizeRequest.java src/main/java/org/elasticsearch/action/admin/indices/optimize/ShardOptimizeRequest.java src/main/java/org/elasticsearch/action/admin/indices/optimize/TransportOptimizeAction.java src/main/java/org/elasticsearch/index/engine/InternalEngine.java src/test/java/org/elasticsearch/bwcompat/StaticIndexBackwardCompatibilityTest.java src/test/java/org/elasticsearch/index/engine/InternalEngineTests.java src/test/java/org/elasticsearch/rest/action/admin/indices/upgrade/UpgradeReallyOldIndexTest.java	2015-04-16 05:24:33 -04:00
Alex Ksikes	d339ee4005	Term Vectors: terms filtering This adds a new feature to the Term Vectors API which allows for filtering of terms based on their tf-idf scores. With `dfs` option on, this could be useful for finding out a good characteric vector of a document or a set of documents. The parameters are similar to the ones used in the MLT Query. Closes #9561	2015-04-14 19:11:09 +02:00
Alex Ksikes	c347dfe91c	Validate API: support for verbose explanation of succesfully validated queries This commit adds a `rewrite` parameter to the validate API in order to shown how the given query is re-written into primitive queries. For example, an MLT query is re-written into a disjunction of the selected terms. Other use cases include `fuzzy`, `common_terms`, or `match` query especially with a `cutoff_frequency` parameter. Note that the explanation is only given for a single randomly chosen shard only, so the output may vary from one shard to another. Relates #1412 Closes #10147	2015-04-13 19:17:58 +02:00
Clinton Gormley	ab3fa78ae0	Docs: Reverte migration docs mentioning parent removal from update request Relates to #9612	2015-04-13 16:35:21 +02:00
Benoit Delbosc	1b35854768	Docs: Fix simple_query_string example The "&" is not part of the simple_query_string DSL Closes #10563	2015-04-13 14:46:47 +02:00
Adrien Grand	ab8926bc6a	Docs: fix build.	2015-04-10 17:38:36 +02:00
Adrien Grand	5b3cc2f07c	Search: deprecate the limit filter. This is really a Collector instead of a filter. This commit deprecates the `limit` filter, makes it a no-op and recommends to use the `terminate_after` parameter instead that we introduced in the meantime.	2015-04-10 17:18:50 +02:00
Adrien Grand	919589b908	Queries: Remove fuzzy-like-this support. The fuzzy-like-this query builds very expensive queries and only serves esoteric use-cases.	2015-04-10 17:16:02 +02:00
Clinton Gormley	abc7de96ae	Docs: Updated version annotations in master	2015-04-09 14:50:11 +02:00
David Pilato	88ee7a5dca	Deprecate rivers * In code, we mark `River`, `AbstractRiverComponent`, `RiverComponent` and `RiverName` classes as deprecated * We log that information when a cluster is still using it * We add this information in the plugins list as well	2015-04-09 14:29:16 +02:00
Adrien Grand	fae124103a	Merge pull request #10420 from jpountz/feature/numeric_resolution Mappings: Bring back numeric_resolution. Close #10420	2015-04-09 12:28:33 +02:00
Adrien Grand	aecd9ac515	Aggregations: Speed up include/exclude in terms aggregations with regexps. Today we check every regular expression eagerly against every possible term. This can be very slow if you have lots of unique terms, and even the bottleneck if your query is selective. This commit switches to Lucene regular expressions instead of Java (not exactly the same syntax yet most existing regular expressions should keep working) and uses the same logic as RegExpQuery to intersect the regular expression with the terms dictionary. I wrote a quick benchmark (in the PR) to make sure it made things faster and the same request that took 750ms on master now takes 74ms with this change. Close #7526	2015-04-09 12:12:56 +02:00
javanna	acabf2d55a	Cluster state REST api: print routing_nodes out only when requested through specific flag For bacwards compatibility reasons routing_nodes were previously printed out when routing_table was requested, together with the actual routing_table. Now they are printed out only when requests through `routing_nodes` flag. Relates to #10412 Closes #10486	2015-04-08 16:10:36 +02:00
marko asplund	5585175173	Docs: fix typos in example JSON data Closes #10479	2015-04-08 13:40:35 +02:00
javanna	d9aebf4906	Scripting: remove deprecated methods from ScriptService Removed the following methods from `ScriptService`, which don't require the `ScriptContext` argument: ``` public CompiledScript compile(String lang, String script, ScriptType scriptType) public ExecutableScript executable(String lang, String script, ScriptType scriptType, Map<String, Object> vars) public SearchScript search(SearchLookup lookup, String lang, String script, ScriptType scriptType, @Nullable Map<String, Object> vars) ``` Also removed the ScriptContext.Standard.GENERIC_PLUGIN enum value, as it was used only for backwards compatibility. Plugins that make use of scripts should declare their own script contexts through `ScriptModule#registerScriptContext` and use them when compiling/executing scripts. Closes #10476	2015-04-08 12:20:03 +02:00
javanna	7bd7ea8f13	Scripting: allow plugins to define custom operations that they use scripts for Plugins can now define multiple operations/contexts that they use scripts for. Fine-grained settings can then be used to enable/disable scripts based on each single registered context. Also added a new generic category called `plugin`, which will be used as a default when the context is not specified. This allows us to restore backwards compatibility for plugins on `ScriptService` by restoring the old methods that don't require the script context and making them internally use the `plugin` context, as they can only be called from plugins. Closes #10347 Closes #10419	2015-04-08 11:57:00 +02:00
Isabel Drost-Fromm	60bb65c4d9	Docs: Note on shard vs. index level doc frequencies. Relates to #10154 and #10150 Adds link to additional information on how document frequencies are treated across shards to the cutoff_frequency parameter documentation. Closes #10451	2015-04-07 14:28:01 +02:00
joelbourbon	3c52bc1098	Docs: Missing 1 escape character in example Closes #10446	2015-04-07 14:10:17 +02:00
Lee Hinman	eed7c8af6d	[DOCS] Document `indices.recovery.concurrent_small_file_streams`	2015-04-06 11:16:50 -06:00
Alexander Reelsen	6170c274ba	Documentation: Add note about not having sources in repositories Closes #10390	2015-04-05 18:08:57 +02:00
Dustin Shiver	ae60144123	Update for clarification Make it clear which nodes in the cluster should have `http.enabled` set to `false`. Closes #10305	2015-04-05 18:05:14 +02:00
Clinton Gormley	276dbc2925	Update repositories.asciidoc Added a warning explaining how `add-apt-repository` adds a `deb-src` entry, which can result in errors. Closes #10223	2015-04-05 11:43:49 +02:00
Clinton Gormley	9607f4c22d	Docs: Elasticsearch will refuse to start with a known bad JVM	2015-04-04 20:21:37 +02:00
Clinton Gormley	a95b11ca61	Document `doc_values` for field type `ip` Closes #9809	2015-04-04 17:51:28 +02:00
Clinton Gormley	c046398093	Update indices.asciidoc Fixed typo in cat indices Relates to #7936	2015-04-04 16:50:04 +02:00
Adrien Grand	c7115f8364	Mappings: Bring back numeric_resolution. We had an undocumented parameter called `numeric_resolution` which allows to configure how to deal with dates when provided as a number. The default is to handle them as milliseconds, but you can also opt-on for eg. seconds. Close #10072	2015-04-03 19:54:14 +02:00
Guillaume Dievart	adcb782423	Update core-types.asciidoc	2015-04-03 14:12:29 +02:00
Adrien Grand	08f93cf33f	Add doc values support to boolean fields. This pull request makes boolean handled like dates and ipv4 addresses: things are stored as as numerics under the hood and aggregations add some special formatting logic in order to return true/false in addition to 1/0. For example, here is an output of a terms aggregation on a boolean field: ``` "aggregations": { "top_f": { "doc_count_error_upper_bound": 0, "buckets": [ { "key": 0, "key_as_string": "false", "doc_count": 2 }, { "key": 1, "key_as_string": "true", "doc_count": 1 } ] } } ``` Sorted numeric doc values are used under the hood. Close #4678 Close #7851	2015-04-02 15:40:46 +02:00
Tanguy Leroux	eeec90be79	[DOCS] Add verify parameter to snapshot documentation Add verify parameter to snapshot documentation and remove 'verify' setting at FS repository level (not supported)	2015-04-02 14:23:31 +02:00
wittyameta	728f834716	[DOCS] add wait_for_active_shards option to health.asciidoc	2015-04-02 09:33:54 +02:00
Reuben Sutton	85c221e9b1	Remove jsonp support and associated tests, closes #9108	2015-04-01 16:06:09 +01:00
David Wittman	3acf4ccb33	Fix typos for gateway.recover_after_time There were a few references to the setting `gateway.recovery_after_time`, which should instead be `gateway.recover_after_time`.	2015-03-31 14:00:08 -05:00
javanna	83fb0a10e5	Scripting: remove support for script.disable_dynamic setting Now that fine-grained script settings are supported (#10116) we can remove support for the script.disable_dynamic setting. Same result as `script.disable_dynamic: false` can be obtained as follows: ``` script.inline: on script.indexed: on ``` An exception is thrown at startup when the old setting is set, so we make sure we tell users they have to change it rather than ignoring the setting. Closes #10286	2015-03-31 13:24:52 +02:00
Adrien Grand	0a6be2c111	Merge pull request #9296 from jpountz/enhancement/remove_count_search_type Search: Merge `search_type=count` and `size=0`. Close #9226	2015-03-31 11:36:38 +02:00
Adrien Grand	a608db122d	Search: Remove the `count` search type. This commit brings the benefits of the `count` search type to search requests that have a `size` of 0: - a single round-trip to shards (no fetch phase) - ability to use the query cache Since `count` now provides no benefits over `query_then_fetch`, it has been deprecated. Close #7630	2015-03-31 11:31:49 +02:00
Patrick Peschlow	be93884538	Update scripting.asciidoc change description to better fit the flag name	2015-03-31 09:26:40 +02:00
Patrick Peschlow	a9af488bb3	Update prefix-filter.asciidoc text said phrase instead of prefix, probably due to copy-paste	2015-03-31 09:25:15 +02:00
olivier bourgain	00a9db73ae	[DOCS] Fix multi percolate response sample in percolate.asciidoc	2015-03-30 11:32:41 +02:00
javanna	0beda40069	[DOCS] added table with supported scripting languages to scripting docs	2015-03-29 11:28:50 +02:00
Martijn van Groningen	6d1a1b328b	Make sure that the parent option on the update request only is delgated to upsert index request. Closes #4538	2015-03-28 08:53:11 +01:00
Martijn van Groningen	75713f4190	Reverted commit: `20f7be3`	2015-03-28 08:53:11 +01:00
Clinton Gormley	743758ce64	Updated version in docs to use 1.5.0	2015-03-27 08:58:13 +02:00
javanna	425ea5bca6	[DOCS] removed coming tags from scripting docs	2015-03-26 20:22:20 +01:00
javanna	d9d1e6a67a	Scripting: add support for fine-grained settings Allow to on/off scripting based on their source (where they get loaded from), the operation that executes them and their language. The settings cover the following combinations: - mode: on, off, sandbox - source: indexed, dynamic, file - engine: groovy, expressions, mustache, etc - operation: update, search, aggs, mapping The following settings are supported for every engine: script.engine.groovy.indexed.update: sandbox/on/off script.engine.groovy.indexed.search: sandbox/on/off script.engine.groovy.indexed.aggs: sandbox/on/off script.engine.groovy.indexed.mapping: sandbox/on/off script.engine.groovy.dynamic.update: sandbox/on/off script.engine.groovy.dynamic.search: sandbox/on/off script.engine.groovy.dynamic.aggs: sandbox/on/off script.engine.groovy.dynamic.mapping: sandbox/on/off script.engine.groovy.file.update: sandbox/on/off script.engine.groovy.file.search: sandbox/on/off script.engine.groovy.file.aggs: sandbox/on/off script.engine.groovy.file.mapping: sandbox/on/off For ease of use, the following more generic settings are supported too: script.indexed: sandbox/on/off script.dynamic: sandbox/on/off script.file: sandbox/on/off script.update: sandbox/on/off script.search: sandbox/on/off script.aggs: sandbox/on/off script.mapping: sandbox/on/off These will be used to calculate the more specific settings, using the stricter setting of each combination. Operation based settings have precedence over conflicting source based ones. Note that the `mustache` engine is affected by generic settings applied to any language, while native scripts aren't as they are static by definition. Also, the previous `script.disable_dynamic` setting can now be deprecated. Closes #6418 Closes #10116 Closes #10274	2015-03-26 19:56:55 +01:00
Glen Smith	5a475d21e5	[DOCS] Added explicit "lang" field to documentation of script score definition	2015-03-25 10:58:59 +01:00
Ryan Ernst	90dfd78267	Remove missed references to delete mapping API See #10231	2015-03-24 10:13:19 -07:00
Ryan Ernst	693d91e41c	Mappings: Remove delete mapping API Deleting a type from an index is inherently dangerous because the type can be recreated with new mappings which may conflict with existing segments still using the old mappings. This removes the ability to delete a type (similar to how deleting fields within a type is not allowed, for the same reason). closes #8877 closes #10231	2015-03-24 09:46:02 -07:00
Nicholas Knize	c2ec463cdb	[GEO] fix docs for geo_point "validate" option Documentation states false as the default for "validate", "validate_lon", and "validate_lat" leading to confusion as described in issue #9539. This simple fix corrects the documentation and communicates that these fields will be deprecated and removed in upcoming versions. closes #9539	2015-03-23 15:34:37 -05:00
Boaz Leskes	4970e3e225	Revert "Rest: Add json in request body to scroll, clear scroll, and analyze API" This reverts commit `16083d454c`.	2015-03-23 12:57:19 +01:00
Jun Ohtani	16083d454c	Rest: Add json in request body to scroll, clear scroll, and analyze API Add json support to scroll, clear scroll, and analyze Closes #5866	2015-03-23 15:35:38 +09:00
Simon Willnauer	7257345db9	Revert Benchmark API The benchmark api is being worked on feature/bench branch and will be merged from there when ready.	2015-03-21 10:36:04 +01:00
Asimov4	649e3aa4c5	[DOCS] Fix typos in percolate.asciidoc	2015-03-21 10:23:15 +01:00
javanna	88e506e58c	[DOCS] add -i flag to more curl HEAD calls	2015-03-21 08:56:20 +01:00
Corey Daley	366f01b4d2	[DOCS] add -i flag to curl HEAD call without -i you never see the status:200 or status:404 messages	2015-03-21 08:56:12 +01:00
Florian Hopf	865cbeb3d8	Filter indices stats for translog Added the missing call in the RestAction, closes #8262	2015-03-20 14:38:49 -07:00
Al Lefebvre	94f82368f0	Update templates.asciidoc I've been attempting to programatically verify that adding index templates via the `{path.conf}/templates/` directory works fine although I was never able to validate this via an API call to the `/_template/`. It seems that these templates do not appear in that API call, which I discovered in the following mail thread: http://elasticsearch-users.115913.n3.nabble.com/Loading-of-index-settings-template-from-file-in-config-templates-td4024923.html#d1366317284000-912 My question is why wouldn't the `/_template/*` method return these templates? This tends to complicate things for those that want to perform automated tests to verify that they are in fact being recognized and used by Elasticsearch.	2015-03-20 14:52:20 -06:00
Michael McCandless	34b397597c	Core: increase default rate limiting for snapshot, restore and recovery to 40 MB/sec This also fixes a possible issue that may cause over-throttling when there are many small files being copied, which should be rare. Closed #10185	2015-03-20 16:09:42 -04:00
javanna	9ca74c8217	[DOCS] clarify no-master-block docs Closes #9739	2015-03-20 15:58:16 +01:00
javanna	4348959f9d	Delete api: remove broadcast delete if routing is missing when required This commit changes the behaviour of the delete api when processing a delete request that refers to a type that has routing set to required in the mapping, and the routing is missing in the request. Up until now the delete api sent a broadcast delete request to all of the shards that belong to the index, making sure that the document could be found although the routing value wasn't specified. This was probably not the best choice: if the routing is set to required, an error should be thrown instead. A `RoutingMissingException` gets now thrown instead, like it happens in the same situation with every other api (index, update, get etc.). Last but not least, this change allows to get rid of a couple of `TransportAction`s, `Request`s and `Response`s and simplify the codebase. Closes #9123 Closes #10136	2015-03-20 09:19:43 +01:00
Simon Willnauer	1168347b9d	[REPLICATION] Remove `async` replication Closes #10114	2015-03-19 14:44:21 -07:00
Clinton Gormley	aa94ced0ae	Remove references to the thrift and memcached transport plugins as they are no longer supported Closes #10166	2015-03-19 20:49:58 +01:00
Clinton Gormley	25369f0727	Remove async replication from the docs and REST spec Relates to #10114	2015-03-19 15:34:12 +01:00
javanna	f4691458d5	[DOCS] added note about dynamic scriptings and updated links in getting started page Closes #10074	2015-03-19 15:22:39 +01:00
javanna	ddcecc2bc2	[DOCS] added instructions on how to write parameterized tests Closes #9423	2015-03-19 12:43:51 +01:00
jaymode	105bdd486a	[HTTP] add option to only return simple exception messages Adds a setting to disable detailed error messages and full exception stack traces in HTTP responses. When set to false, the error_trace request parameter will result in a HTTP 400 response. When the error_trace parameter is not present, the message of the first ElasticsearchException will be output and no nested exception messages will be output.	2015-03-18 17:49:05 -04:00
Joshua Rich	db2caa54cd	Small grammar fix.	2015-03-17 11:27:13 -07:00
Boaz Leskes	b605184471	Recovery: add total operations to the `_recovery` API This commit adds the current total number of translog operations to the recovery reporting API. We also expose the recovered / total percentage: ``` "translog": { "recovered": 536, "total": 986, "percent": "54.3%", "total_time": "2ms", "total_time_in_millis": 2 }, ``` Closes #9368 Closes #10042	2015-03-17 07:31:29 -07:00
Martijn van Groningen	4393939f5e	inner_hits: Nested parent field should be resolved based on the parent inner hit definition, instead of the nested parent field in the mapping. The behaviour is better in the case someone has multiple levels of nested object fields defined in the mapping and like to define a single inner_hits definition that is two or more levels deep. If someone wants inner hits on a nested field that is 2 levels deep the following would need to be defined: ``` { ... "inner_hits" : { "path" : { "level1" : { "inner_hits" : { "path" : { "level2" : { "query" : { .... } } } } } } } } ``` With this change the above can be defined as: ``` { ... "inner_hits" : { "path" : { "level1.level2" : { "query" : { .... } } } } } ``` Closes #9251	2015-03-16 16:31:03 -07:00
Michael McCandless	0683a66277	Core: remove index.fail_on_merge_failure Always fail the engine if an unexpected exception is hit during merge. Closes #10088	2015-03-14 09:53:42 -04:00
Lee Hinman	6aec68cd29	Revert "[QUERY] Remove lowercase_expanded_terms and locale options" This reverts commit `d1f7bd97cb`. Ryan pointed out that this needs to work with the multi term query, so additional analysis and tests should be added.	2015-03-13 13:51:44 -06:00
Lee Hinman	d1f7bd97cb	[QUERY] Remove lowercase_expanded_terms and locale options The analysis chain should be used instead of relying on this, as it is confusing when dealing with different per-field analysers. The `locale` option was only used for `lowercase_expanded_terms`, which, once removed, is no longer needed, so it was removed as well. Fixes #9978 Relates to #9973	2015-03-13 13:17:27 -06:00
Petr Bela	f27cb07eb9	[DOCS] fix typo in scripting docs	2015-03-12 15:28:50 -07:00
olivier bourgain	bcb4decca9	[DOCS] add missing comma in percentile_rank aggregation example	2015-03-10 08:21:06 -07:00
olivier bourgain	fb7cd2ea9a	[DOCS] Adjusted geo_distance aggregation example unit is not returned in the response, but we have key and an implicit from starting at 0 for the first bucket	2015-03-10 08:20:20 -07:00
olivier bourgain	eaeddc6bd4	[DOCS] missing curly brace in ip_range aggregation example	2015-03-10 08:19:57 -07:00
David Pilato	d9c19cd846	[doc] Cat API: show open and closed indices in _cat/indices Related to #7936	2015-03-09 15:45:26 -07:00
Britta Weber	580728dfd6	significant terms: add scriptable significance heuristic This commit adds scripting capability to significant_terms. Custom heuristics can be implemented with a script that provides parameters subset_freq, superset_freq,subset_size, superset_size. closes #7850	2015-03-06 17:06:04 +01:00
Matias Tealdi	cba6dff3ac	fixing typo in expDecayFunction and adding offset to all dacay functions closes #9887	2015-03-05 12:28:08 +01:00
Clinton Gormley	3f9d4f9635	Update query-string-syntax.asciidoc Closes #9965	2015-03-03 20:03:51 +01:00
Pius	430b091e7d	Docs: Added default value Added default value to `cluster.routing.allocation.node_initial_primaries_recoveries` Closes #9955	2015-03-03 10:15:11 +01:00
Clinton Gormley	c223ed0db4	Update search-type.asciidoc Changed search_type docs to reflect that the `(dfs_)query_and_fetch` modes are an internal optimization and should not be specified explicitly by the user. Relates to #9606	2015-03-02 10:55:22 +01:00
Pius	a90e5c03b7	Update getting-started.asciidoc Closes #9932	2015-03-01 21:07:09 +01:00
Pius	a182c7428d	Added note on max # of docs allowed in a shard	2015-03-01 21:07:07 +01:00
Geoff Bourne	0e09c02c56	Spelling out the sort order options Closes #9768	2015-03-01 21:05:52 +01:00
cgp	b1e6df3b6c	Update span-multi-term-query.asciidoc Added comma - there is no "term range" query Closes #9855	2015-02-28 03:05:05 +01:00
Clinton Gormley	e194fb3a07	Docs: Default distance unit in geo distance agg is metres, not km Closes #9812	2015-02-28 01:45:29 +01:00
David Pilato	0c8da6bb84	[doc] Link mapper-attachment type documentation to its repo As explained in elasticsearch/elasticsearch-mapper-attachments#101, we should have consistent documentation. The best option is to link the documentation in elasticsearch guide to the most recent README in the plugin repo. Closes #9756	2015-02-27 22:18:59 +01:00
IsaacHaze	ee163e570b	Docs: Update snapshots.asciidoc Adds more determiners. Closes #9673	2015-02-27 20:42:12 +01:00
Ryan Ernst	9d708e20a0	Mappings: Lock down _size field This also changes the stored setting for _size to true (for indexes created in 2.x). see #8143 closes #9913	2015-02-27 11:09:52 -08:00
Ryan Ernst	3b7928d568	Mappings: Lock down _field_names field Now that we have an explicit `enabled` flag, we can lock down the field type so it is not mungeable. see #8143 closes #9912	2015-02-26 15:15:59 -08:00
Ryan Ernst	7181bbde26	Mappings: Remove _boost field This has been deprecated since 1.0.0.RC1. It is finally removed here. closes #8875	2015-02-26 15:07:07 -08:00
Ryan Ernst	78df69e6a0	Mappings: Lock down _routing field `required` is now the only changeable settings (on indexes created after 1.x). see #8143 closes #9895	2015-02-26 13:09:41 -08:00
Boaz Leskes	e9dbfa9ee6	Transport: added a simple request tracer, logging incoming and outgoing requests The request tracer logs in TRACE level under the `transport.tracer` log and is dynamically configurable with include and exclude arrays to filter out unneeded info. By default all requests are logged with the exception of fault detection pings (fired every second). add the notion of tracers in the MockTransportService for testing purposes Closes #9286	2015-02-25 21:33:57 +01:00
Ryan Ernst	32e042f1c4	Mappings: Lock down _index field see #8143 closes #9870	2015-02-25 12:24:55 -08:00
Lee Hinman	2e9ea4abaf	Add support for `minimum_should_match` to `simple_query_string` This behaves similar to the way that `minimum_should_match` works for the `match` query (in fact it is implemented in the exact same way) Fixes #6449	2015-02-25 11:35:33 -07:00
Boaz Leskes	3e32dd985a	Recovery: RecoveryState clean up To support the `_recovery` API, the recovery process keeps track of current progress in a class called RecoveryState. This class currently have some issues, mostly around concurrency (see #6644 ). This PR cleans it up as well as other issues around it: - Make the Index subsection API cleaner: - remove redundant information - all calculation is done based on the underlying file map - clearer definition of what is what: total files, vs reused files (local files that match the source) vs recovered files (copied over). % based progress is reported based on recovered files only. - cleaned up json response to match other API (sadly this breaks the structure). We now properly report human values for dates and other units. - Add more robust unit testing - Detail flag was passed along as state (it's now a ToXContent param) - State lookup during reporting is now always done via the IndexShard , no more fall backs to many other classes. - Cleanup APIs around time and move the little computations to the state class as opposed to doing them out of the API I also improved error messages out of the REST testing infra for things I run into. Closes #6644 Closes #9811	2015-02-25 17:34:22 +01:00
Igor Motov	c5ebdf11bb	Snapshot/Restore: add ability to retrieve currently running snapshots Together with #8782 it should help in the situations simliar to #8887 by adding an ability to get information about currently running snapshot without accessing the repository itself. Closes #8887	2015-02-25 11:06:32 -05:00
Boaz Leskes	6953777c3a	API: add pending tasks count to cluster health The number of current pending tasks is useful to detect and overloaded master. This commit adds it to the cluster health API. The complete list can be retrieved from the dedicated pending tasks API. It also adds rest tests for the cluster health variants. Closes #9877	2015-02-25 14:58:44 +01:00
Clinton Gormley	5a53ff6f1b	Update migrate_2_0.asciidoc More code formatting in breaking changes	2015-02-25 14:13:25 +01:00
Clinton Gormley	e805fe71cc	Update migrate_2_0.asciidoc Code formatting in breaking changes	2015-02-25 14:11:57 +01:00
Clinton Gormley	5146cf6256	Update migrate_2_0.asciidoc Fixed bad heading levels in breaking changes	2015-02-25 14:10:17 +01:00
Clinton Gormley	0c61ea803d	Update migrate_2_0.asciidoc Fixed bad asciidoc in breaking changes	2015-02-25 14:07:19 +01:00
Colin Goodheart-Smithe	2520dc78ec	[DOCS] added a note for the default shard_size value	2015-02-25 11:00:55 +00:00
Ryan Ernst	be0cef0c43	Mappings: Lock down _type field see #8143 closes #9869	2015-02-24 22:37:41 -08:00
Ryan Ernst	b96bd201c1	Mappings: Lock down _id field There are two implications to this change. First, percolator now uses _uid internally, extracting the id portion when needed. Second, sorting on _id is no longer possible, since you can no longer index _id. However, _uid can still be used to sort, and is better anyways as indexing _id just to make it available to fielddata for sorting is wasteful. see #8143 closes #9842	2015-02-24 14:26:22 -08:00
Michael Sander	fd6c6058ce	Remove Triple Negative! Double negatives are confusing, but a triple negative (1 no, 2 non, 3 null)? It takes five minutes to understand this little sentence. Cleaned that up a bit. Closes #9789	2015-02-23 20:09:05 +01:00
Colin Goodheart-Smithe	2753db4685	Scripting: Removed deprecated script parameter names This change removes the deprecated script parameter names ('file', 'id', and 'scriptField'). It also removes the ability to load file scripts using the 'script' parameter. File scripts should be loaded using the 'script_file' parameter only.	2015-02-23 13:49:21 +00:00
Colin Goodheart-Smithe	7d3856c9d3	[DOCS] update script docs to use preferred script parameter names	2015-02-23 11:16:28 +00:00
Robert Muir	1e015e6e33	Tests: Remove global shared cluster This was previously attempted in #8854. I revived that branch and did some performance testing as was suggested in the comments there. I fixed all the errors, mostly just the rest tests, which needed to have http enabled on the node settings (the global cluster previously had this always enabled). I also addressed the comments from that issue. My performance tests involved running the entire test suite on my desktop which has 6 cores, 16GB of ram, and nothing else was being run on the box at the time. I ran each set of settings 3 times and took the average time. \| mode \| master \| patch \| diff \| \| ------- \| ------ \| ----- \| ---- \| \| local \| 409s \| 417s \| +2% \| \| network \| 368s \| 380s \| +3% \| This increase in average time is clearly worthwhile to pay to achieve isolation of tests. One caveat is the way I fixed the rest tests is still to have one cluster for the entire suite, so all the rest tests can still potentially affect each other, but this is an issue for another day. There were some oddities that I noticed while running these tests that I would like to point out, as they probably deserve some investigation (but orthogonal to this PR): * The total test run times are highly variable (more than a minute between the min and max) * Running in network mode is on average actually faster than local mode. How is this possible!?	2015-02-22 22:04:22 -08:00
Martijn van Groningen	daefb4c673	Docs: Document that the fielddata loading defaults to eager on the _parent field. Closes #9804	2015-02-22 23:15:59 +01:00
markharwood	29b1902cfb	New aggregations feature - “PercentageScore” heuristic for significant_terms aggregation provides simple “per-capita” type measures. Closes #9720	2015-02-20 13:22:08 +00:00
Adrien Grand	4708227ecf	Codecs: Remove the ability to have custom per-field postings and doc values formats. This commit makes the `postings_format` and `doc_values_format` options of mappings illegal on 2.0 and ignored on 1.x (meaning that the default postings and doc values formats from the codec will be used in such a case). This removes a fair amount of code. Close #8746 #9741	2015-02-19 15:47:25 +01:00
Lee Hinman	eb666f7f50	Add shadow replicas for shared filesystems Squashed commit of the following: commit 20835037c98e7d2fac4206c372717a05a27c4790 Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 18 15:27:17 2015 -0700 Use Enum for "_primary" preference commit 325acbe4585179190a959ba3101ee63b99f1931a Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 18 14:32:41 2015 -0700 Use ?preference=_primary automatically for realtime GET operations commit edd49434af5de7e55928f27a1c9ed0fddb1fb133 Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 18 14:32:06 2015 -0700 Move engine creation into protected createNewEngine method commit 67a797a9235d4aa376ff4af16f3944d907df4577 Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 18 13:14:01 2015 -0700 Factor out AssertingSearcher so it can be used by mock Engines commit 62b0c28df8c23cc0b8205b33f7595c68ff940e2b Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 18 11:43:17 2015 -0700 Use IndexMetaData.isIndexUsingShadowReplicas helper commit 1a0d45629457578a60ae5bccbeba05acf5d79ddd Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 18 09:59:31 2015 -0700 Rename usesSharedFilesystem -> isOnSharedFilesystem commit 73c62df4fc7da8a5ed557620a83910d89b313aa1 Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 18 09:58:02 2015 -0700 Add MockShadowEngine and hook it up to be used commit c8e8db473830fce1bdca3c4df80a685e782383bc Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 18 09:45:50 2015 -0700 Clarify comment about pre-defined mappings commit 60a4d5374af5262bd415f4ef40f635278ed12a03 Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 18 09:18:22 2015 -0700 Add a test for shadow replicas that uses field data commit 7346f9f382f83a21cd2445b3386fe67472bc3184 Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 18 08:37:14 2015 -0700 Revert changes to RecoveryTarget.java commit d90d6980c9b737bd8c0f4339613a5373b1645e95 Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 18 08:35:44 2015 -0700 Rename `ownsShard` to `canDeleteShardContent` commit 23001af834d66278ac84d9a72c37b5d1f3a10a7b Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 18 08:35:25 2015 -0700 Remove ShadowEngineFactory, add .newReadOnlyEngine method in EngineFactory commit b64fef1d2c5e167713e869b22d388ff479252173 Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 18 08:25:19 2015 -0700 Add warning that predefined mappings should be used commit a1b8b8cf0db49d1bd1aeb84e51491f7f0de43b59 Author: Lee Hinman <lee@writequit.org> Date: Tue Feb 17 14:31:50 2015 -0700 Remove unused import and fix index creation example in docs commit 0b1b852365ceafc0df86866ac3a4ffb6988b08e4 Merge: b9d1fed `a22bd49` Author: Lee Hinman <lee@writequit.org> Date: Tue Feb 17 10:56:02 2015 -0700 Merge remote-tracking branch 'refs/remotes/origin/master' into shadow-replicas commit b9d1fed25ae472a9dce1904eb806702fba4d9786 Merge: 4473e63 `41fd4d8` Author: Lee Hinman <lee@writequit.org> Date: Tue Feb 17 09:02:27 2015 -0700 Merge remote-tracking branch 'refs/remotes/origin/master' into shadow-replicas commit 4473e630460e2f0ca2a2e2478f3712f39a64c919 Author: Lee Hinman <lee@writequit.org> Date: Tue Feb 17 09:00:39 2015 -0700 Add asciidoc documentation for shadow replicas commit eb699c19f04965952ae45e2caf107124837c4654 Author: Simon Willnauer <simonw@apache.org> Date: Tue Feb 17 16:15:39 2015 +0100 remove last nocommit commit c5ece6d16d423fbdd36f5d789bd8daa5724d77b0 Author: Simon Willnauer <simonw@apache.org> Date: Tue Feb 17 16:13:12 2015 +0100 simplify shadow engine commit 45cd34a12a442080477da3ef14ab2fe7947ea97e Author: Simon Willnauer <simonw@apache.org> Date: Tue Feb 17 11:32:57 2015 +0100 fix tests commit 744f228c192602a6737051571e040731d413ba8b Author: Simon Willnauer <simonw@apache.org> Date: Tue Feb 17 11:28:12 2015 +0100 revert changes to IndexShardGateway - these are leftovers from previous iterations commit 11886b7653dabc23655ec76d112f291301f98f4a Author: Simon Willnauer <simonw@apache.org> Date: Tue Feb 17 11:26:48 2015 +0100 Back out non-shared FS code. this will go in in a second iteration commit 77fba571f150a0ca7fb340603669522c3ed65363 Merge: e8ad614 `2e3c6a9` Author: Simon Willnauer <simonw@apache.org> Date: Tue Feb 17 11:16:46 2015 +0100 Merge branch 'master' into shadow-replicas Conflicts: src/main/java/org/elasticsearch/index/engine/Engine.java commit e8ad61467304e6d175257e389b8406d2a6cf8dba Merge: 48a700d `1b8d8da` Author: Simon Willnauer <simonw@apache.org> Date: Tue Feb 17 10:54:20 2015 +0100 Merge branch 'master' into shadow-replicas commit 48a700d23cff117b8e4851d4008364f92b8272a0 Author: Simon Willnauer <simonw@apache.org> Date: Tue Feb 17 10:50:59 2015 +0100 add test for failing shadow engine / remove nocommit commit d77414c5e7b2cde830a8e3f70fe463ccc904d4d0 Author: Simon Willnauer <simonw@apache.org> Date: Tue Feb 17 10:27:56 2015 +0100 remove nocommits in IndexMetaData commit abb696563a9e418d3f842a790fcb832f91150be2 Author: Simon Willnauer <simonw@apache.org> Date: Mon Feb 16 17:05:02 2015 +0100 remove nocommit and simplify delete logic commit 82b9f0449108cd4741568d9b4495bf6c10a5b019 Author: Simon Willnauer <simonw@apache.org> Date: Mon Feb 16 16:45:27 2015 +0100 reduce the changes compared to master commit 28f069b6d99a65e285ac8c821e6a332a1d8eb315 Author: Simon Willnauer <simonw@apache.org> Date: Mon Feb 16 16:43:46 2015 +0100 fix primary relocation commit c4c999dd61a44a7a0db9798275a622f2b85b1039 Merge: 2ae80f9 `455a85d` Author: Simon Willnauer <simonw@apache.org> Date: Mon Feb 16 15:04:26 2015 +0100 Merge branch 'master' into shadow-replicas commit 2ae80f9689346f8fd346a0d3775a6341874d8bef Author: Lee Hinman <lee@writequit.org> Date: Fri Feb 13 16:25:34 2015 -0700 throw UnsupportedOperationException on write operations in ShadowEngine commit 740c28dd9ef987bf56b670fa1a8bcc6de2845819 Merge: e5bc047 `305ba33` Author: Lee Hinman <lee@writequit.org> Date: Fri Feb 13 15:38:39 2015 -0700 Merge branch 'master' into shadow-replicas commit e5bc047d7c872ae960d397b1ae7b4b78d6a1ea10 Author: Lee Hinman <lee@writequit.org> Date: Fri Feb 13 11:38:09 2015 -0700 Don't replicate document request when using shadow replicas commit 213292e0679d8ae1492ea11861178236f4abd8ea Author: Simon Willnauer <simonw@apache.org> Date: Fri Feb 13 13:58:05 2015 +0100 add one more nocommit commit 83d171cf632f9b77cca9de58505f7db8fcda5599 Merge: aea9692 `09eb8d1` Author: Simon Willnauer <simonw@apache.org> Date: Fri Feb 13 13:52:29 2015 +0100 Merge branch 'master' into shadow-replicas commit aea96920d995dacef294e48e719ba18f1ecf5860 Author: Simon Willnauer <simonw@apache.org> Date: Fri Feb 13 09:56:41 2015 +0100 revert unneeded changes on Store commit ea4e3e58dc6959a92c06d5990276268d586735f3 Author: Lee Hinman <lee@writequit.org> Date: Thu Feb 12 14:26:30 2015 -0700 Add documentation to ShadowIndexShard, remove nocommit commit 4f71c8d9f706a0c1c39aa3a370efb1604559d928 Author: Lee Hinman <lee@writequit.org> Date: Thu Feb 12 14:17:22 2015 -0700 Add documentation to ShadowEngine commit 28a9d1842722acba7ea69e0fa65200444532a30c Author: Lee Hinman <lee@writequit.org> Date: Thu Feb 12 14:08:25 2015 -0700 Remove nocommit, document canDeleteIndexContents commit d8d59dbf6d0525cd823d97268d035820e5727ac9 Author: Lee Hinman <lee@writequit.org> Date: Thu Feb 12 10:34:32 2015 -0700 Refactor more shared methods into the abstract Engine commit a7eb53c1e8b8fbfd9281b43ae39eacbe3cd1a0a6 Author: Simon Willnauer <simonw@apache.org> Date: Thu Feb 12 17:38:59 2015 +0100 Simplify shared filesystem recovery by using a dedicated recovery handler that skip most phases and enforces shard closing on the soruce before the target opens it's engine commit a62b9a70adad87d7492c526f4daf868cb05018d9 Author: Simon Willnauer <simonw@apache.org> Date: Thu Feb 12 15:59:54 2015 +0100 fix compile error after upstream changes commit abda7807bc3328a89fd783ca7ad8c6deac35f16f Merge: f229719 `35f6496` Author: Simon Willnauer <simonw@apache.org> Date: Thu Feb 12 15:57:28 2015 +0100 Merge branch 'master' into shadow-replicas Conflicts: src/main/java/org/elasticsearch/index/engine/Engine.java commit f2297199b7dd5d3f9f1f109d0ddf3dd83390b0d1 Author: Simon Willnauer <simonw@apache.org> Date: Thu Feb 12 12:41:32 2015 +0100 first cut at catchup from primary make flush to a refresh factor our ShadowIndexShard to have IndexShard be idential to the master and least intrusive cleanup abstractions commit 4a367c07505b84b452807a58890f1cbe21711f27 Author: Simon Willnauer <simonw@apache.org> Date: Thu Feb 12 09:50:36 2015 +0100 fix primary promotion commit cf2fb807e7e243f1ad603a79bc9d5f31a499b769 Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 11 16:45:41 2015 -0700 Make assertPathHasBeenCleared recursive commit 5689b7d2f84ca1c41e4459030af56cb9c0151eff Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 11 15:58:19 2015 -0700 Add testShadowReplicaNaturalRelocation commit fdbe4133537eaeb768747c2200cfc91878afeb97 Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 11 15:28:57 2015 -0700 Use check for shared filesystem in primary -> primary relocation Also adds a nocommit commit 06e2eb4496762130af87ce68a47d360962091697 Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 11 15:21:32 2015 -0700 Add a test checking that indices with shadow replicas clean up after themselves commit e4dbfb09a689b449f0edf6ee24222d7eaba2a215 Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 11 15:08:18 2015 -0700 Fix segment info for ShadowEngine, remove test nocommit commit 80cf0e884c66eda7d59ac5d59235e1ce215af8f5 Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 11 14:30:13 2015 -0700 Remove nocommit in ShadowEngineTests#testFailStart() commit 5e33eeaca971807b342f9be51a6a566eee005251 Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 11 14:22:59 2015 -0700 Remove overly-complex test commit 2378fbb917b467e79c0262d7a41c23321bbeb147 Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 11 13:45:44 2015 -0700 Fix missing import commit 52e9cd1b8334a5dd228d5d68bd03fd0040e9c8e9 Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 11 13:45:05 2015 -0700 Add a test for replica -> primary promotion commit a95adbeded426d7f69f6ddc4cbd6712b6f6380b4 Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 11 12:54:14 2015 -0700 Remove tests that don't apply to ShadowEngine commit 1896feda9de69e4f9cf774ef6748a5c50e953946 Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 11 10:29:12 2015 -0700 Add testShadowEngineIgnoresWriteOperations and testSearchResultRelease commit 67d7df41eac5e10a1dd63ddb31de74e326e9d38b Author: Lee Hinman <lee@writequit.org> Date: Wed Feb 11 10:06:05 2015 -0700 Add start of ShadowEngine unit tests commit ca9beb2d93d9b5af9aa6c75dbc0ead4ef57e220d Merge: 2d42736 `57a4646` Author: Simon Willnauer <simonw@apache.org> Date: Wed Feb 11 18:03:53 2015 +0100 Merge branch 'master' into shadow-replicas commit 2d42736fed3ed8afda7e4aff10b65d292e1c6f92 Author: Simon Willnauer <simonw@apache.org> Date: Wed Feb 11 17:51:22 2015 +0100 shortcut recovery if we are on a shared FS - no need to compare files etc. commit 24d36c92dd82adce650e7ac8e9f0b43c83b2dc53 Author: Simon Willnauer <simonw@apache.org> Date: Wed Feb 11 17:08:08 2015 +0100 utilize the new delete code commit 2a2eed10f58825aae29ffe4cf01aefa5743a97c7 Merge: 343dc0b `173cfc1` Author: Simon Willnauer <simonw@apache.org> Date: Wed Feb 11 16:07:41 2015 +0100 Merge branch 'master' into shadow-replicas Conflicts: src/main/java/org/elasticsearch/gateway/GatewayMetaState.java commit 343dc0b527a7052acdc783ac5abcaad1ef78dbda Author: Simon Willnauer <simonw@apache.org> Date: Wed Feb 11 16:05:28 2015 +0100 long adder is not available in java7 commit be02cabfeebaea74b51b212957a2a466cfbfb716 Author: Lee Hinman <lee@writequit.org> Date: Tue Feb 10 22:04:24 2015 -0700 Add test that restarts nodes to ensure shadow replicas recover commit 7fcb373f0617050ca1a5a577b8cf32e32dc612b0 Author: Simon Willnauer <simonw@apache.org> Date: Tue Feb 10 23:19:21 2015 +0100 make test more evil commit 38135af0c1991b88f168ece0efb72ffe9498ff59 Author: Simon Willnauer <simonw@apache.org> Date: Tue Feb 10 22:25:11 2015 +0100 make tests pass commit 05975af69e6db63cb95f3e40d25bfa7174e006ea Author: Lee Hinman <lee@writequit.org> Date: Mon Jan 12 18:44:29 2015 +0100 Add ShadowEngine	2015-02-18 15:34:06 -07:00
Christoph Büscher	30fd70f07b	Aggregations: Simplify time zone option in `date_histogram` Removed the existing `pre_zone` and `post_zone` option in `date_histogram` in favor of the simpler `time_zone` option. Previously, specifying different values for these could lead to confusing scenarios where ES would return bucket keys that are not UTC. Now `time_zone` is the only option setting, the calculation of date buckets to take place in the preferred time zone, but after rounding converting the bucket key values back to UTC. Closes #9062 Closes #9637	2015-02-16 16:54:06 +01:00
Blake Niemyjski	8cba6c3abb	Fixed an invalid query Closes #9682	2015-02-13 21:11:42 +01:00
Ryan Ernst	533fdbdf75	Mappings: Remove support for field access by short name When multiple fields under object fields share the same name, accessing by short name is ambiguous. This removes support for short names, always requiring the full name when used in queries. closes #8872	2015-02-12 09:58:37 -08:00
Andreas Kohn	01b8479179	Allow configuration of the GC log file via an environment variable Enabling GC logging works now by setting the environment variable ES_GC_LOG_FILE to the full path to the GC log file. Missing directories will be created as needed. The ES_USE_GC_LOGGING environment variable is no longer used. Closes #8471 Closes #8479	2015-02-12 17:07:57 +01:00
gseng	d1deb6bd1e	Update update-settings.asciidoc Updating to the fields as mentioned on http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/index-modules-fielddata.html Closes #9657	2015-02-12 13:09:16 +01:00
Clinton Gormley	856b0fa1a0	Docs: Fixed explanation of how the query string query is rewritten	2015-02-12 12:46:44 +01:00
Clinton Gormley	20ece4acb5	Update core-types.asciidoc Provide an example of how to disable norms Closes #9641	2015-02-12 12:10:11 +01:00
Ryan Ernst	f735baf306	Core: Remove ability to run optimize and upgrade async This has been very trappy. Rather than continue to allow buggy behavior of having upgrade/optimize requests sidestep the single shard per node limits optimize is supposed to be subject to, this removes the ability to run the upgrade/optimize async. closes #9638	2015-02-11 11:30:27 -08:00
Clinton Gormley	faae98c5d8	Updated latest version in docs	2015-02-11 19:25:10 +01:00
Clinton Gormley	57a4646776	Docs: Added note about groovy sandbox vulnerability to modules/scripting	2015-02-11 17:54:53 +01:00
Clinton Gormley	6fadeeca56	Updated doc annotations for 1.4.3	2015-02-11 17:54:53 +01:00
Ryan Ernst	b3474f6b25	Mappings: Remove ability to set path for _id and _routing on 2.0+ indexes _id and _routing now no longer support the 'path' setting on indexes created with 2.0. Indexes created before 2.0 still support this setting for backcompat. closes #6730	2015-02-10 10:53:44 -08:00
Alfredo Serafini	e607e53591	Update span-multi-term-query.asciidoc added wildcard to the list of possible nested queries Closes #9586	2015-02-09 16:01:46 +01:00
Christoph Büscher	d2f852a274	Aggregations: Add 'offset' option to date_histogram, replacing 'pre_offset' and 'post_offset' Add offset option to 'date_histogram' replacing and simplifying the previous 'pre_offset' and 'post_offset' options. This change is part of a larger clean up task for `date_histogram` from issue #9062.	2015-02-09 14:03:28 +01:00
Christoph Büscher	dfc0496fc0	Add warning to settings documentation about setting number_of_replicas on a closed index Issue #9566 raises the point that setting the number of shards on a closed index can lead to this index not beeing able to open again. This change in documentation is ment to warn the user about this issue.	2015-02-06 12:09:24 +01:00
Ryan Ernst	c6968883a7	Mappings: Remove support for new indexes using path setting in object/nested fields or index_name in any field Backcompat is still here for indexes created before 2.0. closes #6677	2015-02-05 12:44:43 -08:00
Adrien Grand	95f46f1212	Docs: Use the new experimental annotation. We now have a very useful annotation to mark features or parameters as experimental. Let's use it! This commit replaces some custom text warnings with this annotation and adds this annotation to some existing features/parameters: - inner_hits (unreleased yet) - terminate_after (released in 1.4) - per-bucket doc count errors in the terms agg (released in 1.4) I also tagged with this annotation settings which should either be not needed (like the ability to evict entries from the filter cache based on time) or that are too deep into the way that Elasticsearch works like the Directory implementation or merge settings. Close #9563	2015-02-05 15:29:45 +01:00
Adrien Grand	3a486066fd	Docs: Remove the experimental status of the cardinality and percentiles(-ranks) aggregations These aggregations are not experimental anymore but some of their parameters still are: - `precision_threshold` and `rehash` on `cardinality` - `compression` on percentiles(-ranks) Close #9560	2015-02-05 15:18:40 +01:00
Masaru Hasegawa	b4f7d26723	Fielddata: Change threshold value of fielddata.filter.frequency.max/min Make it consider 1.0 as 100% instead of aboslute count 1. Closes: #9327	2015-02-05 13:27:42 +09:00
Adam	928ea82188	Docs: Updated documentation for query-string-syntax to include '>' '<' and '=' as reserved characters Closes #9518	2015-02-04 17:55:15 +01:00
Simon Willnauer	0c5599e1d1	[ENGINE] Remove full flush / FlushType.NEW_WRITER The `full` option and `FlushType.NEW_WRITER` only exists to allow realtime changes to two settings (`index.codec` and `index.concurrency`). Those settings are very expert and don't really need to be updateable in realtime.	2015-02-04 17:38:05 +01:00
Robert Muir	027730006b	core: add 'checksum' option for index.shard.check_on_startup The current "checkindex" on startup is very very expensive. This is like running one of the old school hard drive diagnostic checkers and usually not a good idea. But we can do a CRC32 verification of files. We don't even need to open an indexreader to do this, its much more lightweight. This option (as well as the existing true/false) are randomized in tests to find problems. Also fix bug where use of the current option would always leak an indexwriter lock. Closes #9183	2015-02-03 00:10:08 -05:00
Ryan Ernst	6079d88d43	Mappings: Remove type prefix support from field names in queries This is the first part of #8872.	2015-02-02 13:10:56 -08:00
Christoph Büscher	44193e7ba5	Aggregations: Add 'offset' option to histogram aggregation Histogram aggregation supports an 'offset' option to move bucket boundaries. In a histogram with buckets of size X these can be moved from 0, X, 2X, 3X,... by an offset value of Y to Y, X+Y, 2X+Y, 3X+Y... by using the 'offset' option. The previous 'pre_offset' and 'post_offset' options are removed in favour of the simplified 'offset' option. Closes #9417 Closes #9505	2015-02-02 18:23:01 +01:00
Clinton Gormley	eea22d7731	Docs: Fixed asciidoc error in snapshots.asciidoc	2015-01-29 20:57:12 +01:00
J Charitopoulos	be8d8d658c	Docs: minor syntax Closes #9481	2015-01-29 20:27:20 +01:00
Glen Smith	3d5fbfb997	Docs: Update pattern-replace-charfilter.asciidoc Remove invalid trailing comma from json Closes #9477	2015-01-29 20:24:08 +01:00
David Pilato	878e46d7f9	[Docs] fix missing space	2015-01-29 19:17:41 +01:00
Oliver	e412dab63a	Docs: Fix sample query Closes #9472	2015-01-29 15:56:24 +01:00
Ryan Ernst	afcedb94ed	Mappings: Remove `index_analyzer` setting to simplify analyzer logic The `analyzer` setting is now the base setting, and `search_analyzer` is simply an override of the search time analyzer. When setting `search_analyzer`, `analyzer` must be set. closes #9371	2015-01-28 13:43:15 -08:00
Zachary Tong	a4eb1d5505	Aggregations: Add standard deviation bounds to extended_stats Extended_stats now displays the upper and lower bounds on standard deviations (e.g. avg +/- std). Default is to show 2 std above/below, but can be changed using the `sigma` parameter. Accepts non-negative doubles Closes #9356	2015-01-28 11:47:20 -05:00
J Charitopoulos	b359520849	Docs: Update snapshots.asciidoc minor syntax Closes #9457	2015-01-28 15:54:13 +01:00
Clinton Gormley	8978aa5465	Docs: Improved the template query docs Added the `file` and `id` parameters. Closes #9458	2015-01-28 14:19:59 +01:00
Lee Hinman	2f6527f491	[DOCS] Update documentation for `max_token_length` In 1.4 the behavior is different due to https://issues.apache.org/jira/browse/LUCENE-5897	2015-01-27 13:52:14 -07:00
Colin Goodheart-Smithe	285ef0f06d	Aggregations: Clean up response API for Aggregations This change makes the response API object for Histogram Aggregations the same for all types of Histogram, and does the same for all types of Ranges. The change removes getBucketByKey() from all aggregations except filters and terms. It also reduces the methods on the Bucket class to just getKey() and getKeyAsString(). The getKey() method returns Object and the actual Type is returns will be appropriate for the type of aggregation being run. e.g. date_histogram will return a DateTime for this method and Histogram will return a Number.	2015-01-27 10:53:44 +00:00
Christian Verkerk	5b31189498	Docs: Update cluster.asciidoc Clarify the preferencing. Closes #9434	2015-01-27 10:48:40 +01:00
Ryan Ernst	385c43c141	Mappings: Remove _analyzer closes #9279	2015-01-26 09:14:17 -08:00
jhtimmins	4aba382358	Docs: Change "There are few concepts" to "There are a few concepts" Closes #8888	2015-01-21 10:33:33 +01:00
Igor Motov	c0da353ef5	Snapshot/Restore: add support for changing index settings during restore process Closes #7887	2015-01-20 15:49:47 -05:00
Alex Ksikes	615513ee9b	Docs: clearer MLT documentation Closes #9351	2015-01-20 16:42:39 +01:00
David Pilato	fb10346953	[Mapper] Add `ignore_missing` option to `timestamp` Related to #9049. By default, the default value for `timestamp` is `now` which means the date the document was processed by the indexing chain. You can now reject documents which not provide a `timestamp` value by setting `ignore_missing` to false (default to `true`): ```js { "tweet" : { "_timestamp" : { "enabled" : true, "ignore_missing" : false } } } ``` When you update the cluster to 1.5 or master, this index created with 1.4 we automatically migrate an index created with 1.4 to the 1.5 syntax. Let say you have defined this in elasticsearch 1.4.x: ```js DELETE test PUT test { "settings": { "number_of_shards": 1, "number_of_replicas": 0 } } PUT test/type/_mapping { "type" : { "_timestamp" : { "enabled" : true, "default" : null } } } ``` After migration, the mapping become: ```js { "test": { "mappings": { "type": { "_timestamp": { "enabled": true, "store": false, "ignore_missing": false }, "properties": {} } } } } ``` Closes #8882.	2015-01-20 13:20:05 +01:00
Michael McCandless	3c0d2081cf	Core: change default xlog size from 200 MB to 512 MB Closes #9341	2015-01-19 15:52:29 -05:00
eBuildy	85ef44fd73	Docs: Fix missing comma and boolean true Closes #9350	2015-01-19 21:31:29 +01:00
Martijn van Groningen	8e0292b1aa	docs: fix inner hits snippet	2015-01-19 18:56:45 +01:00
sweetest	eaa1674d6d	Introduce index option named 'index.percolator.map_unmapped_fields_as_string', that handles unmapped fields in percolator queries as type string. Closes #9053 Closes #9054	2015-01-19 09:51:10 +01:00
Michael McCandless	b9358ccca8	Core: switch to auto IO throttle for merges This adds a new boolean (index.merge.scheduler.auto_throttle) dynamic setting, default true (matching Lucene), to adaptively set the IO rate limit for merges over time. This is more flexible than the previous fixed rate throttling because it responds depending on the incoming merge rate, so search-heavy applications that are not doing much indexing will see merges heavily throttled while indexing-heavy cases will lighten the throttle so merges can keep up within incoming indexing. The fixed rate throttling is still available as a fallback if things go horribly wrong. Closes #9243 Closes #9133	2015-01-16 13:00:08 -05:00
Clinton Gormley	c644c377ab	Update api-conventions.asciidoc Corrected explanation of fuzzy AUTO Related to #9278	2015-01-16 14:26:50 +01:00
Clinton Gormley	f5b91c374a	Update upgrade.asciidoc Upgrade request needs pretty and human for the demonstrated output. Closes #9313	2015-01-16 13:55:22 +01:00
David Haney	395960feef	Docs: Updated standard token filter docs to indicate true behavior: doing nothing Closes #9300	2015-01-15 21:33:29 +01:00
Michael McCandless	def2d34f80	don't mention fixed throttling in the docs	2015-01-14 10:13:10 -05:00
Michael McCandless	107099affa	put back fixed throttling, but off by default	2015-01-14 05:35:09 -05:00
Paul Echeverri	4f938ad37e	Updates the command to add the repo to not use add-apt-repository, which automatically adds a non-working deb-src line to sources.list. Command now uses echo to write the correct line to sources.list instead. Fixes #9261	2015-01-12 21:18:00 +00:00
Tomoya Hirano	15d46988dc	Fix typo in sample json Fixes #9253	2015-01-12 15:58:16 +00:00
David Pilato	052645903a	Rest: remove status code from main action Today we give the HTTP status back within the HTTP response itself and within the JSON response as well: ```sh curl localhost:9200/ ``` ```js { "status" : 200, "name" : "Red Wolf", "version" : { "number" : "2.0.0", "build_hash" : "6837a61d8a646a2ac7dc8da1ab3c4ab85d60882d", "build_timestamp" : "2014-08-19T13:55:56Z", "build_snapshot" : true, "lucene_version" : "4.9" }, "tagline" : "You Know, for Search" } ```	2015-01-12 12:37:46 +01:00
David Pilato	fc7a0d3a4a	[Docs] fix three to four	2015-01-12 12:13:23 +01:00
Michael McCandless	1aad275c55	expose current CMS throttle in merge stats; fix tests, docs; also log per-merge stop/throttle/rate	2015-01-11 05:52:43 -05:00
Michael McCandless	31e6acf3f2	first cut	2015-01-10 16:38:56 -05:00
Christoph Büscher	04cb09f44c	[TEST] Add missing docs and tests for '_cat/segments' The '_cat/segments' api was missing docs and a rest test which are added here. Closes #5856	2015-01-09 12:29:11 +01:00
Ryan Ernst	060f963a8e	Mappings: Remove allow_type_wrapper setting Before Elasticsearch 1.0, the type was allowed to be passed as the root element when uploading a document. However, this was ambiguous if the mappings also contained a field with the same name as the type. The behavior was changed in 1.0 to not allow this, but a setting was added for backwards compatibility. This change removes the setting for 2.0.	2015-01-08 09:13:40 -08:00
Martijn van Groningen	ca4f27f40e	Core: Added `_shards` header to all write responses. The header indicates to how many shard copies (primary and replicas shards) a write was supposed to go to, to how many shard copies to write succeeded and potentially captures shard failures if writing into a replica shard fails. For async writes it also includes the number of shards a write is still pending. Closes #7994	2015-01-08 18:10:08 +01:00
Martijn van Groningen	dedaf9387e	Core: Also check if indices resolved via aliases resolution aren't closed and deal with this according to IndicesOptions. Closes #9057	2015-01-08 16:45:34 +01:00
Martijn van Groningen	20f7be378b	Removed parent parameter from update request, because it is just sets the routing. The routing option should be used instead. The parent a child document points to can't be updated. Closes #4538	2015-01-07 10:26:20 +01:00
Ryan Ernst	f7f99b8dbf	Stats: Added verbose option to segments api, with full ram tree as first additional element per segment. This commit adds a verbose flag to the _segments api. Currently the only additional information returned when set to true is the full ram tree from lucene for each segment.	2015-01-06 10:04:52 -08:00
Adrien Grand	bc86796592	Core: Remove terms filter cache. This is our only cache which is not 'exact' and might allow for stalled results. Additionally, a similar cache that we have and needs to perform lookups in other indices in order to run queries is the script index, and for this index we rely on the filesystem cache, so we should probably do the same with terms filters lookups. Close #9056	2015-01-06 17:21:20 +01:00
Simon Willnauer	236e2491b4	[ALLOCATION] Remove primary balance factor The `cluster.routing.allocation.balance.primary` setting has caused a lot of confusion in the past while it has very little benefit form a shard allocatioon point of view. Users tend to modify this value to evently distribute primaries across the nodes which is dangerous since a prmiary flag on it's own can trigger relocations. The primary flag for a shard is should not have any impact on cluster performance unless the high level feature suffereing from primary hotspots is buggy. Yet, this setting was intended to be a tie-breaker which is not necessary anymore since the algorithm is deterministic. This commit removes this setting entriely.	2015-01-06 16:43:39 +01:00
Simon Willnauer	4900f52619	[ALLOCATION] Weight deltas must be absolute deltas In some situations the shard balanceing weight delta becomes negative. Yet, a negative delta is always treated as `well balanced` which is wrong. I wasn't able to reproduce the issue in any way other than useing the real world data from issue #9023. This commit adds a fix for absolute deltas as well as a base test class that allows to build tests or simulations from the cat API output. Closes #9023	2015-01-06 15:48:44 +01:00
Clinton Gormley	75cc7077c7	Update plugins.asciidoc Added entity resolution plugin for duplication detection Related to #9131	2015-01-05 12:53:37 +01:00
Mikhail Korobov	707025fb7a	[Docs] fix curl examples in Nodes Stats docs Closes #9118	2014-12-31 14:01:37 +01:00
Clinton Gormley	f83909f7ae	Docs: The regexp query defaults to the `ALL` flag, and removed the `AUTOMATON` flag which is not used in Elasticsearch. Closes #6180	2014-12-30 19:53:31 +01:00
Clinton Gormley	904f20a41b	Update setup.asciidoc Add a note about using the same JVM version on all nodes and clients	2014-12-30 17:40:51 +01:00
dtpeacock	582d5e8d3c	Doc has store "false" not store "true" Came from `3465e69e83` due to changing "yes" to "false". Closes #9075	2014-12-29 11:59:22 +01:00
Martijn van Groningen	d8054ec299	inner_hits: Added another more compact syntax for inner hits. Closes #8770	2014-12-24 17:41:35 +01:00
Ryan Ernst	39b3613420	Fix date histogram docs grammar.	2014-12-23 10:19:55 -08:00
Nicholas Knize	77a7ef28b3	[GEO] Add optional left/right parameter to GeoJSON This feature adds an optional orientation parameter to the GeoJSON document and geo_shape mapping enabling users to explicitly define how they want Elasticsearch to interpret vertex ordering. The default uses the right-hand rule (counterclockwise for outer ring, clockwise for inner ring) complying with OGC Simple Feature Access standards. The parameter can be explicitly specified for an entire index using the geo_shape mapping by adding "orientation":{"left"\|"right"\|"cw"\|"ccw"\|"clockwise"\|"counterclockwise"} and/or overridden on each insert by adding the same parameter to the GeoJSON document. closes #8764	2014-12-22 12:09:45 -06:00
Adrien Grand	fb6c3b7c29	[Docs] Improve documentation of the new caching policy for filters.	2014-12-22 17:14:47 +01:00
Adrien Grand	ce11e0ee6d	Filter cache: add a `_cache: auto` option and make it the default. Up to now, all filters could be cached using the `_cache` flag that could be set to `true` or `false` and the default was set depending on the type of the `filter`. For instance, `script` filters are not cached by default while `terms` are. For some filters, the default is more complicated and eg. date range filters are cached unless they use `now` in a non-rounded fashion. This commit adds a 3rd option called `auto`, which becomes the default for all filters. So for all filters a cache wrapper will be returned, and the decision will be made at caching time, per-segment. Here is the default logic: - if there is already a cache entry for this filter in the current segment, then return the cache entry. - else if the doc id set cannot iterate (eg. script filter) then do not cache. - else if the doc id set is already cacheable and it has been used twice or more in the last 1000 filters then cache it. - else if the filter is costly (eg. multi-term) and has been used twice or more in the last 1000 filters then cache it. - else if the doc id set is not cacheable and it has been used 5 times or more in the last 1000 filters, then load it into a cacheable set and cache it. - else return the uncached set. So for instance geo-distance filters and script filters are going to use this new default and are not going to be cached because of their iterators. Similarly, date range filters are going to use this default all the time, but it is very unlikely that those that use `now` in a not rounded fashion will get reused so in practice they won't be cached. `terms`, `range`, ... filters produce cacheable doc id sets with good iterators so they will be cached as soon as they have been used twice. Filters that don't produce cacheable doc id sets such as the `term` filter will need to be used 5 times before being cached. This ensures that we don't spend CPU iterating over all documents matching such filters unless we have good evidence of reuse. One last interesting point about this change is that it also applies to compound filters. So if you keep on repeating the same `bool` filter with the same underlying clauses, it will be cached on its own while up to now it used to never be cached by default. `_cache: true` has been changed to only cache on large segments, in order to not pollute the cache since small segments should not be the bottleneck anyway. However `_cache: false` still has the same semantics. Close #8449	2014-12-18 15:51:36 +01:00
Michael McCandless	242e631e95	Core: ignore known idle threads by default in /_nodes/hot_threads Add a new ignore_idle_threads boolean option (default true) to /_nodes/hot_threads, to filter out threads in known idle places like waiting on a socket select or on pulling the next task from an empty queue. Closes #8985 Closes #8908	2014-12-17 11:59:31 -05:00
Yasir Bamarni	5059d6fe1c	Update percolate.asciidoc wrong type used in the -GET request Closes #8942	2014-12-17 14:05:27 +01:00
Pablo Díaz-López	adb1a5b43b	Update getting-started.asciidoc Missing -X flag at the curl template Closes #8977	2014-12-17 14:03:38 +01:00
Peter Johnson a.k.a. insertcoffee	4b5e6b2de0	[docs] pedantry Closes #8982	2014-12-17 13:46:39 +01:00
Nicholas Knize	ac0e37449e	Adding unit test for self intersecting polygons. Relevant to #7751 even/odd discussion Updating documentation to describe polygon ambiguity and vertex ordering.	2014-12-16 10:54:39 -06:00
Ryan Ernst	37287284e6	Settings: Remove `mapping.date.round_ceil` setting for date math parsing The setting `mapping.date.round_ceil` (and the undocumented setting `index.mapping.date.parse_upper_inclusive`) affect how date ranges using `lte` are parsed. In #8556 the semantics of date rounding were solidified, eliminating the need to have different parsing functions whether the date is inclusive or exclusive. This change removes these legacy settings and improves the tests for the date math parser (now at 100% coverage!). It also removes the unnecessary function `DateMathParser.parseTimeZone` for which the existing `DateTimeZone.forID` handles all use cases. Any user previously using these settings can refer to the changed semantics and change their query accordingly. This is a breaking change because even dates without datemath previously used the different parsing functions depending on context. closes #8598 closes #8889	2014-12-15 13:13:45 -08:00
Timothy Perisho	ceafde41e9	Docs: typo on "frequent" I replaced "high frequent terms" with "high frequency terms" and "low frequent terms" with "low frequency terms". Alternatively, we could write, "highly frequent terms" and "minimally frequent terms" (or just "rare terms"). Closes #8962	2014-12-15 19:59:50 +01:00
Clinton Gormley	fcb83055de	Update repositories.asciidoc Update formatting of PGP key	2014-12-15 18:04:17 +01:00
Simon Willnauer	1247774ff1	Remove Gateway abstraction We only have a single gatweway since es 1.3. There is no need to keep all these abstractsion and nested packages. We can fold most of it into simpler structures.	2014-12-15 15:53:02 +01:00
spapin	ad747ba67f	Docs: fix a typo in cluster stats documentation example Closes #8898	2014-12-15 14:14:38 +01:00
Ayush	23dbecf3e7	Update percolate.asciidoc Updating the `associated` spelling Closes #8907	2014-12-15 14:12:03 +01:00
Alexander Reelsen	544ef8cb17	Packaging: Add java7/8 java-package paths to debian init script If you use the java-package tool to create java packages, those paths also should be added to the debian init script. Also updated the docs, that it is ok to install java8. Closes #7383	2014-12-11 16:15:00 +01:00
Peter Fabian Mitchell	b2bab05c29	HTTP: Add 'http.publish_port' setting to the HTTP module This change adds a 'http.publish_port' setting to the HTTP module to configure the port which HTTP clients should use when communicating with the node. This is useful when running on a bridged network interface or when running behind a proxy or firewall. Closes #8807 Closes #8137	2014-12-11 16:10:07 +01:00
Robert Muir	a2ffe494ae	[core] add best_compression option for Lucene 5.0 Upgrades lucene to latest, and supports the BEST_COMPRESSION parameter now supported (with backwards compatibility, etc) in Lucene. This option uses deflate, tuned for highly compressible data. index.codec:: The default value compresses stored data with LZ4 compression, but this can be set to best_compression for a higher compression ratio, at the expense of slower stored fields performance. IMO its safest to implement as a named codec here, because ES already has logic to handle this correctly, and because its unrealistic to have a plethora of options to Lucene's default codec... we are practically limited in Lucene to what we can support with back compat, so I don't think we should overengineer this and add additional unnecessary plumbing. See also: https://issues.apache.org/jira/browse/LUCENE-5914 https://issues.apache.org/jira/browse/LUCENE-6089 https://issues.apache.org/jira/browse/LUCENE-6090 https://issues.apache.org/jira/browse/LUCENE-6100 Closes #8863	2014-12-10 22:13:09 -05:00
Alexander Clausen	633905161a	Docs: use https to download the gpg public key Closes #8818	2014-12-10 18:14:07 +01:00
Adam Menges	3a3030e217	Docs: Fix the wording for inner hits a bit Closes #8747	2014-12-09 13:36:26 +01:00
Ashraf Sarhan	24f8807cb5	Docs: Update repositories.asciidoc 1. Enable the repository using "add-apt-repository" to avoid this error "No command 'deb' found". 2. Adding "sudo" to update and install command. Closes #8691	2014-12-09 13:23:16 +01:00
Kevin Kluge	63ac4614f4	docs: add pgp key to repositories page	2014-12-08 15:41:09 +01:00
Jun Ohtani	d78d2ff93d	Docs: add randomizedtesting-runner to testing-framework.asciidoc Close #8450	2014-12-07 01:30:58 +09:00
Adrien Grand	344bbf2ced	Docs: Add instructions to start elasticsearch on bootup on RHEL/Fedora.	2014-12-05 11:14:13 +01:00
tristanbob	0a09f1ea13	Docs: Added a command to start elasticsearch on bootup on Debian. Close #8600	2014-12-05 11:03:32 +01:00
David Pilato	d2a2d1bb53	java: QueryBuilders cleanup: remove deprecated Related to #8667: Some QueryBuilders have been deprecated in 1.x branches. We removed them in 2.0. Removed ------- * `textPhrase(...)` * `textPhrasePrefix(...)` * `textPhrasePrefixQuery(...)` * `filtered(...)` * `inQuery(...)` * `commonTerms(...)` * `queryString(...)` * `simpleQueryString(...)` Closes #8721.	2014-12-03 16:07:34 +01:00
Peter Johnson a.k.a. insertcoffee	ac71f1b70a	[docs] formatting and general pedantry I'm not sure if the `distance-units` section is totally clear, when using the 'Geohash Cell Filter' and omitting a unit, the default is to interpret the integer as the 'length of the geohash prefix', not to default it to 'meter'. Maybe I'm being pedantic. Closes #8744	2014-12-02 19:23:48 +01:00
John Michael Luy	01ef80a33d	Update range-filter.asciidoc Closes #8741	2014-12-02 18:00:38 +01:00
John Michael Luy	f20f6ffe22	Docs: Update range-query.asciidoc Closes #8740	2014-12-02 12:55:44 +01:00
Martijn van Groningen	d7e224da04	Added `inner_hits` feature that allows to include nested hits. Inner hits allows to embed nested inner objects, children documents or the parent document that contributed to the matching of the returned search hit as inner hits, which would otherwise be hidden. Closes #8153 Closes #3022 Closes #3152	2014-12-02 12:01:01 +01:00
Itamar Syn-Hershko	cb042cd662	Fixing typo Closes #8713	2014-12-01 10:52:00 +01:00
Dan Tuffery	3b5fa9075a	Docs: Grammar correction Closes #8702	2014-11-29 14:06:04 +01:00
Clinton Gormley	88e06cba80	Update daterange-aggregation.asciidoc Clarified the date-math expressions on date range aggregations Closes #8703	2014-11-28 16:53:33 +01:00
Alex Ksikes	256712640f	MLT Query: Support for ignore docs Adds a `ignore_like` parameter to the MLT Query, which simply tells the algorithm to skip all the terms from the given documents. This could be useful in order to better guide nearest neighbor search by telling the algorithm to never explore the space spanned by the given `ignore_like` docs. In essence we are interested about the characteristic of a given item, but not of the ones provided by `ignore_like`, thereby forcing the algorithm to go deeper in its selection of terms. Note that this is different than simply performing a must not boolean query on the unliked items. The syntax is exactly the same as the `like` parameter. Closes #8674	2014-11-28 14:48:43 +01:00
pmamat	9e2eaeece4	Docs: Additional info about _score calculation Description taken from http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/multi-query-strings.html / 110_Multi_Field_Search/05_Multiple_query_strings.asciidoc Closes #8635	2014-11-28 13:54:45 +01:00
Britta Weber	59507cf793	function_score: match only document with score above custom score threshold functon_score matched each document regardless of the computed score. This commit adds a query parameter `min_score` (-Float.MAX_VALUE default). Documents that have a score lower than this threshold will not be mached. closes #6952	2014-11-28 12:35:26 +01:00
David Pilato	43a1435d3b	[Docs] fix consistency between examples	2014-11-27 20:29:34 +01:00
David Pilato	40f0e07db3	[Docs] Fix missing new line	2014-11-27 19:39:12 +01:00
Britta Weber	f00b431c18	[docs] explain default settings for parameters of decay functions relates to #8624	2014-11-27 19:18:55 +01:00
David Pilato	da27c2104a	[Docs] Fix missing comma in mapping	2014-11-27 11:03:19 +01:00
Clinton Gormley	818b9b7563	Updated docs to use v1.4.1 as current	2014-11-26 17:18:37 +01:00
Sebastian Ziebell	3a6c6f4b26	Docs: Adds documentation for indices.exists_template Closes: #8657	2014-11-25 19:36:01 +01:00
tristanbob	807f363d6d	Added note that ES packages automatically change vm.max_map_count Closes #8601	2014-11-25 18:25:46 +01:00
Matt Hughes	afba977e80	Docs: Added swift openstack repository Closes #8583	2014-11-25 13:49:15 +01:00
David Haney	2c429452e9	Typo: changed "5% or the real words" to "5% of the real words" Closes #8582	2014-11-25 13:15:33 +01:00
Michael McCandless	856b294441	Core: let Lucene kick off merges Today, Elasticsearch has a separate merge thread pool checking once per second (by default) if any merges are necessary, but this is no longer necessary since we can and do now tell Lucene's ConcurrentMergeScheduler never to "hard pause" threads when merges fall behind, since we do our own index throttling. This change goes back to letting Lucene launch merges as needed, and removes these two expert settings: index.merge.force_async_merge index.merge.async_interval Now merges kick off immediately instead of waiting up to 1 second before running. Closes #8643	2014-11-25 04:13:57 -05:00
Martijn van Groningen	1d7cdd7d22	Applied PR, changed the way defaults are handled and updated the docs. Closes #4452	2014-11-24 13:32:41 +01:00
Lee Hinman	45408844e7	Remove NoneGateway, NoneGatewayAllocator, & NoneGatewayModule Always use the LocalGateway* equivalents We already check in the LocalGateway whether a node is a client node, or is not master-eligible, and skip writing the state there. This allows us to remove this code that was previously used only for tribe nodes (which are not master eligible anyway and wouldn't write state) and in tests (which can shake more bugs out)	2014-11-24 12:22:05 +01:00
dw	ad408eee85	Docs: Reword note regarding _source for accuracy Previously it suggested _source was always present, when that is not the case. Closes #8491	2014-11-24 12:19:44 +01:00
Laurent Broudoux	feb465f26f	Docs: Update plugins.asciidoc on river plugins section Adding links to Amazon S3 and Google Drive river plugins Closes #8544	2014-11-24 12:15:12 +01:00
Michael McCandless	dfb6d6081c	Core: upgrade to current Lucene 5.0.0 snapshot Elasticsearch no longer unlocks the Lucene index on startup (this was dangerous, and could possibly lead to corruption). Added the new serbian_normalization TokenFilter from Lucene. NoLockFactory is no longer supported (index.store.fs.fs_lock = none), and if you have a typo in your fs_lock you'll now hit a StoreException instead of silently using NoLockFactory. Closes #8588	2014-11-24 05:08:42 -05:00
Adrien Grand	8346e92ebb	Core: Fix script fields to be returned as a multivalued field when they produce a list. This change is essentially the same as #3015 but on script fields. Close #8592	2014-11-24 09:41:16 +01:00
mdzor	bc52ccfd33	Docs: Update update-settings.asciidoc Inconsistent indentation Closes #8525	2014-11-23 14:45:56 +01:00
barbasa	fd6c41bfbf	Missing quote in the example	2014-11-23 14:03:58 +01:00
Alban Perillat-Merceroz	54466938da	Fix error in documentation Indexation does not fail if no timestamp provided when there is a default value defined in mapping.	2014-11-23 14:02:51 +01:00
dw	bb81055c33	Docs: Remove reference to imaginary "no_docs_query" No reference to it in the source code except this file. Closes #8566	2014-11-23 13:56:33 +01:00
Mariam Hakobyan	4a1ab6543c	Docs: Added new elasticsearch-river-kafka plugin to the documentation (uses latest version of Kafka, EL Bulk API, and supports concurrent requests) Closes #8518	2014-11-23 12:28:45 +01:00
Alex Ksikes	1959275622	Term Vectors: More consistent naming for term vector[s] We speak of the term vectors of a document, where each field has an associated stored term vector. Since by default we are requesting all the term vectors of a document, the HTTP request endpoint should rather be called `_termvectors` instead of `_termvector`. The usage of `_termvector` is now deprecated, as well as the transport client call to termVector and prepareTermVector. Closes #8484	2014-11-21 14:06:44 +01:00
Robert Muir	9ef69f9f36	Disable bloom filters. make the "es090" postings format read-only, just to support old segments. There is a test version that subclasses it with write-capability for testing. Closes #8571	2014-11-20 21:03:23 -05:00
Simon Willnauer	0fcb466555	[STORE] Remove `memory`/ `ram` store The RAM store is discuraged for production usage anyway and we don't test it in our randomized infrastructure. This commit removes it for `2.0`	2014-11-20 14:47:19 +01:00
javanna	06fafa3ed9	[DOCS] document that we support loading multiple logging conf files	2014-11-19 11:34:57 +01:00
Boaz Leskes	1e16375d04	Docs: Update execution hint docs for Significant terms agg copied over the relevant pieces from the terms agg Closes #8532	2014-11-18 20:54:26 +01:00
Olivier Favre	4d68d3d053	Provide more context variables in update scripts In addition to `_source`, the following variables are available through the `ctx` map: `_index`, `_type`, `_id`, `_version`, `_routing`, `_parent`, `_timestamp`, `_ttl`. Some of these fields are more useful still within the context of an Update By Query, see #1607, #2230, #2231.	2014-11-14 10:14:39 +01:00
Clinton Gormley	32fc657d71	Docs: Fixed a bad ref to docs-bulk-udp which no longer exists in master	2014-11-13 14:34:49 +01:00
Colin Goodheart-Smithe	353574d6af	Indices API: Fix GET index API always running all features Previous to this change all features (_alias,_mapping,_settings,_warmer) are run regardless of which features are actually requested. This change fixes the request object to resolve this bug	2014-11-13 13:22:46 +00:00
Clinton Gormley	6b05b229af	Docs: Changed breaking docs in master to correspond with 1.x for easier merging	2014-11-13 13:50:57 +01:00
Colin Goodheart-Smithe	34b37ab7f0	[DOCS] Added documentation for log4j-extras dependency	2014-11-13 12:40:14 +00:00
javanna	c1428b5964	[DOCS] Expand logging documentation Updated log4j link so it doesn't point to log4j 2.0 but version 1.2. Clarified which formats are supported and briefly explained what loggers and appenders are, plus added a link to the log4j docs. Closes #5305 Closes #8455	2014-11-13 11:08:10 +01:00
Joel Taddei	7e72800c83	[DOCS] Corrected syntax error in search curl cmd Closes #8447	2014-11-12 17:21:19 +01:00
Mark Walkom	bfd1bcd30a	Updated threadpool documentation to elaborate/clarify what the pools are for and their values Closes #8446	2014-11-12 22:33:38 +11:00
Israel Tsadok	7590629531	Docs: note about confusing disk threshold settings	2014-11-12 09:24:03 +01:00
Martijn van Groningen	94c1a7dabe	Docs: Fix incorrect documentation for the `index.query.parse.allow_unmapped_fields` setting. The `index.query.parse.allow_unmapped_fields` setting can't influence whether unmapped fields are allowed in alias filters and percolator queries.	2014-11-11 15:13:55 +00:00
Michael McCandless	8aebb9656b	Core: add max_determinized_states to query_string and regexp query/filter This prevents too-difficult regular expressions from consuming excessive RAM/CPU; the default max_determinized_states is 10,000 (same as Lucene) but query_string and regepx query/filter can override per-request. The also upgrades to a new Lucene 5.0.0 snapshot. Closes #8386 Closes #8357	2014-11-10 13:43:48 -05:00
Clinton Gormley	cff544dcc2	Docs: Removed old coming/added tags	2014-11-10 14:41:24 +01:00
Britta Weber	c5a4c1d6b4	[docs] add 2d vis for decay functions and parameters closes #8420	2014-11-10 10:56:41 +01:00
Veres Lajos	4059e4ac86	typo fixes - https://github.com/vlajos/misspell_fixer Closes #8323	2014-11-08 18:55:57 +01:00
Clinton Gormley	08aa715d2e	Update datehistogram-aggregation.asciidoc Clarified use of fractional time units in the date histo agg. Closes #7957	2014-11-08 17:49:34 +01:00
Clinton Gormley	b9149f836b	Docs: Improve the exists/missing filters documentation Closes #7274	2014-11-08 16:57:41 +01:00
Clinton Gormley	f5ad699284	Update multi-get.asciidoc Documented that the fields parameter can be passed in the query string. Closes #4006	2014-11-08 13:55:23 +01:00
Kevin Kluge	c473976e31	[docs] fix typo in getting-started Closes #8354	2014-11-06 10:57:56 +01:00
Robert Muir	610ce078fb	Upgrade master to lucene 5.0 snapshot This has a lot of improvements in lucene, particularly around memory usage, merging, safety, compressed bitsets, etc. On the elasticsearch side, summary of the larger changes: API changes: postings API became a "pull" rather than "push", collector API became per-segment, etc. packaging changes: add lucene-backwards-codecs.jar as a dependency. improvements to boolean filtering: especially ensuring it will not be slow for SparseBitSet. use generic BitSet api in plumbing so that concrete bitset type is an implementation detail. use generic BitDocIdSetFilter api for dedicated bitset cache, so there is type safety. changes to support atomic commits implement Accountable.getChildResources (detailed memory usage API) for fielddata, etc change handling of IndexFormatTooOld/New, since they no longer extends CorruptIndexException Closes #8347. Squashed commit of the following: commit d90d53f5f21b876efc1e09cbd6d63c538a16cd89 Author: Simon Willnauer <simonw@apache.org> Date: Wed Nov 5 21:35:28 2014 +0100 Make default codec/postings/docvalues format constants commit cb66c22c71cd304a36e7371b199a8c279908ae37 Merge: d4e2f6d `ad4ff43` Author: Robert Muir <rmuir@apache.org> Date: Wed Nov 5 11:41:13 2014 -0500 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit d4e2f6dfe767a5128c9b9ae9e75036378de08f47 Merge: 4e5445c `4111d93` Author: Robert Muir <rmuir@apache.org> Date: Wed Nov 5 06:26:32 2014 -0500 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit 4e5445c775f580730eb01360244e9330c0dc3958 Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 16:19:19 2014 -0500 FixedBitSet -> BitSet commit 9887ea73e8b857eeda7f851ef3722ef580c92acf Merge: 1bf8894 `fc84666` Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 15:26:25 2014 -0500 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit 1bf8894430de3e566d0dc5623b0cc28b0d674ebb Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 15:22:51 2014 -0500 remove nocommit commit a9c2a2259ff79c69bae7806b64e92d5f472c18c8 Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 13:48:43 2014 -0500 turn jenkins red again commit 067baaaa4d52fce772c81654dcdb5051ea79139f Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 13:18:21 2014 -0500 unzip from stream commit 82b6fba33d362aca2313cc0ca495f28f5ebb9260 Merge: b2214bb `6523cd9` Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 13:10:59 2014 -0500 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit b2214bb093ec2f759003c488c3c403c8931db914 Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 13:09:53 2014 -0500 go back to my URL until we can figure out what is up with jenkins commit e7d614172240175a51f580aeaefb6460d21cede9 Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 10:52:54 2014 -0500 try this jenkins commit 337a3c7704efa7c9809bf373152d711ee55f876c Author: Simon Willnauer <simonw@apache.org> Date: Tue Nov 4 16:17:49 2014 +0100 Rename temp-files under lock to prevent metadata reads while renaming commit 77d5ba80d0a76efa549dd753b9f114b2f2d2d29c Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 10:07:11 2014 -0500 continue to treat too-old/too-new as corruption for now commit 98d0fd2f4851bc50e505a94ca592a694d502c51c Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 09:24:21 2014 -0500 fix last nocommit commit 643fceed66c8caf22b97fc489d67b4a2a90a1a1c Author: Simon Willnauer <simonw@apache.org> Date: Tue Nov 4 14:46:17 2014 +0100 remove NoSuchDirectoryException commit 2e43c4feba05cfaf451df70f946c0930cbcc4557 Merge: 93826e4 `8163107` Author: Simon Willnauer <simonw@apache.org> Date: Tue Nov 4 14:38:00 2014 +0100 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit 93826e4d56a6a97c2074669014af77ff519bde63 Merge: 7f10129 `44e24d3` Author: Simon Willnauer <simonw@apache.org> Date: Tue Nov 4 12:54:27 2014 +0100 Merge branch 'master' into enhancement/lucene_5_0_upgrade Conflicts: src/main/java/org/elasticsearch/index/store/DistributorDirectory.java src/main/java/org/elasticsearch/index/store/Store.java src/main/java/org/elasticsearch/indices/recovery/RecoveryStatus.java src/test/java/org/elasticsearch/index/store/DistributorDirectoryTest.java src/test/java/org/elasticsearch/index/store/StoreTest.java src/test/java/org/elasticsearch/indices/recovery/RecoveryStatusTests.java commit 7f10129364623620575c109df725cf54488b3abb Author: Adrien Grand <jpountz@gmail.com> Date: Tue Nov 4 11:32:24 2014 +0100 Fix TopHitsAggregator to not ignore the top-level/leaf collector split. commit 042fadc8603b997bdfdc45ca44fec70dc86774a6 Author: Adrien Grand <jpountz@gmail.com> Date: Tue Nov 4 11:31:20 2014 +0100 Remove MatchDocIdSet in favor of DocValuesDocIdSet. commit 7d877581ff5db585a674c95ac391ac78a0282826 Author: Adrien Grand <jpountz@gmail.com> Date: Tue Nov 4 11:10:08 2014 +0100 Make the and filter use the cost API. Lucene 5 ensured that cost() can safely be used, and this will have the benefit that the order in which filters are specified is not important anymore (only for slow random-access filters in practice). commit 78f1718aa2cd82184db7c3a8393e6215f43eb4a8 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 23:55:17 2014 -0500 fix previous eclipse import braindamage commit 186c40e9258ce32f22a9a714ab442a310b6376e0 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 22:32:34 2014 -0500 allow child queries to exhaust iterators again commit b0b1271305e1b6d0c4c4da51a3c54df1aa5c0605 Author: Ryan Ernst <ryan@iernst.net> Date: Mon Nov 3 14:50:44 2014 -0800 Fix nocommit for mapping output. index_options will not be printed if the field is not indexed. commit ba223eb85e399c9620a347a983e29bf703953e7a Author: Ryan Ernst <ryan@iernst.net> Date: Mon Nov 3 14:07:26 2014 -0800 Remove no commit for chinese analyzer provider. We should have a separate issue to address not using this provider on new indexes. commit ca554b03c4471797682b2fb724f25205cf040c4a Author: Ryan Ernst <ryan@iernst.net> Date: Mon Nov 3 13:41:59 2014 -0800 Fix stop tests commit de67c4653ec47dee9c671390536110749d2bb05f Author: Ryan Ernst <ryan@iernst.net> Date: Mon Nov 3 12:51:17 2014 -0800 Remove analysis nocommits, switching over to Lucene43*Filters for backcompat commit 50cae9bec72c25c33a1ab8a8931bccb3355171e2 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 15:32:25 2014 -0500 add ram accounting and TODO lazy-loading (its no worse than master, can be a followup improvement) for suggesters commit 7a7f0122f138684b312d0f0b03dc2a9c16c15f9c Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 15:11:26 2014 -0500 bump lucene version commit cd0cae5c35e7a9e049f49ae45431f658fb86676b Merge: 446bc09 `3c72073` Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 14:49:05 2014 -0500 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit 446bc09b4e8bf4602d3c252b53ddaa0da65cce2f Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 14:46:30 2014 -0500 remove hack commit a19d85a968d82e6d00292b49630ef6ff2dbf2f32 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 12:53:11 2014 -0500 dont create exceptions with circular references on corruption (will open a PR for this) commit 0beefb9e821d97c37e90ec556d81ac7b00369b8a Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 11:47:14 2014 -0500 temporarily add craptastic detector for this horrible bug commit e9f2d298bff75f3d1591f8622441e459c3ce7ac3 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 10:56:01 2014 -0500 add nocommit commit e97f1d50a91a7129650b8effc7a9ecf74ca0569a Merge: c57a3c8 `f1f50ac` Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 10:12:12 2014 -0500 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit c57a3c8341ed61dca62eaf77fad6b8b48aeb6940 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 10:11:46 2014 -0500 fix nocommit commit dd0e77e4ec07c7011ab5f6b60b2ead33dc2333d2 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 09:54:09 2014 -0500 nocommit -> TODO, this is in much more places in the codebase, bigger issue commit 3cc3bf56d72d642059f8fe220d6f2fed608363e9 Author: Ryan Ernst <ryan@iernst.net> Date: Sat Nov 1 23:59:17 2014 -0700 Remove nocommit and awaitsfix for edge ngram filter test. commit 89f115245155511c0fbc0d5ee62e63141c3700c1 Author: Ryan Ernst <ryan@iernst.net> Date: Sat Nov 1 23:57:44 2014 -0700 Fix EdgeNGramTokenFilter logic for version <= 4.3, and fixed instanceof checks in corresponding tests to correctly check for reverse filter when applicable. commit 112df869cd199e36aab0e1a7a288bb1fdb2ebf1c Author: Robert Muir <rmuir@apache.org> Date: Sun Nov 2 00:08:30 2014 -0400 execute geo disjoint query/filter as intersects commit e5061273cc685f1252e9a3a9ae4877ec9bce7752 Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 22:58:59 2014 -0400 remove chinese analyzer from docs commit ea1af11b8978fcc551f198e24fe21d52806993ef Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 22:29:00 2014 -0400 fix ram accounting bug commit 53c0a42c6aa81aa6bf81d3aa77b95efd513e0f81 Merge: e3bcd3c `6011a18` Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 22:16:29 2014 -0400 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit e3bcd3cc07a4957e12c7b3affc462c31290a9186 Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 22:15:01 2014 -0400 fix url-email back compat (thanks ryan) commit 91d6b096a96c357755abee167098607223be1aad Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 22:11:26 2014 -0400 bump lucene version commit d2bb9568df72b37ec7050d25940160b8517394bc Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 20:33:07 2014 -0400 remove nocommit commit 1d049c471e19e5c457262c7399c5bad9e023b2e3 Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 20:28:58 2014 -0400 fix eclipse to group org/com imports together: without this, its madness commit 09d8c1585ee99b6e63be032732c04ef6fed84ed2 Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 14:27:41 2014 -0400 remove nocommit, if you dont liek it, print assembly and tell me how it can be better commit 8a6a294313fdf33b50c7126ec20c07867ecd637c Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 20:01:55 2014 +0100 Remove deprecated usage of DocIdSets.newDocIDSet. commit 601bee60543610558403298124a84b1b3bbd1045 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 14:13:18 2014 -0400 maybe one of these zillions of annotations will stop thread leaks commit 9d3f69abc7267c5e455aefa26db95cb554b02d62 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 14:05:39 2014 -0400 fix some analysis nocommits commit 312e3a29c77214b8142d21c33a6b2c2b151acf9a Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 18:28:45 2014 +0100 Remove XConstantScoreQuery/XFilteredQuery/ApplyAcceptedDocsFilter. commit 5a0cb9f8e167215df7f1b1fad11eec6e6c74940f Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 17:06:45 2014 +0100 Fix misleading documentation of DocIdSets.toCacheable. commit 8b4ef2b5b476fff4c79c0c2a0e4769ead26cf82b Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 17:05:59 2014 +0100 Fix CustomRandomAccessFilterStrategy to override the right method. commit d7a9a407a615987cfffc651f724fbd8795c9c671 Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 16:21:35 2014 +0100 Better handle the special case when there is a single SHOULD clause. commit 648ad389f07e92dfc451f345549c9841ba5e4c9a Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 15:53:38 2014 +0100 Cut over XBooleanFilter to BitDocIdSet.Builder. The idea is similar to what happened to Lucene's BooleanFilter. Yet XBooleanFilter is a bit more sophisticated and I had to slightly change the way it is implemented in order to make it work. The main difference with before is that slow filters are now applied lazily, so eg. if you have 3 MUST clauses, two with a fast iterator and the third with a slow iterator, the previous implementation used to apply the fast iterators first and then only check the slow filter for bits which were set in the bit set. Now we are computing a bit set based on the fast must clauses and then basically returning a BitsFilteredDocIdSet.wrap(bitset, slowClause). Other than that, BooleanFilter still uses the bitset optimizations when or-ing and and-ind filters. Another improvement is that BooleanFilter is now aware of the cost API. commit b2dad312b4bc9f931dc3a25415dd81c0d9deee08 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 10:18:53 2014 -0400 clear nocommit commit 4851d2091e744294336dfade33906c75fbe695cd Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 15:15:16 2014 +0100 cut over to RoaringDocIdSet commit ca6aec24a901073e65ce4dd6b70964fd3612409e Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 14:57:30 2014 +0100 make nocommit more explicit commit d0742ee2cb7a6c48b0bbb31580b7fbcebdb6ec40 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 09:55:24 2014 -0400 fix standardtokenizer nocommit commit 7d6faccafff22a86af62af0384838391d46695ca Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 14:54:08 2014 +0100 fix compilation commit a038a405c1ff6458ad294e6b5bc469e622f699d0 Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 14:53:43 2014 +0100 fix compilation commit 30c9e307b1f5d80e2deca3392c0298682241207f Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 14:52:35 2014 +0100 fix compilation commit e5139bc5a0a9abd2bdc6ba0dfbcb7e3c2e7b8481 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 09:52:16 2014 -0400 clear nocommit here commit 85dd2cedf7a7994bed871ac421cfda06aaf5c0a5 Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 14:46:17 2014 +0100 fix CompletionPostingsFormatTest commit c0f3781f616c9b0ee3b5c4d0998810f595868649 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 09:38:00 2014 -0400 add tests for these analyzers commit 51f9999b4ad079c283ae762c862fd0e22d00445f Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 14:10:26 2014 +0100 remove nocommit - this is not an issue commit fd1388fa03e622b0738601c8aeb2dbf7949a6dd2 Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Fri Oct 31 14:07:01 2014 +0100 Remove redundant null check commit 3d6dd51b0927337ba941a235446b22e8cd500dc3 Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Fri Oct 31 14:01:37 2014 +0100 Removed the work around to prevent p/c error when invoking #iterator() twice, because the custom query filter wrapper now doesn't transform the result to a cache doc id set any more. I think the transforming to a cachable doc id set in CustomQueryWrappingFilter isn't needed at all, because we use the DocIdSet only once and because of that is just slowed things down. commit 821832a537e00cd1216064b379df3e01d2911d3a Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 13:54:33 2014 +0100 one more nocommit commit 77eb9ea4c4ea50afb2680c29682ddcb3851a9d4f Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Fri Oct 31 13:52:29 2014 +0100 Remove cast commit a400573c034ed602221f801b20a58a9186a06eae Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 13:49:24 2014 +0100 fix stop filter commit 51746087cf8ec34c4d20aa05ba8dbff7b3b43eec Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 13:21:36 2014 +0100 fix changed semantics of FBS.nextSetBit to check for NO_MORE_DOCS commit 8d0a4e2511310f1293860823fe3ba80ac771bbe3 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 08:13:44 2014 -0400 do the bogus cast differently commit 46a5cc5732dea096c0c80ae5ce42911c9c51e44e Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 13:00:16 2014 +0100 I hate it but P/C now passes commit 580c0c2f82bbeacf217e594f22312b11d1bdb839 Merge: a9d3c00 `1645434` Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 06:54:31 2014 -0400 fix nocommit/classcast commit a9d3c004d62fe04989f49a897e6ff84973c06eb9 Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 08:49:31 2014 +0100 Update TODO. commit aa75af0b407792aeef32017f03a6f442ed970baa Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 19:18:25 2014 -0400 clear obselete nocommits from lucene bump commit d438534cf41fcbe2d88070e2f27c994625e082c2 Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 18:53:20 2014 -0400 throw classcastexception when ES abuses regular filtercache for nested docs commit 2c751f3a8feda43ec127c34769b069de21f3d16f Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 18:31:34 2014 -0400 bump lucene revision, fix tests commit d6ef7f6304ae262bf6228a7d661b2a452df332be Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 22:37:58 2014 +0100 fix merge problems commit de9d361f88a9ce6bb3fba85285de41f223c95767 Merge: 41f6aab `f6b37a3` Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 22:28:59 2014 +0100 Merge branch 'master' into enhancement/lucene_5_0_upgrade Conflicts: pom.xml src/main/java/org/elasticsearch/Version.java src/main/java/org/elasticsearch/gateway/local/state/meta/MetaDataStateFormat.java commit 41f6aab388aa80c40b08a2facab2617576203a0d Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 17:48:46 2014 +0100 fix potiential NPE commit c4428b12e1ae838b91e847df8b4a8be7f49e10f4 Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 17:38:46 2014 +0100 don't advance iterator in a match(doc) method commit 28ab948e99e3ea4497c9b1e468384806ba7e1790 Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 17:34:58 2014 +0100 don't advance iterator in a match(doc) method commit eb0f33f6634fadfcf4b2bf7327400e568f0427bb Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 16:55:54 2014 +0100 fix GeoUtilsTest commit 7f711fe3eaf73b6c2268cf42d5a41132a61ad831 Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 16:43:16 2014 +0100 Use a dedicated default index option if field type is not indexed by default commit 78e3f37ab779e3e1b25b45a742cc86ab5f975149 Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 10:56:14 2014 -0400 disable this test with AwaitsFix to reduce noise commit 9a590f563c8e03a99ecf0505c92d12d7ab20d11d Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 09:38:49 2014 +0100 fix lucene version commit abe3ca1d8bb6b5101b545198f59aec44bacfa741 Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 09:35:05 2014 +0100 fix AnalyzingCompletionLookupProvider to wrok with new codec API commit 464293b245852d60bde050c6d3feb5907dcfbf5f Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 00:26:00 2014 -0400 don't try to write stuff to tests class directory commit 031cc6c19f4fe4423a034b515f77e5a0e282a124 Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 00:12:36 2014 -0400 AwaitsFix these known issues to reduce noise commit 4600d51891e35847f2d344247d6f915a0605c0d1 Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 00:06:53 2014 -0400 openbitset lives on commit 8492bae056249e2555d24acd55f1046b66a667c4 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 23:42:54 2014 -0400 fixes for filter tests commit 31f24ce4efeda31f97eafdb122346c7047a53bf2 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 23:12:38 2014 -0400 don't use fieldcache commit 8480789942fdff14a6d2b2cd8134502fe62f20c8 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 23:04:29 2014 -0400 ancient index no longer supported commit 02e78dc7ebdd827533009f542582e8db44309c57 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 23:37:02 2014 +0100 fix more tests commit ff746c6df23c50b3f3ec24922413b962c8983080 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 23:08:19 2014 +0100 fix all mapper commit e4fb84b517107b25cb064c66f83c9aa814a311b2 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 22:55:54 2014 +0100 fix distributor tests and cut over to FileStore API commit 20c850e2cfe3210cd1fb9e232afed8d4ac045857 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 22:42:18 2014 +0100 use DOCS_ONLY if index=true and current options == null commit 44169c108418413cfe51f5ce23ab82047463e4c2 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 22:33:36 2014 +0100 Fix index=yes\|no settings in mappers commit a3c5f77987461a18121156ed345d42ded301c566 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 21:51:41 2014 +0100 fix several field mappers conversion from setIndexed to indexOptions commit df84d736908e88a031d710f98e222be68ae96af1 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 21:33:35 2014 +0100 fix SourceFieldMapper to be not indexed commit b2bf01d12a8271a31fb2df601162d0e89924c8f5 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 21:23:08 2014 +0100 Cut over to .liv files in store and corruption tests commit 619004df436f9ef05d24bef1b6a7f084c6b0ad75 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 17:05:52 2014 +0100 fix more tests commit b7ed653a8b464de446e00456bce0a89e47627c38 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 16:19:08 2014 +0100 [STORE] Add dedicated method to write temporary files Recovery writes temporary files which might not end up in the right distributor directories today. This commit adds a dedicated API that allows specifying the target file name in order to create the tempoary file in the correct directory. commit 7d574659f6ae04adc2b857146ad0d8d56ca66f12 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 10:28:49 2014 -0400 add some leniency to temporary bogus method commit f97022ea7c2259f7a5cf97d924c59ed75ab65b32 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 10:24:17 2014 -0400 fix MultiCollector bug commit b760533128c2b4eb10ad76e9689ef714293dd819 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:56:08 2014 +0100 CheckIndex is now closeable we need to close it commit 9dae9fb6d63546a6c2427be2a2d5c8358f5b1934 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:45:11 2014 +0100 s/Lucene51/Lucene50 commit 7aea9b86856a8c1b06a08e7c312ede1168af1287 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:42:30 2014 +0100 fix BloomFilterPostingsFormat commit 16fea6fe842e88665d59cc091e8224e8dc6ce08c Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:41:16 2014 +0100 fix some codec format issues commit 3d77aa97dd2c4012b63befef3f2ba2525965e8a6 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:30:43 2014 +0100 fix CodecTests commit 6ef823b1fde25657438ace1aabd9d552d6ae215e Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:26:47 2014 +0100 make it compile commit 9991eee1fe99435118d4dd42b297ffc83fce5ec5 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 09:12:43 2014 -0400 add an ugly hack for TopHitsAggregator for now commit 03e768a01fcae6b1f4cb50bcceec7d42977ac3e6 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:01:02 2014 +0100 cut over ES090PostingsFormat commit 463d281faadb794fdde3b469326bdaada25af048 Merge: 0f8740a `8eac79c` Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 08:30:36 2014 -0400 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit 0f8740a782455a63524a5a82169f6bbbfc613518 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 01:00:15 2014 -0400 fix/hack remaining filter and analysis issues commit df534488569da13b31d66e581456dfd4b55156b9 Author: Robert Muir <rmuir@apache.org> Date: Tue Oct 28 23:11:47 2014 -0400 fix ngrams / openbitset usage commit 11f5dc3b9887f4da80a0fa1818e1350b30599329 Author: Robert Muir <rmuir@apache.org> Date: Tue Oct 28 22:42:44 2014 -0400 hack over sort comparators commit 4ebdc754350f512596f6a02770d223e9f5f7975a Author: Robert Muir <rmuir@apache.org> Date: Tue Oct 28 21:27:07 2014 -0400 compiler errors < 100 commit 2d60c9e29de48ccb0347dd87f7201f47b67b83a0 Author: Robert Muir <rmuir@apache.org> Date: Tue Oct 28 03:13:08 2014 -0400 clear some nocommits around ram usage commit aaf47fe6c0aabcfb2581dd456fc50edf871da758 Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 12:27:34 2014 -0400 migrate fieldinfo handling commit ef6ed6d15d8def71cd880d97249678136cd29fe3 Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 12:07:13 2014 -0400 more simple fixes commit f475e1048ae697dd9da5bd9da445102b0b7bc5b3 Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 11:58:21 2014 -0400 more fielddata ram accounting fixes commit 16b4239eaa9b4262df258257df4f31d39f28a3a2 Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 16:47:32 2014 +0100 add missing file commit 5b542fa2a6da81e36a0c35b8e891a1d8bc58f663 Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 16:43:29 2014 +0100 cut over completion posting formats - still some nocommits commit ecdea49404c4ec4e1b78fb54575825f21b4e096e Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 11:21:09 2014 -0400 fielddata accountable fixes commit d43da265718917e20c8264abd43342069198fe9c Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 16:19:53 2014 +0100 cut over BloomFilterPostings to new API commit 29b192ba621c14820175775d01242162b88bd364 Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 10:22:51 2014 -0400 fix more analyzers commit 74b4a0c5283e323a7d02490df469497c722780d2 Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 09:54:25 2014 -0400 fix tests commit 554084ccb4779dd6b1c65fa7212ad1f64f3a6968 Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 14:51:48 2014 +0100 maintain supressed exceptions on CorruptIndexException commit cf882d9112c5e8ef1e9f2b0f800f7aa59001a4f2 Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 14:47:17 2014 +0100 commitOnClose=false commit ebb2a9189ab2f459b7c6c9985be610fd90dfe410 Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 14:46:06 2014 +0100 cut over indexwriter closeing in InternalEngine commit cd21b3d4706f0b562bd37792d077d60832aff65f Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 14:38:10 2014 +0100 fix constant commit f93f900c4a1c90af3a21a4af5735a7536423fe28 Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 09:50:49 2014 -0400 fix test commit a9a752940b1ab4699a6a08ba8b34afca82b843fe Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Mon Oct 27 09:26:18 2014 +0100 Be explicit about the index options commit d9ee815babd030fa2ceaec9f467c105ee755bf6b Author: Simon Willnauer <simonw@apache.org> Date: Sun Oct 26 20:03:44 2014 +0100 cut over store and directory commit b3f5c8e39039dd8f5caac0c4dd1fc3b1116e64ca Author: Robert Muir <rmuir@apache.org> Date: Sun Oct 26 13:08:39 2014 -0400 more test fixes commit 8842f2684e3606aae0860c27f7a4c53e273d47fb Author: Robert Muir <rmuir@apache.org> Date: Sun Oct 26 12:14:52 2014 -0400 tests manual labor commit c43de5aec337919a3fdc3638406dff17fc80bc98 Author: Robert Muir <rmuir@apache.org> Date: Sun Oct 26 11:04:13 2014 -0400 BytesRef -> BytesRefBuilder commit 020c0d087a2f37566a1db390b0e044ebab030138 Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Sun Oct 26 15:53:37 2014 +0100 Moved over to BitSetFilter commit 48dd1b909e6c52cef733961c9ecebfe4f67109fe Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Sun Oct 26 15:53:11 2014 +0100 Left over Collector api change in ScanContext commit 6ec248ef63f262bcda400181b838fd9244752625 Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Sun Oct 26 15:47:40 2014 +0100 Moved indexed() over to indexOptions != null or indexOptions == null commit 9937aebfd8546ae4bb652cd976b3b43ac5ab7a63 Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Sun Oct 26 13:26:31 2014 +0100 Fixed many compile errors. Mainly around the breaking Collector api change in 5.0. commit fec32c4abc0e3309cf34260c8816305a6f820c9e Author: Robert Muir <rmuir@apache.org> Date: Sat Oct 25 11:22:17 2014 -0400 more easy fixes commit dab22531d801800d17a65dc7c9464148ce8ebffd Author: Robert Muir <rmuir@apache.org> Date: Sat Oct 25 09:33:41 2014 -0400 more progress commit 414767e9a955010076b0497cc4f6d0c1850b48d3 Author: Robert Muir <rmuir@apache.org> Date: Sat Oct 25 06:33:17 2014 -0400 more progress commit ad9d969fddf139a8830254d3eb36a908ba87cc12 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 24 14:28:01 2014 -0400 current state of fun commit 464475eecb0be15d7d084135ed16051f76a7e521 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 24 11:42:41 2014 -0400 bump to 5.0 snapshot	2014-11-05 15:48:51 -05:00
Clinton Gormley	4d3842311f	Docs: Updated ES/JVM versions	2014-11-05 12:41:22 +01:00
Adrien Grand	9ea25df649	Switch to murmurhash3 to route documents to shards. We currently use the djb2 hash function in order to compute the shard a document should go to. Unfortunately this hash function is not very sophisticated and you can sometimes hit adversarial cases, such as numeric ids on 33 shards. Murmur3 generates hashes with a better distribution, which should avoid the adversarial cases. Here are some examples of how 100000 incremental ids are distributed to shards using either djb2 or murmur3. 5 shards: Murmur3: [19933, 19964, 19940, 20030, 20133] DJB: [20000, 20000, 20000, 20000, 20000] 3 shards: Murmur3: [33185, 33347, 33468] DJB: [30100, 30000, 39900] 33 shards: Murmur3: [2999, 3096, 2930, 2986, 3070, 3093, 3023, 3052, 3112, 2940, 3036, 2985, 3031, 3048, 3127, 2961, 2901, 3105, 3041, 3130, 3013, 3035, 3031, 3019, 3008, 3022, 3111, 3086, 3016, 2996, 3075, 2945, 2977] DJB: [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 900, 900, 900, 900, 1000, 1000, 10000, 10000, 10000, 10000, 9100, 9100, 9100, 9100, 9000, 9000, 0, 0, 0, 0, 0, 0] Even if djb2 looks ideal in some cases (5 shards), the fact that the distribution of its hashes has some patterns can raise issues with some shard counts (eg. 3, or even worse 33). Some tests have been modified because they relied on implementation details of the routing hash function. Close #7954	2014-11-04 16:32:42 +01:00
Clinton Gormley	5797682bd0	Update cluster.asciidoc - fix invalid asciidoc	2014-11-04 15:22:36 +01:00
Clinton Gormley	60eaeb5052	Update cluster.asciidoc Fixed asciidoc on cluster module page	2014-11-04 14:32:05 +01:00
Clinton Gormley	b0e5fb7823	Update zen.asciidoc Tidied up the "No master block" asciidoc	2014-11-04 14:27:22 +01:00
Martijn Laarman	82278bb7bc	[Aggregations] Meta data support This commit adds the ability to associate a bit of state with each individual aggregation. The aggregation response can be hard to stitch back together without having a reference to the aggregation request. In many cases this is not available, many json serializer frameworks cache types globally or have a static deserialisation override mechanism. In these cases making the original request available, if at all possible, would be a hack. The old facets returned `_type` which was just enough metadata to know what the originating facet type in the request was. This PR takes `_type` one step further by introducing ANY arbitrary meta data. This could be further <strike>ab</strike>used for instance by generic/automated aggregations that include UI state (color information, thresholds, user input states, etc) per aggregation.	2014-11-03 22:32:23 +01:00
Ryan Ernst	7ec31abbb7	Fix missing word in upgrade docs.	2014-11-03 11:44:41 -08:00
Alexander Reelsen	c04fa43587	Docs: Convert markdown to asciidoc in transport profile docs	2014-11-02 08:25:45 +01:00
Aarni Koskela	6011a18381	Docs: Add mention of `hyphenation_patterns_path` Refs ElasticSearch's HyphenationCompoundWordTokenFilterFactory.java. Closes #8305	2014-11-01 15:47:53 +01:00
Alexander Reelsen	5eeac2fdf6	Netty: Add HTTP pipelining support This adds HTTP pipelining support to netty. Previously pipelining was not supported due to the asynchronous nature of elasticsearch. The first request that was returned by Elasticsearch, was returned as first response, regardless of the correct order. The solution to this problem is to add a handler to the netty pipeline that maintains an ordered list and thus orders the responses before returning them to the client. This means, we will always have some state on the server side and also requires some memory in order to keep the responses there. Pipelining is enabled by default, but can be configured by setting the http.pipelining property to true\|false. In addition the maximum size of the event queue can be configured. The initial netty handler is copied from this repo https://github.com/typesafehub/netty-http-pipelining Closes #2665	2014-10-31 16:30:11 +01:00
Clinton Gormley	e56d85439c	Update search-template.asciidoc Clarified using the conditional clause template example as a string	2014-10-31 15:32:14 +01:00
Clinton Gormley	2569188d25	Update search-template.asciidoc Fixed asciidoc typo Closes #8308	2014-10-31 14:40:32 +01:00
astefan	4049154dbc	Docs: Document action.replication_type setting Document action.replication_type setting Closes #8290	2014-10-31 13:53:34 +01:00
cmpich	e57c8b0673	Docs: Update getting-started.asciidoc Closes #8195	2014-10-29 15:04:13 +01:00
cmpich	36462c0305	Docs: Update getting-started.asciidoc Closes #8194	2014-10-29 15:01:18 +01:00
Clinton Gormley	8f02c451b8	Update source-field.asciidoc very minor typofix Closes #8066	2014-10-29 14:51:05 +01:00
Alex Ksikes	35f55608cc	MLT Field Query: remove it from master The MLT field query is simply replaced by a MLT query set to specififc field. To simplify code maintenance we should deprecate it in 1.4 and remove it in 2.0. Closes #8238	2014-10-29 10:19:00 +01:00
Areek Zillur	96f1606cdc	Completion Suggester: Fix CompletionFieldMapper to correctly parse weight - Allows weight to be defined as a string representation of a positive integer closes #8090	2014-10-28 18:39:02 -04:00
Dmitriy Khvatov	71a90ab4fe	Docs: Update multi-get.asciidoc Duplicate word Closes #8228	2014-10-28 10:58:47 +01:00
tlrx	8c864cf3f6	Cat Recovery API: Reverting changes introduced with commit `e1c75bae87` Adding these 2 headers to the CAT Recovery made the CI tests hanging for a loooong time. Related to #8041	2014-10-27 20:49:58 +01:00
Zachary Tong	f5b2dfd052	Aliases: Throw exception if index is null or missing when creating an alias Fixes a bug where alias creation would allow `null` for index name, which thereby applied the alias to _all_ indices. This patch makes the validator throw an exception if the index is null. ```bash POST /_aliases { "actions": [ { "add": { "alias": "empty-alias", "index": null } } ] } ``` ```json { "error": "ActionRequestValidationException[Validation Failed: 1: Alias action [add]: [index] may not be null;]", "status": 400 } ``` The reason this bug wasn't caught by the existing tests is because the old test for nullness only validated against a cluster which had zero indices. The null index is translated into "_all", and since there are no indices, this fails because the index doesn't exist. So the test passes. However, as soon as you add an index, "_all" resolves and you get the situation described in the original bug report: null index is accepted by the alias, resolves to "_all" and gets applied to everything. The REST tests, otoh, explicitly tested this bug as a real feature and therefore passed. The REST tests were modified to change this behavior. Fixes #7863	2014-10-27 14:39:01 -04:00
Alex Ksikes	0be5c60bce	MLT Query: use ParseField#withAllDeprecated for percent_terms_to_match Also the parameter was deprecated but not removed so we keep it in the doc and mark it as deprecated ... Closes #8241	2014-10-27 17:35:06 +01:00
Alex Ksikes	991f3e2cd3	Docs: fix tags for dfs and new like parameter	2014-10-27 15:42:44 +01:00
Clinton Gormley	fbd0403a6f	Documented that HTTP pipelining is not supported	2014-10-27 14:49:48 +01:00
Adrien Grand	7ea490dfd1	Aggregations: Return the sum of the doc counts of other buckets. This commit adds a new field to the response of the terms aggregation called `sum_other_doc_count` which is equal to the sum of the doc counts of the buckets that did not make it to the list of top buckets. It is typically useful to have a sector called eg. `other` when using terms aggregations to build pie charts. Example query and response: ```json GET test/_search?search_type=count { "aggs": { "colors": { "terms": { "field": "color", "size": 3 } } } } ``` ```json { [...], "aggregations": { "colors": { "doc_count_error_upper_bound": 0, "sum_other_doc_count": 4, "buckets": [ { "key": "blue", "doc_count": 65 }, { "key": "red", "doc_count": 14 }, { "key": "brown", "doc_count": 3 } ] } } } ``` Close #8213	2014-10-27 12:11:26 +01:00
tlrx	e1c75bae87	Cat API: Add node name to _cat/recovery Add source_node and target_node fields to the recovery cat API. Also fixed and updated the documentation which was not complete concerning fields names. Closes #8041	2014-10-27 09:47:26 +01:00
Alex Ksikes	4da407a869	MLT Query: versatile 'like' parameter The MLT query has a lot of parameters. For example, a set of documents is specified with either `like_text`, `ids` or `docs`, with at least one parameter required. This commit groups all the document specification parameters under one called `like`. The syntax is described below and could easily be extended to allow for new means of specifying document input. The `like_text`, `ids` and `docs` parameters are deprecated. As a single piece text: { "query": { "more_like_this": { "like": "some text here" } } } As a single item: { "query": { "more_like_this": { "like": { "_index": "imdb", "_type": "movies", "_id": "88247" } } } } Or as a mixture of all: { "query": { "more_like_this": { "like": [ "Some random text ...", { "_index": "imdb", "_type": "movies", "_id": "88247" }, { "_index": "imdb", "_type": "movies", "doc": { "title": "Document with an artificial title!" } } ] } } } Closes #8039	2014-10-25 11:04:51 +02:00
David Pilato	62d8b7ab97	Docs: rolling upgrade process seems incorrect When reading the [rolling upgrade process](http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/setup-upgrade.html#rolling-upgrades), you can see that we wrote: * disable allocation * upgrade node1 * upgrade node2 * upgrade node3 * ... * enable allocation That won't work as after a node has been removed and restarted, no shard will be allocated anymore. So closing node2 and remaining nodes, won't help to serve index and search request anymore. We should write: * disable allocation * upgrade node1 * enable allocation * wait for shards being recovered on node1 * disable allocation * upgrade node2 * enable allocation * wait for shards being recovered on node2 * disable allocation * upgrade node3 * enable allocation * wait for shards being recovered on node3 * disable allocation * ... * enable allocation I think this documentation update should go in 1.3, 1.4, 1.x and master branches. Closes #8218 Closes #7973.	2014-10-24 16:45:42 +02:00
Marcin Mikosik	ed86d925cd	Docs: fixed typo in documentation Closes #8205	2014-10-24 15:27:31 +02:00
Simon Willnauer	d5c0a49620	[ROUTING] Add rebalance enabled allocation decider This commit adds the ability to enable / disable relocations on an entire cluster or on individual indices for either: * `primaries` - only primaries can rebalance * `replica` - only replicas can rebalance * `all` - everything can rebalance (default) * `none` - all rebalances are disabled similar to the allocation enable / disable functionality. Relates to #7288	2014-10-23 14:07:13 +02:00
Alex Ksikes	c13f5f21de	Term Vectors: support for distributed frequencies Adds distributed frequencies support for the Term Vectors API. A new parameter called `dfs` is introduced which defaults to `false`. Closes #8144	2014-10-23 13:59:59 +02:00
Clinton Gormley	a8b21f2cd5	Update update-settings.asciidoc Removed deprecated `cluster.routing.allocation.disable` settings	2014-10-22 12:46:33 +02:00
Clinton Gormley	2d0c440b09	Update cluster.asciidoc Fixed asciidoc syntax	2014-10-22 12:45:10 +02:00
Brian Kim	58086dd08b	Docs: missing quote fix missing quote Closes #8176	2014-10-21 12:52:12 +02:00
Peter Dyson	b984cb771f	Docs: Provide example of deleting a repository Example of deleting a repository with explanation that snapshots themselves are left untouched. Closes #8172	2014-10-21 10:10:03 +02:00
Andrei Kolosok	c31a783930	Docs: Update filtered-query.asciidoc Fix mistyping Closes #8167	2014-10-21 09:45:19 +02:00
Andrei Kolosok	92abfc8e24	Docs: Update minimum-should-match.asciidoc Add %-sign to examle in the last section Closes #8157	2014-10-21 09:43:55 +02:00
David Pilato	0ff61e1d6f	Add time_zone setting for query_string Query String query now supports a new `time_zone` option based on JODA time zones. When using a range on date field, the time zone is applied. ```json { "query": { "query_string": { "text": "date:[2012 TO 2014]", "timezone": "Europe/Paris" } } } ``` Closes #7880.	2014-10-20 19:09:45 +02:00
Adrien Grand	230c6684a9	Search: Remove partial fields. Partial fields have been deprecated since 1.0.0Beta1 in favor of _source filtering. They will be removed in 2.0.	2014-10-20 12:29:30 +02:00
Adrien Grand	f4ee3f25e4	Mappings: Store _timestamp by default. Storing `_timestamp` by default means that under the default configuration, you would have all the information you need in order to reindex into a different index. Close #8139	2014-10-20 12:17:26 +02:00
Clinton Gormley	4da62a33b5	Update plugins.asciidoc Added Vietnamese Analyser to plugins page Closes #6647	2014-10-20 11:56:01 +02:00
Ryan Grimm	74586e2867	Docs: Added 'd' to the list of supported units. Day was missing from the list of supported units in the date math section. Closes #8151	2014-10-19 21:24:28 +02:00

... 9 10 11 12 13 ...

1923 Commits