OpenSearch

Commit Graph

Author	SHA1	Message	Date
Martijn van Groningen	b4048b4e7f	Use CoveringQuery to select percolate candidate matches and extract all clauses from a conjunction query. When clauses from a conjunction are extracted the number of clauses is also stored in an internal doc values field (minimum_should_match field). This field is used by the CoveringQuery and allows the percolator to reduce the number of false positives when selecting candidate matches and in certain cases be absolutely sure that a conjunction candidate match will match and then skip MemoryIndex validation. This can greatly improve performance. Before this change only a single clause was extracted from a conjunction query. The percolator tried to extract the clauses that was rarest in order (based on term length) to attempt less candidate queries to be selected in the first place. However this still method there is still a very high chance that candidate query matches are false positives. This change also removes the influencing query extraction added via #26081 as this is no longer needed because now all conjunction clauses are extracted. https://www.elastic.co/guide/en/elasticsearch/reference/6.x/percolator.html#_influencing_query_extraction Closes #26307	2017-11-10 07:44:42 +01:00
Nicholas Knize	06ff92d237	Add ignore_malformed to geo_shape fields This commit adds ignore_malformed support to geo_shape field types to skip malformed geoJson fields. closes #23747	2017-11-09 17:59:05 -06:00
Dimitris Athanasiou	66bef26495	Aggregations: bucket_sort pipeline aggregation (#27152 ) This commit adds a parent pipeline aggregation that allows sorting the buckets of a parent multi-bucket aggregation. The aggregation also offers [from] and [size] parameters in order to truncate the result as desired. Closes #14928	2017-11-09 17:59:57 +00:00
Tal Levy	d22fd4ea58	Introduce templating support to timezone/locale in DateProcessor (#27089 ) Sometimes systems like Beats would want to extract the date's timezone and/or locale from a value in a field of the document. This PR adds support for mustache templating to extract these values. Closes #24024.	2017-11-09 09:45:32 -08:00
Tanguy Leroux	184dda9eb0	Update to AWS SDK 1.11.223 (#27278 )	2017-11-09 13:25:51 +01:00
Mayya Sharipova	abbe853f1e	Add limits for ngram and shingle settings (#27211 ) (#27318 ) Relates to #25887	2017-11-08 10:12:57 -05:00
Jay Greenberg	df5c8bb3bf	Update discovery-ec2.asciidoc Changed the recommendation to use Tribe Node to Cross Cluster Search.	2017-11-07 10:18:38 -05:00
Mayya Sharipova	148376c2c5	Add limits for ngram and shingle settings (#27211 ) * Add limits for ngram and shingle settings (#27211) Create index-level settings: max_ngram_diff - maximum allowed difference between max_gram and min_gram in NGramTokenFilter/NGramTokenizer. Default is 1. max_shingle_diff - maximum allowed difference between max_shingle_size and min_shingle_size in ShingleTokenFilter. Default is 3. Throw an IllegalArgumentException when trying to create NGramTokenFilter, NGramTokenizer, ShingleTokenFilter where difference between max_size and min_size exceeds the settings value. Closes #25887	2017-11-07 08:14:55 -05:00
Zachary Tong	6e9e07d6f8	Fix profiling naming issues (#27133 ) Some code-paths use anonymous classes (such as NonCollectingAggregator in terms agg), which messes up the display name of the profiler. If we encounter an anonymous class, we need to grab the super's name. Another naming issue was that ProfileAggs were not delegating to the wrapped agg's name for toString(), leading to ugly display. This PR also fixes up the profile documentation. Some of the examples were executing against empty indices, which shows different profile results than a populated index (and made for confusing examples). Finally, I switched the agg display names from the fully qualified name to the simple name, so that it's similar to how the query profiles work. Closes #26405	2017-11-06 16:37:33 -05:00
Shubham Aggarwal	5a925cd40c	Fixed references to Multi Index Syntax (#27283 )	2017-11-06 19:15:36 +01:00
Patrice Bourgougnon	4b7b1e2706	Add an active Elasticsearch WordPress plugin link (#27279 )	2017-11-06 18:13:27 +01:00
Boris Tyukin	8e9b30417c	Update to support bulk updates by query (#27172 ) Getting started doc stated that bulk updates by query are not supported but they are now	2017-11-06 17:32:20 +01:00
Boaz Leskes	a8ff4960f3	add split index reference in indices.asciidoc Relates to #26931	2017-11-06 12:55:41 +01:00
Simon Willnauer	bd7efa908a	Add ability to split shards (#26931 ) This change adds a new `_split` API that allows to split indices into a new index with a power of two more shards that the source index. This API works alongside the `_shrink` API but doesn't require any shard relocation before indices can be split. The split operation is conceptually an inverse `_shrink` operation since we initialize the index with a _syntetic_ number of routing shards that are used for the consistent hashing at index time. Compared to indices created with earlier versions this might produce slightly different shard distributions but has no impact on the per-index backwards compatibility. For now, the user is required to prepare an index to be splittable by setting the `index.number_of_routing_shards` at index creation time. The setting allows the user to prepare the index to be splittable in factors of `index.number_of_routing_shards` ie. if the index is created with `index.number_of_routing_shards: 16` and `index.number_of_shards: 2` it can be split into `4, 8, 16` shards. This is an intermediate step until we can make this the default. This also allows us to safely backport this change to 6.x. The `_split` operation is implemented internally as a DeleteByQuery on the lucene level that is executed while the primary shards execute their initial recovery. Subsequent merges that are triggered due to this operation will not be executed immediately. All merges will be deferred unti the shards are started and will then be throttled accordingly. This change is intended for the 6.1 feature release but will not support pre-6.1 indices to be split unless these indices have been shrunk before. In that case these indices can be split backwards into their original number of shards.	2017-11-06 11:37:55 +01:00
Pablo Musa	7b03d68f9f	[Docs] Fix minor paragraph indentation error for multiple Indices params (#25535 )	2017-11-06 10:20:20 +01:00
Nhat	c7ce5a07f2	Add size-based condition to the index rollover API (#27160 ) This is to add a max_size condition to the index rollover API. We use a totalSizeInBytes from DocsStats to evaluate this condition. Closes #27004	2017-11-04 19:51:48 -04:00
Loek van Gool	67e677f443	Add an example of dynamic field names (#27255 )	2017-11-03 23:20:58 +01:00
David Turner	fbf8c3ee83	Reinstate recommendation for ≥ 3 master-eligible nodes. (#27204 ) In the docs for 1.7 ([doc][doc-1.7], [src][src-1.7]) there was a recommendation for at least 3 master-eligible nodes "in critical clusters" but this was lost when that page was updated in 2.0 ([doc][doc-2.0], [src][src-2.0]). I'd like to reinstate this. [doc-1.7]: https://www.elastic.co/guide/en/elasticsearch/reference/1.7/modules-node.html [src-1.7]: `2cbaccb2f2/docs/reference/modules/node.asciidoc` [doc-2.0]: https://www.elastic.co/guide/en/elasticsearch/reference/2.0/modules-node.html#split-brain [src-2.0]: `4799009ad7/docs/reference/modules/node.asciidoc`	2017-11-03 08:48:48 +00:00
Yannick Welsch	7791e72626	Add additional explanations around discovery.zen.ping_timeout (#27231 ) Makes it clearer that this setting should only be changed with extra care.	2017-11-02 16:52:10 +01:00
Colin Goodheart-Smithe	c1b8140c83	Upgrade to Lucene 7.1 (#27225 )	2017-11-02 13:25:33 +00:00
Martijn van Groningen	d805c41b28	Added new terms_set query This query returns documents that match with at least one ore more of the provided terms. The number of terms that must match varies per document and is either controlled by a minimum should match field or computed per document in a minimum should match script. Closes #26915	2017-11-01 10:55:18 +01:00
Toby McLaughlin	b71f7d3559	Update Docker docs for 6.0.0-rc2 (#27166 ) * Update Docker docs for 6.0.0-rc2 * Update the docs to match the new Docker "image flavours" of "basic", "platinum", and "oss". * Clarifications for Openshift and bind-mounts * Bump docker-compose 2.x format to 2.2 * Combine Docker Toolbox instructions for setting vm.max_map_count for both macOS + Windows * devicemapper is not the default storage driver any more on RHEL	2017-11-01 14:24:30 +11:00
Igor Motov	d14486bce6	Docs: restore now fails if it encounters incompatible settings (#26933 ) This change was introduced in 5.0.0, but the documentation wasn't updated to reflect it. Closes #26453	2017-10-31 20:04:00 -04:00
javanna	506a2c276d	[DOCS] Link remote info API in Cross Cluster Search docs page Closes #26327	2017-10-31 15:24:46 +01:00
Shai Erera	bd0261916c	Fix Laplace scorer to multiply by alpha (and not add) (#27125 )	2017-10-31 13:08:44 +01:00
javanna	34666844b3	[DOCS] Clarify migrate guide and search request validation Relates to #26811	2017-10-31 12:36:00 +01:00
kel	c3e2bdf20c	Raise IllegalArgumentException if query validation failed (#26811 ) Closes #26799	2017-10-31 12:17:27 +01:00
Jim Ferenczi	792641a6e3	[Docs] #26541 : add warning regarding the limit on the number of fields that can be queried at once in the multi_match query.	2017-10-30 18:03:56 +01:00
Dimitrios Athanasiou	3796471ac4	[Docs] Fix note in bucket_selector	2017-10-30 15:20:46 +00:00
Clarkie	b1ce5cf836	[Docs] Fix indentation of examples (#27168 )	2017-10-30 11:56:38 +01:00
Jim Ferenczi	a4105c6b4a	[Docs] Clarify `span_not` query behavior for non-overlapping matches (#27150 ) Closes #27134	2017-10-30 11:29:40 +01:00
Christoph Büscher	8e62314ce4	[Docs] Remove first person "I" from getting started (#27155 ) Avoid first person style and consistently switch to an unpersonal style in the getting started docs.	2017-10-30 10:45:50 +01:00
Holger Bartnick	aa03fb72b7	[Docs] Correct link target for datatype murmur3 (#27143 )	2017-10-30 09:31:55 +01:00
Jun Ohtani	77e11f6969	[Doc] Add Ingest CSV Processor Plugin to plugin as a community plugin (#27105 ) * [Doc] Add Ingest CSV Processor Plugin to plugin as a community plugin	2017-10-27 16:16:02 +09:00
Clinton Gormley	0499dc0873	Removed the beta tag from cross-cluster search	2017-10-27 08:51:36 +02:00
Martijn van Groningen	f1e944a675	docs: describe parent/child performances	2017-10-26 11:49:13 +02:00
Catalin Ursachi	8bf33241ed	Add Delete Index API support to high-level REST client (#27019 ) Relates to #25847	2017-10-26 09:52:46 +02:00
Loading Zhang	149e558dd5	Docs: Fix ingest geoip config location (#27110 )	2017-10-25 07:16:42 -07:00
markwalkom	2b864156ca	[Docs] Clarify mapping `index` option default (#27104 )	2017-10-25 12:42:29 +02:00
Luca Cavanna	8caf7d4ff8	Decouple BulkProcessor from ThreadPool (#26727 ) Introduce minimal thread scheduler as a base class for `ThreadPool`. Such a class can be used from the `BulkProcessor` to schedule retries and the flush task. This allows to remove the `ThreadPool` dependency from `BulkProcessor`, which requires to provide settings that contain `node.name` and also needed log4j for logging. Instead, it needs now a `Scheduler` that is much lighter and gets automatically created and shut down on close. Closes #26028	2017-10-25 10:30:23 +02:00
David Turner	559fc5a4de	Update numbers to reflect 4-byte UTF-8-encoded characters (#27083 ) You need 4 bytes for characters outside the BMP, which includes many emoji and a bunch of less-common writing characters too.	2017-10-24 09:50:47 +01:00
Martijn van Groningen	87c9b79b10	Return the _source of inner hit nested as is without wrapping it into its full path context Due to a change happened via #26102 to make the nested source consistent with or without source filtering, the _source of a nested inner hit was always wrapped in the parent path. This turned out to be not ideal for users relying on the nested source, as it would require additional parsing on the client side. This change fixes this, the _source of nested inner hits is now no longer wrapped by parent json objects, irregardless of whether the _source is included as is or source filtering is used. Internally source filtering and highlighting relies on the fact that the _source of nested inner hits are accessible by its full field path, so in order to now break this, the conversion of the _source into its binary form is performed in FetchSourceSubPhase, after any potential source filtering is performed to make sure the structure of _source of the nested inner hit is consistent irregardless if source filtering is performed. PR for #26944 Closes #26944	2017-10-19 12:04:56 +02:00
İsmail Arılık	71f5e2ce6b	Fix a typo. (#27043 ) `=== Instalation with Homebrew` should be `=== Installation with Homebrew`.	2017-10-18 09:46:53 -04:00
Divyum Rastogi	984731f36b	[DOCS] better formatting of ES cluster status (#26838 ) * better formatting of ES cluster status * change phrase missing data	2017-10-18 01:40:21 -06:00
Pius	400480e3b0	action.auto_create_index can be set as a dynamic cluster setting (#27026 ) Per https://github.com/elastic/elasticsearch/pull/20274, action.auto_create_index can be set as a dynamic cluster setting.	2017-10-17 20:44:18 +00:00
Anton Pozhidaev	70668dddf3	Update docs about `script` parameter (#27010 ) Added a description of short script form. Also removed references to the obsolete `script.default_lang`.	2017-10-16 05:04:43 -07:00
Simon Willnauer	8dda827ff4	Don't refresh on `_flush` `_force_merge` and `_upgrade` (#27000 ) Today all these API calls have a sideeffect of making documents visible to search requests. While this is sometimes desired it's an unnecessary sideeffect and now that we have an internal (engine-private) index reader (#26972) we artificially add a refresh call for bwc. This change removes this sideeffect in 7.0.	2017-10-16 10:16:35 +02:00
Jason Tedor	8eba1fa17c	Add docs on full_id parameter in cat nodes API This commit adds a note to the docs on the full_id parameter in the cat nodes API. This is a useful parameter but was not previously documented anywhere. Relates #27009	2017-10-13 13:49:25 -04:00
Jason Tedor	a7895839a0	Reformat paragraph in template docs to 80 columns This commit reformats a paragraph in the template docs to fit in 80 columns as for the rest of the doc, and as-is a standard that we loosely adhere to.	2017-10-12 17:52:43 -04:00
Pius	1125bc635c	Clarify settings and template on create index This commit clarifies the interaction between settings specified in a create index request, and those that would come from any templates that apply to the create index request. Relates #26994	2017-10-12 17:48:57 -04:00
agent5566	93a47cf860	Fix a typo in the similarity docs (#26970 )	2017-10-12 09:29:25 -07:00
Alexander Kazakov	592ab043dd	Change default value to true for transpositions parameter of fuzzy query (#26901 )	2017-10-11 15:31:48 +02:00
Deb Adair	875e582cc9	[DOCS] Fixed indentation of the definition list.	2017-10-10 12:08:21 -07:00
Nicolas Sierra	d6fc4affae	Clarify systemd overrides This commit clarifies how to apply an override to the systemd unit file for Elasticsearch. Relates #26950	2017-10-10 13:06:34 -04:00
Chris Earle	dcc6b426ec	[DOCS] Plugin Installation for Windows (#21671 ) This shows an example of how to install a plugin on Windows, which is not as obvious at I would have expected.	2017-10-10 09:31:44 -06:00
vurple	b3e9aa89dc	Add Homebrew instructions to getting started This commit adds instructions for installing Elasticsearch via Homebrew to the Getting Started guide. Relates #26847	2017-10-10 06:21:33 -04:00
Nhat	bf4c3642b2	remove _primary and _replica shard preferences (#26791 ) The shard preference _primary, _replica and its variants were useful for the asynchronous replication. However, with the current impl, they are no longer useful and should be removed. Closes #26335	2017-10-08 11:03:06 -04:00
shaulzorea	9db21cd23f	fixing typo in datehistogram-aggregation.asciidoc (#26924 )	2017-10-08 15:12:43 +02:00
Deb Adair	b57cb83567	[DOCS] Added info about snapshotting your data before an upgrade.	2017-10-06 12:14:26 -07:00
Adrien Grand	4e1ff8d086	Add documentation about disabling `_field_names`. (#26813 ) This field has significant index-time overhead. Closes #26779	2017-10-06 16:49:15 +02:00
Clinton Gormley	eb3ead6561	Update type-field.asciidoc Fixed asciidoc syntax on deprecated annotation	2017-10-06 11:57:27 +02:00
Jason Tedor	3176ba418e	Remove deploying in JBoss documentation This documentation is not maintained and we do not intend to maintain it. Thus, this commit removes this documentation. Relates #26902	2017-10-05 15:41:32 -04:00
Steve Kotsopoulos	dd95849b62	Document JVM option MaxFDLimit for macOS () This commit documents a JVM option that is needed on macOS when raising file descriptor limits there. Relates #26900	2017-10-05 14:56:15 -04:00
Md. Abdulla-Al-Sun	a40c474e10	Added Bengali Analyzer to Elasticsearch with respect to the lucene update(PR#238)	2017-10-05 13:25:05 +02:00
Martijn van Groningen	dca787ed8a	upgrade to Lucene 7.1.0 snapshot version	2017-10-05 09:06:56 +02:00
Alexander Kazakov	9c95e91471	Expose `fuzzy_transpositions` parameter in fuzzy queries (#26870 ) Add fuzzy_transpositions parameter to multi_match and query_string queries. Add fuzzy_transpositions, fuzzy_prefix_length and fuzzy_max_expansions parameters to simple_query_string query.	2017-10-05 09:01:09 +02:00
Jim Ferenczi	17b9baf5fd	Clarify pure wilcard matching with `query_string` (#26814 ) In 5.x pure wildcard queries `` in `query_string` are rewritten to `exists` query for efficiency. Though this introduced a change in the document that match such queries because `exists` query also return documents with an empty value for the field. This change clarifies this behavior for 5.x and beyond. Closes #26801 review	2017-10-04 09:55:26 +02:00
Shane Connelly	b33c444db5	Shows how to disable CCS from dedicated master/data (#26860 ) This is really just the last bit of the OSS component of https://github.com/elastic/elasticsearch/issues/25210	2017-10-03 06:15:30 -07:00
David Roberts	a292740b9e	Add cgroup memory usage/limit to OS stats on Linux (#26166 ) This change adds cgroup memory usage/limit to the OS stats section of the node stats on Linux. This information is useful because in Docker containers the standard node stats report the host memory limit, not taking account of extra restrictions that may have been applied to the container. The original idea was to store these values as Long, truncating any values outside the range of long. However, this meant that in the relatively common case of no limit being applied, users would not see the same value in the OS stats as they see by querying Linux directly. So instead the values are stored as String. This change places a burden on consumers of the strings to convert the strings to numbers and decide what to do about extremely large values, but there will be very few consumers and they would need to have a policy for dealing with "no limit" in any case.	2017-10-03 12:08:36 +01:00
markwalkom	dbea83a1d0	[Docs] Update length-tokenfilter.asciidoc (#26849 ) Made it clear what the numeric value of `Integer.MAX_VALUE` is,	2017-10-02 11:01:43 +02:00
Amine Daï	3cb99aeec1	Fix references to vm.max_map_count in Docker docs This commit fixes some references to vm.max_map_count in the Docker docs. Relates #26798	2017-09-29 15:56:18 -04:00
David Turner	8fe9a20982	Forbid negative values for index.unassigned.node_left.delayed_timeout (#26828 ) Change delayed_timeout to be a positiveTimeSetting, and add note that this is a breaking change	2017-09-29 14:44:43 +01:00
Jason Tedor	cfd6f35fc3	Add note to docs on /etc/elasticsearch ownership This commit adds a note to the docs for the RPM and Debian installation regarding the expected permissions for /etc/elasticsearch. Relates #26795	2017-09-27 09:22:52 -04:00
olcbean	6952f7b560	Validate top-level keys for create index request (#23755 ) (#23869 ) This commit ensures create index requests do not ignore unknown keys passed to the request. closes #23755	2017-09-26 09:49:20 -07:00
Jim Ferenczi	74473c1c3d	Early termination with index sorting should not set terminated_early in the response (#26597 ) Early termination with index sorting always return the best top N in the response but set the flag `terminated_early` in the response. This can be confusing because we use the same flag for `terminate_after` which on the contrary returns partial results. This change removes the flag when results are not partial (early termination due to index sorting) and keeps it only when `terminate_after` is used. Closes #26408	2017-09-26 11:37:11 +02:00
Christoph Büscher	6189c54c84	Reject the `index_options` parameter for numeric fields (#26668 ) Numeric fields no longer support the index_options parameter. This changes the parameter to be rejected in numeric field types after it was deprecated in 6.0. Closes #21475	2017-09-25 23:43:14 +02:00
Christoph Büscher	3827918417	Add configurable `maxTokenLength` parameter to whitespace tokenizer (#26749 ) Other tokenizers like the standard tokenizer allow overriding the default maximum token length of 255 using the `"max_token_length` parameter. This change enables using this parameter also with the whitespace tokenizer. The range that is currently allowed is from 0 to StandardTokenizer.MAX_TOKEN_LENGTH_LIMIT, which is 1024 * 1024 = 1048576 characters. Closes #26643	2017-09-25 17:21:19 +02:00
javanna	dee2ae1023	[DOCS] Replace mention of string field type with text and keyword Closes #25713	2017-09-25 11:12:06 +02:00
Jason Tedor	d8bb413b1b	Configure heap dump path out of the box The JVM defaults to dumping the heap to the working directory of Elasticsearch. For the RPM and Debian packages, this location is /usr/share/elasticsearch. This directory is not writable by the elasticsearch user, so by default heap dumps in this situation are lost. This commit modifies the packaging for the RPM and Debian packages to set the heap dump path to /var/lib/elasticsearch as the default location for dumping the heap. This location is writable by the elasticsearch user by default. We add documentation of this important setting if /var/lib/elasticsearch is not suitable for receiving heap dumps. Relates #26755	2017-09-22 14:22:03 -04:00
Jason Tedor	2e63a13c0a	Upgrade to Log4j 2.9.1 This commit upgrades the Log4j dependency, picking up a fix for an issue with handling stack traces on JDK 9. Relates #26750	2017-09-22 11:57:06 -04:00
Yannick Welsch	df5c450e89	Add v6.1 BWC layer for adding wait_for_active_shards to index open command This commit disables BWC tests while adding a v6.1 BWC layer for the PR #26682	2017-09-22 16:30:07 +02:00
Alexander Kazakov	ff737a880c	Add wait_for_active_shards parameter to index open command (#26682 ) Adds the wait_for_active_shards parameter to the index open command. Similar to the index creation command, the index open command will now, by default, wait until the primaries have been allocated. Closes #20937	2017-09-22 11:15:03 +02:00
Jason Tedor	e0db89bc35	Upgrade to Lucene 7.0.0 This commit upgrades to the GA release of Luence 7! Relates #26744	2017-09-21 19:19:33 -04:00
wasserman	67845134de	[Docs] Fixed typo of configuration (#25058 )	2017-09-21 16:49:00 +02:00
kel	601be4f83e	Add azure storage endpoint suffix #26432 (#26568 ) Allow specifying azure storage endpoint suffix for an azure client.	2017-09-20 22:26:19 -07:00
lcawley	06551a8549	[DOCS] Added index-shared4 and index-shared5.asciidoc	2017-09-20 10:54:26 -07:00
Tahmim Ahmed Shibli	34662c9e6d	[Docs] Fix name of character filter in example. (#26724 )	2017-09-20 17:08:43 +02:00
Christoph Büscher	86b00b84bc	Remove parse field deprecations in query builders (#26711 ) The `fielddata` field and the use of the `_name` field in the short syntax of the range query have been deprecated in 5.0 and can be removed. The same goes for the deprecated `score_mode` field in HasParentQueryBuilder, the deprecated `like_text`, `ids` and `docs` parameter in the `more_like_this` query, the deprecated query name in the short version of the `regexp` query, and several deprecated alternative field names in other query builders.	2017-09-20 16:22:21 +02:00
Tanguy Leroux	c16c653c3e	[Test] Fix reference/cat/allocation/line_8 test failure In this test, 260b is replaced by the regexp \d+b but the test sometimes produces results like 1.1kb so this commit adapts the regexp to match values with decimals	2017-09-18 10:46:19 +02:00
Peter Dyson	1f9e0fd0dd	[Docs] improved description for fs.total.available_in_bytes (#26657 )	2017-09-18 16:56:19 +10:00
Dimitrios Liappis	b789ce737b	Docs: Use single-node discovery.type for dev example For the single node, dev example, the `discovery.type=single-node`[1],[2] is a perfect fit and makes the example shorter and more self explanatory. Also expose the transport port, to help with dev use-cases using the transport client. [1] https://github.com/elastic/elasticsearch/pull/23595 [2] https://github.com/elastic/elasticsearch/pull/23598 Relates #26289	2017-09-15 16:14:47 +03:00
Tanguy Leroux	7f74a620a1	[Docs] Add description for missing fields in Reindex/Update/Delete By Query (#26618 ) This commit adds some missing description for some fields in the Reindex/UBQ/DBQ responses.	2017-09-15 11:23:57 +02:00
markwalkom	3d5f70790a	[Docs] Update ingest.asciidoc (#26599 ) Added a brief note to clarify where configured pipelines are stored (cluster state).	2017-09-15 11:15:31 +02:00
lcawley	120ddd99c3	[DOCS] Remove edit link from ML node	2017-09-14 16:18:29 -07:00
Michael Basnight	f385e0cf26	Add bad_request to the rest-api-spec catch params (#26539 ) This adds another request to the catch params. It also makes sure that the generic request param does not allow 400 either.	2017-09-14 14:24:03 -05:00
Boaz Leskes	1ca0b5e9e4	Introduce a History UUID as a requirement for ops based recovery (#26577 ) The new ops based recovery, introduce as part of #10708, is based on the assumption that all operations below the global checkpoint known to the replica do not need to be synced with the primary. This is based on the guarantee that all ops below it are available on primary and they are equal. Under normal operations this guarantee holds. Sadly, it can be violated when a primary is restored from an old snapshot. At the point the restore primary can miss operations below the replica's global checkpoint, or even worse may have total different operations at the same spot. This PR introduces the notion of a history uuid to be able to capture the difference with the restored primary (in a follow up PR). The History UUID is generated by a primary when it is first created and is synced to the replicas which are recovered via a file based recovery. The PR adds a requirement to ops based recovery to make sure that the history uuid of the source and the target are equal. Under normal operations, all shard copies will stay with that history uuid for the rest of the index lifetime and thus this is a noop. However, it gives us a place to guarantee we fall back to file base syncing in special events like a restore from snapshot (to be done as a follow up) and when someone calls the truncate translog command which can go wrong when combined with primary recovery (this is done in this PR). We considered in the past to use the translog uuid for this function (i.e., sync it across copies) and thus avoid adding an extra identifier. This idea was rejected as it removes the ability to verify that a specific translog really belongs to a specific lucene index. We also feel that having a history uuid will serve us well in the future.	2017-09-14 21:25:02 +03:00
Bernd	59600dfe2d	[Docs] Correct typo in removal_of_types.asciidoc (#26646 )	2017-09-14 15:34:07 +02:00
Christoph Büscher	c7c6443b10	[Docs] "The the" is a great band, but ... (#26644 ) Removing several occurrences of this typo in the docs and javadocs, seems to be a common mistake. Corrections turn up once in a while in PRs, better to correct some of this in one sweep.	2017-09-14 15:08:20 +02:00
Daniel A. Ochoa	914416e9f4	[Docs] Update link in removal_of_types.asciidoc (#26614 ) Fix link to [parent-child relationship].	2017-09-14 10:11:03 +02:00
Jim Ferenczi	401f4ba2ce	Fix percolator highlight sub fetch phase to not highlight query twice (#26622 ) * Fix percolator highlight sub fetch phase to not highlight query twice The PercolatorHighlightSubFetchPhase does not override hitExecute and since it extends HighlightPhase the search hits are highlighted twice (by the highlight phase and then by the percolator). This does not alter the results, the second highlighting just overrides the first one but this slow down the request because it duplicates the work.	2017-09-14 09:31:14 +02:00

1 2 3 4 5 ...

4714 Commits