OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-03-09 14:34:43 +00:00

Author	SHA1	Message	Date
Boaz Leskes	1e1f8e6376	Validate a joining node's version with version of existing cluster nodes (#25770 ) When a node tries to join a cluster, it goes through a validation step to make sure the node is compatible with the cluster. Currently we validation that the node can read the cluster state and that it is compatible with the indexes of the cluster. This PR adds validation that the joining node's version is compatible with the versions of existing nodes. Concretely we check that: 1) The node's min compatible version is higher or equal to any node in the cluster (this prevents a too-new node from joining) 2) The node's version is higher or equal to the min compat version of all cluster nodes (this prevents a too old join where, for example, the master is on 5.6, there's another 6.0 node in the cluster and a 5.4 node tries to join). 3) The node's major version is at least as higher as the lowest node in the cluster. This is important as we use the minimum version in the cluster to stop executing bwc code for operations that require multiple nodes. If the nodes are already operating in "new cluster mode", we should prevent nodes from the previous major to join (even if they are wire level compatible). This does mean that if you have a very unlucky partition during the upgrade which partitions all old nodes which are also a minority / data nodes only, the may not be able to re-join the cluster. We feel this edge case risk is well worth the simplification it brings to BWC layers only going one way. Also, the node join validation can now selectively fail specific nodes (previously the entire batch was failed). This is an important preparation for a follow up PR where we plan to have a rejected joining node die with dignity.	2017-07-19 12:57:29 +02:00
Simon Willnauer	9882d2b9d3	Reduce the scope of `QueryRewriteContext` (#25787 ) Today we provide a lot of functionality on the `QueryRewriteContext` that we potentially don't have ie. if we rewrite on a coordinating node or when we percolating. This change moves most of the unnecessary shard level or index level services and dependencies to `QueryShardContext` instead.	2017-07-19 12:30:38 +02:00
Jason Tedor	4b18800df9	Fix handling of invalid error trace parameter If a request contains an invalid error trace parameter, we send a error on the channel. This should immediately abort any additional processing of the request but instead we march on, dispatch the request and subsequently send another message on the channel. The problem here is this means two writes on the channel which leads to the request being released twice ultimately raising in illegal reference count exception. This commit addresses this by performing an early return in the case that the request contained an invalid error trace parameter. Relates #25785	2017-07-19 18:07:11 +09:00
Jason Tedor	82f52b17e1	Remove timed latch await in listeners test This commit removes a timed latch await in a transport client listeners test. The problem with a timed wait here is that on an overloaded machine, the test can fail because the waiting thread was not unlatched quickly enough. This makes the test unnecessarily flaky. Instead, we should wait indefinitely and simply let the test fail by the test timeout if the latch is not counted down for some reason. Closes #25760	2017-07-19 16:51:27 +09:00
Jason Tedor	3d3d99557d	Expand migration note regarding default paths This commit expands on the migration note regarding the removal of default.path.data and default.path.logs to include a note that users that were relying on the defaults (the common case for path.logs), and they carry over their previous elasticsearch.yml configruation file, then they must add explicit values for path.data and path.logs.	2017-07-19 13:40:42 +09:00
Deb Adair	23c810b334	[DOCS] Changes xrefs to cross doc links to enable building GS "mini-docs"	2017-07-18 13:52:38 -07:00
Deb Adair	d9e55179f1	[DOCS] Adding index file for GS "mini book".	2017-07-18 13:44:08 -07:00
Jim Ferenczi	4cd9728f55	[Test] Make sure that QueryPhaseTests#testIndexSortScrollOptimization creates segments that can be early terminated	2017-07-18 19:30:15 +02:00
Christoph Büscher	e24af64de2	Add strict parsing of aggregation ranges (#25769 ) Currently we ignore unknown field names when parsing RangeAggregator.Range and GeoDistanceAggregationBuilder.Range from `range`, `date_range` or `geo_distance` aggregations. This can hide subtle errors in the query. This change makes parsing `ranges` stricter.	2017-07-18 18:31:04 +02:00
Boaz Leskes	c0e6dafcab	CombinedDeletionPolicy can't assert it has no commits when creating an index This is an appealing assertion, but there scenarios where it can happen under normal operations. For example, when an index is created it may run into an exception when the lucene files have already been created. The master will try to assign the shard to another node (it's empty, so no need to look for data) but if there is no other node, it will reassign it to the same node. At that point the deletion will get a list of existing commits (which it will typically delete).	2017-07-18 17:23:54 +02:00
Christoph Büscher	43bfe06759	[Docs] Add sorting and source filtering section to client docs (#25767 )	2017-07-18 16:58:46 +02:00
Luca Cavanna	5c5d723b86	Improve error message when aliases are not supported (#25728 ) With #23997 and #25268 we have changed put alias, delete alias, update aliases and delete index to not accept aliases. Instead concrete indices should be provided as their index parameter. This commit improves the error message in case aliases are provided, from an IndexNotFoundException (404 status code) with "no such index" message, to an IllegalArgumentException (400 status code) with "The provided expression [alias] matches an alias, specify the corresponding concrete indices instead." message. Note that there is no specific error message for the case where wildcard expressions match one or more aliases. In fact, aliases are simply ignored when expanding wildcards for such APIs. An error is thrown only when the expression ends up matching no indices at all, and allow_no_indices is set to false. In that case the error is still the generic "404 - no such index".	2017-07-18 15:40:17 +02:00
Clinton Gormley	ff4a2519f2	Update experimental labels in the docs (#25727 ) Relates https://github.com/elastic/elasticsearch/issues/19798 Removed experimental label from: * Painless * Diversified Sampler Agg * Sampler Agg * Significant Terms Agg * Terms Agg document count error and execution_hint * Cardinality Agg precision_threshold * Pipeline Aggregations * index.shard.check_on_startup * index.store.type (added warning) * Preloading data into the file system cache * foreach ingest processor * Field caps API * Profile API Added experimental label to: * Moving Average Agg Prediction Changed experimental to beta for: * Adjacency matrix agg * Normalizers * Tasks API * Index sorting Labelled experimental in Lucene: * ICU plugin custom rules file * Flatten graph token filter * Synonym graph token filter * Word delimiter graph token filter * Simple pattern tokenizer * Simple pattern split tokenizer Replaced experimental label with warning that details may change in the future: * Analysis explain output format * Segments verbose output format * Percentile Agg compression and HDR Histogram * Percentile Rank Agg HDR Histogram	2017-07-18 14:06:22 +02:00
Luca Cavanna	0d8b753325	IndexClosedException to return 400 rather than 403 (#25752 ) 403 can be confused with security. If an API doesn't support working against closed indices and closed indices are referred to in a request, that is a bad request, hence 400 is more appropriate.	2017-07-18 10:26:32 +02:00
Boaz Leskes	194f267110	TruncateTranslogIT.testCorruptTranslogTruncation should wait for replica to allocate The test checks if a file based or ops based recovery happened, but if the replica shard never finished recovering expectations are not met. Fixes #25761	2017-07-18 10:17:39 +02:00
Christoph Büscher	a6e3d356ed	Change parsing of numeric `to` and `from` parameters in `date_range` aggregation (#25376 ) Currently the `to` and `from` parameter in the `date_range` aggregation is not parsed with the correct date field format from the mappings or the aggregation if the argument is numeric, but always treated as a long value specifying `epoch_millis`. This leads to problems e.g. when the format is `epoch_second`, but the `to` and `from` are currently treated as millis. With this change, we interpret these parameters according to the `format` of the target field. If the `format` in the mappings is not compatible with numeric input values, a compatible `format` (e.g. `epoch_millis`, `epoch_second`) must be specified in the `date_range` aggregation itself, otherwise an error is thrown. #Closes #17920	2017-07-18 09:45:28 +02:00
Boaz Leskes	f347bd4a4e	await fix testCorruptTranslogTruncation	2017-07-18 09:15:17 +02:00
Jason Tedor	c63b7f8b0b	Stop disabling explicit GC The problem here is simple: when using direct buffers as in NIO, the JDK relies on explict GC invocataions to trigger cleaning up direct buffers; if such GCs do not occur and the direct buffer limit is reached, the JVM will throw an out of memory exception. With explicit GCs disabled, the JVM is neutered from explicitly cleaning up direct buffers in the act of reserving a new direct buffer and instead relies on a GC occurring for another reason. If such a GC never occurs, the JVM will OOM. This commit removes disabling of explicit GCs. Note that these explicit GCs only occur as a last ditch effort before going OOM when the JVM is trying to reserve more direct memory. This is a known issue, see for example: JDK-8142537. Relates #25759	2017-07-18 15:16:52 +09:00
Jim Ferenczi	c6d9456693	#25747 : Fix check of termVector with and without offsets	2017-07-17 19:46:42 +02:00
Christoph Büscher	56b1250a34	[Docs] Adding highlighting section to high level client docs (#25751 ) Adding a section about how to use highlighting in the SearchSourceBuilder and how to retrieve highlighted fragments from the SearchResponse.	2017-07-17 19:30:58 +02:00
Jim Ferenczi	41ea8fdcec	Picks offset source for the unified highlighter directly from the es mapping (#25747 ) This commit changes how the offset source is picked for each field using the es mapping rather than the underlying Lucene field infos. It's mandatory for large mappings where field infos retrieval can be costly (the global field infos is merged for each highlighted field in every hit by the Lucene impl). Fixes #25699	2017-07-17 19:10:46 +02:00
Lee Hinman	610ba7e427	Register data node stats from info carried back in search responses (#25430 ) * Register data node stats from info carried back in search responses This is part of #24915, where we now calculate the EWMA of service time for tasks in the search threadpool, and send that as well as the current queue size back to the coordinating node. The coordinating node now tracks this information for each node in the cluster. This information will be used in the future the determining the best replica a search request should be routed to. This change has no user-visible difference. * Move response time timing into ResponseListenerWrapper * Move ResponseListenerWrapper to ActionListener instead of SearchActionListener Also removes the logger * Move `requestIndex` back to private * De-guice-ify ResponseCollectorService \o/ * Undo all changes to SearchQueryThenFetchAsyncAction * Remove unneeded response collector from TransportSearchAction * Undo all changes to SearchDfsQueryThenFetchAsyncAction * Completely rewrite the inside of ResponseCollectorService's record keeping * Documentation and cleanups for ResponseCollectorService * Add unit test for collection of queue size and service time * Fix Guice construction error * Add basic unit tests for ResponseCollectorService * Fix version constant for the master merge * Fix test compilation after master merge * Add a test for node removal on cluster changed event * Remove integration test as there are now unit tests * Rename ResponseListenerWrapper -> SearchExecutionStatsCollector * Fix line-length * Make classes private and final where appropriate * Pass nodeId into SearchExecutionStatsCollector and use only ActionListener * Get nodeId from connection so searchShardTarget can be private * Remove threadpool from SearchContext, get it from IndexShard instead * Add missing import * Use BiFunction for responseWrapper rather than passing in collector service	2017-07-17 11:04:51 -06:00
Simon Willnauer	cb4eebcd6a	Make `index` in TermsLookup mandatory (#25753 ) This change removes the leniency of having a `null` index to fetch terms from in 6.0 onwards. This feature will be deprecated in the 5.x series and 6.0 nodes will require the index to be set. Closes #25750	2017-07-17 18:50:30 +02:00
Simon Willnauer	9ff259c260	Use concrete version for BWC checks in SearchTransportService (#25748 ) We used to compare agaisnt the min compatible version which is misleading since it might move over time and since we backported the `can_match` API entirely it's better to compare against a version constant.	2017-07-17 18:49:50 +02:00
Clinton Gormley	25a89e613a	Broke recipes into separate pages	2017-07-17 18:21:39 +02:00
Boaz Leskes	c0751c8650	deubg logging to TruncateTranslogIT To see what data paths are used.	2017-07-17 17:18:05 +02:00
Adrien Grand	949db39fad	Fix reproducibility of UUIDTests. Closes #25714	2017-07-17 15:43:28 +02:00
Adrien Grand	78a6c3427b	Optimize `terms` queries on `ip` addresses to use a `PointInSetQuery` whenever possible. (#25669 ) We can't do it in the general case because of prefix queries, but I believe this is mostly used in query strings and not in explicit `terms` queries. Closes #25667	2017-07-17 15:39:01 +02:00
Adrien Grand	264088f1c4	Deprecate the `_default_` mapping. (#25652 ) Now that indices cannot have types anymore, this feature does not buy anything anymore. Closes #25500	2017-07-17 15:37:59 +02:00
Jason Tedor	e9aa60dc9d	Skip shrink ignores template mapping in BWC tests This commit reverts some changes to the shrink API ignore template mapping REST test in favor of simply skipping the test for BWC purposes. The complexity here is due to deprecations and lacking the infrastructure to gracefully handle a situation like this.	2017-07-17 20:32:18 +09:00
Colin Goodheart-Smithe	7a401cd1d2	[TEST] skips shrink source mapping rest test This change skips the rest test in `rest-api-spec/test/indices.shrink/20_source_mapping.yml` as it currently fails because if we don’t expect the deprecation warning the normal rest tests fail because they get a warning they don’t expect but if we do expect the deprecation warning the mixed cluster tests fail because they don’t get a warning which they expected.	2017-07-17 12:24:07 +01:00
Jason Tedor	b1f8b75ac3	Fix warnings in shrink ignore templates test This commit fixes an issue with the REST test that the shrink API ignores templates. The problem is that we have to use a BWC version of the API (for the BWC tests) but this raises deprecation warnings. This commit adds an expectation for these deprecation warnings.	2017-07-17 18:25:37 +09:00
Boaz Leskes	7739aad1aa	Add testing around recovery to TruncateTranslogIT	2017-07-17 10:48:26 +02:00
Jason Tedor	f121cd3beb	Fix pre-6.0 response to unknown replication actions When sending replica requests for replication operations, we skip sending the request to pre-6.0 nodes for operations that such nodes would not be aware of (e.g., the background global checkpoint sync, or the primary/replica resync) since they would not know what to do with these requests. Yet, we simulate that we received responses from these nodes. Today, this is done by simulating that they sent us that their local checkpoint is unassigned sequence number. However, for pre-6.0 nodes we have introduced a special local checkpoint used in the global checkpoint tracker for such nodes and that is what we should use here too. This commit fixes this issue. Relates #25744	2017-07-17 17:47:48 +09:00
Simon Willnauer	2da79f2b5e	[TEST] Use 5.x compatible API in shrink tests	2017-07-17 09:45:49 +02:00
Jason Tedor	5b25b5d80a	Fix comment on shrink indices test This commit fixes a comment on a shrink indices test; the comment is wrong because the fix in question was applied starting 5.6.0.	2017-07-17 16:28:09 +09:00
Martijn van Groningen	8003171a0c	Move more token filters to analysis-common module The following token filters were moved: arabic_normalization, german_normalization, hindi_normalization, indic_normalization, persian_normalization, scandinavian_normalization, serbian_normalization, sorani_normalization, cjk_width and cjk_width Relates to #23658	2017-07-17 08:29:44 +02:00
Glen Smith	e9dfb2a215	Fix another simulate example in ingest docs When simulating an ingest pipeline against an existing pipeline, the _source field is required to wrap each doc. This commit fixes another example in the docs that is missing this. Relates #25743, relates e3a0c11239c3923f876ecbb310346aadadf1d902	2017-07-17 15:17:42 +09:00
Glen Smith	e3a0c11239	Fix simulate example in ingest docs When simulating an ingest pipeline against an existing pipeline, the _source field is required to wrap each doc. This commit fixes an example in the docs that is missing this. Relates #25742	2017-07-17 14:17:41 +09:00
Jason Tedor	fd98f7abc2	Adjust skip version for shrink index test This commit adjusts the skip version for a shrink index test that ensures that a shrunken index ignores templates; the version can be adjusted after the fix was backported targeting 5.6.0 and later. Relates #25380	2017-07-17 12:56:12 +09:00
Simon Willnauer	8364279b98	Prevent skipping shards if a suggest builder is present (#25739 ) Even if the query part can rewrite to match none we can't skip the suggest execution since it might yield results. Relates to #25658	2017-07-16 19:06:47 +02:00
Simon Willnauer	ccda0441e1	Bump BWC versions after #25658 backport to 5.6	2017-07-15 11:34:16 +02:00
Boaz Leskes	a6bea1bf97	testMockFailToSendNoConnectRule should wait for connection close to bubble up and disconnect the node #25521 changed channel closing to be handled async on anything but transport stop. This means it may take a while before calling `connection.close()` and the node being removed from the `connectedNodes` list (but the connection is immediately unusuable). Fixes #25686	2017-07-15 09:28:17 +02:00
Ryan Ernst	072402463b	Scripting: Remove search template actions (#25717 ) The dedicated search template put/get/delete actions are deprecated in 5.6. This commit removes them from 6.0.	2017-07-14 23:12:05 -07:00
javanna	2c38e93e96	[DOCS] Added note to high level client docs on version The alpha2 docs is built out of master which may make users think that the high level client was already released as part of alpha2 which it was not. This note should clarify that the client will be released with 6.0.0-beta1	2017-07-15 07:50:25 +02:00
Ryan Ernst	b1762d69b5	Setup: Change default heap to 1G (#25695 ) This commit changes the default heap size to 1 GB. Experimenting with elasticsearch is often done on laptops, and 1 GB is much friendlier to laptop memory. It does put more pressure on the gc, but the tradeoff is a smaller default footprint. Users running in production can (and should) adjust the heap size as necessary for their usecase.	2017-07-14 09:38:08 -07:00
Christoph Büscher	5387ed00d2	[Docs] Adding suggestion sections to high level client docs (#25724 ) This adds a section about how to add suggestions to the SearchSourceBuilder and how to retrieve them from a SearchResponse.	2017-07-14 18:33:28 +02:00
Yannick Welsch	8f0b357651	Let primary own its replication group (#25692 ) Currently replication and recovery are both coordinated through the latest cluster state available on the ClusterService as well as through the GlobalCheckpointTracker (to have consistent local/global checkpoint information), making it difficult to understand the relation between recovery and replication, and requiring some tricky checks in the recovery code to coordinate between the two. This commit makes the primary the single owner of its replication group, which simplifies the replication model and allows to clean up corner cases we have in our recovery code. It also reduces the dependencies in the code, so that neither RecoverySourceXXX nor ReplicationOperation need access to the latest state on ClusterService anymore. Finally, it gives us the property that in-sync shard copies won't receive global checkpoint updates which are above their local checkpoint (relates #25485).	2017-07-14 13:52:53 +02:00
Christoph Büscher	f809a12493	[Docs] Adding aggregation sections to high level client docs (#25707 ) This adds a section about how to add aggregations to the SearchSourceBuilder and how to retrieve them from a SearchRepsonse to the documentation for the high level rest client.	2017-07-14 12:47:47 +02:00
Bodecker DellaMaria	4f0dc5bf32	Mark filtered query example as not to be used (#25661 ) The Filtered Query has been deprecated in favour of the Bool Query with a filter context. However, this deleted page for the Filtered Query is often ranked highly in search results when searching for documentation on "filtered queries". Often people just copy the first code snippet they see, which in this case is the INCORRECT syntax (the correct syntax follows). I think reordering the examples would help avoid a lot of confusion (I have seen people make this same mistake 3 times now) Adding a comment to indicate that the first example shouldn't be used	2017-07-14 11:45:21 +02:00

1 2 3 4 5 ...

28256 Commits