OpenSearch

Commit Graph

Author	SHA1	Message	Date
Christoph Büscher	b83e14858a	Correcting some minor typos in comments	2017-12-07 16:39:23 +01:00
Tim Brooks	5b3230cbae	Fix issue where the incorrect buffers are written (#27695 ) This is a followup to #27551. That commit introduced a bug where the incorrect byte buffers would be returned when we attempted a write. This commit fixes the logic.	2017-12-06 20:57:46 -07:00
Tim Brooks	2aa62daed4	Introduce resizable inbound byte buffer (#27551 ) This is related to #27563. In order to interface with java nio, we must have buffers that are compatible with ByteBuffer. This commit introduces a basic ByteBufferReference to easily allow transferring bytes off the wire to usage in the application. Additionally it introduces an InboundChannelBuffer. This is a buffer that can internally expand as more space is needed. It is designed to be integrated with a page recycler so that it can internally reuse pages. The final piece is moving all of the index work for writing bytes to a channel into the WriteOperation.	2017-12-06 11:02:25 -07:00
Jim Ferenczi	caea6b70fa	Add a new cluster setting to limit the total number of buckets returned by a request (#27581 ) This commit adds a new dynamic cluster setting named `search.max_buckets` that can be used to limit the number of buckets created per shard or by the reduce phase. Each multi bucket aggregator can consume buckets during the final build of the aggregation at the shard level or during the reduce phase (final or not) in the coordinating node. When an aggregator consumes a bucket, a global count for the request is incremented and if this number is greater than the limit an exception is thrown (TooManyBuckets exception). This change adds the ability for multi bucket aggregator to "consume" buckets in the global limit, the default is 10,000. It's an opt-in consumer so each multi-bucket aggregator must explicitly call the consumer when a bucket is added in the response. Closes #27452 #26012	2017-12-06 09:15:28 +01:00
Luca Cavanna	f4fb4d3bf5	Add support for filtering mappings fields (#27603 ) Add support for filtering fields returned as part of mappings in get index, get mappings, get field mappings and field capabilities API. Plugins can plug in their own function, which receives the index as argument, and return a predicate which controls whether each field is included or not in the returned output.	2017-12-05 20:31:29 +01:00
Jason Tedor	42a4ad35da	Add node name to thread pool executor name This commit adds the node name to the names of thread pool executors so that the node name is visible in rejected execution exception messages. Relates #27663	2017-12-05 07:45:40 -05:00
Lee Hinman	1ff5ef9055	[TEST] Check accounting breaker is equal to segment stats rather than 0 If there are existing indices, it may not be 0	2017-12-04 14:15:23 -07:00
Simon Willnauer	84ec472428	Include internal refreshes in refresh stats (#27615 ) Today we exclude internal refreshes in the refresh stats. Yet, it's very much confusing to not take these into account. This change includes internal refreshes into the stats until we have a dedicated stats for this.	2017-12-04 16:33:47 +01:00
Boaz Leskes	f58a3d0b96	testRelocationWithConcurrentIndexing: wait for green (on relevan index) and shard initialization to settle down before starting relocation	2017-12-04 13:18:42 +01:00
Boaz Leskes	1a976ea7a4	Cherry pick tests and seqNo recovery hardning from #27580	2017-12-04 13:15:40 +01:00
Lee Hinman	623d3700f0	Add accounting circuit breaker and track segment memory usage (#27116 ) * Add accounting circuit breaker and track segment memory usage This commit adds a new circuit breaker "accounting" that is used for tracking the memory usage of non-request-tied memory users. It also adds tracking for the amount of Lucene segment memory used by a shard as a user of the new circuit breaker. The Lucene segment memory is updated when the shard refreshes, and removed when the shard relocates away from a node or is deleted. It should also be noted that all tracking for segment memory uses `addWithoutBreaking` so as not to fail the shard if a limit is reached. The `accounting` breaker has a default limit of 100% and will contribute to the parent breaker limit. Resolves #27044	2017-12-01 07:59:45 -07:00
Luca Cavanna	3e8ca38fca	Deprecate the transport client in favour of the high-level REST client (#27085 )	2017-12-01 12:24:16 +01:00
Tim Brooks	b8557651aa	Add exception handling for write listeners (#27590 ) This potential issue was exposed when I saw this PR #27542. Essentially we currently execute the write listeners all over the place without consistently catching and handling exceptions. Some of these exceptions will be logged in different ways (including as low as `debug`). This commit adds a single location where these listeners are executed. If the listener throws an execption, the exception is caught and logged at the `warn` level.	2017-11-29 15:47:12 -07:00
David Turner	00867e618d	Transpose expected and actual, and remove duplicate info from message. (#27515 ) Previously: ``` > Throwable #1: java.lang.AssertionError: Expected all shards successful but got successful [8] total [9] > Expected: <8> > but: was <9> ``` Now: ``` > Throwable #1: java.lang.AssertionError: Expected all shards successful > Expected: <9> > but: was <8> ```	2017-11-24 17:45:34 +00:00
Tanguy Leroux	5dc5580eac	Delete shard store files before restoring a snapshot (#27476 ) Pull request #20220 added a change where the store files that have the same name but are different from the ones in the snapshot are deleted first before the snapshot is restored. This logic was based on the `Store.RecoveryDiff.different` set of files which works by computing a diff between an existing store and a snapshot. This works well when the files on the filesystem form valid shard store, ie there's a `segments` file and store files are not corrupted. Otherwise, the existing store's snapshot metadata cannot be read (using Store#snapshotStoreMetadata()) and an exception is thrown (CorruptIndexException, IndexFormatTooOldException etc) which is later caught as the begining of the restore process (see RestoreContext#restore()) and is translated into an empty store metadata (Store.MetadataSnapshot.EMPTY). This will make the deletion of different files introduced in #20220 useless as the set of files will always be empty even when store files exist on the filesystem. And if some files are present within the store directory, then restoring a snapshot with files with same names will fail with a FileAlreadyExistException. This is part of the #26865 issue. There are various cases were some files could exist in the store directory before a snapshot is restored. One that Igor identified is a restore attempt that failed on a node and only first files were restored, then the shard is allocated again to the same node and the restore starts again (but fails because of existing files). Another one is when some files of a closed index are corrupted / deleted and the index is restored. This commit adds a test that uses the infrastructure provided by IndexShardTestCase in order to test that restoring a shard succeed even when files with same names exist on filesystem. Related to #26865	2017-11-24 13:15:34 +01:00
Martijn van Groningen	f1ebf366bf	unmuted test, this has been fixed by #27397 Closes #27497	2017-11-24 08:53:00 +01:00
David Turner	89ba8996c6	Consolidate version numbering semantics (#27397 ) Fixes to the build system, particularly around BWC testing, and to make future version bumps less painful.	2017-11-23 20:21:53 +00:00
Martijn van Groningen	ca9c476d88	muted test	2017-11-22 19:18:35 +01:00
Tim Brooks	ef34555b29	Decouple nio constructs from the tcp transport (#27484 ) This is related to #27260. Currently, basic nio constructs (nio channels, the channel factories, selector event handlers, etc) implement logic that is specific to the tcp transport. For example, NioChannel implements the TcpChannel interface. These nio constructs at some point will also need to support other protocols (ex: http). This commit separates the TcpTransport logic from the nio building blocks.	2017-11-22 11:39:31 -06:00
Jim Ferenczi	6319424e4a	Move composite aggregation to core (#27474 ) This change removes the module named aggs-composite and adds the `composite` aggs as a core aggregation. This allows other plugins to use this new aggregation and simplifies the integration in the HL rest client.	2017-11-21 13:31:01 +01:00
Tim Brooks	f37eb1b403	Remove tcp profile from low level nio channel (#27441 ) This is related to #27260. Currently every nio channel has a profile field. Profile is a concept that only relates to the tcp transport. Http channels will not have profiles. This commit moves the profile from the nio channel to the read context. The context is the level that protocol specific features and logic should live.	2017-11-20 12:20:42 -07:00
Tim Brooks	0a8f48d592	Transition transport apis to use void listeners (#27440 ) Currently we use ActionListener<TcpChannel> for connect, close, and send message listeners in TcpTransport. However, all of the listeners have to capture a reference to a channel in the case of the exception api being called. This commit changes these listeners to be type <Void> as passing the channel to onResponse is not necessary. Additionally, this change makes it easier to integrate with low level transports (which use different implementations of TcpChannel).	2017-11-20 10:47:47 -07:00
Michael Basnight	2949c53174	Remove config prompting for secrets and text (#27216 ) This commit removes the ability to use ${prompt.secret} and ${prompt.text} as valid config settings. Secure settings has obsoleted the need for this, and it cleans up some of the code in Bootstrap.	2017-11-19 22:33:17 -06:00
Michael Basnight	cb3e8f4763	Move the CLI into its own subproject (#27114 ) Projects the depend on the CLI currently depend on core. This should not always be the case. The EnvironmentAwareCommand will remain in :core, but the rest of the CLI components have been moved into their own subproject of :core, :core:cli.	2017-11-18 21:42:57 -06:00
Tim Brooks	ce45e29be7	Remove manual tracking of registered channels (#27445 ) This is related to #27260. Currently, every ESSelector keeps track of all channels that are registered with it. ESSelector is just an abstraction over a raw java nio selector. The java nio selector already tracks its own selection keys. This commit removes our tracking and relies on the java nio selector tracking.	2017-11-17 16:20:09 -07:00
David Turner	08a257327f	Remove newline from log message (#27425 ) It leads to harder-to-parse logs that look like this: ``` 1> [2017-11-16T20:46:21,804][INFO ][o.e.t.r.y.ClientYamlTestClient] Adding header Content-Type 1> with value application/json 1> [2017-11-16T20:46:21,812][INFO ][o.e.t.r.y.ClientYamlTestClient] Adding header Content-Type 1> with value application/json 1> [2017-11-16T20:46:21,820][INFO ][o.e.t.r.y.ClientYamlTestClient] Adding header Content-Type 1> with value application/json 1> [2017-11-16T20:46:21,966][INFO ][o.e.t.r.y.ClientYamlTestClient] Adding header Content-Type 1> with value application/json ```	2017-11-17 14:12:06 +00:00
Tim Brooks	f761a0e0e4	Remove unneeded Throwable handling in nio (#27412 ) This is related to #27260. In the nio transport work we do not catch or handle `Throwable`. There are a few places where we have exception handlers that accept `Throwable`. This commit removes those cases.	2017-11-16 18:24:06 -07:00
David Turner	9766b858d0	Prepare for bump to 6.0.1 on the master branch (#27391 ) An assortment of fixes, particularly to version number calculations, in preparation for the bump to 6.0.1.	2017-11-16 18:38:54 +00:00
Tim Brooks	80ef9bbdb1	Remove parameterization from TcpTransport (#27407 ) This commit is a follow up to the work completed in #27132. Essentially it transitions two more methods (sendMessage and getLocalAddress) from Transport to TcpChannel. With this change, there is no longer a need for TcpTransport to be aware of the specific type of channel a transport returns. So that class is no longer parameterized by channel type.	2017-11-16 11:19:36 -07:00
Tim Brooks	35a5922927	Delete unneeded nio client (#27408 ) This is a follow up to #27132. As that PR greatly simplified the connection logic inside a low level transport implementation, much of the functionality provided by the NioClient class is no longer necessary. This commit removes that class.	2017-11-16 09:22:40 -07:00
Jim Ferenczi	623367d793	Add composite aggregator (#26800 ) * This change adds a module called `aggs-composite` that defines a new aggregation named `composite`. The `composite` aggregation is a multi-buckets aggregation that creates composite buckets made of multiple sources. The sources for each bucket can be defined as: * A `terms` source, values are extracted from a field or a script. * A `date_histogram` source, values are extracted from a date field and rounded to the provided interval. This aggregation can be used to retrieve all buckets of a deeply nested aggregation by flattening the nested aggregation in composite buckets. A composite buckets is composed of one value per source and is built for each document as the combinations of values in the provided sources. For instance the following aggregation: ```` "test_agg": { "terms": { "field": "field1" }, "aggs": { "nested_test_agg": "terms": { "field": "field2" } } } ```` ... which retrieves the top N terms for `field1` and for each top term in `field1` the top N terms for `field2`, can be replaced by a `composite` aggregation in order to retrieve all the combinations of `field1`, `field2` in the matching documents: ```` "composite_agg": { "composite": { "sources": [ { "field1": { "terms": { "field": "field1" } } }, { "field2": { "terms": { "field": "field2" } } }, } } ```` The response of the aggregation looks like this: ```` "aggregations": { "composite_agg": { "buckets": [ { "key": { "field1": "alabama", "field2": "almanach" }, "doc_count": 100 }, { "key": { "field1": "alabama", "field2": "calendar" }, "doc_count": 1 }, { "key": { "field1": "arizona", "field2": "calendar" }, "doc_count": 1 } ] } } ```` By default this aggregation returns 10 buckets sorted in ascending order of the composite key. Pagination can be achieved by providing `after` values, the values of the composite key to aggregate after. For instance the following aggregation will aggregate all composite keys that sorts after `arizona, calendar`: ```` "composite_agg": { "composite": { "after": {"field1": "alabama", "field2": "calendar"}, "size": 100, "sources": [ { "field1": { "terms": { "field": "field1" } } }, { "field2": { "terms": { "field": "field2" } } } } } ```` This aggregation is optimized for indices that set an index sorting that match the composite source definition. For instance the aggregation above could run faster on indices that defines an index sorting like this: ```` "settings": { "index.sort.field": ["field1", "field2"] } ```` In this case the `composite` aggregation can early terminate on each segment. This aggregation also accepts multi-valued field but disables early termination for these fields even if index sorting matches the sources definition. This is mandatory because index sorting picks only one value per document to perform the sort.	2017-11-16 15:13:36 +01:00
Tim Brooks	ca11085bb6	Add TcpChannel to unify Transport implementations (#27132 ) Right now our different transport implementations must duplicate functionality in order to stay compliant with the requirements of TcpTransport. They must all implement common logic to open channels, close channels, keep track of channels for eventual shutdown, etc. Additionally, there is a weird and complicated relationship between Transport and TransportService. We eventually want to start merging some of the functionality between these classes. This commit starts moving towards a world where TransportService retains all the application logic and channel state. Transport implementations in this world will only be tasked with returning a channel when one is requested, calling transport service when a channel is accepted from a server, and starting / stopping itself. Specifically this commit changes how channels are opened and closed. All Transport implementations now return a channel type that must comply with the new TcpChannel interface. This interface has the methods necessary for TcpTransport to completely manage the lifecycle of a channel. This includes setting the channel up, waiting for connection, adding close listeners, and eventually closing.	2017-11-15 12:38:39 -07:00
Luca Cavanna	382da0f227	REST spec: Validate that api name matches file name that contains it (#27366 ) This commit validates that each spec json file contains an API that has the same name as the file	2017-11-14 14:53:00 +01:00
Simon Willnauer	2299c70371	Allow affix settings to specify dependencies (#27161 ) We use affix settings to group settings / values under a certain namespace. In some cases like login information for instance a setting is only valid if one or more other settings are present. For instance `x.test.user` is only valid if there is an `x.test.passwd` present and vice versa. This change allows to specify such a dependency to prevent settings updates that leave settings in an inconsistent state.	2017-11-13 12:06:36 +01:00
Simon Willnauer	a34c2f0b8d	Ensure external refreshes will also refresh internal searcher to minimize segment creation (#27253 ) We cut over to internal and external IndexReader/IndexSearcher in #26972 which uses two independent searcher managers. This has the downside that refreshes of the external reader will never clear the internal version map which in-turn will trigger additional and potentially unnecessary segment flushes since memory must be freed. Under heavy indexing load with low refresh intervals this can cause excessive segment creation which causes high GC activity and significantly increases the required segment merges. This change adds a dedicated external reference manager that delegates refreshes to the internal reference manager that then `steals` the refreshed reader from the internal reference manager for external usage. This ensures that external and internal readers are consistent on an external refresh. As a sideeffect this also releases old segments referenced by the internal reference manager which can potentially hold on to already merged away segments until it is refreshed due to a flush or indexing activity.	2017-11-09 08:40:22 +00:00
Tim Brooks	dc86b4c2ed	Decouple `ChannelFactory` from Tcp classes (#27286 ) * Decouple `ChannelFactory` from Tcp classes This is related to #27260. Currently `ChannelFactory` is tightly coupled to classes related to the elasticsearch Tcp binary protocol. This commit modifies the factory to be able to construct http or other protocol channels.	2017-11-08 14:30:00 -07:00
Jason Tedor	d5451b2037	Die with dignity while merging If an out of memory error is thrown while merging, today we quietly rewrap it into a merge exception and the out of memory error is lost. Instead, we need to rethrow out of memory errors, and in fact any fatal error here, and let those go uncaught so that the node is torn down. This commit causes this to be the case. Relates #27265	2017-11-06 17:55:11 -05:00
Jason Tedor	766d29e7cf	Correctly encode warning headers The warnings headers have a fairly limited set of valid characters (cf. quoted-text in RFC 7230). While we have assertions that we adhere to this set of valid characters ensuring that our warning messages do not violate the specificaion, we were neglecting the possibility that arbitrary user input would trickle into these warning headers. Thus, missing here was tests for these situations and encoding of characters that appear outside the set of valid characters. This commit addresses this by encoding any characters in a deprecation message that are not from the set of valid characters. Relates #27269	2017-11-06 13:20:30 -05:00
Simon Willnauer	bd7efa908a	Add ability to split shards (#26931 ) This change adds a new `_split` API that allows to split indices into a new index with a power of two more shards that the source index. This API works alongside the `_shrink` API but doesn't require any shard relocation before indices can be split. The split operation is conceptually an inverse `_shrink` operation since we initialize the index with a _syntetic_ number of routing shards that are used for the consistent hashing at index time. Compared to indices created with earlier versions this might produce slightly different shard distributions but has no impact on the per-index backwards compatibility. For now, the user is required to prepare an index to be splittable by setting the `index.number_of_routing_shards` at index creation time. The setting allows the user to prepare the index to be splittable in factors of `index.number_of_routing_shards` ie. if the index is created with `index.number_of_routing_shards: 16` and `index.number_of_shards: 2` it can be split into `4, 8, 16` shards. This is an intermediate step until we can make this the default. This also allows us to safely backport this change to 6.x. The `_split` operation is implemented internally as a DeleteByQuery on the lucene level that is executed while the primary shards execute their initial recovery. Subsequent merges that are triggered due to this operation will not be executed immediately. All merges will be deferred unti the shards are started and will then be throttled accordingly. This change is intended for the 6.1 feature release but will not support pre-6.1 indices to be split unless these indices have been shrunk before. In that case these indices can be split backwards into their original number of shards.	2017-11-06 11:37:55 +01:00
Tanguy Leroux	43e7a4a349	Upgrade to Jackson 2.8.10 (#27230 ) While it's not possible to upgrade the Jackson dependencies to their latest versions yet (see #27032 (comment) for more) it's still possible to upgrade to the latest 2.8.x version.	2017-11-06 10:20:05 +01:00
Jim Ferenczi	429275a773	Remove ElasticsearchQueryCachingPolicy (#27190 ) We have an hidden setting called `index.queries.cache.term_queries` that disables caching of term queries in the query cache. Though term queries are not cached in the Lucene UsageTrackingQueryCachingPolicy since version 6.5. This makes the es policy useless but also makes it impossible to re-enable caching for term queries. This change appeared in Lucene 6.5 so this setting is no-op since version 5.4 of Elasticsearch The change in this PR removes the setting and the custom policy.	2017-11-06 08:26:24 +01:00
David Roberts	749c3ec716	Remove the single argument Environment constructor (#27235 ) Only tests should use the single argument Environment constructor. To enforce this the single arg Environment constructor has been replaced with a test framework factory method. Production code (beyond initial Bootstrap) should always use the same Environment object that Node.getEnvironment() returns. This Environment is also available via dependency injection.	2017-11-04 13:25:09 +00:00
kel	0f21262b36	Do not create directories if repository is readonly (#26909 ) For FsBlobStore and HdfsBlobStore, if the repository is read only, the blob store should be aware of the readonly setting and do not create directories if they don't exist. Closes #21495	2017-11-03 13:10:50 +01:00
Jason Tedor	d6d830ff0b	Fix logic detecting unreleased versions When partitioning version constants into released and unreleased versions, today we have a bug in finding the last unreleased version. Namely, consider the following version constants on the 6.x branch: ..., 5.6.3, 5.6.4, 6.0.0-alpha1, ..., 6.0.0-rc1, 6.0.0-rc2, 6.0.0, 6.1.0. In this case, our convention dictates that: 5.6.4, 6.0.0, and 6.1.0 are unreleased. Today we correctly detect that 6.0.0 and 6.1.0 are unreleased, and then we say the previous patch version is unreleased too. The problem is the logic to remove that previous patch version is broken, it does not skip alphas/betas/RCs which have been released. This commit fixes this by skipping backwards over pre-release versions when finding the previous patch version to remove. Relates #27206	2017-11-01 13:01:45 -04:00
Colin Goodheart-Smithe	99aca9cdfc	Enhances exists queries to reduce need for `_field_names` (#26930 ) * Enhances exists queries to reduce need for `_field_names` Before this change we wrote the name all the fields in a document to a `_field_names` field and then implemented exists queries as a term query on this field. The problem with this approach is that it bloats the index and also affects indexing performance. This change adds a new method `existsQuery()` to `MappedFieldType` which is implemented by each sub-class. For most field types if doc values are available a `DocValuesFieldExistsQuery` is used, falling back to using `_field_names` if doc values are disabled. Note that only fields where no doc values are available are written to `_field_names`. Closes #26770 * Addresses review comments * Addresses more review comments * implements existsQuery explicitly on every mapper * Reinstates ability to perform term query on `_field_names` * Added bwc depending on index created version * Review Comments * Skips tests that are not supported in 6.1.0 These values will need to be changed after backporting this PR to 6.x	2017-11-01 10:46:59 +00:00
kel	c3e2bdf20c	Raise IllegalArgumentException if query validation failed (#26811 ) Closes #26799	2017-10-31 12:17:27 +01:00
Adrien Grand	3812d3cb43	TopHitsAggregator must propagate calls to `setScorer`. (#27138 ) It is required in order to work correctly with bulk scorer implementations that change the scorer during the collection process. Otherwise sub collectors might call `Scorer.score()` on the wrong scorer. Closes #27131	2017-10-31 09:59:06 +01:00
Jason Tedor	a566942219	Refactor internal engine This commit is a minor refactoring of internal engine to move hooks for generating sequence numbers into the engine itself. As such, we refactor tests that relied on this hook to use the new hook, and remove the hook from the sequence number service itself. Relates #27082	2017-10-30 13:10:20 -04:00
Ryan Ernst	2a8452b513	Reindex: Fix headers in reindex action (#26937 ) The headers passed to reindex were skipped except for the last one. This commit fixes the copying of the headers, as well as adds a base test case for rest client builders to access the headers within the built rest client. relates #22976	2017-10-25 16:37:01 -07:00
olcbean	981b7f4d39	Make yaml test runner stricter by enforcing `required` for paths and parameters (#27035 ) Till now the yaml test runner was verifying that the provided path parts and parameters are supported. With this PR, yaml test runner also checks that all required path parts and parameters are provided.	2017-10-25 19:36:42 +00:00
Luca Cavanna	8caf7d4ff8	Decouple BulkProcessor from ThreadPool (#26727 ) Introduce minimal thread scheduler as a base class for `ThreadPool`. Such a class can be used from the `BulkProcessor` to schedule retries and the flush task. This allows to remove the `ThreadPool` dependency from `BulkProcessor`, which requires to provide settings that contain `node.name` and also needed log4j for logging. Instead, it needs now a `Scheduler` that is much lighter and gets automatically created and shut down on close. Closes #26028	2017-10-25 10:30:23 +02:00
Lee Hinman	fcfbdf1f37	Expose adaptive replica selection stats in /_nodes/stats API This exposes the collected metrics we store for ARS in the nodes stats, as well as the computed rank of nodes. Each node exposes its perspective about the cluster. Here's an example output (with `?human`): ```json ... "adaptive_selection" : { "_k6v1-wERxyUd5ke6s-D0g" : { "outgoing_searches" : 0, "avg_queue_size" : 0, "avg_service_time" : "7.8ms", "avg_service_time_ns" : 7896963, "avg_response_time" : "9ms", "avg_response_time_ns" : 9095598, "rank" : "9.1" }, "VJiCUFoiTpySGmO00eWmtQ" : { "outgoing_searches" : 0, "avg_queue_size" : 0, "avg_service_time" : "1.3ms", "avg_service_time_ns" : 1330240, "avg_response_time" : "4.5ms", "avg_response_time_ns" : 4524154, "rank" : "4.5" }, "DHNGTdzyT9iiaCpEUsIAKA" : { "outgoing_searches" : 0, "avg_queue_size" : 0, "avg_service_time" : "2.1ms", "avg_service_time_ns" : 2113164, "avg_response_time" : "6.3ms", "avg_response_time_ns" : 6375810, "rank" : "6.4" } } ... ```	2017-10-24 08:58:42 -06:00
Tim Brooks	277637f42f	Do not set SO_LINGER on server channels (#26997 ) Right now we are attempting to set SO_LINGER to 0 on server channels when we are stopping the tcp transport. This is not a supported socket option and throws an exception. This also prevents the channels from being closed. This commit 1. doesn't set SO_LINGER for server channges, 2. checks that it is a supported option in nio, and 3. changes the log message to warn for server channel close exceptions.	2017-10-13 13:06:38 -06:00
Jason Tedor	393e73612e	Fix formatting in channel close test This commit fixes the indentation in the transport test case for a channel closing while connecting.	2017-10-10 13:39:45 -04:00
Jason Tedor	4c06b8f1d2	Check for closed connection while opening While opening a connection to a node, a channel can subsequently close. If this happens, a future callback whose purpose is to close all other channels and disconnect from the node will fire. However, this future will not be ready to close all the channels because the connection will not be exposed to the future callback yet. Since this callback is run once, we will never try to disconnect from this node again and we will be left with a closed channel. This commit adds a check that all channels are open before exposing the channel and throws a general connection exception. In this case, the usual connection retry logic will take over. Relates #26932	2017-10-10 13:34:51 -04:00
Simon Willnauer	cdd7c1e6c2	Return List instead of an array from settings (#26903 ) Today we return a `String[]` that requires copying values for every access. Yet, we already store the setting as a list so we can also directly return the unmodifiable list directly. This makes list / array access in settings a much cheaper operation especially if lists are large.	2017-10-09 09:52:08 +02:00
Nhat	bf4c3642b2	remove _primary and _replica shard preferences (#26791 ) The shard preference _primary, _replica and its variants were useful for the asynchronous replication. However, with the current impl, they are no longer useful and should be removed. Closes #26335	2017-10-08 11:03:06 -04:00
Jason Tedor	470e5e7cfc	Add additional low-level logging handler () * Add additional low-level logging handler We have the trace handler which is useful for recording sent messages but there are times where it would be useful to have more low-level logging about the events occurring on a channel. This commit adds a logging handler that can be enabled by setting a certain log level (org.elasticsearch.transport.netty4.ESLoggingHandler) to trace that provides trace logging on low-level channel events and includes some information about the request/response read/write events on the channel as well. * Remove imports * License header * Remove redundant * Add test * More assertions	2017-10-05 12:10:58 -04:00
Martijn van Groningen	b27e408ed2	Removed void token filter entries and added two tests	2017-10-05 13:25:05 +02:00
Md. Abdulla-Al-Sun	a40c474e10	Added Bengali Analyzer to Elasticsearch with respect to the lucene update(PR#238)	2017-10-05 13:25:05 +02:00
Boaz Leskes	2a04118e88	Promote common rest test utility methods to ESRestTestCase We have duplicates in some classes and I was about to create one more.	2017-10-05 10:08:10 +02:00
Simon Willnauer	00dfdf50cf	Represent lists as actual lists inside Settings (#26878 ) Today we represent each value of a list setting with it's own dedicated key that ends with the index of the value in the list. Aside of the obvious weirdness this has several issues especially if lists are massive since it causes massive runtime penalties when validating settings. Like a list of 100k words will literally cause a create index call to timeout and in-turn massive slowdown on all subsequent validations runs. With this change we use a simple string list to represent the list. This change also forbids to add a settings that ends with a .0 which was internally used to detect a list setting. Once this has been rolled out for an entire major version all the internal .0 handling can be removed since all settings will be converted. Relates to #26723	2017-10-05 09:27:08 +02:00
Martijn van Groningen	dca787ed8a	upgrade to Lucene 7.1.0 snapshot version	2017-10-05 09:06:56 +02:00
Simon Willnauer	d1533e2397	Remove Settings#getAsMap() (#26845 ) Since `#getAsMap` exposes internal representation we are trying to remove it step by step. This commit is cleaning up some xcontent writing as well as usage in tests	2017-10-04 01:21:38 -06:00
Boaz Leskes	a18bd9caa2	Increase ESRestTestCase.waitForClusterStateUpdatesToFinish time out to 30s It is set to 10 sec but sometimes it takes the cluster longer to settle.	2017-10-03 12:24:36 +02:00
Tim Brooks	d80ad7f097	Check channel i open before setting SO_LINGER (#26857 ) This commit fixes a #26855. Right now we set SO_LINGER to 0 if we are stopping the transport. This can throw a ChannelClosedException if the raw channel is already closed. We have a number of scenarios where it is possible this could be called with a channel that is already closed. This commit fixes the issue be checking that the channel is not closed before attempting to set the socket option.	2017-10-02 15:09:52 -06:00
Tim Brooks	9ae7a80ba5	Move raw selector usage into ESSelector (#26825 ) Currently we only log generic messages about errors in logs from the nio event handler. This means that we do not know which channel had issues connection, reading, writing, etc. This commit changes the logs to include the local and remote addresses and profile for a channel.	2017-10-01 17:59:57 -06:00
Simon Willnauer	7b8d036ab5	Replace group map settings with affix setting (#26819 ) We use group settings historically instead of using a prefix setting which is more restrictive and type safe. The majority of the usecases needs to access a key, value map based on the _leave node_ of the setting ie. the setting `index.tag.*` might be used to tag an index with `index.tag.test=42` and `index.tag.staging=12` which then would be turned into a `{"test": 42, "staging": 12}` map. The group settings would always use `Settings#getAsMap` which is loosing type information and uses internal representation of the settings. Using prefix settings allows now to access such a method type-safe and natively.	2017-09-30 14:27:21 +02:00
Tim Brooks	bf403ae028	Add information about nio channels in logs (#26806 ) Currently we only log generic messages about errors in logs from the nio event handler. This means that we do not know which channel had issues connection, reading, writing, etc. This commit changes the logs to include the local and remote addresses and profile for a channel.	2017-09-28 17:11:26 -06:00
Simon Willnauer	25d6778d31	Add comment to TCP transport impls why we set SO_LINGER on close	2017-09-28 13:07:01 +02:00
Armin Braun	af06231d4c	#26701 Close TcpTransport on RST in some Spots to Prevent Leaking TIME_WAIT Sockets (#26764 ) #26701 Added option to RST instead of FIN to TcpTransport#closeChannels	2017-09-26 19:58:11 +00:00
Simon Willnauer	a506ba8602	Remove `Settings,put(Map<String,String>)` (#26785 ) `Map<String,String>` is basically erasing the type while other methods on the `Settings.Builder` are type safe and have corresponding `get` methods.	2017-09-26 12:15:20 +02:00
Simon Willnauer	aab4655e63	Unify Settings xcontent reading and writing (#26739 ) This change adds a fromXContent method to Settings that allows to read the xcontent that is produced by toXContent. It also replaces the entire settings loader infrastructure and removes the structured map representation. Future PRs will also tackle the `getAsMap` that exposes the internal represenation of settings for better encapsulation.	2017-09-25 13:23:01 +02:00
Jason Tedor	f35d1de502	Introduce global checkpoint background sync It is the exciting return of the global checkpoint background sync. Long, long ago, in snapshot version far, far away we had and only had a global checkpoint background sync. This sync would fire periodically and send the global checkpoint from the primary shard to the replicas so that they could update their local knowledge of the global checkpoint. Later in time, as we sped ahead towards finalizing the initial version of sequence IDs, we realized that we need the global checkpoint updates to be inline. This means that on a replication operation, the primary shard would piggy back the global checkpoint with the replication operation to the replicas. The replicas would update their local knowledge of the global checkpoint and reply with their local checkpoint. However, this could allow the global checkpoint on the primary to advance again and the replicas would fall behind in their local knowledge of the global checkpoint. If another replication operation never fired, then the replicas would be permanently behind. To account for this, we added one more sync that would fire when the primary shard fell idle. However, this has problems: - the shard idle timer defaults to five minutes, a long time to wait for the replicas to learn of the new global checkpoint - if a replica missed the sync, there was no follow-up sync to catch them up - there is an inherent race condition where the primary shard could fall idle mid-operation (after having sent the replication request to the replicas); in this case, there would never be a background sync after the operation completes - tying the global checkpoint sync to the idle timer was never natural To fix this, we add two additional changes for the global checkpoint to be synced to the replicas. The first is that we add a post-operation sync that only fires if there are no operations in flight and there is a lagging replica. This gives us a chance to sync the global checkpoint to the replicas immediately after an operation so that they are always kept up to date. The second is that we add back a global checkpoint background sync that fires on a timer. This timer fires every thirty seconds, and is not configurable (for simplicity). This background sync is smarter than what we had previously in the sense that it only sends a sync if the global checkpoint on at least one replica is lagging that of the primary. When the timer fires, we can compare the global checkpoint on the primary to its knowledge of the global checkpoint on the replicas and only send a sync if there is a shard behind. Relates #26591	2017-09-21 15:34:13 -04:00
Michael Basnight	f385e0cf26	Add bad_request to the rest-api-spec catch params (#26539 ) This adds another request to the catch params. It also makes sure that the generic request param does not allow 400 either.	2017-09-14 14:24:03 -05:00
Christoph Büscher	c7c6443b10	[Docs] "The the" is a great band, but ... (#26644 ) Removing several occurrences of this typo in the docs and javadocs, seems to be a common mistake. Corrections turn up once in a while in PRs, better to correct some of this in one sweep.	2017-09-14 15:08:20 +02:00
Adrien Grand	93da7720ff	Move non-core mappers to a module. (#26549 ) Today we have all non-plugin mappers in core. I'd like to start moving those that neither map to json datatypes nor are very frequently used like `date` or `ip` to a module. This commit creates a new module called `mappers-extra` and moves the `scaled_float` and `token_count` mappers to it. I'd like to eventually move `range` fields there but it's more complicated due to their intimate relationship with range queries. Relates #10368	2017-09-13 17:58:53 +02:00
Simon Willnauer	42f3129d7b	Allow plugins to validate cluster-state on join (#26595 ) Today we don't have a pluggable way to validate if the cluster state is compatible with the node that joins. We already apply some checks for index compatibility that prevents nodes to join a cluster with indices it doesn't support but for plugins this isn't possible. This change adds a cluster state validator that allows plugins to prevent a join if the cluster-state is incompatible.	2017-09-12 15:32:33 +02:00
Ryan Ernst	5c35bff1c3	Test: Remove leftover static bwc test case (#26584 ) This test case was leftover from the static bwc tests. There was still one use for checking we do not load old indices, but this PR moves the legacy code needed for that directly into the test. I also opened a follow up issue to completely remove the unsupported test: #26583.	2017-09-11 15:38:30 -07:00
Jason Tedor	b2e4bfa0a7	Snapshot fallback should consider build.snapshot When determining if a build is a snapshot build, we look for a field in the JAR manifest. However, when running tests, we are not running with a compiled core Elasticsearch JAR, we are running with the compiled core classes on the classpath. We have a fallback for this, we always assume such a situation is a snapshot build. However, when running builds with -Dbuild.snapshot=false, this is not the case. As such, we need to fallback to the value of build.snapshot. However, there are cases where we are not running with a compiled core Elasticsearch JAR (e.g., when the transport client is embedded in a web container) so we should only do this fallback if we are in tests. To verify we are in tests, we check if randomized runner is on the classpath. Relates #26554	2017-09-11 07:42:11 -04:00
Martijn van Groningen	b391425da1	Added support to the percolate query to percolate multiple documents The percolator will add a `_percolator_document_slot` field to all percolator hits to indicate with what document it has matched. This number matches with the order in which the documents have been specified in the percolate query. Also improved the support for multiple percolate queries in a search request.	2017-09-08 17:28:39 +02:00
Tim Brooks	c1a20f7e48	Merge tsa with ts (#26369 ) We currently have a weird relationship between Transport, TransportService, and TransportServiceAdaptor. At some point I think that we would like to collapse these all into one concept as we only support TCP transports. This commit moves in that direction by eliminating the adaptor and just passing the transport service to the transport.	2017-09-05 09:15:56 -06:00
Boaz Leskes	2fd4af82e4	Move `UNASSIGNED_SEQ_NO` and `NO_OPS_PERFORMED` to SequenceNumbers (#26494 ) Where they better belong.	2017-09-04 16:31:00 +02:00
Alexander Reelsen	80d0a32f8e	ScriptService: Replace max compilation per minute setting with max compilation rate (#26399 ) The current script service has a script compilation limit for a one minute window. This is set to a small default value of 15. Instead of increasing that default value, this commit introduces a new setting that allows to configure a rate per time unit, so that the script service can deal with bursts better. The new setting is named `script.max_compilations_rate`, requires a nonnegative number and a positive time value. The default is `75/5m`, which is equivalent to the existing 15 per minute.	2017-09-01 10:15:27 +02:00
Lee Hinman	c3da66d021	Implement adaptive replica selection (#26128 ) * Implement adaptive replica selection This implements the selection algorithm described in the C3 paper for determining which copy of the data a query should be routed to. By using the service time EWMA, response time EWMA, and queue size EWMA we calculate the score of a node by piggybacking these metrics with each search request. Since Elasticsearch lacks the "broadcast to every copy" behavior that Cassandra has (as mentioned in the C3 paper) to update metrics after a node has been highly weighted, this implementation adjusts a node's response stats using the average of the its own and the "best" node's metrics. This is so that a long GC or other activity that may cause a node's rank to increase dramatically does not permanently keep a node from having requests routed to it, instead it will eventually lower its score back to the realm where it is a potential candidate for new queries. This feature is off by default and can be turned on with the dynamic setting `cluster.routing.use_adaptive_replica_selection`. Relates to #24915, however instead of `b=3` I used `b=4` (after benchmarking) * Randomly use adaptive replica selection for internal test cluster * Use an action name prefix for retrieving pending requests * Add unit test for replica selection * don't use adaptive replica selection in SearchPreferenceIT * Track client connections in a SearchTransportService instead of TransportService * Bind `entry` pieces in local variables * Add javadoc link to C3 paper and javadocs for stat adjustments * Bind entry's key and value to local variables * Remove unneeded actionNamePrefix parameter * Use conns.longValue() instead of cached Long * Add comments about removing entries from the map * Pull out bindings for `entry` in IndexShardRoutingTable * Use .compareTo instead of manually comparing * add assert for connections not being null and gte to 1 * Copy map for pending search connections instead of "live" map * Increase the number of pending search requests used for calculating rank when chosen When a node gets chosen, this increases the number of search counts for the winning node so that it will not be as likely to be chosen again for non-concurrent search requests. * Remove unused HashMap import * Rename rank -> rankShardsAndUpdateStats * Rename rankedActiveInitializingShardsIt -> activeInitializingShardsRankedIt * Instead of precalculating winning node, use "winning" shard from ranked list * Sort null ranked nodes before nodes that have a rank	2017-08-30 20:55:11 -06:00
Tal Levy	ed151d829d	Migrate Search requests to use Writeable reading strategies (#26428 ) Migrates many SearchRequest objects to use Writeable conventions and rejects usage of `readFrom` in these new classes.	2017-08-30 11:00:33 -07:00
Sergey Galkin	c075323522	Refactor create index service to be unit testable This commit refactors MetaDataCreateIndexService so that it is unit testable. Relates #25961	2017-08-29 16:55:44 -04:00
Michael Basnight	cfd14cd2b8	Revert shading for the low level rest client (#26367 ) At current, we do not feel there is enough of a reason to shade the low level rest client. It caused problems with commons logging and IDE's during the brief time it was used. We did not know exactly how many users will need this, and decided that leaving shading out until we gather more information is best. Users can still shade the jar themselves. For information and feeback, see issue #26366. Closes #26328 This reverts commit `3a20922046`. This reverts commit `2c271f0f22`. This reverts commit `9d10dbea39`. This reverts commit `e816ef89a2`.	2017-08-25 14:13:12 -05:00
Nik Everett	b3edd11aa0	Allow plugins to plug rescore implementations (#26368 ) This allows plugins to plug rescore implementations into Elasticsearch. While this is a fairly expert thing to do I've done my best to point folks to the QueryRescorer as one that at least documents the tradeoffs that it makes. I've attempted to limit the API surface area by removing `SearchContext` from the exposed interface, instead exposing just the IndexSearcher and `QueryShardContext`. I also tried to make some of the class names more consistent and do some general cleanup while I was there. I entertained the notion of moving the `QueryRescorer` to module. After all, it'd be a wonderful test to prove that you can plug rescore implementation into Elasticsearch if the only built in rescore implementation is in the module. But I decided against it because the new module would require a client jar and it'd require moving some more things around. I think if we really want to do it, we should do it as a followup. I did, on the other hand, create an "example" rescore plugin which should both be a nice example for anyone wanting to plug in their own rescore implementation and servers as a good integration test to make sure that you can indeed plug one in. Closes #26208	2017-08-25 13:46:57 -04:00
Yannick Welsch	0390c76f0a	Remove reinitShadowPrimary (#26349 ) With shadow replicas gone, there is no need to have this method anymore.	2017-08-25 10:37:51 +09:30
Tal Levy	6ab4b6b0ac	revamp TransportRequest handlers to support Writeable (#26315 ) This PR begins the long journey to deprecating Streamable. The idea here is to add additional method signatures that support Writeable.Reader, so that the work to migrate objects TransportMessage to implement Writeable and not Streamable. One example conversion is done in this PR: SimulatePipelineRequest.	2017-08-22 15:47:05 -07:00
Yannick Welsch	3d8feff66e	Use Java 9 FilePermission model (#26302 ) This commit makes the security code aware of the Java 9 FilePermission changes (see #21534) and allows us to remove the `jdk.io.permissionsUseCanonicalPath` system property.	2017-08-22 11:22:00 +09:30
Ryan Ernst	96b0d3e0cc	Script: Convert script query to a dedicated script context (#26003 ) This commit converts script query to use a new FilterScript context. The new context returns a boolean, so the error that would have previously happened at runtime if a non boolean was returned would now happen at script compilation. Also, the leniency of supporting returning a number and 0 mapping to false, non-zero to true is gone, but it was never documented. With the new context compilation will now also fail if special variables are used at compilation time, instead of runtime, eg ctx.	2017-08-18 15:18:35 -07:00
Tim Brooks	5d7a78fcdb	Use PlainListenableActionFuture for CloseFuture (#26242 ) Right now we use a custom future for the CloseFuture associated with a channel. This is because we need special unwrapping logic to ensure that exceptions from a future failure are a certain type (opposed to an UncategorizedException). However, the current version is limiting because we can only attach one listener. This commit changes the CloseFuture to extend the PlainListenableActionFuture. This change allows us to attach multiple listeners.	2017-08-18 13:38:38 -05:00
Luca Cavanna	1309dfd44d	Add links to external classes in clients javadoc (#25998 ) The client sniffer depends on the low-level REST client, while the Java high-level REST client and the transport client depend on Elasticsearch itself. Javadoc are not that useful unless they have links to the Elasticsearch classes in the latter case, and to the low-level REST client in the sniffer javadoc. This commit adds those links.	2017-08-17 21:03:47 +02:00
Colin Goodheart-Smithe	a975f4e5d6	Moves more classes over to ToXContentObject/Fragment (#26234 ) * Moves more classes over to ToXContentObject/Fragment * review comments	2017-08-16 15:40:40 +01:00
Simon Willnauer	a9169e536b	Several internal improvements to internal test cluster infra (#26214 ) This chance adds several random test infrastructure improvements that caused issues in on-going developments but are generally useful. For instance is it impossible to restart a node with a secure setting source since we close it after the node is started. This change makes it cloneable such that we can reuse it for a restart.	2017-08-15 17:42:15 +02:00
Martijn van Groningen	1146a35870	Move more token filters to analysis-common module The following token filters were moved: arabic_stem, brazilian_stem, czech_stem, dutch_stem, french_stem, german_stem and russian_stem. Relates to #23658	2017-08-11 17:39:24 +02:00
Andy Bristol	7e3cd6a019	reindex: automatically choose the number of slices (#26030 ) In reindex APIs, when using the `slices` parameter to choose the number of slices, adds the option to specify `slices` as "auto" which will choose a reasonable number of slices. It uses the number of shards in the source index, up to a ceiling. If there is more than one source index, it uses the smallest number of shards among them. This gives users an easy way to use slicing in these APIs without having to make decisions about how to configure it, as it provides a good-enough configuration for them out of the box. This may become the default behavior for these APIs in the future.	2017-08-11 08:25:25 -07:00
Simon Willnauer	6f82b0c6e2	Allow `ClusterState.Custom` to be created on initial cluster states (#26144 ) Today we have a `null` invariant on all `ClusterState.Custom`. This makes several code paths complicated and requires complex state handling in some cases. This change allows to register a custom supplier that is used to initialize the initial clusterstate with these transient customs.	2017-08-11 09:51:49 +02:00
Nik Everett	99ac7beb8e	Teach the build about betas and rcs (#26066 ) The build was ignoring suffixes like "beta1" and "rc1" on the version numbers which was causing the backwards compatibility packaging tests to fail because they expected to be upgrading from 6.0.0 even though they were actually upgrading from 6.0.0-beta1. This adds the suffixes to the information that the build scrapes from Version.java. It then uses those suffixes when it resolves artifacts build from the bwc branch and for testing. Closes #26017	2017-08-10 14:30:00 -04:00
Colin Goodheart-Smithe	dfbaf90951	Adds ToXContentFragment (#25771 ) * Adds ToXContentFragment This interface is meant for objects that implement `ToXContent` but are not complete objects. It is basically the opposite of `ToXContentObject`. It means that it will be easier to track the migration of classes over to the fragment/not fragment ToXContent model as it will be clear which classes are not migrated. When no classes directly implement `ToXContent` we can make `ToXContent` package private to be sure that all new classes must implement `ToXContentObject` or `ToXContentFragment`. * review comments * more review comments * javadocs * iter * Adds tests * iter * adds toString test for aggs * improves tests following review comments * iter * iter	2017-08-09 15:53:30 +01:00
Simon Willnauer	82fa531ab4	Remove `_index` fielddata hack if cluster alias is present (#26082 ) We introduced a hack in #25885 to respect the cluster alias if available on the `_index` field. This is important if aggregations or other field data related operations are executed. Yet, we added a small hack that duplicated an implementation detail from the `_index` field data builder to make this work. This change adds a necessary but simple API change that allows us to remove the hack and only have a single implementation.	2017-08-08 09:24:24 +02:00
Adrien Grand	f0cba4fce5	Add a scripted similarity. (#25831 ) The goal of this similarity is to help users who would like to keep the functionality of the `tf-idf` similarity that we want to remove, or to allow for specific usec-cases (disabling idf, disabling tf, disabling length norm, etc.) to not have to build a custom plugin and familiarize with the low-level Lucene API.	2017-08-08 08:55:12 +02:00
Martijn van Groningen	99d79d5a0f	tests: when do not generate random unicode strings for field names, but instead random alpha ascii strings Should fail build failures like this one: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+6.0+multijob-unix-compatibility/	2017-08-07 15:09:01 +02:00
Luca Cavanna	14ba36977e	[TEST] prevent yaml tests from using raw requests (#26044 ) Raw requests are supported only by the java yaml test runner and were introduced to test docs snippets. Some yaml tests ended up using them (see #23497) which causes failures for other language clients. This commit migrates those yaml tests to Java tests that send requests through the Java low-level REST client, and also moves the ability to send raw requests to a special client that's only available when testing docs snippets. Closes #25694	2017-08-07 11:02:16 +02:00
Boaz Leskes	e11cbed534	Adding a refresh listener to a recovering shard should be a noop (#26055 ) When `refresh=wait_for` is set on an indexing request, we register a listener on the shards that are call during the next refresh. During the recover translog phase, when the engine is open, we have a window of time when indexing operations succeed and they can add their listeners. Those listeners will only be called when the recovery finishes as we do not refresh during recoveries (unless the indexing buffer is full). Next to being a bad user experience, it can also cause deadlocks with an ongoing peer recovery that may wait for those operations to mark the replica in sync (details below). To fix this, this PR changes refresh listeners to be a noop when the shard is not yet serving reads (implicitly covering the recovery period). It doesn't matter anyway. Deadlock with recovery: When finalizing a peer recovery we mark the peer as "in sync". To do so we wait until the peer's local checkpoint is at least as high as the global checkpoint. If an operation with `refresh=wait_for` is added as a listener on that peer during recovery, it is not completed from the perspective of the primary. The primary than may wait for it to complete before advancing the local checkpoint for that peer. Since that peer is not considered in sync, the global checkpoint on the primary can be higher, causing a deadlock. Operation waits for recovery to finish and a refresh to happen. Recovery waits on the operation.	2017-08-04 19:51:15 +02:00
Tim Brooks	0401df81e0	Revert "Tests: Disable NIO transport mechanism in tests" This reverts commit `c24dbec6f5`.	2017-08-02 09:59:07 -05:00
Colin Goodheart-Smithe	87c6e63e73	Adds mutate function to various tests (#25999 ) * Adds mutate function to various tests Relates to #25929 * fix test * implements mutate function for all single bucket aggs * review comments * convert getMutateFunction to mutateIInstance	2017-08-02 11:38:31 +01:00
Alexander Reelsen	c24dbec6f5	Tests: Disable NIO transport mechanism in tests Due to test instability the new transport mechanism is always disabled and does not randomly pick the new IO transport.	2017-08-02 11:18:12 +02:00
Adrien Grand	58feb5efa0	Fix `_exists_` in query_string on empty indices. (#25993 ) It currently fails if there are no mappings yet. Closes #25956	2017-08-02 10:06:34 +02:00
Luca Cavanna	e2d25c3c89	[TEST] Remove duplicated main response unit test (#25855 ) Also move MainResponseTets to extend AbstractStreamableXContentTestCase	2017-08-02 08:42:38 +02:00
Tim Brooks	58d2dcc54f	Ensure send listener is called on IOException Currently there is an issue where the send listener is not called in the nio transport when an exception is throw during channel flush. This leads to memory leaks. This commit ensures that the listener is called	2017-08-01 22:30:04 -05:00
Tim Brooks	0f4f49496f	Use nio transport in test clusters (#25986 ) This commit adds the nio transport as an option in place of the mock tcp transport for tests. Each test will only use one transport type. The transport type is decided by a random boolean generated inside of the `ESTestCase` class.	2017-08-01 16:19:31 -05:00
Ryan Ernst	072281d5aa	Update version to 7.0.0-alpha1 (#25876 ) This commit updates the version for master to 7.0.0-alpha1. It also adds the 6.1 version constant, and fixes many tests, as well as marking some as awaits fix. Closes #25893 Closes #25870	2017-08-01 15:47:48 -04:00
Luca Cavanna	4d589afbc2	AbstractQueryBuilder to no longer extend ToXContentBytes (#25948 ) ToXContentToBytes is used as a base class that adds toString and buildAsBytes method implementation to classes that implement ToXContent. With the ongoing cleanups, this class is limited and doesn't add a lot of value, given that buildAsBytes can be replaced with XContentHelper.toXContent and toString can be replaced with Strings.toString(this). The plan would be to remove ToXContentToBytes entirely, and AbstractQueryBuilder is the first place where we can remove its usage.	2017-07-31 17:38:24 +02:00
Boaz Leskes	9d10ffd547	Goodbye, Translog Views (#25962 ) During peer recoveries, we need to copy over lucene files and replay the operations they miss from the source translog. Guaranteeing that translog files are not cleaned up has seen many iterations overtime. Back in the old 1.0 days, recoveries went through the Engine and actively prevented both translog cleaning and lucene commits. We then moved to a notion called Translog Views, which allowed the recovery code to "acquire" a view into the translog which is then guaranteed to be kept around until the view is closed. The Engine code was free to commit lucene and do what it ever it wanted without coordinating with recoveries. Translog file deletion logic was based on reference counting on the file level. Those counters were incremented when a view was acquired but also when the view was used to create a `Snapshot` that allowed you to read operations from the files. At some point we removed the file based counting complexity in favor of constructs on the Translog level that just keep track of "open" views and the minimum translog generation they refer to. To do so, Views had to be kept around until the last snapshot that was made from them was consumed. This was fine in recovery code but lead to [a subtle bug](https://github.com/elastic/elasticsearch/pull/25862) in the [Primary Replica Resyncer](https://github.com/elastic/elasticsearch/pull/25862). Concurrently, we have developed the notion of a `TranslogDeletionPolicy` which is responsible for the liveness aspect of translog files. This class makes it very simple to take translog Snapshot into account for keep translog files around, allowing people that just need a snapshot to just take a snapshot and not worry about views and such. Recovery code which actually does need a view can now prevent trimming by acquiring a simple retention lock (a `Closable`). This removes the need for the notion of a View.	2017-07-31 17:29:43 +02:00
Colin Goodheart-Smithe	7740cb54a5	Improves AbstractWireSerializingTestCase equals test (#25910 ) * Improves AbstractWireSerializingTestCase equals test `AbstractWireSerializingTestCase.testEqualsAndHashcode()` now uses `EqualsHashcodeTestUtils` to perform the hashCode and equals checks. To support this `AbstractWireSerializingTestCase` has two new methods: `getCopyFunction()` and `getMutateFunction` which are used when calling `EqualsHashcodeTestUtils` * Adds TODO * Makes equivalent changes to AbstractStreamableTestCase * corrects javadoc error	2017-07-31 14:46:58 +01:00
Martijn van Groningen	0b776a1de0	Move more token filters to analysis-common module The following token filters were moved: delimited_payload_filter, keep, keep_types, classic, apostrophe, decimal_digit, fingerprint, min_hash and scandinavian_folding. Relates to #23658	2017-07-31 15:15:04 +02:00
Martijn van Groningen	7c3735bdc4	percolator: Store the QueryBuilder's Writable representation instead of its XContent representation. The Writeble representation is less heavy to parse and that will benefit percolate performance and throughput. The query builder's binary format has now the same bwc guarentees as the xcontent format. Added a qa test that verifies that percolator queries written in older versions are still readable by the current version.	2017-07-28 12:24:10 +02:00
Yannick Welsch	1a01514081	Move tribe to a module (#25778 ) This commit moves tribe to a module, stripping core from the tribe functionality.	2017-07-28 11:23:50 +02:00
Jason Tedor	1492ccd7ae	Fix environment-aware command tests This commit fixes tests for environment-aware commands. A previous change added a check that es.path.conf is not null. The problem is that this system property is not being set in tests so this check trips every single time. To fix this, we move the check into a method that can be overridden, and then override this method in relevant places in tests to avoid having to set the property in tests. We also add a test that this check works as expected.	2017-07-28 14:37:04 +09:00
Simon Willnauer	b72c71083c	Cleanup IndexFieldData visibility (#25900 ) Today we expose `IndexFieldDataService` outside of IndexService to do maintenance or lookup field data in different ways. Yet, we have a streamlined way to access IndexFieldData via `QueryShardContext` that should encapsulate all access to it. This also ensures that we control all other functionality like cache clearing etc. This change also removes the `recycler` option from `ClearIndicesCacheRequest` this option is a no-op and should have been removed long ago.	2017-07-26 20:03:42 +02:00
Tim Brooks	6d02b45f10	Support client-only mode for NioTransport (#25839 ) Currently, NioTransport does start normal socket selectors and the client when the network server setting is set to false. This commit makes it so that the client will be started even when the network server is not enabled. Additionally, it randomly introduces the NioTransport as an option for the MockTransportClient throughout tests.	2017-07-26 10:27:15 -05:00
Luca Cavanna	d8203f19fd	Remove XContentHelper#toString(ToXContent) in favour of Strings#toString(ToXContent) (#25866 ) These two methods do do the same thing. The subtle difference between the two is that the former prints out pretty printed content by default while the latter doesn't. There are way more usages of the latter throughout the codebase hence I kept that variant although I do think that it would be much better to print out prettified content by default from a `toString`. That breaks quite some tests so I didn't make that change yet. Also XContentHelper#toString was outdated as it didn't check the ToXContent#isFragment method to decide whether a new anonymous object has to be created or not. It would simply fail with any ToXContentObject.	2017-07-26 16:00:59 +02:00
Simon Willnauer	634ce90dc0	Respect cluster alias in `_index` aggs and queries (#25885 ) Today when we aggregate on the `_index` field the cross cluster search alias is not taken into account. Neither is it respected when we search on the field. This change adds support for cluster alias when the cluster alias is present on the `_index` field. Closes #25606	2017-07-26 09:16:52 +02:00
Tim Brooks	2d22bad53f	Simplify selector close method (#25838 ) Currently we have an option to interrupt the selector thread on close. This option is not needed as we do not call this method and we should not be blocking on the network thread. Instead we only need to ever call wakeup() on the raw selector.	2017-07-25 10:52:15 -05:00
Michael Basnight	e816ef89a2	Shade external dependencies in the rest client jar This commit removes all external dependencies from the rest client jar and shades them in an 'org.elasticsearch.client' package within the jar using shadowJar gradle plugin. All projects that depended on the existing jar have been converted to using the 'org.elasticsearch.client' package prefixes to interact with the rest client. Closes #25208	2017-07-24 12:55:43 -05:00
Tim Brooks	0a4b38b60c	Close raw channel when bind / connect fails (#25840 ) Currently we are failing to close socket channels when the initial bind or connect operation fails. This leaves the file descriptor hanging around. This closes the channel when an exception occurs during bind or connect.	2017-07-22 13:55:33 -05:00
Tim Brooks	c7a7c69b2b	Simplify NioChannel creation and closing process (#25504 ) Currently an NioChannel is created and it is UNREGISTERED. At some point it is registered with a selector. From that point on, the channel can only be closed by the selector. The fact that a channel might not be associated with a selector has significant implications for concurrency and the channel shutdown process. The only thing that is simplified by allowing channels to be in a state independent of a selector is some testing scenarios. This PR modifies channels so that they are given a selector at creation time and are always associated with that selector. Only that selector can close that channel. This simplifies the channel lifecycle and closing intricacies.	2017-07-21 11:55:23 -05:00
Yannick Welsch	a2624dfcef	Move primary term from ReplicationRequest to ConcreteShardRequest (#25822 ) Removes the primary term from the replication request and pushes it into the transport envelope. This makes it possible to remove the term from the ReplicationOperation universe. The primary term that is to be used for a replication operation is now determined in the reroute phase when the node decides to execute a primary action (and validated once the primary action gets to execute). This makes it possible to validate that the primary action was sent to the correct primary shard instance that it was meant to be sent to (currently we only validate primary actions using the allocation id, which can be reused for failed and reallocated primaries).	2017-07-21 15:57:42 +02:00
Boaz Leskes	7488877d1a	Validate a joining node's version with version of existing cluster nodes (#25808 ) When a node tries to join a cluster, it goes through a validation step to make sure the node is compatible with the cluster. Currently we validation that the node can read the cluster state and that it is compatible with the indexes of the cluster. This PR adds validation that the joining node's version is compatible with the versions of existing nodes. Concretely we check that: 1) The node's min compatible version is higher or equal to any node in the cluster (this prevents a too-new node from joining) 2) The node's version is higher or equal to the min compat version of all cluster nodes (this prevents a too old join where, for example, the master is on 5.6, there's another 6.0 node in the cluster and a 5.4 node tries to join). 3) The node's major version is at least as higher as the lowest node in the cluster. This is important as we use the minimum version in the cluster to stop executing bwc code for operations that require multiple nodes. If the nodes are already operating in "new cluster mode", we should prevent nodes from the previous major to join (even if they are wire level compatible). This does mean that if you have a very unlucky partition during the upgrade which partitions all old nodes which are also a minority / data nodes only, the may not be able to re-join the cluster. We feel this edge case risk is well worth the simplification it brings to BWC layers only going one way. This restriction only holds if the cluster state has been recovered (i.e., the cluster has properly formed). Also, the node join validation can now selectively fail specific nodes (previously the entire batch was failed). This is an important preparation for a follow up PR where we plan to have a rejected joining node die with dignity.	2017-07-20 20:11:29 +02:00
Simon Willnauer	5e629cfba0	Ensure query resources are fetched asynchronously during rewrite (#25791 ) The `QueryRewriteContext` used to provide a client object that can be used to fetch geo-shapes, terms or documents for percolation. Unfortunately all client calls used to be blocking calls which can have significant impact on the rewrite phase since it occupies an entire search thread until the resource is received. In the case that the index the resource is fetched from isn't on the local node this can have significant impact on query throughput. Note: this doesn't fix MLT since it fetches stuff in doQuery which is a different beast. Yet, it is a huge step in the right direction	2017-07-20 15:37:50 +02:00
Boaz Leskes	9989ac69a4	Revert "Validate a joining node's version with version of existing cluster nodes (#25770 )" This reverts commit `1e1f8e6376`.	2017-07-19 17:34:53 +02:00
Simon Willnauer	4d78935df7	Introduce a new Rewriteable interface to streamline rewriting (#25788 ) Today we have duplicated code that is quite complicated to iterate over rewriteable (`QueryBuilders` mainly) This change introduces a `Rewriteable` interface that allow to share code to do the rewriting as well as encapsulation and composition of queries.	2017-07-19 15:06:49 +02:00
Adrien Grand	55ad318541	Reduce the overhead of timeouts and low-level search cancellation. (#25776 ) Setting a timeout or enforcing low-level search cancellation used to make us wrap the collector and check either the current time or whether the search task was cancelled for every collected document. This can be significant overhead on cheap queries that match many documents. This commit changes the approach to wrap the bulk scorer rather than the collector and exponentially increase the interval between two consecutive checks in order to reduce the overhead of those checks.	2017-07-19 14:15:53 +02:00
Boaz Leskes	1e1f8e6376	Validate a joining node's version with version of existing cluster nodes (#25770 ) When a node tries to join a cluster, it goes through a validation step to make sure the node is compatible with the cluster. Currently we validation that the node can read the cluster state and that it is compatible with the indexes of the cluster. This PR adds validation that the joining node's version is compatible with the versions of existing nodes. Concretely we check that: 1) The node's min compatible version is higher or equal to any node in the cluster (this prevents a too-new node from joining) 2) The node's version is higher or equal to the min compat version of all cluster nodes (this prevents a too old join where, for example, the master is on 5.6, there's another 6.0 node in the cluster and a 5.4 node tries to join). 3) The node's major version is at least as higher as the lowest node in the cluster. This is important as we use the minimum version in the cluster to stop executing bwc code for operations that require multiple nodes. If the nodes are already operating in "new cluster mode", we should prevent nodes from the previous major to join (even if they are wire level compatible). This does mean that if you have a very unlucky partition during the upgrade which partitions all old nodes which are also a minority / data nodes only, the may not be able to re-join the cluster. We feel this edge case risk is well worth the simplification it brings to BWC layers only going one way. Also, the node join validation can now selectively fail specific nodes (previously the entire batch was failed). This is an important preparation for a follow up PR where we plan to have a rejected joining node die with dignity.	2017-07-19 12:57:29 +02:00
Lee Hinman	610ba7e427	Register data node stats from info carried back in search responses (#25430 ) * Register data node stats from info carried back in search responses This is part of #24915, where we now calculate the EWMA of service time for tasks in the search threadpool, and send that as well as the current queue size back to the coordinating node. The coordinating node now tracks this information for each node in the cluster. This information will be used in the future the determining the best replica a search request should be routed to. This change has no user-visible difference. * Move response time timing into ResponseListenerWrapper * Move ResponseListenerWrapper to ActionListener instead of SearchActionListener Also removes the logger * Move `requestIndex` back to private * De-guice-ify ResponseCollectorService \o/ * Undo all changes to SearchQueryThenFetchAsyncAction * Remove unneeded response collector from TransportSearchAction * Undo all changes to SearchDfsQueryThenFetchAsyncAction * Completely rewrite the inside of ResponseCollectorService's record keeping * Documentation and cleanups for ResponseCollectorService * Add unit test for collection of queue size and service time * Fix Guice construction error * Add basic unit tests for ResponseCollectorService * Fix version constant for the master merge * Fix test compilation after master merge * Add a test for node removal on cluster changed event * Remove integration test as there are now unit tests * Rename ResponseListenerWrapper -> SearchExecutionStatsCollector * Fix line-length * Make classes private and final where appropriate * Pass nodeId into SearchExecutionStatsCollector and use only ActionListener * Get nodeId from connection so searchShardTarget can be private * Remove threadpool from SearchContext, get it from IndexShard instead * Add missing import * Use BiFunction for responseWrapper rather than passing in collector service	2017-07-17 11:04:51 -06:00
Adrien Grand	264088f1c4	Deprecate the `_default_` mapping. (#25652 ) Now that indices cannot have types anymore, this feature does not buy anything anymore. Closes #25500	2017-07-17 15:37:59 +02:00
Martijn van Groningen	8003171a0c	Move more token filters to analysis-common module The following token filters were moved: arabic_normalization, german_normalization, hindi_normalization, indic_normalization, persian_normalization, scandinavian_normalization, serbian_normalization, sorani_normalization, cjk_width and cjk_width Relates to #23658	2017-07-17 08:29:44 +02:00
Boaz Leskes	a6bea1bf97	testMockFailToSendNoConnectRule should wait for connection close to bubble up and disconnect the node #25521 changed channel closing to be handled async on anything but transport stop. This means it may take a while before calling `connection.close()` and the node being removed from the `connectedNodes` list (but the connection is immediately unusuable). Fixes #25686	2017-07-15 09:28:17 +02:00
Yannick Welsch	8f0b357651	Let primary own its replication group (#25692 ) Currently replication and recovery are both coordinated through the latest cluster state available on the ClusterService as well as through the GlobalCheckpointTracker (to have consistent local/global checkpoint information), making it difficult to understand the relation between recovery and replication, and requiring some tricky checks in the recovery code to coordinate between the two. This commit makes the primary the single owner of its replication group, which simplifies the replication model and allows to clean up corner cases we have in our recovery code. It also reduces the dependencies in the code, so that neither RecoverySourceXXX nor ReplicationOperation need access to the latest state on ClusterService anymore. Finally, it gives us the property that in-sync shard copies won't receive global checkpoint updates which are above their local checkpoint (relates #25485).	2017-07-14 13:52:53 +02:00
Luca Cavanna	ec66d655b5	Rename client artifacts (#25693 ) It was brought up that our current client artifacts have generic names like 'rest' that may cause conflicts with other artifacts. This commit renames: - rest -> elasticsearch-rest-client - sniffer -> elasticsearch-rest-client-sniffer - rest-high-level -> elasticsearch-rest-high-level-client A couple of small changes are also preparing the high level client for its first release. Closes #20248	2017-07-13 09:44:25 +02:00
Simon Willnauer	b7bc790428	Use a non default port range in MockTransportService We already use a per JVM port range in MockTransportService. Yet, it's possible that if we are executing in the JVM with ordinal 0 that other clusters reuse ports from the mock transport service and some tests try to simulate disconnects etc. By using a non-defautl port range (starting at 10300) we prevent internal test clusters from reusing any of the mock impls ports Relates to #25301	2017-07-12 22:29:21 +02:00
Simon Willnauer	e81804cfa4	Add a shard filter search phase to pre-filter shards based on query rewriting (#25658 ) Today if we search across a large amount of shards we hit every shard. Yet, it's quite common to search across an index pattern for time based indices but filtering will exclude all results outside a certain time range ie. `now-3d`. While the search can potentially hit hundreds of shards the majority of the shards might yield 0 results since there is not document that is within this date range. Kibana for instance does this regularly but used `_field_stats` to optimize the indexes they need to query. Now with the deprecation of `_field_stats` and it's upcoming removal a single dashboard in kibana can potentially turn into searches hitting hundreds or thousands of shards and that can easily cause search rejections even though the most of the requests are very likely super cheap and only need a query rewriting to early terminate with 0 results. This change adds a pre-filter phase for searches that can, if the number of shards are higher than a the `pre_filter_shard_size` threshold (defaults to 128 shards), fan out to the shards and check if the query can potentially match any documents at all. While false positives are possible, a negative response means that no matches are possible. These requests are not subject to rejection and can greatly reduce the number of shards a request needs to hit. The approach here is preferable to the kibana approach with field stats since it correctly handles aliases and uses the correct threadpools to execute these requests. Further it's completely transparent to the user and improves scalability of elasticsearch in general on large clusters.	2017-07-12 22:19:20 +02:00
Tim Brooks	a3ade99fcf	Fix BytesReferenceStreamInput#skip with offset (#25634 ) There is a bug when a call to `BytesReferenceStreamInput` skip is made on a `BytesReference` that has an initial offset. The offset for the current slice is added to the current index and then subtracted from the length. This introduces the possibility of a negative number of bytes to skip. This happens inside a loop, which leads to an infinte loop. This commit correctly subtracts the current slice index from the slice.length. Additionally, the `BytesArrayTests` are modified to test instances that include an offset.	2017-07-11 09:54:29 -05:00
Simon Willnauer	98c91a3bd0	Limit the number of concurrent shard requests per search request (#25632 ) This is a protection mechanism to prevent a single search request from hitting a large number of shards in the cluster concurrently. If a search is executed against all indices in the cluster this can easily overload the cluster causing rejections etc. which is not necessarily desirable. Instead this PR adds a per request limit of `max_concurrent_shard_requests` that throttles the number of concurrent initial phase requests to `256` by default. This limit can be increased per request and protects single search requests from overloading the cluster. Subsequent PRs can introduces addiontional improvemetns ie. limiting this on a `_msearch` level, making defaults a factor of the number of nodes or sort shards iters such that we gain the best concurrency across nodes.	2017-07-11 16:23:10 +02:00
Simon Willnauer	ec1afe30ea	Ensure remote cluster alias is preserved in inner hits aggs (#25627 ) We lost the cluster alias due to some special caseing in inner hits and due to the fact that we didn't pass on the alias to the shard request. This change ensures that we have the cluster alias present on the shard to ensure all SearchShardTarget reads preserve the alias. Relates to #25606	2017-07-11 11:34:06 +02:00
Tim Brooks	b22bbf94da	Avoid blocking on channel close on network thread (#25521 ) Currently when we close a channel in Netty4Utils.closeChannels we block until the closing is complete. This introduces the possibility that a network selector thread will block while waiting until a separate network selector thread closes a channel. For instance: T1 closes channel 1 (which is assigned to a T1 selector). Channel 1's close listener executes the closing of the node. That means that T1 now tries to close channel 2. However, channel 2 is assigned to a selector that is running on T2. T1 now must wait until T2 closes that channel at some point in the future. This commit addresses this by adding a boolean to closeChannels indicating if we should block on close. We only set this boolean to true if we are closing down the server channels at shutdown. This call is never made from a network thread. When we call the closeChannels method with that boolean set to false, we do not block on close.	2017-07-10 10:50:51 -05:00
Colin Goodheart-Smithe	3a5a54e83e	Collapses package structure for some bucket aggs (#25579 ) This change collapses some of the packages for the bucket aggregations into their parent packages. This was done for the following aggregations: * The variants of the range aggregation (geo_distance, date and ip) were moved into the `o.e.s.a.bucket.range` package * The `o.e.s.a.bucket.terms.support` package was removed and the classes were moved to `o.e.s.a.bucket.terms` * The filter aggregation was moved to `o.e.s.a.bucket.filter` Since this PR is already relatively large with only the above changes subsequent PRs will do similar operations on relevant metric and pipeline aggregations Relates to #22868	2017-07-10 15:08:15 +01:00
Boaz Leskes	09378f48e4	Add a scheduled translog retention check (#25622 ) We currently check whether translog files can be trimmed whenever we create a new translog generation or close a view. However #25294 added a long translog retention period (12h, max 512MB by default), which means translog files should potentially be cleaned up long after there isn't any indexing activity to trigger flushes/the creation of new translog files. We therefore need a scheduled background check to clean up those files once they are no longer needed. Relates to #10708	2017-07-10 10:28:39 +02:00
Jason Tedor	c084542731	Bump version to 6.0.0-beta1 This commit does two things: - bumps the version from 6.0.0-alpha3 to 6.0.0-beta1 - renames the 6.0.0-alpha3 version constant to 6.0.0-beta1 Relates #25621	2017-07-09 18:12:50 -04:00
Jason Tedor	bc22c1c286	Add disk threshold settings validation This commit adds cross-settings validation for the low/high/flood stage disk watermark settings. This validation was enabled by the introduction of multiple settings validation. Relates #25600	2017-07-07 19:54:36 -04:00
Nik Everett	794257c421	Drop current from the list of released versions (#25187 ) It hasn't been released....	2017-07-07 15:59:57 -04:00
Yannick Welsch	baa87db5d1	Harden global checkpoint tracker This commit refactors the global checkpont tracker to make it more resilient. The main idea is to make it more explicit what state is actually captured and how that state is updated through replication/cluster state updates etc. It also fixes the issue where the local checkpoint information is not being updated when a shard becomes primary. The primary relocation handoff becomes very simple too, we can just verbatim copy over the internal state. Relates #25468	2017-07-07 14:04:28 -04:00
Lee Hinman	8aa0a5c111	Improve REST error handling when endpoint does not support HTTP verb, add OPTIONS support (#24437 ) * Improved REST endpoint exception handling, see #15335 Also improved OPTIONS http method handling to better conform with the http spec. * Tidied up formatting and comments See #15335 * Tests for #15335 * Cleaned up comments, added section number * Swapped out tab indents for space indents * Test class now extends ESSingleNodeTestCase * Capture RestResponse so it can be examined in test cases Simple addition to surface the RestResponse object so we can run tests against it (see issue #15335). * Refactored class name, included feedback See #15335. * Unit test for REST error handling enhancements Randomizing unit test for enhanced REST response error handling. See issue #15335 for more details. * Cleaned up formatting * New constructor to set HTTP method Constructor added to support RestController test cases. * Refactored FakeRestRequest, streamlined test case. * Cleaned up conflicts * Tests for #15335 * Added functionality to ignore or include path wildcards See #15335 * Further enhancements to request handling Refactored executeHandler to prioritize explicit path matches. See #15335 for more information. * Cosmetic fixes * Refactored method handlers * Removed redundant import * Updated integration tests * Refactoring to address issue #17853 * Cleaned up test assertions * Fixed edge case if OPTIONS method randomly selected as invalid method In this test, an OPTIONS method request is valid, and should not return a 405 error. * Remove redundant static modifier * Hook the multiple PathTrie attempts into RestHandler.dispatchRequest * Add missing space * Correctly retrieve new handler for each Trie strategy * Only copy headers to threadcontext once * Fix test after REST header copying moved higher up * Restore original params when trying the next trie candidate * Remove OPTIONS for invalidHttpMethodArray so a 405 is guaranteed in tests * Re-add the fix I already added and got removed during merge :-/ * Add missing GET method to test * Add documentation to migration guide about breaking 404 -> 405 changes * Explain boolean response, pull into local var * fixup! Explain boolean response, pull into local var * Encapsulate multiple HTTP methods into PathTrie<MethodHandlers> * Add PathTrie.retrieveAll where all matching modes can be retrieved Then TrieMatchingMode can be package private and not leak into RestController * Include body of error with 405 responses to give hint about valid methods * Fix missing usageService handler addition I accidentally removed this :X * Initialize PathTrieIterator modes with Arrays.asList * Use "== false" instead of ! * Missing paren :-/	2017-07-07 09:01:23 -06:00
Adrien Grand	40bb1663ee	Index ids in binary form. (#25352 ) Indexing ids in binary form should help with indexing speed since we would have to compare fewer bytes upon sorting, should help with memory usage of the live version map since keys will be shorter, and might help with disk usage depending on how efficient the terms dictionary is at compressing terms. Since we can only expect base64 ids in the auto-generated case, this PR tries to use an encoding that makes the binary id equal to the base64-decoded id in the majority of cases (253 out of 256). It also specializes numeric ids, since this seems to be common when content that is stored in Elasticsearch comes from another database that uses eg. auto-increment ids. Another option could be to require base64 ids all the time. It would make things simpler but I'm not sure users would welcome this requirement. This PR should bring some benefits, but I expect it to be mostly useful when coupled with something like #24615. Closes #18154	2017-07-07 14:22:47 +02:00
Martijn van Groningen	6db708ef75	Move more token filters to analysis-common module The following token filters were moved: common grams, limit token, pattern capture and pattern raplace. Relates to #23658	2017-07-07 10:02:52 +02:00
Simon Willnauer	1f67d079b1	Validate `transport.profiles.` settings (#25508 ) Transport profiles unfortunately have never been validated. Yet, it's very easy to make a mistake when configuring profiles which will most likely stay undetected since we don't validate the settings but allow almost everything based on the wildcard in `transport.profiles.`. This change removes the settings subset based parsing of profiles but rather uses concrete affix settings for the profiles which makes it easier to fall back to higher level settings since the fallback settings are present when the profile setting is parsed. Previously, it was unclear in the code which setting is used ie. if the profiles settings (with removed prefixes) or the global node setting. There is no distinction anymore since we don't pull prefix based settings.	2017-07-07 09:40:59 +02:00
Simon Willnauer	38a1df7da1	Use a port range per JVM in MockTransportService (#25565 ) Some tests use MockTransportService to do network based testing. Yet, we run tests in multiple JVMs that means concurrent tests could claim port that another JVM just released and if that test tries to simulate a disconnect it might be smart enough to re-connect depending on what is tested. To reduce the risk, since this is very hard to debug we use a different default port range per JVM unless the incoming settings overriding it. Closes #25301	2017-07-06 09:14:52 +02:00
Simon Willnauer	6e5cc424a8	Switch indices read-only if a node runs out of disk space (#25541 ) Today when we run out of disk all kinds of crazy things can happen and nodes are becoming hard to maintain once out of disk is hit. While we try to move shards away if we hit watermarks this might not be possible in many situations. Based on the discussion in #24299 this change monitors disk utilization and adds a flood-stage watermark that causes all indices that are allocated on a node hitting the flood-stage mark to be switched read-only (with the option to be deleted). This allows users to react on the low disk situation while subsequent write requests will be rejected. Users can switch individual indices read-write once the situation is sorted out. There is no automatic read-write switch once the node has enough space. This requires user interaction. The flood-stage watermark is set to `95%` utilization by default. Closes #24299	2017-07-05 22:18:23 +02:00
Christoph Büscher	3185eaece8	QueryBuilders should implement ToXContentObject (#25530 ) All query builders written as self contained xContent objects, to we should mark them accordingly using ToXContentObject. This also makes it possible to use things like XContentHelper#toXContent to render query builders in tests.	2017-07-05 09:50:10 +02:00
Christoph Büscher	f576c987ce	Remove QueryParseContext (#25486 ) QueryParseContext is currently only used as a wrapper for an XContentParser, so this change removes it entirely and changes the appropriate APIs that use it so far to only accept a parser instead.	2017-07-03 17:30:40 +02:00
Simon Willnauer	5a7c8bb04e	Cleanup network / transport related settings (#25489 ) This commit makes the use of the global network settings explicit instead of implicit within NetworkService. It cleans up several places where we fall back to the global settings while we should have used tcp or http ones. In addition this change also removes unnecessary settings classes	2017-07-02 10:16:50 +02:00
Tim Brooks	cac2eec7d2	Add NioTransport threads to thread name checks (#25477 ) We have various assertions that check we never block on transport threads. This commit adds the thread names for the NioTransport to these assertions. With this change I had to fix two places where we were calling blocking methods from the transport threads.	2017-06-29 15:16:07 -05:00
Tim Brooks	dd5d165da1	Prevent channel enqueue after selector close (#25478 ) This commit adds additional protection to `ESSelector` and its implementations to ensure that channels are not enqueued after the selector is closed. After a channel has been added to the queue, we check that the selector is open. If it is not, then we remove the channel from the queue. If the channel is removed successfully, we throw an `IllegalStateException`.	2017-06-29 14:02:50 -05:00
Tim Brooks	6c58f0c4e6	Handle ping correctly in NioTransport (#25462 ) Our current TCPTransport logic assumes that we do not pass pings to the TCPTransport level. This commit fixes an issue where NioTransport was passing pings to TCPTransport and leading to exceptions.	2017-06-29 11:03:51 -05:00
Christoph Büscher	acade2b40a	Tests: Remove platform specific assertion in NioSocketChannelTests This check depends on the language settings on the system the test runs on, e.g. it fails on Ubuntu with LANG=de_DE.UTF-8.	2017-06-29 17:32:51 +02:00
Christoph Büscher	927111c91d	Remove QueryParseContext from parsing QueryBuilders (#25448 ) Currently QueryParseContext is only a thin wrapper around an XContentParser that adds little functionality of its own. I provides helpers for long deprecated field names which can be removed and two helper methods that can be made static and moved to other classes. This is a first step in helping to remove QueryParseContext entirely.	2017-06-29 17:10:20 +02:00
Tim Brooks	cad57959e1	Remove finicky exception message assertion In SimpleNioTransportTests we assert that an IOException has a certain message. This message appears that it is not dependible (and might change based on platform). Our other transport tests (mock and netty) do not make this assertion. Instead they only assert on our application exception message. This commit removes the IOException message assertion. And retains the ConnectTransportException message assertion.	2017-06-28 14:16:04 -05:00
Tim Brooks	5f8be0e090	Introduce NioTransport into framework for testing (#24262 ) This commit introduces a nio based tcp transport into framework for testing. Currently Elasticsearch uses a simple blocking tcp transport for testing purposes (MockTcpTransport). This diverges from production where our current transport (netty) is non-blocking. The point of this commit is to introduce a testing variant that more closely matches the behavior of production instances.	2017-06-28 10:51:20 -05:00
Yannick Welsch	5a4a47332c	Use a single method to update shard state This commit refactors index shard to provide a single method for updating the shard state on an incoming cluster state update. Relates #25431	2017-06-28 09:48:47 -04:00
Jason Tedor	5a9fc8aa2a	Remove path.conf setting This commit removes path.conf as a valid setting and replaces it with a command-line flag for specifying a non-default path for configuration. Relates #25392	2017-06-26 15:18:29 -04:00
Martijn van Groningen	a34f5fa812	Move more token filters to analysis-common module The following token filters were moved: stemmer, stemmer_override, kstem, dictionary_decompounder, hyphenation_decompounder, reverse, elision and truncate. Relates to #23658	2017-06-26 09:02:16 +02:00
Ryan Ernst	1583f81047	Test: Allow merging mock secure settings (#25387 ) While real secure settings (ie an ES keystore) cannot be merged together, mocked secure settings can and need to be sometimes merged. This commit adds a merge method to allow tests to merge together multiple instances of secure settings.	2017-06-25 10:19:51 -07:00
Martijn van Groningen	9c511bc447	test: Replace OldIndexBackwardsCompatibilityIT#testOldClusterStates with a full cluster restart qa test OldIndexBackwardsCompatibilityIT#testOldClusterStates tested whether global and index metadata could be read from data directory, this can also be tested in full cluster qa test that checks cluster state via api. Relates to #24939	2017-06-23 09:54:05 +02:00
Boaz Leskes	d963882053	Enable a long translog retention policy by default (#25294 ) #25147 added the translog deletion policy but didn't enable it by default. This PR enables a default retention of 512MB (same maximum size of the current translog) and an age of 12 hours (i.e., after 12 hours all translog files will be deleted). This increases to chance to have an ops based recovery, even if the primary flushed or the replica was offline for a few hours. In order to see which parts of the translog are committed into lucene the translog stats are extended to include information about uncommitted operations. Views now include all translog ops and guarantee, as before, that those will not go away. Snapshotting a view allows to filter out generations that are not relevant based on a specific sequence number. Relates to #10708	2017-06-22 17:08:14 +02:00
Adrien Grand	44e9c0b947	Upgrade to lucene-7.0.0-snapshot-ad2cb77. (#25349 ) Most notable changes: - better update concurrency: LUCENE-7868 - TopDocs.totalHits is now a long: LUCENE-7872 - QueryBuilder does not remove the boolean query around multi-term synonyms: LUCENE-7878 - removal of Fields: LUCENE-7500 For the `TopDocs.totalHits` change, this PR relies on the fact that the encoding of vInts and vLongs are compatible: you can write and read with any of them as long as the value can be represented by a positive int.	2017-06-22 12:35:33 +02:00
Boaz Leskes	7013cbd927	Update MockTransportService to the age of Transport.Connection (#25320 ) MockTransportServices allows us to simulate network disruptions in our testing infra. Sadly it wasn't updated to the state of the art in Transport land. This PR brings it up to speed. Specifically: 1) Opening a connection is now also blocked (before only node connections were blocked) 2) Simplifies things using the latest connection based notification between TcpTransport and TransportService for when a disconnect happens. 3) By 2, it fixes a race condition where we may fail to respond to a sent request when it is sent concurrently with the closing of a connection. The old code relied on a node based bridge between tcp transport and transport service. Sadly, the following doesn't work any more: ``` if (transport.nodeConnected(node)) { // this a connected node, disconnecting from it will be up the exception transport.disconnectFromNode(node); <-- this may now be a noop and it doesn't mean that the transport service was notified of the disconnect between the nodeConnected check and here. } else { throw new ConnectTransportException(node, reason, e); } ```	2017-06-21 10:27:57 +02:00
Simon Willnauer	86a544de3b	Ensure we never read from a closed MockSecureSettings object (#25322 ) If secure settings are closed after the node has been constructed no key-store access is permitted. We should also try to be as close as possible to the real behavior if we mock secure settings. This change also adds the same behavior as bootstrap has to InternalTestCluster to ensure we fail if we try to read from secure settings after the node has been constructed.	2017-06-21 08:14:38 +02:00
Simon Willnauer	5abb7c4bec	Use IndexMetaData settings as a basis for new index settings (#25310 ) In MockFSDirectory we should use the actual indexes settings to build a new IndexMetaData settings object instead of the node settings. Relates to #25297	2017-06-20 15:44:19 +02:00
Nik Everett	3261586cac	Tweak reindex cancel logic and add many debug logs (#25256 ) I'm still trying to hunt down rare failures in the cancelation tests for reindex and friends. Here is the latest: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+5.x+multijob-unix-compatibility/os=ubuntu/876/console It doesn't show much, other than that one of the tasks didn't kill itself when asked to cancel. So I'm going a bit crazy with debug logging so that the next time this comes up I can trace exactly what happened. Additionally, this tweaks the logic around how rethrottles were performed around cancel. Previously we set the `requestsPerSecond` to `0` when we cancelled the task. That was the "old way" to set them to inifity which was the intent. This switches that from `0` to `Float.MAX_VALUE` which is the "new way" to set the `requestsPerSecond` to infinity. I don't know that this is much better, but it feels better.	2017-06-19 18:46:42 -04:00
Jay Modi	1a6491bc54	Test: do not copy secure settings when creating random directory service (#25297 ) In tests, we sometimes create a random directory service and as part of that the IndexSettings get built again. When we build them again, we need to make sure we do not set the secure settings on the new IndexMetaData object that gets created as the node settings already have the secure settings and the index settings and node settings will be combined. If both have secure settings, the settings builder will throw an AlreadySetException.	2017-06-19 14:52:32 -06:00
Yannick Welsch	1a20760d79	Simplify IndexShard indexing and deletion methods (#25249 ) Indexing or deleting documents through the IndexShard interface is quite complex and error-prone. It requires multiple calls, e.g. first prepareIndexOnPrimary, then do some checks if mapping updates have occurred, then do the actual indexing using index(...) etc. Currently each consumer of the interface (local recovery, peer recovery, replication) has additional custom checks built around it to deal with mapping updates, some of which are even inconsistent. This commit aims at reducing the complexity by exposing a simpler interface on IndexShard. There are no more prepare*** methods and the mapping complexity is also hidden, but still giving callers a possibility to implement custom logic to deal with mapping updates.	2017-06-19 20:11:54 +02:00
Martijn van Groningen	bcaa413b0b	test: Port the remaining old indices search tests to full cluster restart qa module Also tweaked the qa module's gradle file to actually run bwc tests against all index compat versions. Relates to #24939	2017-06-19 12:27:24 +02:00
Simon Willnauer	dc02b32650	Simplify connection closing and cleanups in TcpTransport (#25250 ) Today we maintain a map of open connections in order to close them when a low level channel gets closed or handles a failure. We also spawn a thread due to some tricky concurrency issues especially with respect to netty since they listener might be called on a transport / boss thread. Executions on those threads must not be blocking since otherwise we will likely deadlock the event processing which adds to the complexity of the concurrency model in this class. This change associates the connection with the close callback that every channel invokes once it's closed which allows us to remove the connections map. A relaxed non-blocking concurrency model in the connection close listener allows cleaning up connected nodes without blocking on any lock.	2017-06-19 09:19:45 +02:00
Simon Willnauer	5f18791f1c	[TEST] assertBusy on transport stats since some implementations invoke listeners concurrently	2017-06-18 00:08:34 +02:00
Christoph Büscher	e99ced06cc	[Tests] Check that parsing aggregations works in a forward compatible way (#25219 ) This change adds tests for the aggregation parsing that try to simulate that we can parse existing aggregations in a forward compatible way in the future, ignoring potential newly added fields or substructures to the xContent response.	2017-06-17 13:06:31 +02:00
Simon Willnauer	f18b0d293c	Move TransportStats accounting into TcpTransport (#25251 ) Today TcpTransport is the de-facto base-class for transport implementations. The need for all the callbacks we have in TransportServiceAdaptor are not necessary anymore since we can simply have the logic inside the base class itself. This change moves the stats metrics directly into TcpTransport removing the need for low level bytes send / received callbacks.	2017-06-16 22:34:11 +02:00
Christoph Büscher	d3442f7d0c	Add unit test for PathHierarchyTokenizerFactory (#24984 )	2017-06-15 19:18:33 +02:00
Martijn van Groningen	428e70758a	Moved more token filters to analysis-common module. The following token filters were moved: `edge_ngram`, `ngram`, `uppercase`, `lowercase`, `length`, `flatten_graph` and `unique`. Relates to #23658	2017-06-15 18:28:31 +02:00
Boaz Leskes	648b4717a4	move assertBusy to use CheckException (#25246 ) We use assertBusy in many places where the underlying code throw exceptions. Currently we need to wrap those exceptions in a RuntimeException which is ugly.	2017-06-15 13:24:07 +02:00
Adrien Grand	0c117145f6	Upgrade to lucene-7.0.0-snapshot-92b1783. (#25222 ) This snapshot has faster range queries on range fields (LUCENE-7828), more accurate norms (LUCENE-7730) and the ability to use fake term frequencies (LUCENE-7854).	2017-06-15 09:52:07 +02:00
Ryan Ernst	caf7792db1	Scripting: Rename SearchScript.needsScores to needs_score (#25235 ) This commit renames the needsScores method so as to make it automatically generatable, based on the name of the `_score` variable which is available in search scripts. It also adds documentation to ScriptContext to explain the naming and signature of such methods.	2017-06-14 22:01:19 -07:00
Nik Everett	ce11b894b4	Extract the snapshot/restore full cluster restart tests from the translog full cluster restart tests (#25204 ) Extract the snapshot/restore full cluster restart tests from the translog full cluster restart tests. That way they are easier to read.	2017-06-14 13:03:59 -04:00
Jay Modi	ed76b9a518	Test: allow setting socket timeout for rest client (#25221 ) In #25201, a setting was added to allow setting the retry timeout for the rest client under the impression that this would allow requests to go longer than 30s. However, there is also a socket timeout that needs to be set to greater than 30s, which this change adds a setting for.	2017-06-14 08:21:56 -06:00
Andy Bristol	48696ab544	expose simple pattern tokenizers (#25159 ) Expose the experimental simplepattern and simplepatternsplit tokenizers in the common analysis plugin. They provide tokenization based on regular expressions, using Lucene's deterministic regex implementation that is usually faster than Java's and has protections against creating too-deep stacks during matching. Both have a not-very-useful default pattern of the empty string because all tokenizer factories must be able to be instantiated at index creation time. They should always be configured by the user in practice.	2017-06-13 12:46:59 -07:00
Jay Modi	190242fb1b	Test: add setting to change request timeout for rest client (#25201 ) This commit adds a setting to change the request timeout for the rest client. This is useful as the default timeout is 30s, which is also the same default for calls like cluster health. If both are the same then the response from the cluster health api will not be received as the client usually times out first making test failures harder to debug. Relates #25185	2017-06-13 12:19:17 -06:00
Simon Willnauer	186c16ea41	Ensure pending transport handlers are invoked for all channel failures (#25150 ) Today if a channel gets closed due to a disconnect we notify the response handler that the connection is closed and the node is disconnected. Unfortunately this is not a complete solution since it only works for published connections. Connections that are unpublished ie. for discovery can indefinitely hang since we never invoke their handers when we get a failure while a user is waiting for the response. This change adds connection tracking to TcpTransport that ensures we are notifying the corresponding connection if there is a failure on a channel.	2017-06-13 09:37:05 +02:00
Tal Levy	340909582f	remove Ingest's Internal Template Service (#25085 ) Ingest was using it's own wrapper around TemplateScripts and the ScriptService. This commit removes that abstraction	2017-06-08 15:24:03 -07:00
Lee Hinman	119f8ed9f0	Correctly enable _all for older 5.x indices When we disabled `_all` by default for indices created in 6.0, we missed adding a layer that would handle the situation where `_all` was not enabled in 5.x and then the cluster was updated to 6.0, this means that when the cluster was updated the `_all` field would be disabled for 5.x indices and field values would not be added to the `_all` field. This adds a compatibility layer for 5.x indices where we treat the default enabled value for the `_all` field to be `true` if unset on 5.x indices. Resolves #25068	2017-06-08 14:37:44 -06:00
Nik Everett	4a8c09c5f1	Make randomVersionBetween work with unreleased versions (#25042 ) Test: randomVersionBetween works with unreleased Modifies randomVersionBetween so that it works with unreleased versions. This should make switching a version from unreleased to released much simpler.	2017-06-08 10:19:06 -04:00
Yannick Welsch	cd57395c98	Use correct primary term for replicating NOOPs (#25128 ) NOOPs should be, same as for indexing operations, written on the replica using the original operation term instead of the current term of the replica.	2017-06-08 14:20:26 +02:00
Jim Ferenczi	36a5cf8f35	Automatically early terminate search query based on index sorting (#24864 ) This commit refactors the query phase in order to be able to automatically detect queries that can be early terminated. If the index sort matches the query sort, the top docs collection is early terminated on each segment and the computing of the total number of hits that match the query is delegated to a simple TotalHitCountCollector. This change also adds a new parameter to the search request called `track_total_hits`. It indicates if the total number of hits that match the query should be tracked. If false, queries sorted by the index sort will not try to compute this information and and will limit the collection to the first N documents per segment. Aggregations are not impacted and will continue to see every document even when the index sort matches the query sort and `track_total_hits` is false. Relates #6720	2017-06-08 12:10:46 +02:00
Jim Ferenczi	21a57c1494	Always use DisjunctionMaxQuery to build cross fields disjunction (#25115 ) This commit modifies query_string, simple_query_string and multi_match queries to always use a DisjunctionMaxQuery when a disjunction over multiple fields is built. The tiebreaker is set to 1 in order to behave like the boolean query in terms of scoring. The removal of the coord factor in Lucene 7 made this change mandatory to correctly handle minimum_should_match. Closes #23966	2017-06-08 11:18:17 +02:00
David Roberts	f9503af0d5	[TEST] Move test skip/blacklist assumptions out of @Before method (#25100 ) This commit moves the assumeFalse() calls that implement test skipping and blacklisting out of the @Before method of ESClientYamlSuiteTestCase. The problem with having them in the @Before method is that if an assumption triggers then the @Before methods of classes that extend ESClientYamlSuiteTestCase will not run, but their @After methods will. This can lead to inconsistencies that cause assertions in the @After methods and fail the test even though it was skipped/blacklisted. Instead the assumeFalse() calls are now at the beginning of the test() method, which runs after all @Before methods (including those in classes that extend ESClientYamlSuiteTestCase) have completed. The only side effect is that overridden test() methods in classes that extend ESClientYamlSuiteTestCase which call super.test() and also do other things must now be designed not to consume any InternalAssumptionViolatedException that may be thrown by the super.test() call. Relates elastic/x-pack-elasticsearch#1650	2017-06-08 09:06:42 +01:00
Jack Conradson	d187fa78fd	Generate Painless Factory for Creating Script Instances (#25120 )	2017-06-07 16:06:11 -07:00
Christoph Büscher	9e741cd13d	Tests: Add ability to generate random new fields for xContent parsing test (#23437 ) For the response parsing we want to be lenient when it comes to parsing new xContent fields. In order to ensure this in our testing, this change adds a utility method to XContentTestUtils that takes xContent bytes representation as input and recursively a random field on each object level. Sometimes we also want to exclude a whole subtree from this treatment (e.g. skipping "_source"), other times an element (e.g. "fields", "highlight" in SearchHit) can have arbitraryly named objects. Those cases can be specified as exceptions.	2017-06-07 21:01:20 +02:00
Yannick Welsch	26ec89173b	Remove TranslogRecoveryPerformer (#24858 ) Splits TranslogRecoveryPerformer into three parts: - the translog operation to engine operation converter - the operation perfomer (that indexes the operation into the engine) - the translog statistics (for which there is already RecoveryState.Translog) This makes it possible for peer recovery to use the same IndexShard interface as bulk shard requests (i.e. Engine operations instead of Translog operations). It also pushes the "fail on bad mapping" logic outside of IndexShard. Future pull requests could unify the BulkShard and peer recovery path even more.	2017-06-07 17:11:27 +02:00
Tim Brooks	233c63fc63	Add version 5.6 to versions (#25084 ) * Add version 5.6 to versions * Fix test * Remove 5.4.2 constant	2017-06-07 09:59:27 -04:00
Tim Brooks	feca0a9f33	Bumping version to v6.0.0-alpha3 (#25077 )	2017-06-06 15:47:23 -05:00
Jim Ferenczi	7e60cf3e54	Move parent_id query to the parent-join module (#25072 ) This change moves the parent_id query to the parent-join module and handles the case when only the parent-join field can be declared on an index (index with single type on). If single type is off it uses the legacy parent join field mapper and switch to the new one otherwise (default in 6). Relates #20257	2017-06-06 19:35:14 +02:00
Nik Everett	73307a2144	Plugins can register pre-configured char filters (#25000 ) Fixes the plumbing so plugins can register char filters and moves the `html_strip` char filter into analysis-common. Relates to #23658	2017-06-05 09:25:15 -04:00
Nik Everett	190f5dce10	Test that gradle and Java version types match (#24943 ) Both gradle and java code attempt to infer the type of a each Version constant in Version.java. It is super important that they infer that each constant has the same type. If they disagree we might accidentally not be testing backwards compatibility for some version. This adds a test to make sure that they agree, modulo known and accepted differences (mostly around alphas). It also changes the minimum wire compatible version from the released 5.4.0 to the unreleased 5.5.0 as that lines up with the gradle logic. Relates to #24798 Note that the gradle and java version logic doesn't actually match so this contains a hack to make it look like it matches. Since this is a start, I'm merging it and going to work on some followups to make the logic actually match.....	2017-06-02 21:30:47 -04:00
Ryan Ernst	0d8216d5af	Scripting: Convert CompiledTemplate to a ScriptContext (#25032 ) This commit creates TemplateScript and associated classes so that templates no longer need a special ScriptService.compileTemplate method. The execute() method is equivalent to the old run() method. relates #20426	2017-06-02 13:41:26 -07:00
Ali Beyad	e024c67561	Checks the circuit breaker before allocating bytes for a new big array (#25010 ) Previously, when allocating bytes for a BigArray, the array was created (or attempted to be created) and only then would the array be checked for the amount of RAM used to see if the circuit breaker should trip. This is problematic because for very large arrays, if creating or resizing the array, it is possible to attempt to create/resize and get an OOM error before the circuit breaker trips, because the allocation happens before checking with the circuit breaker. This commit ensures that the circuit breaker is checked before all big array allocations (note, this does not effect the array allocations that are less than 16kb which use the [Type]ArrayWrapper classes found in BigArrays.java). If such an allocation or resizing would cause the circuit breaker to trip, then the breaker trips before attempting to allocate and potentially running into an OOM error from the JVM. Closes #24790	2017-06-02 15:16:22 -04:00
Boaz Leskes	aa5b11687d	reduce the number of threads used by testNotBlockingUnsafeStackTraces It times out some times. Fixes #24936	2017-06-02 19:06:58 +02:00
Nik Everett	18f16ba555	Test: improve error message on leftover tasks After every REST test we wait for the list of pending cluster tasks to empty before moving on to the next task. If the list doesn't empty in 10 second we fail the test. This improves the error message when we fail the test to include the list of running tasks.	2017-06-02 11:02:44 -04:00
Christoph Büscher	a94ac30360	[Tests] Improve error message for failed xContentEquivalent() tests (#24828 ) For comparing actual and parsed object equality for the response parsing we currently rely on comparing the original xContent and the output of the parsed object. Currently we only have cryptic error messages if this comparison fails which are hard to read also because we recursively compare lists and maps of the xContent structures we compare. This commits leverages the existing NotEqualMessageBuilder for providing error messages that are more detailed and useful for debugging if an error occurs.	2017-06-01 14:12:26 +02:00
Martijn van Groningen	258be2b135	Moved `keyword_marker`, `trim`, `snowball` and `porter_stemmer` tokenfilter factories from core to common-analysis module. Relates to #23658	2017-05-31 09:34:08 +02:00
Martijn van Groningen	a089dc9dcd	Added more unit test coverage for terms aggregation and removed terms agg integration tests that were replaced by unit tests.	2017-05-31 09:30:10 +02:00
Ryan Ernst	7c1211d2ed	Scripting: Add StatefulFactoryType as optional intermediate factory in script contexts (#24974 ) ScriptContexts currently understand a FactoryType that can produce instances of the script InstanceType. However, for search scripts, this does not work as we have the concept of LeafSearchScript that is created per lucene segment. This commit effectively renames the existing SearchScript class into SearchScript.LeafFactory, which is a new, optional, class that can be defined within a ScriptContext. LeafSearchScript is effectively renamed back into SearchScript. This change allows the model of stateless factory -> stateful factory -> script instance to continue, but in a generic way that any script context may take advantage of. relates #20426	2017-05-30 16:32:14 -07:00
Nik Everett	5da8ce8318	Remove the need for _UNRELEASED suffix in versions (#24798 ) Removes the need for the `_UNRELEASED` suffix on versions by detecting if a version should be unreleased or not based on the versions around it. This should make it simpler to automate the task of adding a new version label.	2017-05-26 18:36:32 -04:00
Jim Ferenczi	9ef414fead	Merge branch 'mattweber-multiple_collapse_inner_hits'	2017-05-26 13:28:08 +02:00
Matt Weber	601a61a91c	Support Multiple Collapse Inner Hits Support multiple named inner hits on a field collapsing request.	2017-05-26 13:23:57 +02:00
Ryan Ernst	74e031e842	Scripting: Rename CompiledType to FactoryType in ScriptContext (#24897 ) This commit renames the concept of the "compiled type" to a "factory type", along with all implementations of this class to be named Factory. This brings it inline with the classes purpose.	2017-05-26 00:02:54 -07:00
Ryan Ernst	8eab1fefa1	Scripting: Make contexts available to ScriptEngine construction (#24896 ) This commit adds collection of all contexts to the parameters of getScriptEngine. This will allow script engines like painless to precache extra information about the contexts.	2017-05-25 16:55:47 -07:00
Ryan Ernst	7d03cff820	Scripting: Make ScriptEngine.compile generic on the script context (#24873 ) This commit changes the compile method of ScriptEngine to be generic in the same way it is on ScriptService. This moves the shim of handling the two existing context classes into each script engine, so that each engine can be worked on independently to convert to real handling of contexts.	2017-05-24 20:06:32 -07:00
Ryan Ernst	1daacd97b0	Scripting: Add instance and compiled classes to script contexts (#24868 ) This commit modifies the compile method of ScriptService to be context aware. The ScriptContext is now a generic class which contains both the instance type and compiled type for a script. Instance type may be stateful (for example, pre loading field information for the index a script will execute on, like in expressions), while the compiled type is stateless and used to construct instance type instances. This change is only a first step to cutover ScriptService to the new paradigm. It only converts callers to the script service, and has a small shim to wrap compilation from the script engines to support the current two fixed instance types, SearchScript and ExecutableScript.	2017-05-24 14:29:02 -07:00
Ryan Ernst	0ddd219423	Scripting: Add default implementation of close() for ScriptEngine (#24851 ) Since groovy was removed, we no longer have any ScriptEngines with resources to release. We may want to keep the option open for a script engine to close resources, but this would not be common. This commit adds a default implementation to ScriptEngine for `close()` to reduce the boiler plate that must be added for a ScriptEngine implementation.	2017-05-24 13:19:27 -07:00
Boaz Leskes	6bc5b1dbcd	Guarantee that translog generations are seqNo conflict free (#24825 ) With #24779 in place, we can now guaranteed that a single translog generation file will never have a sequence number conflict that needs to be resolved by looking at primary terms. These conflicts can a occur when a replica contains an operation which isn't part of the history of a newly promoted primary. That primary can then assign a different operation to the same slot and replicate it to the replica. PS. Knowing that each generation file is conflict free will simplifying repairing these conflicts when we read from the translog. PPS. This PR also fixes some bugs in the piping of primary terms in the bulk shard action. These bugs are a result of the legacy of IndexRequest/DeleteRequest being a ReplicationRequest. We need to change that as a follow up. Relates to #10708	2017-05-24 13:26:39 +02:00
Ryan Ernst	bf49d37ab3	Test: Convert test script engine impls to use MockScriptEngine (#24854 ) This commit cleans up tests which currently use custom script engine implementations, converting them to use a MockScriptEngine with script functions provided by the tests. It also creates a common set of metric scripts which were copied across a couple metric agg tests.	2017-05-23 20:34:12 -07:00
Jason Tedor	2e570fc6fa	Increase max compilations per minute in tests Large test suites with unfortunate seed choices can easily exceed the 1000 script compilations per minute limit. This commit increases the limit in integration tests to 2048.	2017-05-23 20:04:20 -04:00
Nik Everett	13a86fec99	Add magic $_path stash key to docs tests (#24724 ) Adds a "magic" key to the yaml testing stash mostly for use with documentation tests. When unstashing an object, `$_path` is the path into the current position in the object you are unstashing. This means that in docs tests you can use `// TESTRESPONSEs/somevalue/$body.${_path}/` to mean "replace `somevalue` with whatever is the response in the same position." Compare how you must carefully mock out all the numbers in the profile response without this change: ``` // TESTRESPONSE[s/"id": "\[2aE02wS1R8q_QFnYu6vDVQ\]\[twitter\]\[1\]"/"id": $body.profile.shards.0.id/] // TESTRESPONSE[s/"rewrite_time": 51443/"rewrite_time": $body.profile.shards.0.searches.0.rewrite_time/] // TESTRESPONSE[s/"score": 51306/"score": $body.profile.shards.0.searches.0.query.0.breakdown.score/] // TESTRESPONSE[s/"time_in_nanos": "1873811"/"time_in_nanos": $body.profile.shards.0.searches.0.query.0.time_in_nanos/] // TESTRESPONSE[s/"build_scorer": 2935582/"build_scorer": $body.profile.shards.0.searches.0.query.0.breakdown.build_scorer/] // TESTRESPONSE[s/"create_weight": 919297/"create_weight": $body.profile.shards.0.searches.0.query.0.breakdown.create_weight/] // TESTRESPONSE[s/"next_doc": 53876/"next_doc": $body.profile.shards.0.searches.0.query.0.breakdown.next_doc/] // TESTRESPONSE[s/"time_in_nanos": "391943"/"time_in_nanos": $body.profile.shards.0.searches.0.query.0.children.0.time_in_nanos/] // TESTRESPONSE[s/"score": 28776/"score": $body.profile.shards.0.searches.0.query.0.children.0.breakdown.score/] // TESTRESPONSE[s/"build_scorer": 784451/"build_scorer": $body.profile.shards.0.searches.0.query.0.children.0.breakdown.build_scorer/] // TESTRESPONSE[s/"create_weight": 1669564/"create_weight": $body.profile.shards.0.searches.0.query.0.children.0.breakdown.create_weight/] // TESTRESPONSE[s/"next_doc": 10111/"next_doc": $body.profile.shards.0.searches.0.query.0.children.0.breakdown.next_doc/] // TESTRESPONSE[s/"time_in_nanos": "210682"/"time_in_nanos": $body.profile.shards.0.searches.0.query.0.children.1.time_in_nanos/] // TESTRESPONSE[s/"score": 4552/"score": $body.profile.shards.0.searches.0.query.0.children.1.breakdown.score/] // TESTRESPONSE[s/"build_scorer": 42602/"build_scorer": $body.profile.shards.0.searches.0.query.0.children.1.breakdown.build_scorer/] // TESTRESPONSE[s/"create_weight": 89323/"create_weight": $body.profile.shards.0.searches.0.query.0.children.1.breakdown.create_weight/] // TESTRESPONSE[s/"next_doc": 2852/"next_doc": $body.profile.shards.0.searches.0.query.0.children.1.breakdown.next_doc/] // TESTRESPONSE[s/"time_in_nanos": "304311"/"time_in_nanos": $body.profile.shards.0.searches.0.collector.0.time_in_nanos/] // TESTRESPONSE[s/"time_in_nanos": "32273"/"time_in_nanos": $body.profile.shards.0.searches.0.collector.0.children.0.time_in_nanos/] ``` To how you can cavalierly mock all the numbers at once with this change: ``` // TESTRESPONSE[s/(?<=[" ])\d+(\.\d+)?/$body.$_path/] ```	2017-05-23 15:33:48 -04:00
Martijn van Groningen	34093735e3	Added unit tests for MatrixStatsAggregator	2017-05-23 16:19:12 +02:00
Jason Tedor	a85c1bcede	Remove unused assertions enabled method This commit removes an unused assertions enabled method in ESTestCase. For future uses of such a method, use the field ENABLED in org.elasticsearch.Assertions.	2017-05-23 09:32:48 -04:00
Jim Ferenczi	9087803cd9	Add the ability to define custom inner hit sub context builder (#24676 ) This commit moves the handling of nested and parent/child inner hits to specialized classes that can be defined outside of ES core. InnerHitBuilderContext is now used by the parent query (nested or hasChild, ...) to build the sub context from the InnerHitBuilder definition. BWC is also ensured so that nodes in previous versions can still send/receive inner hits to/from this version. Relates #20257	2017-05-23 13:06:22 +02:00
Ryan Ernst	52d504bb5f	Scripting: Simplify ScriptContext (#24818 ) As we work towards contexts implying the return type of compilation, we first need ScriptContext to not be an enum. This commit removes the Standard enum and Plugin subclass of ScriptContext.	2017-05-22 13:11:15 -07:00
javanna	7a3e38eb8e	Merge branch 'master' into feature/client_aggs_parsing	2017-05-22 12:25:14 +02:00
Luca Cavanna	726e6c45ab	Move getType to Aggregation interface (#24822 ) Given that both InternalAggregation and ParsedAggregation have this method, it makes sense to move it to the interface they both implement.	2017-05-22 10:49:23 +02:00
Ryan Ernst	2de748859f	Scripting: Remove "inline script enabled" on script engines (#24815 ) ScriptEngine implementations have an overridable method to indicate they are safe to use as inline scripts. Since groovy was removed fro 6.0, there are no longer any implementations which used the default false value. Furthermore, the value was not actually read anywhere. This commit removes the method. The ScriptEngineRegistry was also no longer necessary as it only was used to build a map from language to engine.	2017-05-20 12:01:25 -07:00
javanna	db0490343e	Merge branch 'master' into feature/client_aggs_parsing	2017-05-19 18:17:06 +02:00
Nik Everett	b9ea579633	Allow plugins to register pre-configured tokenizers (#24751 ) Allows plugins to register pre-configured tokenizers. Much of the decisions are the same as those in #24223, #24572, and #24223. This only migrates the lowercase tokenizer but I figure that is a good start because it proves out the features.	2017-05-19 12:07:04 -04:00
Tanguy Leroux	83aa00b3f6	Merge remote-tracking branch 'origin/master' into feature/client_aggs_parsing	2017-05-19 13:13:00 +02:00
Tanguy Leroux	4c34ea8fc8	Remove //norelease and cleans up somet aggregations tests (#24789 )	2017-05-19 12:46:37 +02:00
Jack Conradson	1196dfb6bb	Remove Deprecated Script Settings (#24756 ) Removes all fine-grained script settings replaced by scripts.types_allowed and scripts.contexts_allowed.	2017-05-18 13:32:46 -07:00
Koen De Groote	905eb422f6	Use StringBuilder to construct a String instead of relying on appending where possible (#24753 ) This PR revolves around places in the code where introducing a StringBuilder might make the construction of a String easier to follow and also, maybe avoid a case where the compiler's very safe way of introducing StringBuilder instead of String might not always be optimal for performance.	2017-05-18 12:02:29 +02:00
Simon Willnauer	b19537cae6	Add utility method to get the latest version of the previous minor	2017-05-18 11:21:50 +02:00
Tanguy Leroux	eeef2e6c31	Merge remote-tracking branch 'origin/master' into feature/client_aggs_parsing	2017-05-18 09:43:57 +02:00
Tanguy Leroux	055875392e	Add parsing method for Top Hits aggregation (#24717 ) Related to #23331	2017-05-18 09:37:46 +02:00
Tanguy Leroux	25fceb8c0f	Add parsing method for binary range aggregation (#24706 ) Related to #23331	2017-05-18 09:24:33 +02:00
Koen De Groote	d744d77f61	Fix String concatenation within a StringBuilder append chain This commit replaces String concatenation within a StringBuilder append chain by using explicit append calls.	2017-05-18 08:51:30 +02:00
Ryan Ernst	26e2e933f5	Scripting: Remove native scripts (#24726 ) Native scripts have been replaced in documentation by implementing a ScriptEngine and they were deprecated in 5.5.0. This commit removes the native script infrastructure for 6.0. closes #19966	2017-05-17 14:49:24 -07:00
Ryan Ernst	463fe2f4d4	Scripting: Remove file scripts (#24627 ) This commit removes file scripts, which were deprecated in 5.5. closes #21798	2017-05-17 14:42:25 -07:00
Christoph Büscher	9fc9db26fd	Add parsing for InternalScriptedMetric aggregation (#24738 )	2017-05-17 18:55:57 +02:00
javanna	ce7326eb88	Merge branch 'master' into feature/client_aggs_parsing	2017-05-17 17:59:00 +02:00
Simon Willnauer	2ccc223ff7	Fix Version based BWC and set correct minCompatVersion (#24732 ) Approaching the release of 6.0 we need to sort out the usage of `Version#minimumCompatibilityVersion` which was still set to 5.0.0. Now this change moves it to the latest released version of 5.x (5.4 at this point) to ensure we are compatible with the latest minor of the previous major. This change also removes all the `_UNRELEASED` from the versions that where released and drops versions that were never released and are not expected to be released (bugfixes in minors that are not the latest in the previous major).	2017-05-17 17:27:09 +02:00
Nik Everett	0189a65e6b	Fail rest tests on yaml files (#24740 ) We've switched to supporting only `yml` files but anyone who didn't notice will commit a `yaml` file which won't be executed which is bad because it is easy not to notice. The test to catch this is simple enough that I think it is worth adding just to warn folks about their mistake.	2017-05-17 10:24:57 -04:00
Ryan Ernst	2a65bed243	Tests: Change rest test extension from .yaml to .yml (#24659 ) This commit renames all rest test files to use the .yml extension instead of .yaml. This way the extension used within all of elasticsearch for yaml is consistent.	2017-05-16 17:24:35 -07:00
Nik Everett	7ef390068a	Move remaining pre-configured token filters into analysis-common (#24716 ) Moves the remaining preconfigured token figured into the analysis-common module. There were a couple of tests in core that depended on the pre-configured token filters so I had to touch them: * `GetTermVectorsCheckDocFreqIT` depended on `type_as_payload` but didn't do anything important with it. I dropped the dependency. Then I moved the test to a single node test case because we're trying to cut down on the number of `ESIntegTestCase` subclasses. * `AbstractTermVectorsTestCase` and its subclasses depended on `type_as_payload`. I dropped their usage of the token filter and added an integration test for the termvectors API that uses `type_as_payload` to the `analysis-common` module. * `AnalysisModuleTests` expected a few pre-configured token filtes be registered by default. They aren't any more so I dropped this assertion. We assert that the `CommonAnalysisPlugin` registers these pre-built token filters in `CommonAnalysisFactoryTests` * `SearchQueryIT` and `SuggestSearchIT` had tests that depended on the specific behavior of the token filters so I moved the tests to integration tests in `analysis-common`.	2017-05-16 13:10:24 -04:00
Simon Willnauer	1cae850cf5	Add a cluster block that allows to delete indices that are read-only (#24678 ) Today when an index is `read-only` the index is also blocked from being deleted which sometimes is undesired since in-order to make changes to a cluster indices must be deleted to free up space. This is a likely scenario in a hosted environment when disk-space is limited to switch indices read-only but allow deletions to free up space.	2017-05-16 17:34:37 +02:00
Nik Everett	c38b3360b6	Allow unstashing values into keys (#24685 ) This is almost exclusively for docs test which frequently match the entire response. This allow something like: ``` - set: {nodes.$master.http.publish_address: host} - match: $body: { "nodes": { $host: { ... stuff in here ... } } } ``` This should make it possible for the docs tests to work with unpredictable keys.	2017-05-16 11:16:12 -04:00
Zachary Tong	1e97184519	Automatically close releasables after test (#24687 ) This moves the releasing logic to the base test, so that individual test cases don't need to worry about releasing the aggregators. It's not a big deal for individual aggs, but once tests start using sub-aggs, it can become tricky to free (without double-freeing) all the aggregators.	2017-05-16 09:01:38 -04:00
Tanguy Leroux	d5fc520741	Add parsing to Significant Terms aggregations (#24682 ) Related to #23331	2017-05-16 14:54:42 +02:00
Christoph Büscher	ef7c2e62c3	Add parsing for InternalAdjacencyMatrix aggregation (#24700 )	2017-05-16 14:35:49 +02:00
Christoph Büscher	059b23e92e	Merge branch 'master' into feature/client_aggs_parsing	2017-05-16 11:54:02 +02:00
Ryan Ernst	6ce597a378	Scripts: Convert template script engines to return String instead of BytesReference (#24447 ) Template script engines (mustache, the only one) currently return a BytesReference that users must know is utf8 encoded. This commit modifies all callers and mustache to have the template engine return String. This is much simpler, and does not require decoding in order to use (for example, in ingest).	2017-05-15 22:37:31 -07:00
Jason Tedor	92ba969804	Remove Jython hacks We had a hack in setting up permissions for tests to support testing the lang-python plugin. We also had a hack to prevent Log4j from loading a shaded version of Jansi provided by Jython. This plugin has been removed so these hacks are no longer necessary. Relates #24681	2017-05-15 12:40:24 -04:00
Tanguy Leroux	5fb04fa603	Merge remote-tracking branch 'origin/master' into feature/client_aggs_parsing	2017-05-15 17:08:26 +02:00
Christoph Büscher	60505c9100	Add parsing for InternalFilters aggregation (#24648 ) This adds parsing to the InternalFilters aggregation.	2017-05-15 15:26:35 +02:00
Christoph Büscher	0b688a8733	Small improvement in InternalAggregationTestCase test setup after changes in master (#24675 )	2017-05-15 15:06:01 +02:00
Jason Tedor	bd5aee8cfa	Split disruption test suite The disruption tests sit in a single test suite which causes these tests to be single-threaded. We can split this test suite into multiple suites (logically, of course) enabling them to be run in parallel reducing the total run time of all integration tests in core. This commit splits the discovery with service disruptions test suite into three suites - master disruptions - discovery disruptions - cluster disruptions The last one could probably be better named, it is meant to represent performing actions in the cluster (indexing, failing a shard, etc.) while a disruption is taking place. Relates #24662	2017-05-15 07:47:42 -04:00
Christoph Büscher	42e8d4b761	Merge branch 'master' into feature/client_aggs_parsing Conflicts: core/src/test/java/org/elasticsearch/search/aggregations/bucket/filter/InternalFilterTests.java core/src/test/java/org/elasticsearch/search/aggregations/bucket/global/InternalGlobalTests.java core/src/test/java/org/elasticsearch/search/aggregations/bucket/missing/InternalMissingTests.java core/src/test/java/org/elasticsearch/search/aggregations/bucket/nested/InternalNestedTests.java core/src/test/java/org/elasticsearch/search/aggregations/bucket/nested/InternalReverseNestedTests.java core/src/test/java/org/elasticsearch/search/aggregations/bucket/sampler/InternalSamplerTests.java modules/parent-join/src/test/java/org/elasticsearch/join/aggregations/InternalChildrenTests.java test/framework/src/main/java/org/elasticsearch/search/aggregations/InternalSingleBucketAggregationTestCase.java	2017-05-15 12:25:07 +02:00
Christoph Büscher	bb59ee51b0	Revert changing the InternalSampler type constant (#24667 )	2017-05-15 11:49:47 +02:00
Tanguy Leroux	b9d2ecc3ea	Add parsing methods to Range aggregations (#24583 )	2017-05-12 16:52:47 +02:00
Koen De Groote	878ae8eb3c	Size lists in advance when known When constructing an array list, if we know the size of the list in advance (because we are adding objects to it derived from another list), we should size the array list to the appropriate capacity in advance (to avoid resizing allocations). This commit does this in various places. Relates #24439	2017-05-12 10:36:13 -04:00
Jim Ferenczi	279a18a527	Add parent-join module (#24638 ) * Add parent-join module This change adds a new module named `parent-join`. The goal of this module is to provide a replacement for the `_parent` field but as a first step this change only moves the `has_child`, `has_parent` queries and the `children` aggregation to this module. These queries and aggregations are no longer in core but they are deployed by default as a module. Relates #20257	2017-05-12 15:58:06 +02:00
Tanguy Leroux	29a5694bb7	Add parsing method to GeoHashGrid aggregation (#24589 )	2017-05-12 15:44:39 +02:00
Simon Willnauer	be2a6ce80b	Notify onConnectionClosed rather than onNodeDisconnect to prune transport handlers (#24639 ) Today we prune transport handlers in TransportService when a node is disconnected. This can cause connections to starve in the TransportService if the connection is opened as a short living connection ie. without sharing the connection to a node via registering in the transport itself. This change now moves to pruning based on the connections cache key to ensure we notify handlers as soon as the connection is closed for all connections not just for registered connections. Relates to #24632 Relates to #24575 Relates to #24557	2017-05-12 15:40:40 +02:00
Yannick Welsch	04e08f5e49	Simplify Discovery interface (#24608 ) - Removes clusterState, getInitialClusterState and getMinimumMasterNodes methods from Discovery interface. - Sets PingContextProvider in ZenPing constructor - Renames state in ZenDiscovery to committedState	2017-05-12 14:08:14 +02:00
Christoph Büscher	744b1afcb2	Merge branch 'master' into feature/client_aggs_parsing Conflicts: core/src/test/java/org/elasticsearch/search/aggregations/bucket/histogram/InternalHistogramTests.java core/src/test/java/org/elasticsearch/search/aggregations/bucket/terms/DoubleTermsTests.java core/src/test/java/org/elasticsearch/search/aggregations/bucket/terms/LongTermsTests.java core/src/test/java/org/elasticsearch/search/aggregations/bucket/terms/StringTermsTests.java	2017-05-12 10:55:05 +02:00
Ryan Ernst	f477a6472d	Settings: Deprecate settings in .yml and .json (#24059 ) This commit adds a deprecation warning when elasticsearch.yml or elasticsearch.json is read during startup. relates #19391	2017-05-11 13:11:18 -07:00
Simon Willnauer	1155615536	Move DeleteByQuery and Reindex requests into core (#24578 ) This allows other plugins to use a client to call the functionality that is in the core modules without duplicating the logic. Plugins can now safely send the request and response classes via the client even if the requests are executed locally. All relevant classes are loaded by the core classloader such that plugins can share them. This is re-adds this commit that was revered in `952feb58e4`	2017-05-11 20:22:30 +02:00
qwerty4030	e7d352b489	Compound order for histogram aggregations. (#22343 ) This commit adds support for histogram and date_histogram agg compound order by refactoring and reusing terms agg order code. The major change is that the Terms.Order and Histogram.Order classes have been replaced/refactored into a new class BucketOrder. This is a breaking change for the Java Transport API. For backward compatibility with previous ES versions the (date)histogram compound order will use the first order. Also the _term and _time aggregation order keys have been deprecated; replaced by _key. Relates to #20003: now that all these aggregations use the same order code, it should be easier to move validation to parse time (as a follow up PR). Relates to #14771: histogram and date_histogram aggregation order will now be validated at reduce time. Closes #23613: if a single BucketOrder that is not a tie-breaker is added with the Java Transport API, it will be converted into a CompoundOrder with a tie-breaker.	2017-05-11 18:06:26 +01:00
Simon Willnauer	952feb58e4	Revert "Move DeleteByQuery and Reindex requests into core (#24578 )" This reverts commit `6ea2ae32b8`.	2017-05-11 18:26:40 +02:00
Lee Hinman	57fddce8c4	[TEST] Use at least 1ms for FunctionScoreQueryBuilderTests Previously micros or nanoseconds could be used, which was reduced to 0 milliseconds and `scale` must be higher than 0.	2017-05-11 10:10:55 -06:00
Simon Willnauer	6ea2ae32b8	Move DeleteByQuery and Reindex requests into core (#24578 ) This allows other plugins to use a client to call the functionality that is in the core modules without duplicating the logic. Plugins can now safely send the request and response classes via the client even if the requests are executed locally. All relevant classes are loaded by the core classloader such that plugins can share them.	2017-05-11 16:20:40 +02:00
Nik Everett	8188569fd1	Add qa module that tests reindex-from-remote against pre-5.0 versions of Elasticsearch (#24561 ) Adds tests for reindex-from-remote for the latest 2.4, 1.7, and 0.90 releases. 2.4 and 1.7 are fairly popular versions but 0.90 is a point of pride. This fixes any issues those tests revealed. Closes #23828 Closes #24520	2017-05-11 10:06:20 -04:00
Christoph Büscher	c4fc8edc03	Add parsing for single bucket aggregations (#24564 ) This adds parsing to all implementations of SingleBucketAggregations. They are mostly similar, so they share the common base class `ParsedSingleBucketAggregation` and the shared base test `InternalSingleBucketAggregationTestCase`.	2017-05-11 11:50:35 +02:00
Christoph Büscher	570390ac36	Merge branch 'master' into feature/client_aggs_parsing	2017-05-11 11:20:39 +02:00
Nik Everett	65f2717ab7	Make PreConfiguredTokenFilter harder to misuse (#24572 ) There are now three public static method to build instances of PreConfiguredTokenFilter and the ctor is private. I chose static methods instead of constructors because those allow us to change out the implementation returned if we so desire. Relates to #23658	2017-05-10 22:39:43 -04:00
Jack Conradson	6ac8a1eb85	Deprecate Fine Grain Settings for Scripts (#24573 )	2017-05-10 13:09:31 -07:00
Christoph Büscher	fbc8345db5	Tests: Fix VersionUtilsTests after version bump	2017-05-10 17:36:12 +02:00
Tanguy Leroux	3201e22710	Fix merging conflicts	2017-05-10 14:05:43 +02:00
Tanguy Leroux	bf718a686f	Merge remote-tracking branch 'origin/master' into feature/client_aggs_parsing # Conflicts: # core/src/test/java/org/elasticsearch/search/aggregations/bucket/terms/InternalTermsTestCase.java # core/src/test/java/org/elasticsearch/search/aggregations/metrics/InternalExtendedStatsTests.java # core/src/test/java/org/elasticsearch/search/aggregations/metrics/InternalMaxTests.java # core/src/test/java/org/elasticsearch/search/aggregations/metrics/InternalStatsTests.java # core/src/test/java/org/elasticsearch/search/aggregations/metrics/avg/InternalAvgTests.java # core/src/test/java/org/elasticsearch/search/aggregations/metrics/cardinality/InternalCardinalityTests.java # core/src/test/java/org/elasticsearch/search/aggregations/metrics/geobounds/InternalGeoBoundsTests.java # core/src/test/java/org/elasticsearch/search/aggregations/metrics/geocentroid/InternalGeoCentroidTests.java # core/src/test/java/org/elasticsearch/search/aggregations/metrics/min/InternalMinTests.java # core/src/test/java/org/elasticsearch/search/aggregations/metrics/percentiles/InternalPercentilesTestCase.java # core/src/test/java/org/elasticsearch/search/aggregations/metrics/percentiles/hdr/InternalHDRPercentilesRanksTests.java # core/src/test/java/org/elasticsearch/search/aggregations/metrics/percentiles/tdigest/InternalTDigestPercentilesRanksTests.java # core/src/test/java/org/elasticsearch/search/aggregations/metrics/sum/InternalSumTests.java # core/src/test/java/org/elasticsearch/search/aggregations/metrics/valuecount/InternalValueCountTests.java # core/src/test/java/org/elasticsearch/search/aggregations/pipeline/InternalSimpleValueTests.java # core/src/test/java/org/elasticsearch/search/aggregations/pipeline/bucketmetrics/percentile/InternalPercentilesBucketTests.java # core/src/test/java/org/elasticsearch/search/aggregations/pipeline/derivative/InternalDerivativeTests.java # test/framework/src/main/java/org/elasticsearch/test/InternalAggregationTestCase.java	2017-05-10 13:40:39 +02:00
Martijn van Groningen	51c74ce547	Added unit tests for InternalMatrixStats. Also moved InternalAggregationTestCase to test-framework module in order to make use of it from other modules than core. Relates to #22278	2017-05-10 11:06:18 +02:00
Matt Weber	b24326271e	Add ICUCollationFieldMapper (#24126 ) Adds a new "icu_collation" field type that exposes lucene's ICUCollationDocValuesField. ICUCollationDocValuesField is the replacement for ICUCollationKeyFilter which has been deprecated since Lucene 5.	2017-05-10 10:35:11 +02:00
Ryan Ernst	9ca7d28552	Scripting: Remove "service" from ScriptEngine interface name (#24574 ) This commit renames ScriptEngineService to ScriptEngine. It is often confusing because we have the ScriptService, and then ScriptEngineService implementations, but the latter are not services as we see in other places in elasticsearch.	2017-05-10 00:47:33 -07:00
Ryan Ernst	ebd3e5f73f	Scripting: Deprecate file script settings (#24555 ) File scripts have 2 related settings: the path of file scripts, and whether they can be dynamically reloaded. This commit deprecates those settings. relates #21798	2017-05-09 16:14:57 -07:00
Jason Tedor	8f873620ee	Inline global checkpoints Today we rely on background syncs to relay the global checkpoint under the mandate of the primary to its replicas. This means that the global checkpoint on a replica can lag far behind the primary. The commit moves to inlining global checkpoints with replication requests. When a replication operation is performed, the primary will send the latest global checkpoint inline with the replica requests. This keeps the replicas closer in-sync with the primary. However, consider a replication request that is not followed by another replication request for an indefinite period of time. When the replicas respond to the primary with their local checkpoint, the primary will advance its global checkpoint. During this indefinite period of time, the replicas will not be notified of the advanced global checkpoint. This necessitates a need for another sync. To achieve this, we perform a global checkpoint sync when a shard falls idle. Relates #24513	2017-05-09 15:08:11 -04:00
Nik Everett	bb06d8ec4f	Allow plugins to build pre-configured token filters (#24223 ) This changes the way we register pre-configured token filters so that plugins can declare them and starts to move all of the pre-configured token filters out of core. It doesn't finish the job because doing so would make the change unreviewably large. So this PR includes a shim that keeps the "old" way of registering pre-configured token filters around. The Lowercase token filter is special because there is a "special" interaction between it and the lowercase tokenizer. I'm not sure exactly what to do about it so for now I'm leaving it alone with the intent of figuring out what to do with it in a followup. This also renames these pre-configured token filters from "pre-built" to "pre-configured" because that seemed like a more descriptive name. This is a part of #23658	2017-05-09 14:50:49 -04:00
Jim Ferenczi	b6c714ccc8	Fix BWC for query_and_fetch	2017-05-09 18:52:53 +02:00
Adrien Grand	a72eaa8e0f	Identify documents by their `_id`. (#24460 ) Now that indices have a single type by default, we can move to the next step and identify documents using their `_id` rather than the `_uid`. One notable change in this commit is that I made deletions implicitly create types. This helps with the live version map in the case that documents are deleted before the first type is introduced. Otherwise there would be no way to differenciate `DELETE index/foo/1` followed by `PUT index/foo/1` from `DELETE index/bar/1` followed by `PUT index/foo/1`, even though those are different if versioning is involved.	2017-05-09 16:33:52 +02:00
Hendrik Muhs	f41ddb3607	Move MockLogAppender to elasticsearch test (#24542 ) In order to make MockLogAppender (utility to test logging) available outside of es-core move MockLogAppender from test core-tests to test framework. As package names do not change, no need to change clients.	2017-05-08 13:02:27 +02:00
Koen De Groote	13c17c75b5	Remove unneeded empty string concatentation This commit removes concatenation by empty string in places where it is simply not needed to obtain a string representation. Relates #24411	2017-05-06 00:28:53 -04:00
Yannick Welsch	c8712e9531	Limit AllocationService dependency injection hack (#24479 ) Changes the scope of the AllocationService dependency injection hack so that it is at least contained to the AllocationService and does not leak into the Discovery world.	2017-05-05 08:39:18 +02:00
Jason Tedor	61d5eddbd6	Fix typo in comment in IndexShardTestCase This commit fixes a silly typo in IndexShardTestCase.java.	2017-05-04 21:04:35 -04:00
Yannick Welsch	be19ccef57	Discard stale node responses from async shard fetching (#24434 ) Async shard fetching only uses the node id to correlate responses to requests. This can lead to a situation where a response from an earlier request is mistaken as response from a new request when a node is restarted. This commit adds unique round ids to correlate responses to requests.	2017-05-03 09:47:21 +02:00
Simon Willnauer	2f9e9460d4	Move RemoteClusterService into TransportService (#24424 ) TransportService and RemoteClusterService are closely coupled already today and to simplify remote cluster integration down the road it can be a direct dependency of TransportService. This change moves RemoteClusterService into TransportService with the goal to make it a hidden implementation detail of TransportService in followup changes.	2017-05-02 18:09:32 +02:00
Koen De Groote	0fef5acd01	Cleanup collections construction This commit cleans up some cases where a list or map was being constructed, and then an existing collection was copied into the new collection. The clean is to instead use an appropriate constructor to directly copy the existing collection in during collection construction. The advantage of this is that the new collection is sized appropriately. Relates #24409	2017-04-30 21:26:51 -04:00
Yannick Welsch	35f78d098a	Separate publishing from applying cluster states (#24236 ) Separates cluster state publishing from applying cluster states: - ClusterService is split into two classes MasterService and ClusterApplierService. MasterService has the responsibility to calculate cluster state updates for actions that want to change the cluster state (create index, update shard routing table, etc.). ClusterApplierService has the responsibility to apply cluster states that have been successfully published and invokes the cluster state appliers and listeners. - ClusterApplierService keeps track of the last applied state, but MasterService is stateless and uses the last cluster state that is provided by the discovery module to calculate the next prospective state. The ClusterService class is still kept around, which now just delegates actions to ClusterApplierService and MasterService. - The discovery implementation is now responsible for managing the last cluster state that is used by the consensus layer and the master service. It also exposes the initial cluster state which is used by the ClusterApplierService. The discovery implementation is also responsible for adding the right cluster-level blocks to the initial state. - NoneDiscovery has been renamed to TribeDiscovery as it is exclusively used by TribeService. It adds the tribe blocks to the initial state. - ZenDiscovery is synchronized on state changes to the last cluster state that is used by the consensus layer and the master service, and does not submit cluster state update tasks anymore to make changes to the disco state (except when becoming master). Control flow for cluster state updates is now as follows: - State updates are sent to MasterService - MasterService gets the latest committed cluster state from the discovery implementation and calculates the next cluster state to publish - MasterService submits the new prospective cluster state to the discovery implementation for publishing - Discovery implementation publishes cluster states to all nodes and, once the state is committed, asks the ClusterApplierService to apply the newly committed state. - ClusterApplierService applies state to local node.	2017-04-28 09:34:31 +02:00
Yannick Welsch	2fa1c9fff1	Provide target allocation id as part of start recovery request (#24333 ) This makes it possible for the recovery source to verify that it is talking to the shard it thinks it is talking to. Closes #24167	2017-04-27 14:45:44 +02:00
Adrien Grand	1be2800120	Only allow one type on 7.0 indices (#24317 ) This adds the `index.mapping.single_type` setting, which enforces that indices have at most one type when it is true. The default value is true for 6.0+ indices and false for old indices. Relates #15613	2017-04-27 08:43:20 +02:00
Jason Tedor	74acc594a9	Fix inconsistencies in long GC disruption This commit fixes some inconsistencies in long GC disruption where we mixed stopping and suspending when the action we are performing on threads is suspending which is distinct from stopping a thread.	2017-04-26 21:23:19 -04:00
Nik Everett	bc45d10e82	Remove most usages of 1-arg Script ctor (#24325 ) The one argument ctor for `Script` creates a script with the default language but most usages of are for testing and either don't care about the language or are for use with `MockScriptEngine`. This replaces most usages of the one argument ctor on `Script` with calls to `ESTestCase#mockScript` to make it clear that the tests don't need the default scripting language. I've also factored out some copy and pasted script generation code into a single place. I would have had to change that code to use `mockScript` anyway, so it was easier to perform the refactor. Relates to #16314	2017-04-26 16:04:38 -04:00
Jason Tedor	2ed1f7a339	Avoid leaks in Long GC disruption tests We can leak disrupted threads here since we never wait for them to complete after freeing them from their loops. This commit addresses this by joining on disrupted threads, and addresses fallout from trying to join here. Relates #24338	2017-04-26 15:26:36 -04:00
Nik Everett	7c3efb829b	Move char filters into analysis-common (#24261 ) Another step down the road to dropping the lucene-analyzers-common dependency from core. Note that this removes some tests that no longer compile from core. I played around with adding them to the analysis-common module where they would compile but we already test these in the tests generated from the example usage in the documentation. I'm not super happy with the way that `requriesAnalysisSettings` works with regards to plugins. I think it'd be fairly bug-prone for plugin authors to use. But I'm making it visible as is for now and I'll rethink later. A part of #23658	2017-04-26 13:25:34 -04:00
Ryan Ernst	51b33f1fd5	S3 Repository: Deprecate remaining `repositories.s3.*` settings (#24144 ) Most of these settings should always be pulled from the repository settings. A couple were leftover that should be moved to client settings. The path style access setting should be removed altogether. This commit adds deprecations for all of these existing settings, as well as adding new client specific settings for max retries and throttling. relates #24143	2017-04-25 23:43:20 -07:00
Nik Everett	fc97e25b56	Add task to look for tests in src/main (#24298 ) Creates a new task `namingConventionsMain`, that runs on the `buildSrc` and `test:framework` projects and fails the build if any of the classes in the main artifacts are named like tests or are non-abstract subclasses of ESTestCase. It also fixes the three tests that would cause it to fail.	2017-04-25 21:11:47 -04:00
Simon Willnauer	e69147a870	Add support for `tests.enable_mock_modules` to ESIntegTestCase (#24309 ) `tests.enable_mock_modules` is a documented but unrespected / unused option to disable all mock modules / pluings during test runs. This will basically site-step mock assertions like check-index on shard closing. This can speed up test-execution dramatically on nodes with slow disks etc. Relates to #24304	2017-04-25 17:34:25 +02:00
Koen De Groote	88de33d43d	Minor changes to collection creation from enums (#24274 ) These changes are mainly cosmetic with minor perf advantages drawn from checkstyle.	2017-04-25 13:13:55 +02:00
Jason Tedor	1500beafc7	Check for default.path.data included in path.data If the user explicitly configured path.data to include default.path.data, then we should not fail the node if we find indices in default.path.data. This commit addresses this. Relates #24285	2017-04-24 09:31:54 -04:00
Ryan Ernst	aadc33d260	Scripts: Remove unwrap method from executable scripts (#24263 ) The unwrap method was leftover from support javascript and python. Since those languages are removed in 6.0, this commit removes the unwrap feature from scripts.	2017-04-21 17:50:22 -07:00
Simon Willnauer	2ca7072b24	Fill missing sequence IDs up to max sequence ID when recovering from store (#24238 ) Today we might promote a primary and recover from store where after translog recovery the local checkpoint is still behind the maximum sequence ID seen. To fill the holes in the sequence ID history this PR adds a utility method that fills up all missing sequence IDs up to the maximum seen sequence ID with no-ops. Relates to #10708	2017-04-21 20:28:00 +02:00
Adrien Grand	2b8fa64cf7	ESIntegTestCase.indexRandom should not introduce types. (#24202 ) Since we plan on removing types, `indexRandom` should not introduce new types. This commit refactors `indexRandom` to reuse existing types.	2017-04-21 10:38:36 +02:00
Nik Everett	caf376c8af	Start building analysis-common module (#23614 ) Start moving built in analysis components into the new analysis-common module. The goal of this project is: 1. Remove core's dependency on lucene-analyzers-common.jar which should shrink the dependencies for transport client and high level rest client. 2. Prove that analysis plugins can do all the "built in" things by moving all "built in" behavior to a plugin. 3. Force tests not to depend on any oddball analyzer behavior. If tests need anything more than the standard analyzer they can use the mock analyzer provided by Lucene's test infrastructure.	2017-04-19 18:51:34 -04:00
Ali Beyad	3c82eea5fb	Wait for cluster to become quiescent between REST tests (#24148 ) [TEST] ensures REST tests wait for cluster state updates to finish processing before moving to the next test	2017-04-19 13:17:09 -04:00
Jim Ferenczi	f05af0a382	Enable index-time sorting (#24055 ) This change adds an index setting to define how the documents should be sorted inside each Segment. It allows any numeric, date, boolean or keyword field inside a mapping to be used to sort the index on disk. It is not allowed to use a `nested` fields inside an index that defines an index sorting since `nested` fields relies on the original sort of the index. This change does not add early termination capabilities in the search layer. This will be added in a follow up. Relates #6720	2017-04-19 14:36:11 +02:00
Ryan Ernst	212f24aa27	Tests: Clean up rest test file handling (#21392 ) This change simplifies how the rest test runner finds test files and removes all leniency. Previously multiple prefixes and suffixes would be tried, and tests could exist inside or outside of the classpath, although outside of the classpath never quite worked. Now only classpath tests are supported, and only one resource prefix is supported, `/rest-api-spec/tests`. closes #20240	2017-04-18 15:07:08 -07:00
Adrien Grand	4632661bc7	Upgrade to a Lucene 7 snapshot (#24089 ) We want to upgrade to Lucene 7 ahead of time in order to be able to check whether it causes any trouble to Elasticsearch before Lucene 7.0 gets released. From a user perspective, the main benefit of this upgrade is the enhanced support for sparse fields, whose resource consumption is now function of the number of docs that have a value rather than the total number of docs in the index. Some notes about the change: - it includes the deprecation of the `disable_coord` parameter of the `bool` and `common_terms` queries: Lucene has removed support for coord factors - it includes the deprecation of the `index.similarity.base` expert setting, since it was only useful to configure coords and query norms, which have both been removed - two tests have been marked with `@AwaitsFix` because of #23966, which we intend to address after the merge	2017-04-18 15:17:21 +02:00
Jason Tedor	8033c576b7	Detect remnants of path.data/default.path.data bug In Elasticsearch 5.3.0 a bug was introduced in the merging of default settings when the target setting existed as an array. When this bug concerns path.data and default.path.data, we ended up in a situation where the paths specified in both settings would be used to write index data. Since our packaging sets default.path.data, users that configure multiple data paths via an array and use the packaging are subject to having shards land in paths in default.path.data when that is very likely not what they intended. This commit is an attempt to rectify this situation. If path.data and default.path.data are configured, we check for the presence of indices there. If we find any, we log messages explaining the situation and fail the node. Relates #24099	2017-04-17 07:03:46 -04:00
Ali Beyad	0afcaf5627	[TEST] fix BytesReference tests to never have a negative slice offset	2017-04-13 16:16:53 -04:00
Lee Hinman	5cace8e48a	Remove shadow replicas Resolves #22024	2017-04-11 11:26:26 -06:00
Colin Goodheart-Smithe	0114f0061c	Removes version 2.x constants from Version (#24011 ) * Removes version 2.x constants from Version Closes #21887 * Addresses review comments	2017-04-11 08:31:22 +01:00
Ryan Ernst	65f7a76630	Settings: Add secure file setting to keystore (#24001 ) Some systems like GCE rely on a plaintext file containing credentials. Rather than extract the information out of that credentials file and store each peace individually in the keystore, it is cleaner to just store the entire file. This commit adds support to the keystore wrapper for secure file settings. These are settings that contain an entire file that would normally be stored on the local filesystem. Retrieving the file returns an input stream to the file contents. This also adds a `add-file` command to the keystore cli. In order to support both strings and files as values for settings, the metadata format of the keystore has also been updated (with backcompat) to keep a map of setting name to type.	2017-04-10 13:10:42 -07:00
Jay Modi	42b0b05af1	Test: add support for replacing stashed values within headers of REST tests (#24014 ) This commit adds support for replacing a stashed value within a header of a REST test. This is useful for requests that may want to use a value previously obtained within a header.	2017-04-10 12:10:01 -04:00
javanna	3b7bc8012a	[TEST] increase minimum length of randomly generated fields in RandomObjects We had a couple of unfortunate field name collisions in our CI, where the json duplicate check tripped. Increasing the minimum length of randomly generated field names should decrease the chance of this issue happening again.	2017-04-10 11:32:23 +02:00
Ryan Ernst	d4c0ef0028	Settings: Migrate ec2 discovery sensitive settings to elasticsearch keystore (#23961 ) This change adds secure settings for access/secret keys and proxy username/password to ec2 discovery. It adds the new settings with the prefix `discovery.ec2`, copies other relevant ec2 client settings to the same prefix, and deprecates all other settings (`cloud.aws.` and `cloud.aws.ec2.`). Note that this is simpler than the client configs in repository-s3 because discovery is only initialized once for the entire node, so there is no reason to complicate the configuration with the ability to have multiple sets of client settings. relates #22475	2017-04-07 13:28:15 -07:00
Yannick Welsch	a3cceb8a00	[TEST] Fix testMultipleNodesShutdownNonMasterNodes to wait for the right nodes to rejoin the cluster This test was sporadically failing for the following reason: - 4 nodes (nodes 0, 1, 2, and 3) running with `minimum_master_nodes` set to 3 - we stop 2 nodes (node 0 and 3) - wait for cluster block to be in place on all nodes - start 2 nodes (node 4 and node 5) and do a `prepareHealth().setWaitForNodes("4")` - then do a search request The search request runs into the `ClusterBlockException` as the `prepareHealth().setWaitForNodes("4")` check succeeds on a cluster state that has nodes 1, 2, 3, and 4, i.e., only one of the two new nodes has joined the cluster and only one of the two dead nodes was removed by the master (removing the dead nodes only happens after there are again `minimum_master_nodes` nodes in the cluster). This commit fixes the issue by reusing a method from InternalTestCluster that checks that the right nodes have rejoined the cluster.	2017-04-07 15:26:21 +02:00
Luca Cavanna	13cf8aaa52	[TEST] fix shuffling of xContent keys (#23929 ) ESTestCase has methods to shuffle xContent keys given a builder or a parser. Shuffling wasn't actually doing what was expected but rather reordering the keys in their natural ordering, hence the output was always the same at every run. Corrected that and added tests, also fixed a couple of tests that were affected by this fix.	2017-04-07 10:20:32 +02:00
Lee Hinman	0257a7b97a	Only re-parse operation if a mapping update was needed When executing an index operation on the primary shard, `TransportShardBulkAction` first parses the document, sees if there are any mapping updates that needs to be applied, and then updates the mapping on the master node. It then re-parses the document to make sure that the mappings have been applied and propagated. This adds a check that skips the second parsing of the document in the event there was not a mapping update applied in the first case. Fixes a performance regression introduced in #23665	2017-04-05 09:29:44 -06:00
Luca Cavanna	318d365b12	[TEST] make sure that fromXContent doesn't rely on keys ordering (#23901 ) We shuffle the keys before we parse our responses for the high level client so that we make sure we never rely on keys ordering.	2017-04-05 11:12:34 +02:00
Jason Tedor	3136ed1490	Rename random ASCII helper methods This commit renames the random ASCII helper methods in ESTestCase. This is because this method ultimately uses the random ASCII methods from randomized runner, but these methods actually only produce random strings generated from [a-zA-Z]. Relates #23886	2017-04-04 11:04:18 -04:00
Boaz Leskes	2266947ac5	testDifferentRolesMaintainPathOnRestart - fix broken comment	2017-04-04 11:03:44 +02:00
Boaz Leskes	20b274d7b9	testDifferentRolesMaintainPathOnRestart - lower join timeout as split elections are likely the test reduce the wait for initial cluster state to 0, causing multiple nodes to be start while elections are going on. This means there is a chance of a split election which shouldn't cause the test to time out.	2017-04-04 10:36:09 +02:00
Jason Tedor	71293a89bf	Introduce single-node discovery This commit adds a single node discovery type. With this discovery type, a node will elect itself as master and never form a cluster with another node. Relates #23595	2017-04-04 03:02:58 -04:00
Boaz Leskes	40eb68c95a	testRestorePersistentSettings doesn't to mess with discovery settings	2017-04-03 16:23:17 +02:00
Boaz Leskes	55a3fd1919	testDifferentRolesMaintainPathOnRestart shouldn't use auto managing of min master nodes It starts nodes in any order and thus it disabled the wait for first cluster state at node start up time the later is required for the auto management logic. Closes #23728	2017-04-03 16:23:17 +02:00
Boaz Leskes	5cf1d4ae90	mute testDifferentRolesMaintainPathOnRestart See https://github.com/elastic/elasticsearch/issues/23728	2017-04-03 10:23:04 +02:00
Jason Tedor	1d648a3d46	Fix BootstrapForTesting blowup This commit fixes an issue with BootstrapForTesting where the common case was to invoke a method with a null parameter that does not accept null.	2017-04-01 17:49:40 -04:00
Jason Tedor	8c554215e0	Ban Boolean#getBoolean The method Boolean#getBoolean is dangerous. It is too easy to mistakenly invoke this method thinking that it is parsing a string as a boolean. However, what it actually does is get a system property with the specified string, and then attempts to use usual crappy boolean parsing in the JDK to parse that system property as boolean with complete leniency (it parses every input value into either true or false); that is, this method amounts to invoking Boolean#parseBoolean(String) on the result of System#getProperty(String). Boo. This commit bans usage of this method. Relates #23864	2017-04-01 17:02:19 -04:00
Tim Brooks	5fa80a6521	Pass exception from sendMessage to listener (#23559 ) This commit changes the listener passed to sendMessage from a Runnable to a ActionListener. This change also removes IOException from the sendMessage signature. That signature is misleading as it allows implementers to assume an exception will be thrown in case of failure. That does not happen due to Netty's async nature.	2017-03-30 15:08:23 -05:00
Jason Tedor	48357e43d3	Honor update request timeout When executing an update request, the request timeout is not transferred to the index/delete request executed on behalf of the update request. This leads to update requests not timing out when they should (e.g., if not all shards are available when the request specifies wait_for_shards=all with a small timeout). This commit causes the index/delete requests to honor the update request timeout. Relates #23825	2017-03-30 14:38:34 -04:00
Ryan Ernst	f8453aca57	Packaging: Remove classpath ordering hack (#23596 ) After the removal of the joda time hack we used to have, we can cleanup the codebase handling in security, jarhell and plugins to be more picky about uniqueness. This was originally in #18959 which was never merged. closes #18959	2017-03-21 12:12:16 -07:00
Jason Tedor	7b17689458	Search took time should use a relative clock Search took time uses an absolute clock to measure elapsed time, and then tries to deal with the complexities of using an absolute clock for this purpose. Instead, we should use a high-precision monotonic relative clock that is designed exactly for measuring elapsed time. This commit modifies the search infrastructure to use a relative clock for measuring took time, but still provides an absolute clock for the components of search that require a real clock (e.g., index name expression resolution, etc.). Relates #23662	2017-03-20 18:48:51 -04:00
Igor Motov	1bd66136d7	Task Manager should be able to support non-transport tasks (#23619 ) Currently the task manager is tied to the transport and can only create tasks based on TransportRequests. This commit enables task manager to support tasks created by non-transport services such as the persistent tasks service.	2017-03-17 19:29:18 -04:00
Christoph Büscher	d02b6f58fa	Tests: Adapt ExistsQueryBuilderTests to changes in ExistQueryBuilder#toQuery() (#23462 ) Recent changes in the Lucene query that the ExistsQueryBuilder creates broke this test.	2017-03-02 18:27:30 +01:00
Luca Cavanna	cc65a94fd4	[TEST] improve yaml test sections parsing (#23407 ) Throw error when skip or do sections are malformed, such as they don't start with the proper token (START_OBJECT). That signals bad indentation, which would be ignored otherwise. Thanks (or due to) our pull parsing code, we were still able to properly parse the sections, yet other runners weren't able to. Closes #21980 * [TEST] fix indentation in matrix_stats yaml tests * [TEST] fix indentation in painless yaml test * [TEST] fix indentation in analysis yaml tests * [TEST] fix indentation in generated docs yaml tests * [TEST] fix indentation in multi_cluster_search yaml tests	2017-03-02 12:43:20 +01:00
Jason Tedor	64e193874f	Properly clean up thread context after tests Today when resetting the deprecation logger after a test is torn down, we attach a new thread context to the deprecation logger. This thread context is never cleared and we are left with a thread context attached to the deprecation logger for every test method that ran in the same JVM. This commit adds a flag when resetting the deprecation logger to not attach a new thread context when the test is being torn down. Relates #23441	2017-03-01 16:34:10 -05:00
Adrien Grand	3134d6b520	Add unit tests to percentile ranks aggregations. (#23240 ) Relates #22278	2017-03-01 13:57:40 +01:00
Jason Tedor	7ce06aeb8c	Fix date format in warning headers This commit fixes the date format in warning headers. There is some confusion around whether or not RFC 1123 requires two-digit days. However, the warning header specification very clearly relies on a format that requires two-digit days. This commit removes the usage of RFC 1123 date/time format from Java 8, which allows for one-digit days, in favor of a format that forces two-digit days (it's otherwise identical to RFC 1123 format, it is just fixed width). Relates #23418	2017-02-28 20:28:07 -05:00
Jason Tedor	ee2f6ccf32	Add convenience method for asserting deprecations This commit adds a convenience method for simultaneously asserting settings deprecations and other warnings and fixes some tests where setting deprecations and general warnings were present.	2017-02-28 18:24:39 -05:00
Ali Beyad	5e2e45cad9	Makes the same_shard host dyanamically updatable (#23397 ) Previously, cluster.routing.allocation.same_shard.host was not a dynamic setting and could not be updated after startup. This commit changes the behavior to allow the setting to be dynamically updatable. The documentation already states that the setting is dynamic so no documentation changes are required. Closes #22992	2017-02-28 12:48:54 -05:00
Jim Ferenczi	5c84640126	Upgrade to lucene-6.5.0-snapshot-d00c5ca (#23385 ) Lucene upgrade	2017-02-27 18:39:04 +01:00
Jason Tedor	577e6a5e14	Correct warning header to be compliant The warning header used by Elasticsearch for delivering deprecation warnings has a specific format (RFC 7234, section 5.5). The format specifies that the warning header should be of the form warn-code warn-agent warn-text [warn-date] Here, the warn-code is a three-digit code which communicates various meanings. The warn-agent is a string used to identify the source of the warning (either a host:port combination, or some other identifier). The warn-text is quoted string which conveys the semantic meaning of the warning. The warn-date is an optional quoted date that can be in a few different formats. This commit corrects the warning header within Elasticsearch to follow this specification. We use the warn-code 299 which means a "miscellaneous persistent warning." For the warn-agent, we use the version of Elasticsearch that produced the warning. The warn-text is unchanged from what we deliver today, but is wrapped in quotes as specified (this is important as a problem that exists today is that multiple warnings can not be split by comma to obtain the individual warnings as the warnings might themselves contain commas). For the warn-date, we use the RFC 1123 format. Relates #23275	2017-02-27 12:14:21 -05:00
javanna	756e26cb33	[TEST] make headers case-insensitive when running yaml tests	2017-02-27 12:27:03 +01:00
javanna	4f487ab1b9	[TEST] randomize request content_type between all of the supported formats	2017-02-27 12:27:03 +01:00
javanna	9a2dba3036	[TEST] add support for binary responses to REST tests infra	2017-02-27 12:27:03 +01:00
javanna	ca858befab	[TEST] create HttpEntity earlier in REST tests This allows to set content-type together with the body itself. At the moment it is always json, but this change allows makes it easier to randomize it later	2017-02-27 12:27:03 +01:00
javanna	04aaedc083	[TEST] Remove content type auto-detection while parsing request body in REST tests	2017-02-27 12:27:03 +01:00
Ryan Ernst	48548f6c3d	CLI: Fix prompting for yes/no to handle console returning null (#23320 ) Console.readText may return null in certain cases. This commit fixes a bug in Terminal.promptYesNo which assumed a non-null return value. It also adds a test for this, and modifies mock terminal to be able to handle null input values.	2017-02-24 20:20:17 -08:00
Simon Willnauer	ce625ebdcc	Expose `batched_reduce_size` via `_search` (#23288 ) In #23253 we added an the ability to incrementally reduce search results. This change exposes the parameter to control the batch since and therefore the memory consumption of a large search request.	2017-02-21 18:36:59 +01:00
Tanguy Leroux	3a0fc526bb	UpdateRequest implements ToXContent (#23289 ) This commit changes UpdateRequest so that it implements the ToXContentObject interface.	2017-02-21 15:20:15 +01:00
Simon Willnauer	f933f80902	First step towards incremental reduction of query responses (#23253 ) Today all query results are buffered up until we received responses of all shards. This can hold on to a significant amount of memory if the number of shards is large. This commit adds a first step towards incrementally reducing aggregations results if a, per search request, configurable amount of responses are received. If enough query results have been received and buffered all so-far received aggregation responses will be reduced and released to be GCed.	2017-02-21 13:02:48 +01:00
Tanguy Leroux	872412f645	[Tests] Cleans up DocWriteResponse parsing tests (#23233 ) This commit cleans up some parsing tests added from the High Level Rest Client: IndexResponseTests, DeleteResponseTests, UpdateResponseTests, BulkItemResponseTests. These tests are now more uniform with the others test-from-to-XContent tests we have, they now shuffle the XContent fields before parsing, the asserting method for parsed objects does not used a Map<String, Object> anymore, and buggy equals/hasCode methods in ShardInfo and ShardInfo.Failure have been removed.	2017-02-20 09:45:33 +01:00
Jay Modi	b234644035	Enforce Content-Type requirement on the rest layer and remove deprecated methods (#23146 ) This commit enforces the requirement of Content-Type for the REST layer and removes the deprecated methods in transport requests and their usages. While doing this, it turns out that there are many places where *Entity classes are used from the apache http client libraries and many of these usages did not specify the content type. The methods that do not specify a content type explicitly have been added to forbidden apis to prevent more of these from entering our code base. Relates #19388	2017-02-17 14:45:41 -05:00
Boaz Leskes	f83db675c8	Ensure network connections are restored after disruptions (#23135 ) With #22977, network disruption also disconnects nodes from the transport service. That has the side effect that when the disruption is healed, the disconnected node stay disconnected until the `NodeConnectionsService` restores the connection. This can take too long for the tests. This PR adds logic to the cluster healing to restore connections immediately. See https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+multijob-unix-compatibility/os=debian/611/console for an example failure.	2017-02-15 11:04:52 +02:00
Adrien Grand	8d6a41f671	Nested queries should avoid adding unnecessary filters when possible. (#23079 ) When nested objects are present in the mappings, many queries get deoptimized due to the need to exclude documents that are not in the right space. For instance, a filter is applied to all queries that prevents them from matching non-root documents (`+: -_type:__`). Moreover, a filter is applied to all child queries of `nested` queries in order to make sure that the child query only matches child documents (`_type:__nested_path`), which is required by `ToParentBlockJoinQuery` (the Lucene query behing Elasticsearch's `nested` queries). These additional filters slow down `nested` queries. In 1.7-, the cost was somehow amortized by the fact that we cached filters very aggressively. However, this has proven to be a significant source of slow downs since 2.0 for users of `nested` mappings and queries, see #20797. This change makes the filtering a bit smarter. For instance if the query is a `match_all` query, then we need to exclude nested docs. However, if the query is `foo: bar` then it may only match root documents since `foo` is a top-level field, so no additional filtering is required. Another improvement is to use a `FILTER` clause on all types rather than a `MUST_NOT` clause on all nested paths when possible since `FILTER` clauses are more efficient. Here are some examples of queries and how they get rewritten: ``` "match_all": {} ``` This query gets rewritten to `ConstantScore(+:* -_type:__)` on master and `ConstantScore(_type:AutomatonQuery {\norg.apache.lucene.util.automaton.Automaton@4371da44})` with this change. The automaton is the complement of `_type:__` so it matches the same documents, but is faster since it is now a positive clause. Simplistic performance testing on a 10M index where each root document has 5 nested documents on average gave a latency of 420ms on master and 90ms with this change applied. ``` "term": { "foo": { "value": "0" } } ``` This query is rewritten to `+foo:0 #(ConstantScore(+: -_type:__))^0.0` on master and `foo:0` with this change: we do not need to filter nested docs out since the query cannot match nested docs. While doing performance testing in the same conditions as above, response times went from 250ms to 50ms. ``` "nested": { "path": "nested", "query": { "term": { "nested.foo": { "value": "0" } } } } ``` This query is rewritten to `+ToParentBlockJoinQuery (+nested.foo:0 #_type:__nested) #(ConstantScore(+:* -_type:__))^0.0` on master and `ToParentBlockJoinQuery (nested.foo:0)` with this change. The top-level filter (`-_type:__`) could be removed since `nested` queries only match documents of the parent space, as well as the child filter (`#_type:__nested`) since the child query may only match nested docs since the `nested` object has both `include_in_parent` and `include_in_root` set to `false`. While doing performance testing in the same conditions as above, response times went from 850ms to 270ms.	2017-02-14 16:05:19 +01:00
Christoph Büscher	5b459a0bdc	[Tests] increase minimal field name when creating random objects I encountered several cases of duplicate field names when generating random fields using the RandomObjects helper. This leads to invalid json in some tests, so increasing the minimum field name length to four to make this less likely to happen.	2017-02-14 11:31:37 +01:00
Jason Tedor	5343b87502	Handle bad HTTP requests When Netty decodes a bad HTTP request, it marks the decoder result on the HTTP request as a failure, and reroutes the request to GET /bad-request. This either leads to puzzling responses when a bad request is sent to Elasticsearch (if an index named "bad-request" does not exist then it produces an index not found exception and otherwise responds with the index settings for the index named "bad-request"). This commit addresses this by inspecting the decoder result on the HTTP request and dispatching the request to a bad request handler preserving the initial cause of the bad request and providing an error message to the client. Relates #23153	2017-02-13 17:39:25 -05:00
Jay Modi	61e383813d	Make the version of the remote node accessible on a transport channel (#23019 ) This commit adds a new method to the TransportChannel that provides access to the version of the remote node that the response is being sent on and that the request came from. This is helpful for serialization of data attached as headers.	2017-02-13 15:15:57 -05:00
jaymode	d8d03f45c2	Fix communication with 5.3.0 nodes This commit fixes communication with 5.3.0 nodes to send XContentType to these nodes since #22691 was backported to the 5.3 branch.	2017-02-13 13:15:51 -05:00
Boaz Leskes	6a8ef0ea74	Traces in testAdapterSendReceiveCallbacks should only listen the relevant actions The traces callback is only called after responses are set. This can lead to concurrent issues where the trace is notified of previously sent responses if it was added after the response was sent (enabling further execution of the test) but before the tracer call backs are called.	2017-02-12 09:20:18 +02:00
Boaz Leskes	c2494bbaed	log extra information on failure of testAdapterSendReceiveCallbacks	2017-02-11 19:41:19 +02:00
Adrien Grand	709cc9ba65	Upgrade to lucene-6.5.0-snapshot-f919485. (#23087 )	2017-02-10 15:08:47 +01:00
Boaz Leskes	cd1cb41603	Move EvilPeerRecoveryIT to a unit test in RecoveryDuringReplicationTests (#22900 ) EvillPeerRecoveryIT checks scenario where recovery is happening while there are on going indexing operation that already have been assigned a seq# . This is fairly hard to achieve and the test goes through a couple of hoops via the plugin infra to achieve that. This PR extends the unit tests infra to allow for those hoops to happen in unit tests. This allows the test to be moved to RecoveryDuringReplicationTests Relates to #22484	2017-02-09 20:14:03 +02:00
Simon Willnauer	ecb01c15b9	Fold InternalSearchHits and friends into their interfaces (#23042 ) We have a bunch of interfaces that have only a single implementation for 6 years now. These interfaces are pretty useless from a SW development perspective and only add unnecessary abstractions. They also require lots of casting in many places where we expect that there is only one concrete implementation. This change removes the interfaces, makes all of the classes final and removes the duplicate `foo` `getFoo` accessors in favor of `getFoo` from these classes.	2017-02-08 14:40:08 +01:00
Yannick Welsch	9154686623	Remove legacy primary shard allocation mode based on versions (#23016 ) Elasticsearch v5.0.0 uses allocation IDs to safely allocate primary shards whereas prior versions of ES used a version-based mode instead. Elasticsearch v5 still has support for version-based primary shard allocation as it needs to be able to load 2.x shards. ES v6 can drop the legacy support.	2017-02-08 10:00:55 +01:00
Boaz Leskes	ba06c14a97	TransportService.connectToNode should validate remote node ID (#22828 ) #22194 gave us the ability to open low level temporary connections to remote node based on their address. With this use case out of the way, actual full blown connections should validate the node on the other side, making sure we speak to who we think we speak to. This helps in case where multiple nodes are started on the same host and a quick node restart causes them to swap addresses, which in turn can cause confusion down the road.	2017-02-07 22:11:32 +02:00
Ryan Ernst	470ad1ae4a	Settings: Add secure settings validation on startup (#22894 ) Secure settings from the elasticsearch keystore were not yet validated. This changed improves support in Settings so that secure settings more seamlessly blend in with normal settings, allowing the existing settings validation to work. Note that the setting names are still not validated (yet) when using the elasticsearc-keystore tool.	2017-02-07 09:34:41 -08:00
Tim Brooks	27b7d9bd8d	Add FileSystemUtil method to read 'file:/' URLs (#23020 ) As part of #22116 we are going to forbid usage of api java.net.URL#openStream(). However in a number of places across the we use this method to read files from the local filesystem. This commit introduces a helper method openFileURLStream(URL url) to read files from URLs. It does specific validation to only ensure that file:/ urls are read. Additionlly, this commit removes unneeded method FileSystemUtil.newBufferedReader(URL, Charset). This method used the openStream () method which will soon be forbidden. Instead we use the Files.newBufferedReader(Path, Charset).	2017-02-07 10:24:22 -06:00
Boaz Leskes	03ef756539	MockTransportService should physically disconnect when simulating it (#22977 ) This is in order to trigger listeners for disconnect events, most importantly the NodeFaultDetection. MockTransportService now does slightly a better job at mimicking real life failures: connecting to already connected node will be a noop (we don't detect any errors here in production either) and failing to send will cause the target node to be disconnected. This is the cause of failure in https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+5.2+multijob-unix-compatibility/os=debian/72	2017-02-06 17:44:29 +01:00
Boaz Leskes	5e7d22357f	Connect to new nodes concurrently (#22984 ) When a node receives a new cluster state from the master, it opens up connections to any new node in the cluster state. That has always been done serially on the cluster state thread but it has been a long standing TODO to do this concurrently, which is done by this PR. This is spin off of #22828, where an extra handshake is done whenever connecting to a node, which may slow down connecting. Also, the handshake is done in a blocking fashion which triggers assertions w.r.t blocking requests on the cluster state thread. Instead of adding an exception, I opted to implement concurrent connections which both side steps the assertion and compensates for the extra handshake.	2017-02-06 16:32:41 +01:00
Jason Tedor	9a0b216c36	Upgrade checkstyle to version 7.5 This commit upgrades the checkstyle configuration from version 5.9 to version 7.5, the latest version as of today. The main enhancement obtained via this upgrade is better detection of redundant modifiers. Relates #22960	2017-02-03 09:46:44 -05:00
Jay Modi	7520a107be	Optionally require a valid content type for all rest requests with content (#22691 ) This change adds a strict mode for xcontent parsing on the rest layer. The strict mode will be off by default for 5.x and in a separate commit will be enabled by default for 6.0. The strict mode, which can be enabled by setting `http.content_type.required: true` in 5.x, will require that all incoming rest requests have a valid and supported content type header before the request is dispatched. In the non-strict mode, the Content-Type header will be inspected and if it is not present or not valid, we will continue with auto detection of content like we have done previously. The content type header is parsed to the matching XContentType value with the only exception being for plain text requests. This value is then passed on with the content bytes so that we can reduce the number of places where we need to auto-detect the content type. As part of this, many transport requests and builders were updated to provide methods that accepted the XContentType along with the bytes and the methods that would rely on auto-detection have been deprecated. In the non-strict mode, deprecation warnings are issued whenever a request with body doesn't provide the Content-Type header. See #19388	2017-02-02 14:07:13 -05:00
Igor Motov	c34b63dadd	Expand AbstractSerializingTestCase and AbstractWireSerializingTestCase to test diff serialization This commit adds two additional test cases that can be used to verify correct diff serialization in additional to binary and xcontent serialization.	2017-02-02 12:19:53 -05:00
Tanguy Leroux	f86fd62821	Parse elasticsearch exception's root causes (#22924 ) This commit change ElasticsearchException.failureFromXContent() method so that it now parses root causes which were ignored before, and adds them as suppressed exceptions of the returned exception.	2017-02-02 17:00:16 +01:00
Boaz Leskes	eb36b82de4	Seq Number based recovery should validate last lucene commit max seq# (#22851 ) The seq# base recovery logic relies on rolling back lucene to remove any operations above the global checkpoint. This part of the plan is not implemented yet but have to have these guarantees. Instead we should make the seq# logic validate that the last commit point (and the only one we have) maintains the invariant and if not, fall back to file based recovery. This commit adds a test that creates situation where rollback is needed (primary failover with ops in flight) and fixes another issue that was surfaced by it - if a primary can't serve a seq# based recovery request and does a file copy, it still used the incoming `startSeqNo` as a filter. Relates to #22484 & #10708	2017-01-31 20:27:31 +01:00
Ryan Ernst	29f63c78cc	Internal: Convert empty and size checks of settings to not use getAsMap() (#22890 ) With the new secure settings, methods like getAsMap() no longer work correctly as a means of checking for empty settings, or the total size. This change converts the existing uses of that method to use methods directly on Settings. Note this does not update the implementations to account for SecureSettings, as that will require a followup which changes how secure settings work.	2017-01-31 10:44:09 -08:00
Nik Everett	e042c77301	Add tests for reducing top hits (#22837 ) Also adds many `equals` and `hashCode` implementations and moves the failure printing in `MatchAssertion` into a common spot and exposes it over `assertEqualsWithErrorMessageFromXContent` which does an object equality test but then uses `toXContent` to print the differences. Relates to #22278	2017-01-27 20:54:11 -05:00
Nik Everett	2e48fb8294	Move delete by query helpers into core (#22810 ) This moves the building blocks for delete by query into core. This should enabled two thigns: 1. Plugins other than reindex to implement "bulk by scroll" style operations. 2. Plugins to directly call delete by query. Those plugins should be careful to make sure that task cancellation still works, but this should be possible. Notes: 1. I've mostly just moved classes and moved around tests methods. 2. I haven't been super careful about cohesion between these core classes and reindex. They are quite interconnected because I wanted to make the change as mechanical as possible. Closes #22616	2017-01-27 16:09:18 -05:00
Ryan Ernst	aad51d44ab	S3 repository: Add named configurations (#22762 ) * S3 repository: Add named configurations This change implements named configurations for s3 repository as proposed in #22520. The access/secret key secure settings which were added in #22479 are reverted, and the only secure settings are those with the new named configs. All other previously used settings for the connection are deprecated. closes #22520	2017-01-27 10:42:45 -08:00
Nik Everett	8abd4101eb	Add tests for reducing top hits Also adds many `equals` and `hashCode` implementations and moves the failure printing in `MatchAssertion` into a common spot and exposes it over `assertEqualsWithErrorMessageFromXContent` which does an object equality test but then uses `toXContent` to print the differences. Relates to #22278	2017-01-27 12:32:17 -05:00
Jason Tedor	930282e161	Introduce sequence-number-based recovery This commit introduces sequence-number-based recovery. When a replica has fallen out of sync, rather than performing a file-based recovery we first attempt to replay operations since the last local checkpoint on the replica. To do this, at the start of recovery the replica tells the primary what its local checkpoint is. The primary will then wait for all operations between that local checkpoint and the current maximum sequence number to complete; this is to ensure that there are no gaps in the operations that will be replayed from the primary to the replica. This is a best-effort attempt as we currently have no guarantees on the primary that these operations will be available; if we are not able to replay all operations in the desired range, we just fallback to file-based recovery. Later work will strengthen the guarantees. Relates #22484	2017-01-27 08:16:38 -08:00
Jim Ferenczi	e48bc2eed7	Add field collapsing for search request (#22337 ) * Add top hits collapsing to search request The field collapsing is done with a custom top docs collector that "collapse" search hits with same field value. The distributed aspect is resolve using the two passes that the regular search uses. The first pass "collapse" the top hits, then the coordinating node merge/collapse the top hits from each shard. ``` GET _search { "collapse": { "field": "category", } } ``` This change also adds an ExpandCollapseSearchResponseListener that intercepts the search response and expands collapsed hits using the CollapseBuilder#innerHit} options. The retrieval of each inner_hits is done by sending a query to all shards filtered by the collapse key. ``` GET _search { "collapse": { "field": "category", "inner_hits": { "size": 2 } } } ```	2017-01-23 16:33:51 +01:00
Simon Willnauer	27b5c2ad54	Pass `forceExecution` flag to transport interceptor (#22739 ) To effectively allow a plugin to intercept a transport handler it needs to know if the handler must be executed even if there is a rejection on the thread pool in the case the wrapper forks a thread to execute the actual handler.	2017-01-23 11:04:27 +01:00
Simon Willnauer	824beea89d	Fix handling of document failure expcetion in InternalEngine (#22718 ) Today we try to be smart and make a generic decision if an exception should be treated as a document failure but in some cases concurrency in the index writer make this decision very difficult since we don't have a consistent state in the case another thread is currently failing the IndexWriter/InternalEngine due to a tragic event. This change simplifies the exception handling and makes specific decisions about document failures rather than using a generic heuristic. This prevent exceptions to be treated as document failures that should have failed the engine but backed out of failing since since some other thread has already taken over the failure procedure but didn't finish yet.	2017-01-20 16:55:00 +01:00
Ryan Ernst	c5b4bba30b	S3 repository: Deprecate specifying credentials through env vars, sys props, and remove profile files (#22567 ) * S3 repository: Deprecate specifying credentials through env vars and sys props This is a follow up to #22479, where storing credentials secure way was added.	2017-01-19 12:36:32 -08:00
Simon Willnauer	24e2847af2	Streamline foreign stored context restore and allow to perserve response headers (#22677 ) Today we do not preserve response headers if they are present on a transport protocol response. While preserving these headers is not always desired, in the most cases we should pass on these headers to have consistent results for depreciation headers etc. yet, this hasn't been much of a problem since most of the deprecations are detected early ie. on the coordinating node such that this bug wasn't uncovered until #22647 This commit allow to optionally preserve headers when a context is restored and also streamlines the context restore since it leaked frequently into the callers thread context when the callers context wasn't restored again.	2017-01-18 16:17:54 +01:00
Simon Willnauer	19f9cb307a	Merge branch 'master' into feature/multi_cluster_search	2017-01-18 09:24:35 +01:00
Luca Cavanna	bc5b604cbd	[TEST] parse global parameters from _common.json (#22655 ) Replace the hardcoded global parameters in the yaml test suite with parameters parsed from the newly added _common.json file. Relates to #22569	2017-01-17 16:13:09 +01:00
Ali Beyad	e2977889b8	Allow comma delimited array settings to have a space after each entry (#22591 ) Previously, certain settings that could take multiple comma delimited values would pick up incorrect values for all entries but the first if each comma separated value was followed by a whitespace character. For example, the multi-value "A,B,C" would be correctly parsed as ["A", "B", "C"] but the multi-value "A, B, C" would be incorrectly parsed as ["A", " B", " C"]. This commit allows a comma separated list to have whitespace characters after each entry. The specific settings that were affected by this are: cluster.routing.allocation.awareness.attributes index.routing.allocation.require.* index.routing.allocation.include.* index.routing.allocation.exclude.* cluster.routing.allocation.require.* cluster.routing.allocation.include.* cluster.routing.allocation.exclude.* http.cors.allow-methods http.cors.allow-headers For the allocation filtering related settings, this commit also provides validation of each specified entry if the filtering is done by _ip, _host_ip, or _publish_ip, to ensure that each entry is a valid IP address. Closes #22297	2017-01-17 08:51:04 -06:00
Simon Willnauer	709cb9a39e	Merge branch 'master' into feature/multi_cluster_search	2017-01-17 12:34:36 +01:00
Michael McCandless	ebd38e2a6a	Expose FlattenGraphTokenFilter (#22643 ) FlattenGraphTokenFilter is necessary for using graph-based token streams (e.g. the new SynonymGraphFilter) during indexing.	2017-01-16 16:53:32 -05:00
Simon Willnauer	f30b1f82ee	Remove HttpServer and HttpServerAdapter in favor of a simple dispatch method (#22636 ) Today we have quite some abstractions that are essentially providing a simple dispatch method to the plugins defining a `HttpServerTransport`. This commit removes `HttpServer` and `HttpServerAdaptor` and introduces a simple `Dispatcher` functional interface that delegate to `RestController` by default. Relates to #18482	2017-01-16 21:06:08 +01:00
Luca Cavanna	193111919c	move ignore parameter support from yaml test client to low level rest client (#22637 ) All the language clients support a special ignore parameter that doesn't get passed to elasticsearch with the request, but used to indicate which error code should not lead to an exception if returned for a specific request. Moving this to the low level REST client will allow the high level REST client to make use of it too, for instance so that it doesn't have to intercept ResponseExceptions when the get api returns a 404.	2017-01-16 18:54:44 +01:00
Simon Willnauer	895124e67e	Merge branch 'master' into feature/multi_cluster_search	2017-01-16 13:20:45 +01:00
Simon Willnauer	5f0344a918	Pass ThreadContext to transport interceptors to allow header modification (#22618 ) TransportInterceptors are commonly used to enrich requests with headers etc. which requires access the the thread context. This is not always easily possible since threadpools are hard to access for instance if the interceptor is used on a transport client. This commit passes on the thread context to all the interceptors for further consumption. Closes #22585	2017-01-15 13:35:39 +01:00
Simon Willnauer	3f784a4424	Merge branch 'master' into feature/multi_cluster_search	2017-01-15 10:28:34 +01:00
Simon Willnauer	2dd0ec57b2	[TEST] Remove connection listener from all transports in AbstractSimpleTransportTestCase#testSendRandomRequests	2017-01-13 23:19:04 +01:00
Simon Willnauer	63e4552c0d	Merge branch 'master' into feature/multi_cluster_search	2017-01-13 23:07:20 +01:00
Simon Willnauer	4c1ee018f6	Remove setLocalNode from ClusterService and TransportService (#22608 ) ClusterService and TransportService expect the local discovery node to be set before they are started but this requires manual interaction and is error prone since to work absolutely correct they should share the same instance (same ephemeral ID). TransportService also has 2 modes of operation, mainly realted to transport client vs. internal to a node. This change removes the mode where we don't maintain a local node and uses a dummy local node in the transport client since we don't bind to any port in such a case. Local discovery node instances are now managed by the node itself and only suppliers and factories that allow creation only once are passed to TransportService and ClusterService.	2017-01-13 16:12:27 +01:00
Simon Willnauer	d5fa84f869	Harder close and remove reference concurrency in MockTcpTransport (#22613 ) There was still small race in MockTcpTransport where channesl that are concurrently closing are not yet removed from the reference tracking causing tests to fail. Compared to the other races before this is a rather small windown and requires very very short test durations.	2017-01-13 16:04:05 +01:00
Simon Willnauer	6779ea9c2a	Merge branch 'master' into feature/multi_cluster_search	2017-01-13 12:10:23 +01:00
Simon Willnauer	acf2d2f86f	Ensure new connections won't be opened if transport is closed or closing (#22589 ) Today there are several races / holes in TcpTransport and MockTcpTransport that can allow connections to be opened and remain unclosed while the actual transport implementation is closed. A recently added assertions in #22554 exposes these problems. This commit fixes several issues related to missed locks or channel creations outside of a lock not checking if the resource is still open.	2017-01-12 20:27:09 +01:00
javanna	8072f168a3	Remove ParseFieldMatcher usages from QueryParseContext	2017-01-12 14:43:35 +01:00
Luca Cavanna	7674de9e1f	Move human flag under always accepted query_string params (#22562 ) There are some parameters that are accepted by each and every api we expose. Those (pretty, source, error_trace and filter_path) are not explicitly listed in the spec of every api, rather whitelisted in clients test runners so that they are always accepted. The `human` flag has been treated up until now as a parameter that's accepted by only some stats and info api, but that doesn't reflect reality as es core treats it exactly like `pretty` (relevant especially now that we validate params and throw exception when we find one that is not supported). Furthermore, the human flag has effect on every api that outputs a date, time, percentage or byte size field. For instance the tasks api outputs a date field although they don't have the human flag explicitly listed in their spec. There are other similar cases. This commit removes the human flag from the rest spec and makes it an always accepted query_string param.	2017-01-12 10:04:45 +01:00
Simon Willnauer	00781d24ce	Merge branch 'master' into feature/multi_cluster_search	2017-01-11 23:40:46 +01:00
Simon Willnauer	8a0393f718	Move assertion for open channels under TcpTransport lock TcpTransport has an actual mechanism to stop resources in subclasses. Instead of overriding `doStop` subclasses should override `stopInternal` that is executed under the connection lock guaranteeing that there is no concurrency etc. Relates to #22554	2017-01-11 23:37:12 +01:00
Ryan Ernst	8015fbbf25	Make s3 repository sensitive settings use secure settings (#22479 ) * Settings: Make s3 repository sensitive settings use secure settings This change converts repository-s3 to use the new secure settings. In order to support the multiple ways we allow aws creds to be configured, it also moves the main methods for the keystore wrapper into a SecureSettings interface, in order to allow settings prefixing to work.	2017-01-11 11:19:46 -08:00
Simon Willnauer	d3124dd62b	Merge branch 'master' into feature/multi_cluster_search	2017-01-11 17:03:30 +01:00
Simon Willnauer	6810125a8b	Prevent open channel leaks if handshake times out or is interrupted (#22554 ) The low level TCP handshake can cause channel / connection leaks if it's interrupted since the caller doesn't close the channel / connection if the handshake was not successful. This commit fixes the channel leak and adds general test infrastructure to detect channel leaks in the future.	2017-01-11 17:02:36 +01:00
Simon Willnauer	6d2d878068	Merge branch 'master' into feature/multi_cluster_search	2017-01-11 09:28:00 +01:00
Tanguy Leroux	2dcb05fca8	Add fromxcontent methods to index response (#22229 ) This commit adds the parsing fromXContent() methods to the IndexResponse class. The method is based on a ObjectParser because it is easier to use when parsing parent abstract classes like DocWriteResponse. It also changes the ReplicationResponse.ShardInfo so that it now implements ToXContentObject. This way, the ShardInfo.fromXContent() method can be used by the IndexResponse's ObjectParser.	2017-01-10 20:25:32 +01:00
Yannick Welsch	c35277e623	[TEST] Fix JSON generation of failure in InternalTestCluster Relates to #22387	2017-01-10 17:53:04 +01:00
Boaz Leskes	f387848f83	MockTransportService.doClose assertions should check openConnections under lock	2017-01-10 14:03:31 +01:00
Yannick Welsch	9fc1a735cc	Keep NodeConnectionsService in sync with current nodes in the cluster state (#22509 ) The NodeConnectionsService currently determines which nodes to connect to / disconnect from by inspecting cluster state changes and connecting to added nodes / disconnecting from removed nodes. When a master steps down (for example due to another master-eligible node shutting down which brings the number of master-eligible nodes below minimum_master_master), and the connection to other existing nodes was dropped while pinging, however, the connection to these nodes is not re-established while publishing the first cluster state that establishes the node as master. This commit changes the NodeConnectionsService connect / disconnect logic to always rely on the state that is to be / was published, looking not only at the added / removed nodes, but validating that exactly all nodes that are currently registered in NodeConnectionsService are connected (corresponds to a NOOP if the node is already connected).	2017-01-10 13:29:49 +01:00
Simon Willnauer	1ef98ede17	Merge branch 'master' into feature/multi_cluster_search	2017-01-09 12:09:23 +01:00
Nik Everett	12923ef896	Close and flush refresh listeners on shard close Right now closing a shard looks like it strands refresh listeners, causing tests like `delete/50_refresh/refresh=wait_for waits until changes are visible in search` to fail. Here is a build that fails: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+multi_cluster_search+multijob-darwin-compatibility/4/console This attempts to fix the problem by implements `Closeable` on `RefreshListeners` and rejecting listeners when closed. More importantly the act of closing the instance flushes all pending listeners so we shouldn't have any stranded listeners on close. Because it was needed for testing, this also adds the number of pending listeners to the `CommonStats` object and all API to which that flows: `_cat/nodes`, `_cat/indices`, `_cat/shards`, and `_nodes/stats`.	2017-01-06 20:03:32 -05:00
Ryan Ernst	cd6e3f4cea	Merge branch 'master' into keystore	2017-01-06 09:32:08 -08:00
Tim B	b9c2c2f6f0	Move IfConfig.logIfNecessary call into bootstrap (#22455 ) This is related to #22116. A logIfNecessary() call makes a call to NetworkInterface.getInterfaceAddresses() requiring SocketPermission connect privileges. By moving this to bootstrap the logging call can be made before installing the SecurityManager.	2017-01-06 11:10:53 -06:00
Simon Willnauer	418ec62bfb	Merge branch 'master' into feature/multi_cluster_search	2017-01-06 10:24:40 +01:00
Ryan Ernst	eb596d7270	more renames	2017-01-06 01:03:45 -08:00
javanna	d87a30647b	remove ParseFieldMatcher usages from SearchAfterBuilder	2017-01-05 19:33:04 +01:00
Simon Willnauer	0183b0c5a8	More cleanups	2017-01-05 15:23:55 +01:00
Simon Willnauer	80bf01d3c0	Merge branch 'master' into feature/multi_cluster_search	2017-01-05 08:00:03 +01:00
Simon Willnauer	a5daa5d3a2	Execute low level handshake in #openConnection (#22440 ) Today we execute the low level handshake on the TCP layer in #connectToNode. If #openConnection is used directly, which is truly expert, no handshake is executed which allows connecting to nodes that are not necessarily compatible. This change moves the handshake to #openConnection to prevent bypassing this logic.	2017-01-05 07:32:53 +01:00
Tim B	be22a250b6	Replace Socket, ServerSocket, and HttpServer usages in tests with mocksocket versions (#22287 ) This integrates the mocksocket jar with elasticsearch tests. Mocksocket wraps actions requiring SocketPermissions in doPrivilege blocks. This will eventually allow SocketPermissions to be assigned to the mocksocket jar opposed to the entire elasticsearch codebase.	2017-01-04 14:38:51 -06:00
Adrien Grand	f8998fece5	Upgrade to lucene-6.4.0-snapshot-084f7a0. (#22413 )	2017-01-04 19:03:52 +01:00
Simon Willnauer	e642965804	Cleanup lots of code, add javadocs and tests	2017-01-04 17:26:00 +01:00
Simon Willnauer	c6573e6e56	Filter actions to trace in test Notifications for request tracing are invoked concurrently and can still be in flight once a tracer is installed in the test. This can lead to side-effects since the test relied on exact invocations. This commit adds action filtering to the test tracer to only count invocations for the relevant actions. Closes #22418	2017-01-03 23:40:52 +01:00
Simon Willnauer	422cd1ef77	Add support for proxy nodes this commit adds full support for proxy nodes on the search layer. This allows to connection only to a small set of nodes on a remote cluster to exectue the search. The nodes will proxy the request to the correct node in the cluster while the coordinting node doesn't need to be connected to the target node.	2017-01-03 17:24:32 +01:00
javanna	6329a98a97	Remove ParseFieldMatcher usages from SearchContext	2017-01-03 15:52:32 +01:00
javanna	71d6a37032	[TEST] assign blacklistPathMatchers only after the contexts have been assigned There could be an issue creating the REST clients and/or making the first request to the external cluster. If that happens, the blacklist has already been assigned and the following tests will fail because of an assertion that checks that the blacklist is not already assigned when the contexts are not.	2017-01-03 15:25:05 +01:00
Daniel Mitterdorfer	1ed64f0551	Eliminate unneccessary declaration of IOException With this commit we remove the declaration of IOException from assertWarnings and modify all call sites. Checked with @javanna	2017-01-03 12:36:28 +01:00
Igor Motov	ca90d9ea82	Remove PROTO-based custom cluster state components Switches custom cluster state components from PROTO-based de-serialization to named objects based de-serialization	2016-12-28 13:32:35 -05:00
Adrien Grand	2d81750a13	Make ESTestCase resilient to initialization errors.	2016-12-26 14:55:22 +01:00
Adrien Grand	d89757b848	Fix mutate function to always actually modify the failure object.	2016-12-26 10:34:50 +01:00
Jason Tedor	ddf4a463f3	Reject invalid test logging annotations Today we silently ignore invalid test logging annotations. This commit rejects these annotations, failing the processing of the annotation and aborting the test.	2016-12-23 07:51:35 -05:00
Jason Tedor	432ec54347	Apply logging levels in hierarchical order This commit adds a test for applying logging levels in hierarchical order, and addresses an issue with restoring the logging levels at the end of a test or suite.	2016-12-23 07:51:19 -05:00
Yannick Welsch	baea17b53f	Separate cluster update tasks that are published from those that are not (#21912 ) This commit factors out the cluster state update tasks that are published (ClusterStateUpdateTask) from those that are not (LocalClusterUpdateTask), serving as a basis for future refactorings to separate the publishing mechanism out of ClusterService.	2016-12-23 12:23:52 +01:00
Boaz Leskes	215874aff3	process TestLogging annotation value in prefix-first order We have to sort the logger names so they wouldn't override each other. Processing org.elasticsearch:DEBUG after org.elasticsearch.transport:TRACE resets the setting of the later	2016-12-23 09:03:43 +01:00
Ryan Ernst	fb690ef748	Settings: Add infrastructure for elasticsearch keystore This change is the first towards providing the ability to store sensitive settings in elasticsearch. It adds the `elasticsearch-keystore` tool, which allows managing a java keystore. The keystore is loaded upon node startup in Elasticsearch, and used by the Setting infrastructure when a setting is configured as secure. There are a lot of caveats to this PR. The most important is it only provides the tool and setting infrastructure for secure strings. It does not yet provide for keystore passwords, keypairs, certificates, or even convert any existing string settings to secure string settings. Those will all come in follow up PRs. But this PR was already too big, so this at least gets a basic version of the infrastructure in. The two main things to look at. The first is the `SecureSetting` class, which extends `Setting`, but removes the assumption for the raw value of the setting to be a string. SecureSetting provides, for now, a single helper, `stringSetting()` to create a SecureSetting which will return a SecureString (which is like String, but is closeable, so that the underlying character array can be cleared). The second is the `KeyStoreWrapper` class, which wraps the java `KeyStore` to provide a simpler api (we do not need the entire keystore api) and also extend the serialized format to add metadata needed for loading the keystore with no assumptions about keystore type (so that we can change this in the future) as well as whether the keystore has a password (so that we can know whether prompting is necessary when we add support for keystore passwords).	2016-12-22 16:28:34 -08:00
Nik Everett	f5f2149ff2	Remove much ceremony from parsing client yaml test suites (#22311 ) * Remove a checked exception, replacing it with `ParsingException`. * Remove all Parser classes for the yaml sections, replacing them with static methods. * Remove `ClientYamlTestFragmentParser`. Isn't used any more. * Remove `ClientYamlTestSuiteParseContext`, replacing it with some static utility methods. I did not rewrite the parsers using `ObjectParser` because I don't think it is worth it right now.	2016-12-22 11:00:34 -05:00
Colin Goodheart-Smithe	06576ed13b	Adds abstract test classes for serialisation (#22281 ) This adds test classes that can be used to test the wire serialisation and (optionally) the XContent serialisation of objects that implement Streamable/Writeable and ToXContent. These test classes will enable classes sich as InternalAggregation (or at least its implementations) to be tested in a consistent way when is comes to testing serialisation.	2016-12-22 10:49:18 +00:00
Jason Tedor	7946396fe6	Introduce translog no-op As the translog evolves towards a full operations log as part of the sequence numbers push, there is a need for the translog to be able to represent operations for which a sequence number was assigned, but the operation did not mutate the index. Examples of how this can arise are operations that fail after the sequence number is assigned, and gaps in this history that arise when an operation is assigned a sequence number but the operation never completed (e.g., a node crash). It is important that these operations appear in the history so that they can be replicated and replayed during recovery as otherwise the history will be incomplete and local checkpoints will not be able to advance. This commit introduces a no-op to the translog to set the stage for these efforts. Relates #22291	2016-12-21 23:08:16 -05:00
Boaz Leskes	0e9186e137	Simplify Unicast Zen Ping (#22277 ) The `UnicastZenPing` shows it's age and is the result of many small changes. The current state of affairs is confusing and is hard to reason about. This PR cleans it up (while following the same original intentions). Highlights of the changes are: 1) Clear 3 round flow - no interleaving of scheduling. 2) The previous implementation did a best effort attempt to wait for ongoing pings to be sent and completed. The pings were guaranteed to complete because each used the total ping duration as a timeout. This did make it hard to reason about the total ping duration and the flow of the code. All of this is removed now and ping should just complete within the given duration or not be counted (note that it was very handy for testing, but I move the needed sync logic to the test). 3) Because of (2) the pinging scheduling changed a bit, to give a chance for the last round to complete. We now ping at the beginning, 1/3 and 2/3 of the duration. 4) To offset for (3) a bit, incoming ping requests are now added to on going ping collections. 5) UnicastZenPing never establishes full blown connections (but does reuse them if there). Relates to #22120 6) Discovery host providers are only used once per pinging round. Closes #21739 7) Usage of the ability to open a connection without connecting to a node ( #22194 ) and shorter connection timeouts helps with connections piling up. Closes #19370 8) Beefed up testing and sped them up. 9) removed light profile from production code	2016-12-21 15:09:58 +01:00
Nik Everett	567c65b0d5	Replace IndicesQueriesRegistry (#22289 ) * Switch query parsing to namedObject * Remove IndicesQueriesRegistry	2016-12-21 09:05:14 -05:00
javanna	7141f6b554	[TEST] improve error message in ESTestCase#assertWarnings	2016-12-21 13:31:02 +01:00
Luca Cavanna	ae01a51b44	[TEST] make ESSingleNodeTestCase tests repeatable (#22283 ) If we conditionally do random things, e.g. initialize a node only after the first test, we have to make sure that we unconditionally create a new seed calling random.nextLong(), then initialize the node under a private randomness context. This makes sure that any random usage through Randomness.get() will retrieve the proper random instance through RandomizedContext.current().getRandom(). When running under private randomness, the context will return the Random instance that was created with the provided seed (forked from the main random instance) rather than the main Random that's exposed to tests as well. Otherwise tests become non repeatable because that initialization part happens only before the first executed test.	2016-12-21 11:44:24 +01:00
Nik Everett	a04dcfb95b	Introduce XContentParser#namedObject (#22003 ) Introduces `XContentParser#namedObject which works a little like `StreamInput#readNamedWriteable`: on startup components register parsers under names and a superclass. At runtime we look up the parser and call it to parse the object. Right now the parsers take a context object they use to help with the parsing but I hope to be able to eliminate the need for this context as most what it is used for at this point is to move around parser registries which should be replaced by this method eventually. I make no effort to do so in this PR because it is big enough already. This is meant to the a start down a road that allows us to remove classes like `QueryParseContext`, `AggregatorParsers`, `IndicesQueriesRegistry`, and `ParseFieldRegistry`. The goal here is to reduce the amount of plumbing required to allow parsing pluggable things. With this you don't have to pass registries all over the place. Instead you must pass a super registry to fewer places and use it to wrap the reader. This is the same tradeoff that we use for NamedWriteable and it allows much, much simpler binary serialization. We think we want that same thing for xcontent serialization. The only parsing actually converted to this method is parsing `ScoreFunctions` inside of `FunctionScoreQuery`. I chose this because it is relatively self contained.	2016-12-20 11:05:24 -05:00
Ryan Ernst	850f51db01	Internal: Refactor SettingCommand into EnvironmentAwareCommand (#22175 ) * Internal: Refactor SettingCommand into EnvironmentAwareCommand This change renames and changes the behavior of SettingCommand to have its primary method take in a fully initialized Environment for elasticsearch instead of just a map of settings. All of the subclasses of SettingCommand already did this at some point, so this just removes duplication.	2016-12-19 15:23:44 -08:00
javanna	5dae10db11	[TEST] add warnings check to ESTestCase We are currenlty checking that no deprecation warnings are emitted in our query tests. That can be moved to ESTestCase (disabled in ESIntegTestCase) as it allows us to easily catch where our tests use deprecated features and assert on the expected warnings.	2016-12-19 19:39:56 +01:00
javanna	6a27628f12	Remove support for strict parsing mode We return deprecation warnings as response headers, besides logging them. Strict parsing mode stayed around, but was only used in query tests, though we also introduced checks for deprecation warnings there that don't need strict parsing anymore (see #20993). We can then safely remove support for strict parsing mode. The final goal is to remove the ParseFieldMatcher class, but there are many many users of it. This commit prepares the field for the removal, by deprecating ParseFieldMatcher and making it effectively not needed. Strict parsing is removed from ParseFieldMatcher, and strict parsing is replaced in tests where needed with deprecation warnings checks. Note that the setting to enable strict parsing was never ported to the new settings infra hance it cannot be set in production. It is really only used in our own tests. Relates to #19552	2016-12-19 19:39:56 +01:00
javanna	38914f17ed	[TEST] improve ElasticsearchAssertions#assertEquivalent for ToXContent Rename the method to assertToXContentEquivalent to highlight that it's tailored to ToXContent comparisons. Rather than parsing into a map and replacing byte[] in both those maps, add custom equality assertions that recursively walk maps and lists and call Arrays.equals whenever a byte[] is encountered.	2016-12-19 19:32:50 +01:00
Luca Cavanna	3421e54a42	Add fromXContent method to GetResponse (#22082 ) Moved field values `toXContent` logic to `GetField` (from `GetResult`), which outputs its own fields, and can also parse them now. Also added `fromXContent` to `GetResult` and `GetResponse`. The start object and end object for `GetResponse` output have been moved to `GetResult#toXContent`, from the corresponding rest action. This makes it possible to have `toXContent` and `fromXContent` completely symmetric, as parsing requires looping till an end object is found which is weird when the corresponding `toXContent` doesn't print that out. This also introduces the foundation for testing retrieval of _source and stored field values.	2016-12-19 17:21:26 +01:00
Nik Everett	5bec4f8024	Unescape \\r in stash dump Oh windows..... Relates to #22195	2016-12-19 10:57:26 -05:00
Yannick Welsch	63af03a104	Atomic mapping updates across types (#22220 ) This commit makes mapping updates atomic when multiple types in an index are updated. Mappings for an index are now applied in a single atomic operation, which also allows to optimize some of the cross-type updates and checks.	2016-12-19 14:39:50 +01:00
Boaz Leskes	b857b316b6	Add BWC layer to seq no infra and enable BWC tests (#22185 ) Sequence BWC logic consists of two elements: 1) Wire level BWC using stream versions. 2) A changed to the global checkpoint maintenance semantics. For the sequence number infra to work with a mixed version clusters, we have to consider situation where the primary is on an old node and replicas are on new ones (i.e., the replicas will receive operations without seq#) and also the reverse (i.e., the primary sends operations to a replica but the replica can't process the seq# and respond with local checkpoint). An new primary with an old replica is a rare because we do not allow a replica to recover from a new primary. However, it can occur if the old primary failed and a new replica was promoted or during primary relocation where the source primary is treated as a replica until the master starts the target. 1) Old Primary & New Replica - this case is easy as is taken care of by the wire level BWC. All incoming requests will have their seq# set to `UNASSIGNED_SEQ_NO`, which doesn't confuse the local checkpoint logic (keeping it at `NO_OPS_PERFORMED`) 2) New Primary & Old replica - this one is trickier as the global checkpoint service currently takes all in sync replicas into consideration for the global checkpoint calculation. In order to deal with old replicas, we change the semantics to say all new node in sync replicas. That means the replicas on old nodes don't count for the global checkpointing. In this state the seq# infra is not fully operational (you can't search on it, because copies may miss it) but it is maintained on shards that can support it. The old replicas will have to go through a file based recovery at some point and will get the seq# information at that point. There is still an edge case where a new primary fails and an old replica takes over. I'lll discuss this one with @ywelsch as I prefer to avoid it completely. This PR also re-enables the BWC tests which were disabled. As such it had to fix any BWC issue that had crept in. Most notably an issue with the removal of the `timestamp` field in #21670. The commit also includes a fix for the default value of the seq number field in replicated write requests (it was 0 but should be -2), that surface some other minor bugs which are fixed as well. Last - I added some debugging tools like more sane node names and forcing replication request to implement a `toString`	2016-12-19 13:08:24 +01:00
Daniel Mitterdorfer	3ce7b119d2	Enable strict duplicate checks for all XContent types (#22225 ) With this commit we enable the Jackson feature 'STRICT_DUPLICATE_DETECTION' by default for all XContent types (not only JSON). We have also changed the name of the system property to disable this feature from `es.json.strict_duplicate_detection` to the now more appropriate name `es.xcontent.strict_duplicate_detection`. Relates elastic/elasticsearch#19614 Relates elastic/elasticsearch#22073	2016-12-19 09:29:47 +01:00
Simon Willnauer	ccfeac8dd5	Remove `doHandshake` test-only settings from TcpTransport (#22241 ) In #22094 we introduce a test-only setting to simulate transport impls that don't support handshakes. This commit implements the same logic without a setting.	2016-12-18 09:26:53 +01:00
Jason Tedor	58d73bae74	Tighten sequence numbers recovery This commit touches addresses issues related to recovery and sequence numbers: - A sequence number can be assigned and a Lucene commit created with a maximum sequence number at least as large as that sequence number, yet the operation corresponding to that sequence number can be missing from both the Lucene commit and the translog. This means that upon recovery the local checkpoint will be stuck at or below this missing sequence number. To address this, we force the local checkpoint to the maximum sequence number in the Lucene commit when opening the engine. Note that there can still be gaps in the history in the translog but we do not address those here. - The global checkpoint is transferred to the target shard at the end of peer recovery. - Additionally, we reenable the relocation integration tests. Lastly, this work uncovered some bugs in the assignment of sequence numbers on replica operations: - setting the sequence number on replica write requests was missing, very likely introduced as a result of resolving merge conflicts - handling operations that arrive out of order on a replica and have a version conflict with a previous operation were never marked as processed Relates #22212	2016-12-17 09:20:46 -05:00
Simon Willnauer	1f3eb068d5	Add infrastructure to manage network connections outside of Transport/TransportService (#22194 ) Some expert users like UnicastZenPing today establishes real connections to nodes during it's ping phase that can be used by other parts of the system. Yet, this is potentially dangerous and undesirable unless the nodes have been fully verified and should be connected to in the case of a cluster state update or if we join a newly elected master. For use-cases like this, this change adds the infrastructure to manually handle connections that are not publicly available on the node ie. should not be managed by `Transport`/`TransportSerivce`	2016-12-17 11:49:57 +01:00
Simon Willnauer	0c0353fc7d	[TEST] Add some testlogging	2016-12-16 14:25:17 +01:00
Masaru Hasegawa	a0185c83a7	Merge pull request #21393 from masaruh/alias_boost Resolve index names in indices_boost	2016-12-16 15:07:51 +09:00
Nik Everett	61597f2c20	Send error_trace by default when testing (#22195 ) Sends the `error_trace` parameter with all requests sent by the yaml test framework, including the doc snippet tests. This can be overridden by settings `error_trace: false`. While this drift's core's handling of the yaml tests from the client's slightly this should only be a problem for tests that rely on the default value, both of which I've fixed by setting the value explicitly. This also escapes `\n` and `\t` in the `Stash dump on failure` so the `stack_trace` is more readable. Also fixes `RestUpdateSettingsAction` to not think of the `error_trace` parameter as a setting.	2016-12-15 13:35:14 -05:00
Boaz Leskes	b6cbcc49ba	ClusterService should expose "applied" cluster states (i.e., remove ClusterStateStatus) (#21817 ) `ClusterService` is responsible of updating the cluster state on every node (as a response to an API call on the master and when non-masters receive a new state from the master). When a new cluster state is processed, it is made visible via the `ClusterService#state` method and is sent to series of listeners. Those listeners come in two flavours - one is to change the state of the node in response to the new cluster state (call these cluster state appliers), the other is to start a secondary process. Examples for the later include an indexing operation waiting for a shard to be started or a master node action waiting for a master to be elected. The fact that we expose the state before applying it means that samplers of the cluster state had to worry about two things - working based on a stale CS and working based on a future, i.e., "being applied" CS. The `ClusterStateStatus` was used to allow distinguishing between the two. Working with a stale cluster state is not avoidable. How this PR changes things to make sure consumers don't need to worry about future CS, removing the need for the status and simplifying the waiting logic. This change does come with a price as "cluster state appliers" can't sample the cluster state from `ClusterService` whenever they want as the cluster state isn't exposed yet. However, recent clean ups made this is situation easier and this PR takes the last steps to remove such sampling. This also helps clarify the "information flow" and helps component separation (and thus potential unit testing). It also adds an assertion that will trigger if the cluster state is sampled by such listeners. Note that there are still many "appliers" that could be made a simpler, unrestricted "listener" but this can be done in smaller bits in the future. The commit also makes it clear what the `appliers` and what the `listeners` are by using dedicated interfaces. Also, since I had to change the listener types I went ahead and changed the data structure for temporary/timeout listeners (used for the observer) so addition and removal won't be an O(n) operation.	2016-12-15 17:06:25 +01:00
Simon Willnauer	ef610636b6	Remove TCP handshake BWC from master (#22151 ) Since #22094 has been back-ported to 5.2 we can remove all BWC layers from master since all supported version will handle handshake requests. Relates to #22094	2016-12-15 12:47:01 +01:00
Simon Willnauer	d27a12510b	Handle race-condition when connection is closed before handshake listener was added Today sending a message on a closed channel doesn't throw an exception. The channel might just swallow the exception and informs the internal async exception handler that a channel got disconnected. This change adds a safety check that we fail the handshake if we registered a handler but the channel has been closed already for instance due to a reset by peer.	2016-12-15 12:41:50 +01:00
Simon Willnauer	80d6539e9c	Handle connection close / reset events gracefully during handshake (#22178 ) Low level handshake code doesn't handle situations gracefully if the connection is concurrently closed or reset by peer. This commit adds the relevant code to fail the handshake if the connection is closed.	2016-12-14 23:04:14 +01:00
Boaz Leskes	bf65a69bbf	Enforce min master nodes in test cluster (#22065 ) In order to start clusters with min master nodes set without setting `discovery.initial_state_timeout`, #21846 has changed the way we start nodes. Instead to the previous serial start up, we now always start the nodes in an async fashion (internally). This means that starting a cluster is unsafe without `min_master_nodes` being set. We should therefore make it mandatory.	2016-12-14 20:14:16 +01:00
Daniel Mitterdorfer	7e5058037b	Enable strict duplicate checks for JSON content With this commit we enable the Jackson feature 'STRICT_DUPLICATE_DETECTION' by default. This ensures that JSON keys are always unique. While this has a performance impact, benchmarking has indicated that the typical drop in indexing throughput is around 1 - 2%. As a last resort, we allow users to still disable strict duplicate checks by setting `-Des.json.strict_duplicate_detection=false` which is intentionally undocumented. Closes #19614	2016-12-14 09:35:53 +01:00
Nik Everett	49bdd29f91	Consolidate more parser creation into ESTestCase This will make it easier to add the forthcoming required argument, `NamedXContentRegistry`.	2016-12-13 20:28:41 -05:00
Jason Tedor	510ad7b9c7	Add shutdown hook for closing CLI commands This commit enables CLI commands to be closeable and installs a runtime shutdown hook to ensure that if the JVM shuts down (as opposed to aborting) the close method is called. It is not enough to wrap uses of commands in main methods in try-with-resources blocks as these will not run if, say, the virtual machine is terminated in response to SIGINT, or system shutdown event. Relates #22126	2016-12-13 19:10:11 -05:00
Nik Everett	872984d21a	Continue consolidating `XContentParser` construction in tests (#22145 ) Consolidate more parser creation in tests Moves more parser creation in tests to the `createParser` methods in `ESTestCase`.	2016-12-13 17:22:39 -05:00
Simon Willnauer	7a9b667e98	Introduce a low level protocol handshake (#22094 ) Today we rely on the version that the API user passes in together with the DiscoveryNode. This commit introduces a low level handshake where nodes exchange their version to be used with the transport protocol that is executed every time a connection to a node is established. This, on the one hand allows to change the wire protocol based on the version we are talking to even without a full cluster restart. Today we would need to carry on a BWC layer across major versions but with a handshake we can rely on the fact that the latest version of the previous minor executes a handshake and uses the latest protocol version across all communication with the N+1 version nodes. This change is yet fully backwards compatible, a followup PR will remove the BWC in 6.0 once this has been back-ported to the 5.x branch	2016-12-13 21:06:23 +01:00
Nik Everett	ce86405394	Start to centralize creation of XContentParser in tests (#22096 ) Starts to centralize creation of the `XContentParser` in `protected final` methods on `ESTestCase`. The idea is to enable adding `NamedXContentRegistry` relatively easily by giving tests a single place they can override to define the `NamedXContentRegistry`. Since `NamedXContentRegistry` doesn't exist yet neither does the override point. This doesn't attempt to migrate all the tests to calling the new methods to build the parsers. I wanted to make this so we could review the concept and then I'll merge a followup to migrate the tests.	2016-12-13 11:22:15 -05:00
Simon Willnauer	b667ff46c4	Allow plugins to install bootstrap checks (#22110 ) Plugins also have the need to provide better OOTB experience by configuring defaults unless the plugin is used in _production_ mode. This change exposes the bootstrap check infrastructure as part of the plugin API to allow plugins to specify / install their own bootstrap checks if necessary.	2016-12-12 17:35:00 +01:00
Luca Cavanna	6d987a9b69	Remove support for empty queries (#22092 ) Our query DSL supports empty queries (`{}`), which have a different meaning depending on the query that holds it, either ignored, match_all or match_none. We deprecated the support for empty queries in 5.0, where we log a deprecation warning wherever they are used. The way we supported it once we moved query parsing to the coordinating node was having an Optional<QueryBuilder> return type in all of our parse methods (called fromXContent). See #17624. The central place for this was QueryParseContext#parseInnerQueryBuilder. We can now remove all the optional return types and simply throw an exception whenever an empty query is found.	2016-12-12 12:37:12 +01:00
Masaru Hasegawa	3df2a086d4	Resolve index names in indices_boost This change allows specifying alias/wildcard expression in indices_boost. And added another format for specifying indices_boost. It accepts array of index name and boost pair. If an index is included in multiple aliases/wildcard expressions, the first match will be used. With new format, old format is marked as deprecated. Closes #4756	2016-12-11 21:41:49 +09:00
Simon Willnauer	01d67e09b9	Detach handshake from connect to node (#22037 ) Today we connect and publish the nodes connection before we execute a handshake with the node we connect to. In the case of connecting to a node that won't pass the handshake this connection is already `published` and other code paths can use it. This commit detaches the connection and the publish of the connection such that `TransportService` can do a handshake before actually connect and publish the connection.	2016-12-10 10:03:26 +01:00
Ryan Ernst	b1cef5fdf8	Remove 2.0 prerelease version constants (#22004 ) * Remove 2.0 prerelease version constants This is a start to addressing #21887. This removes: * pre 2.0 snapshot format support * automatic units addition to cluster settings * bwc check for delete by query in pre 2.0 indexes	2016-12-08 21:48:35 -08:00
Nik Everett	e9bb8d8b38	Don't allow yaml tests with `warnings` that don't skip `warnings` (#21989 ) If you write a yaml test with a `warnings` section in a `do` block that doesn't also have a corresponding `skip` section for `warnings` then client test runners that don't support `warnings` will fail. This causes the elasticsearch build to fail so we catch these errors earlier. Related to #21811	2016-12-08 13:17:31 -05:00
Ali Beyad	e6e7bab58c	Prepares allocator decision objects for use with the allocation explain API (#21691 ) This commit enhances the allocator decision result objects (namely, AllocateUnassignedDecision, MoveDecision, and RebalanceDecision) to enable them to be used directly by the cluster allocation explain API. In particular, this commit does the following: - Adds serialization and toXContent methods to the response objects, which will form the explain API responses. - Moves the calculation of the final explanation to the response object itself, removing it from the responsibility of the allocators. - Adds shard store information to the NodeAllocationResult, so that store information is available for each node, when explaining a shard allocation by the PrimaryShardAllocator or the ReplicaShardAllocator. - Removes RebalanceDecision in favor of using MoveDecision for both moving and rebalancing shards. - Removes NodeRebalanceResult in favor of using NodeAllocationResult. - Changes the notion of weight ranking to be relative to the current node, instead of an absolute weight that doesn't convey any added value to the API user and can be confusing. - Introduces a new enum AllocationDecision to convey the decision type, which enables conveying unassigned, moving, and rebalancing scenarios with more detail as opposed to just Decision.Type and AllocationStatus.	2016-12-07 17:37:51 -05:00
Adrien Grand	c746854e03	Pre-built analysis factories do not implement MultiTermAware correctly. (#21981 ) We had tests for the regular factories, but not for the pre-built ones, that ship by default without requiring users to define them in the analysis settings.	2016-12-07 10:32:25 +01:00
Boaz Leskes	4519bdfeb0	InternalTestCluster shouldn't auto heal an active disruption when a new one is set Instead people should explicitly clear the existing one so it's clear what's going on.	2016-12-06 19:58:11 +01:00
Boaz Leskes	a7050b2d56	Remove `InternalTestCluster.startNode(s)Async` (#21846 ) Since the removal of local discovery of #https://github.com/elastic/elasticsearch/pull/20960 we rely on minimum master nodes to be set in our test cluster. The settings is automatically managed by the cluster (by default) but current management doesn't work with concurrent single node async starting. On the other hand, with `MockZenPing` and the `discovery.initial_state_timeout` set to `0s` node starting and joining is very fast making async starting an unneeded complexity. Test that still need async starting could, in theory, still do so themselves via background threads. Note that this change also removes the usage of `INITIAL_STATE_TIMEOUT_SETTINGS` as the starting of nodes is done concurrently (but building them is sequential)	2016-12-06 12:06:15 +01:00
Nik Everett	2087234d74	Timeout improvements for rest client and reindex (#21741 ) Changes the default socket and connection timeouts for the rest client from 10 seconds to the more generous 30 seconds. Defaults reindex-from-remote to those timeouts and make the timeouts configurable like so: ``` POST _reindex { "source": { "remote": { "host": "http://otherhost:9200", "socket_timeout": "1m", "connect_timeout": "10s" }, "index": "source", "query": { "match": { "test": "data" } } }, "dest": { "index": "dest" } } ``` Closes #21707	2016-12-05 10:54:51 -05:00
Nik Everett	0c724b1878	Keep context during reindex's retries (#21941 ) * Keep context during reindex's retries This fixes reindex and friend's retries to keep the context. * Docs	2016-12-02 13:48:51 -05:00
Tanguy Leroux	fe95aef6a9	[TEST] Remove CompositeTestCluster and ExternalNode (#21933 ) They are not used anymore. Related #21915	2016-12-02 13:25:40 +01:00
Simon Willnauer	20177f6eee	[TEST] Add back ExternalTestCluster - downstream tests still use it	2016-12-02 10:54:27 +01:00
Simon Willnauer	adf9bd90a4	Remove legacy BWC test infrastructure and tests (#21915 ) We don't use the test infra nor do we run the tests. They might all be entirely out of date. We also have a different BWC test infra in-place. This change removes all of the legacy infra.	2016-12-02 08:06:20 +01:00
Simon Willnauer	6522538033	Add validation for supported index version on node join, restore, upgrade & open index (#21830 ) Today we can easily join a cluster that holds an index we don't support since we currently allow rolling upgrades from 5.x to 6.x. Along the same lines we don't check if we can support an index based on the nodes in the cluster when we open, restore or metadata-upgrade and index. This commit adds additional safety that fails cluster state validation, open, restore and /or upgrade if there is an open index with an incompatible index version created in the cluster. Realtes to #21670	2016-12-01 15:40:35 +01:00
Simon Willnauer	155de53fe3	Add a connect timeout to the ConnectionProfile to allow per node connect timeouts (#21847 ) Timeouts are global today across all connections this commit allows to specify a connection timeout per node such that depending on the context connections can be established with different timeouts. Relates to #19719	2016-12-01 15:39:49 +01:00
Boaz Leskes	087a85a4e7	always auto manage min master node in testTwoNodeCluster	2016-12-01 12:57:44 +01:00
Boaz Leskes	9097abee04	Add before and after logging for unit tests Currently we have these logs for integration tests only. This adds the following log at the start: ``` logger.info("[{}]: before test", getTestName()); ``` and this is logged at the end, but before any clean up done in sub classes ``` logger.info("[{}]: after test", getTestName()); ```	2016-12-01 12:56:37 +01:00
Luca Cavanna	103984a4a1	Remove indices query (#21837 ) The indices query is deprecated since 5.0.0 (#17710). It can now be removed in master (future 6.0 version).	2016-11-30 19:37:01 +01:00
Adrien Grand	34e682d3bc	Prevent testing on double values whose toString may use the scientific notation. This might break query parsers because the standard analyzer splits on punctuation.	2016-11-30 16:48:46 +01:00
Adrien Grand	6231009a8f	Remove 2.x backward compatibility of mappings. (#21670 ) For the record, I also had to remove the geo-hash cell and geo-distance range queries to make the code compile. These queries already throw an exception in all cases with 5.x indices, so that does not hurt any more. I also had to rename all 2.x bwc indices from `index-${version}` to `unsupported-${version}` to make `OldIndexBackwardCompatibilityIT` happy.	2016-11-30 13:34:46 +01:00
Boaz Leskes	be4074e13d	improve debug logging when node waits for initial cluster state And enabled debug logging in InternalTestClusterTests so we can see it.	2016-11-29 20:38:19 +01:00
Nicholas Knize	af1ab68b64	Add RangeFieldMapper for numeric and date range types Lucene 6.2 added index and query support for numeric ranges. This commit adds a new RangeFieldMapper for indexing numeric (int, long, float, double) and date ranges and creating appropriate range and term queries. The design is similar to NumericFieldMapper in that it uses a RangeType enumerator for implementing the logic specific to each type. The following range types are supported by this field mapper: int_range, float_range, long_range, double_range, date_range. Lucene does not provide a DocValue field specific to RangeField types so the RangeFieldMapper implements a CustomRangeDocValuesField for handling doc value support. When executing a Range query over a Range field, the RangeQueryBuilder has been enhanced to accept a new relation parameter for defining the type of query as one of: WITHIN, CONTAINS, INTERSECTS. This provides support for finding all ranges that are related to a specific range in a desired way. As with other spatial queries, DISJOINT can be achieved as a MUST_NOT of an INTERSECTS query.	2016-11-29 10:10:14 -06:00
Simon Willnauer	f5ff69fabe	Remove connectToNodeLight and replace it with a connection profile (#21799 ) The Transport#connectToNodeLight concepts is confusing and not very flexible. neither really testable on a unittest level. This commit cleans up the code used to connect to nodes and simplifies transport implementations to share more code. This also allows to connect to nodes with custom profiles if needed, for instance future improvements can be added to connect to/from nodes that are non-data nodes without dedicated bulks and recovery connections.	2016-11-29 09:35:07 +01:00
Luca Cavanna	360b74eda8	[TEST] Don't reinitialize YamlTestClient and RestClient before each single test (#21807 ) In the past we ran yaml tests against an internal cluster, which would get restarted after each test failure, hence the client objects needed to eventually be refreshed before each test. That is why we had the initClient method to re-initialize the YamlTestClient in the execution context. We ended up though re-initializing the client unconditionally, which is not needed. Also, ESRestTestCase recreates the RestClient against the external cluster before each test, which is not needed given that nothing changes in the external cluster. This commit removes the initClient method from the yaml tests execution context. The YamlTestClient can be eagerly created before the first yaml test runs and then re-used in subsequent tests. Also api calls to check for nodes versions etc. are moved out of YamlTestClient to ESClientYamlSuiteTestCase. Also the RestClient is now initialized in ESRestTestCase before the first test runs, and kept around afterwards as a static member. Basically each subclass of EsRestTestCase will have its own RestClient instance, but the client will be shared across the different tests within the same class. The yaml test suite is just a special suite, composed of 600+ tests that are loaded from files, which will share the same client instance. This change should speed tests up as well, as we don't recreate the RestClient before each single test, and we don't call _cat/nodes either before each single test.	2016-11-28 18:43:27 +01:00
Simon Willnauer	b7292a6005	Remove TcpTransport#addressSupported since TransportAddress is now final TransportAddress used to be customizable per transport but this has been removed a while ago. Therefore we can remove all usage of this method as well. Relates to #20695	2016-11-28 16:06:59 +01:00
Yannick Welsch	8390648709	Minor clean-ups in MockBigArrays (#21822 ) Removes an unused static variable and an unused instance variable.	2016-11-28 14:09:26 +01:00
Yannick Welsch	7e198f0e41	Detect nodes being blocked by GC-disrupted node (#21797 ) The disruption type LongGCDisruption simulates GCs on a node by suspending all the threads of that node. If the suspended threads are in a code section with shared JVM locks, however, it can prevent the other nodes from doing their thing. The class LongGCDisruption has a list of class names for which we know that this can occur. Whenever a test using the GC disruption type fails in mysterious ways, it becomes a long guessing game to find the offending class. This commit adds code to LongGCDisruption to automatically detect these situations, fail the test early and report the offending class and all relevant context.	2016-11-28 11:24:25 +01:00
Simon Willnauer	41e9ed13d6	[TEST] Fix AbstractBytesReferenceTestCase#testSlice to not assert on offset	2016-11-24 15:31:36 +01:00
Jason Tedor	8416b16dfd	Improve handling of unreleased versions Today when handling unreleased versions for backwards compatilibity support, we scatted version constants across the code base and add some asserts to support removing these constants when the version in question is actually released. This commit improves this situation, enabling us to just add a single unreleased version constant that can be renamed when the version is actually released. This should make maintenance of these versions simpler. Relates #21760	2016-11-23 15:49:05 -05:00
Ryan Ernst	6940b2b8c7	Remove groovy scripting language (#21607 ) * Scripting: Remove groovy scripting language Groovy was deprecated in 5.0. This change removes it, along with the legacy default language infrastructure in scripting.	2016-11-22 19:24:12 -08:00
Nik Everett	1791623700	Document `error_trace` The `error_trace` parameter turns on the `stack_trace` field in errors which returns stack traces. Removes documentation for `camelCase` because it hasn't worked in a while.... Documents the internal parameters used to render stack traces as internal only. Closes #21708	2016-11-22 19:16:07 -05:00
Simon Willnauer	a9a2753f0b	Add a HostFailureListener to notify client code if a node got disconnected (#21709 ) Today there is no way to get notified if a node is disconnected. Client code must poll the TransportClient constantly to detect that a node is not connected anymore in order to react and add new nodes or notify altering etc. For instance if a hostname gets resolved to an IP but that host is disconnected clients want to reconnect by resolving the hostname again which is a common situation in cloud environments. Closes #21424	2016-11-22 20:46:28 +01:00
Jason Tedor	9dc65037bc	Lazy resolve unicast hosts Today we eagerly resolve unicast hosts. This means that if DNS changes, we will never find the host at the new address. Moreover, a single host failng to resolve causes startup to abort. This commit introduces lazy resolution of unicast hosts. If a DNS entry changes, there is an opportunity for the host to be discovered. Note that under the Java security manager, there is a default positive cache of infinity for resolved hosts; this means that if a user does want to operate in an environment where DNS can change, they must adjust networkaddress.cache.ttl in their security policy. And if a host fails to resolve, we warn log the hostname but continue pinging other configured hosts. When doing DNS resolutions for unicast hostnames, we wait until the DNS lookups timeout. This appears to be forty-five seconds on modern JVMs, and it is not configurable. If we do these serially, the cluster can be blocked during ping for a lengthy period of time. This commit introduces doing the DNS lookups in parallel, and adds a user-configurable timeout for these lookups. Relates #21630	2016-11-22 14:17:04 -05:00
Areek Zillur	0ccf8a742d	Add support for merging custom meta data in tribe node (#21552 ) * Add support for merging custom meta data in tribe node Currently, when any underlying cluster has custom metadata (via plugin), tribe node does not store custom meta data in its cluster state. This is because the tribe node has no idea how to select the appropriate custom metadata from one or many custom metadata (corresponding to the number of underlying clusters). This change adds an interface that custom metadata implementations can extend to add support for merging mulitple custom metadata of the same type for storing in the tribe state. Relates to #20544 Supersedes #20791 * Simplify updating tribe state * Add tests for merging multiple custom metadata types in tribe node * cleanup merging custom md logic in tribe service	2016-11-21 12:03:01 -05:00
Tanguy Leroux	e7b9e65fc3	Add checkstyle rule to forbid empty javadoc comments (#20881 ) This commit adds a RegexpMultiline check to checkstyle that yells when an empty Javadoc comment is found in Java files. Related #20871	2016-11-21 12:36:44 +01:00
Adrien Grand	6581b77198	Remove store throttling. (#21573 ) Store throttling has been disabled by default since Lucene added automatic throttling of merge operations based on the indexing rate.	2016-11-17 09:33:32 +01:00
Jason Tedor	d06a8903fd	Merge branch 'master' into feature/seq_no * master: (22 commits) Add proper toString() method to UpdateTask (#21582) Fix `InternalEngine#isThrottled` to not always return `false`. (#21592) add `ignore_missing` option to SplitProcessor (#20982) fix trace_match behavior for when there is only one grok pattern (#21413) Remove dead code from GetResponse.java Fixes date range query using epoch with timezone (#21542) Do not cache term queries. (#21566) Updated dynamic mapper section Docs: Clarify date_histogram bucket sizes for DST time zones Handle release of 5.0.1 Fix skip reason for stats API parameters test Reduce skip version for stats API parameter tests Strict level parsing for indices stats Remove cluster update task when task times out (#21578) [DOCS] Mention "all-fields" mode doesn't search across nested documents InternalTestCluster: when restarting a node we should validate the cluster is formed via the node we just restarted Fixed bad asciidoc in boolean mapping docs Fixed bad asciidoc ID in node stats Be strict when parsing values searching for booleans (#21555) Fix time zone rounding edge case for DST overlaps ...	2016-11-16 09:10:35 -05:00
Adrien Grand	00de8e07fc	Do not cache term queries. (#21566 ) There have been reports that the query cache did not manage to speed up search requests when the query includes a large number of different sub queries since a single request may manage to exhaust the whole history (256 queries) while the query cache only starts caching queries once they appear multiple times in the history (#16031). On the other hand, increasing the size of the query cache is a bit controversial (#20116) so this pull request proposes a different approach that consists of never caching term queries, and not adding them to the history of queries either. The reasoning is that these queries should be fast anyway, regardless of caching, so taking them out of the equation should not cause any slow down. On the other hand, the fact that they are not added to the cache history anymore means that other queries have greater chances of being cached.	2016-11-16 10:02:24 +01:00
Boaz Leskes	d99d02ecc3	InternalTestCluster: when restarting a node we should validate the cluster is formed via the node we just restarted This is to deal with potential delays in processing the fact that node was node is restarted.	2016-11-15 17:58:08 +00:00
Boaz Leskes	9171407906	remove an unneeded assert busy	2016-11-15 17:36:06 +00:00
Boaz Leskes	2c0338fa87	Merge remote-tracking branch 'upstream/master' into feature/seq_no	2016-11-15 17:09:08 +00:00
Boaz Leskes	d6c2b4f7c5	Adapt InternalTestCluster to auto adjust `minimum_master_nodes` (#21458 ) #20960 removed `LocalDiscovery` and we now use `ZenDiscovery` in all our tests. To keep cluster forming fast, we are using a `MockZenPing` implementation which uses static maps to return instant results making master election fast. Currently, we don't set `minimum_master_nodes` causing the occasional split brain when starting multiple nodes concurrently and their pinging is so fast that it misses the fact that one of the node has elected it self master. To solve this, `InternalTestCluster` is modified to behave like a true cluster and manage and set `minimum_master_nodes` correctly with every change to the number of nodes. Tests that want to manage the settings themselves can opt out using a new `autoMinMasterNodes` parameter to the `ClusterScope` annotation. Having `min_master_nodes` set means the started node may need to wait for other nodes to be started as well. To combat this, we set `discovery.initial_state_timeout` to `0` and wait for the cluster to form once all node have been started. Also, because a node may wait and ping while other nodes are started, `MockZenPing` is adapted to wait rather than busy-ping.	2016-11-15 13:42:26 +00:00
Boaz Leskes	c9f49039d3	Merge remote-tracking branch 'upstream/master' into feature/seq_no	2016-11-15 10:14:47 +00:00
Ryan Ernst	c7bd4f3454	Tests: Add TestZenDiscovery and replace uses of MockZenPing with it (#21488 ) This changes adds a test discovery (which internally uses the existing mock zenping by default). Having the mock the test framework selects be a discovery greatly simplifies discovery setup (no more weird callback to a Node method).	2016-11-14 21:46:10 -08:00
Ryan Ernst	d14c470b89	Remove generics from ActionRequest closes #21368	2016-11-14 15:32:01 -08:00
Jason Tedor	491a945ac8	Add socket permissions for tribe nodes Today when a node starts, we create dynamic socket permissions based on the configured HTTP ports and transport ports. If no ports are configured, we use the default port ranges. When a tribe node starts, a tribe node creates an internal node client for connecting to each remote cluster. If neither an explicit HTTP port nor transport ports were specified, the default port ranges are large enough for the tribe node and its internal node clients. If an explicit HTTP port or transport port was specified for the tribe node, then socket permissions for those ports will be created, but not for the internal node clients. Whether the internal node clients have explicit ports specified, or attempt to bind within the default range, socket permissions for these will not have been created and the internal node clients will hit a permissions issue when attempting to bind. This commit addresses this issue by also accounting for tribe nodes when creating the dynamic socket permissions. Additionally, we add our first real integration test for tribe nodes.	2016-11-14 11:58:44 -05:00
Simon Willnauer	bdc942fa72	Enable 5.x to 6.x BWC tests This commit enables real BWC testing against a 5.1 snapshot. All REST tests plus rolling upgrade test now run against a mixed version cross major version cluster.	2016-11-14 14:26:49 +01:00
Jason Tedor	c7a1b3eb50	Merge branch 'master' into feature/seq_no * master: Hack around cluster service and logging race Do not prematurely shutdown Log4j Support decimal constants with trailing [dD] in painless (#21412) In painless suggest a long constant if int won't do (#21415) Account for different paths for sysctl utilities [TEST] testRebalancePossible() may not have an assigned node id Tests: Disable merge in SearchCancellationTests Tests: clean search scroll at the end of SearchCancellationIT	2016-11-13 20:01:44 -05:00
Jason Tedor	d273419d00	Do not prematurely shutdown Log4j When a node closes, we shutdown logging as the last statement. This statement must be last lest any subsequent attempts to log will blow up by running into security permissions. Yet, in the case of a tribe node this isn't enough. The first internal tribe node to close will shutdown logging, and subsequent node closes will blow up with the aforementioned problem. This commit migrate the Log4j shutdown to occur as part of the shutdown hook that closes the node, after all nodes have closed. Consequently, we can remove a hack in the test infrastructure to prevent Log4j shutdowns when internal test nodes close and instead just register a single shutdown hook that runs when the test JVM exits. Relates #21519	2016-11-13 17:27:30 -05:00
Jason Tedor	1e7c424479	Merge branch 'master' into feature/seq_no * master: ShardActiveResponseHandler shouldn't hold to an entire cluster state Ensures cleanup of temporary index-* generational blobs during snapshotting (#21469) Remove (again) test uses of onModule (#21414) [TEST] Add assertBusy when checking for pending operation counter after tests Revert "Add trace logging when aquiring and releasing operation locks for replication requests" Allows multiple patterns to be specified for index templates (#21009) [TEST] fixes rebalance single shard check as it isn't guaranteed that a rebalance makes sense and the method only tests if rebalance is allowed Document _reindex with random_score	2016-11-11 11:25:27 -05:00
Jason Tedor	d3417fb022	Merge branch 'master' into feature/seq_no * master: (516 commits) Avoid angering Log4j in TransportNodesActionTests Add trace logging when aquiring and releasing operation locks for replication requests Fix handler name on message not fully read Remove accidental import. Improve log message in TransportNodesAction Clean up of Script. Update Joda Time to version 2.9.5 (#21468) Remove unused ClusterService dependency from SearchPhaseController (#21421) Remove max_local_storage_nodes from elasticsearch.yml (#21467) Wait for all reindex subtasks before rethrottling Correcting a typo-Maan to Man-in README.textile (#21466) Fix InternalSearchHit#hasSource to return the proper boolean value (#21441) Replace all index date-math examples with the URI encoded form Fix typos (#21456) Adapt ES_JVM_OPTIONS packaging test to ubuntu-1204 Add null check in InternalSearchHit#sourceRef to prevent NPE (#21431) Add VirtualBox version check (#21370) Export ES_JVM_OPTIONS for SysV init Skip reindex rethrottle tests with workers Make forbidden APIs be quieter about classpath warnings (#21443) ...	2016-11-10 23:40:33 -05:00
Ryan Ernst	48bfb142b9	Remove (again) test uses of onModule (#21414 ) This change was reverted after it caused random test failures. This was due to a copy/paste error in the original PR which caused the mock version of ClusterInfoService to be used whenever the mock ZenPing was used, and the real ClusterInfoService to be used when MockZenPing was not used.	2016-11-10 16:06:14 -08:00
Areek Zillur	7ed195fe93	[TEST] Add assertBusy when checking for pending operation counter after tests Currently, pending operations can complete after tests with disruption scheme completes. This commit waits for the pending operation counter to complete after the tests are run	2016-11-10 18:35:52 -05:00
Alexander Lin	0219a211d3	Allows multiple patterns to be specified for index templates (#21009 ) * Allows for an array of index template patterns to be provided to an index template, and rename the field from 'template' to 'index_pattern'. Closes #20690	2016-11-10 18:00:30 -05:00
javanna	2f32c1173b	Revert "Tests: Remove a couple test uses of onModule (#21414 )" This reverts commit `b326f0bc51`.	2016-11-09 11:32:16 +01:00
Ryan Ernst	b326f0bc51	Tests: Remove a couple test uses of onModule (#21414 ) There were still a couple test use cases and examples that were using onModule. This change cleans those cases up.	2016-11-08 13:50:13 -08:00
Nik Everett	b7531984a9	Ignore IAE when checking for version serialization This allows us to throw IllegalArgumentException from serialization code when the destination node can't support the request.	2016-11-08 11:36:12 -05:00
Yannick Welsch	cd34eed03e	Make ensureGreen and ensureYellow wait for cluster size consistency (#21344 ) We currently often use ensureGreen or ensureYellow to check whether the cluster is in a good state again after shutting down a node. With the change in #21092, however, it can happen that if the node that is stopped is the master node, another node will become master and publish a cluster state where it is master but where the node that was stopped hasn't been removed yet from the cluster state. It will only publish a second state thereafter where the old master is removed. If the ensureGreen/ensureYellow is timed just right, it will get to execute before the second cluster state update removing the old master and the condition ensureGreen / ensureYellow might not hold at that point anymore.	2016-11-08 11:07:54 +01:00
Ryan Ernst	7a2c984bcc	Test: Remove multi process support from rest test runner (#21391 ) At one point in the past when moving out the rest tests from core to their own subproject, we had multiple test classes which evenly split up the tests to run. However, we simplified this and went back to a single test runner to have better reproduceability in tests. This change removes the remnants of that multiplexing support.	2016-11-07 15:07:34 -08:00
Nik Everett	a13a050271	Add automatic parallelization support to reindex and friends (#20767 ) Adds support for `?slices=N` to reindex which automatically parallelizes the process using parallel scrolls on `_uid`. Performance testing sees a 3x performance improvement for simple docs on decent hardware, maybe 30% performance improvement for more complex docs. Still compelling, especially because clusters should be able to get closer to the 3x than the 30% number. Closes #20624	2016-11-04 20:59:15 -04:00
Jason Tedor	f16c308efd	Assert status logger does not warn on Log4j usage Today if you start Elasticsearch with the status logger configured to the warn level, or use a transport client with the default status logger level, you will see warn messages about deprecation loggers being created with different message factories and that formatting might be broken. This happens because the deprecation logger is constructed using the message factory from its parent, an artifact leftover from the first Log4j 2 implementation that used a custom message factory. When that custom message factory was removed, this constructor invocation should have been changed to not explicitly use the message factory from the parent. This commit fixes this invocation. However, we also had some status checking to all tests to ensure that there are no warn status log messages that might indicate a configuration problem with Log4j 2. These assertions blow up badly without the fix for the deprecation logger construction, and also caught a misconfiguration in one of the logging tests. Relates #21339	2016-11-04 14:19:59 -04:00
Nik Everett	8943421494	Only log rest connection setup once per suite (#21280 ) This is a bit funky to do with junit because we need per test state but we only want to log it per suite. So we use a static flag that we test per test and reset before every suite.	2016-11-03 21:47:11 -04:00
Yannick Welsch	39f4229594	Add information about in-flight requests when checking IndexShard operation counter (#21308 ) Our test infrastructure checks after running each test that there are no more in-flight requests on the shard level. Whenever the check fails, we only know that there were in-flight requests but don't know what requests were causing this issue. This commit adds the replication tasks that are still active at that moment to the assertion error.	2016-11-03 18:36:07 +01:00
Ryan Ernst	dc6ed7b8d4	Remove pluggability of ZenPing (#21049 ) Plugins: Remove pluggability of ZenPing ZenPing is the part of zen discovery which knows how to ping nodes. There is only one alternative implementation, which is just for testing. This change removes the ability to add custom zen pings, and instead hooks in the MockZenPing for tests through an overridden method in MockNode. This also folds in the ZenPingService (which was really just a single method) into ZenDiscovery, and removes the idea of having multiple ZenPing instances. Finally, this was the last usage of the ExtensionPoint classes, so that is also removed here.	2016-11-03 08:20:20 -07:00
Boaz Leskes	be1772b70d	pending states assertion should dump states This was removed in a cleanup assuming that Hamcrest will dump the array content. Sadly it only dumps the size.	2016-11-03 09:02:29 +01:00
Christoph Büscher	b3370de715	Tests: Add warning header checks to QueryBuilder tests and QueryParseContextTests This adds checks for expected warning headers to the query builder test infrastructure. Tests that are adding deprecation warnings to the response headers need to check those, otherwise the abstract base class for the test class will complain at teardown.	2016-11-02 15:45:33 +01:00
Yannick Welsch	6930a4846c	[TEST] Check static test state after suite scoped cluster is shut down (#21256 ) Checks on static test state are run by an @After method in ESTestCase. Suite-scoped tests in ESIntegTestCase only shut down in an @AfterClass method, which executes after the @After method in ESTestCase. The suite-scoped cluster can thus still execute actions that will violate the checks in @After without those being caught. A subsequent test executing within the same JVM will fail these checks however when @After gets called for that test. This commit adds an explicit call to check the static test state after the suite-scoped cluster has been shut down.	2016-11-02 15:00:16 +01:00
Boaz Leskes	0daf483587	Change ClusterState and PendingClusterTasksResponse's toString() to their prettyPrint format (#21245 ) The current XContent output is much harder to read than the prettyPrint format. This commit folds prettyPrint into toString and removes it.	2016-11-02 13:43:39 +01:00
Simon Willnauer	cf1457ed22	Allow skip test by version OR feature (#21240 ) Today these two are considered mutual exclusive but they are not in practice. For instance a mixed version cluster might not return a given warning depending on which node we talk to but on the other hand some runners might not even support warnings at all so the test might be skipped either by version or by feature.	2016-11-02 12:24:20 +01:00
Adrien Grand	aa6cd93e0f	Require arguments for QueryShardContext creation. (#21196 ) The `IndexService#newQueryShardContext()` method creates a QueryShardContext on shard `0`, with a `null` reader and that uses `System.currentTimeMillis()` to resolve `now`. This may hide bugs, since the shard id is sometimes used for query parsing (it is used to salt random score generation in `function_score`), passing a `null` reader disables query rewriting and for some use-cases, it is simply not ok to rely on the current timestamp (eg. percolation). So this pull request removes this method and instead requires that all call sites provide these parameters explicitly.	2016-11-02 09:48:49 +01:00
Simon Willnauer	2ba4dadea0	[TEST] fix extrasFS file filtering in OldIndexUtils	2016-11-02 09:38:51 +01:00
Simon Willnauer	4db1ac931f	Fix InternalEngineTests#testUpgradeOldIndex for 5.0.0 BWC indices Relates to #21147	2016-11-02 09:38:44 +01:00
Jason Tedor	7751049c14	Add version for 5.0.0 This commit adds the version constant for 5.0.0. Relates #21244	2016-11-01 14:09:00 -04:00
Boaz Leskes	523f7ea71e	Fix a racing condition in MockTransportService#addUnresponsiveRule where a request can be delayed even if the rule was removed. Relates to #21129 Also properly reset DiscoveryWithServiceDisruptionsIT#disableBeforeIndexDeletion	2016-11-01 14:08:18 +01:00
Boaz Leskes	ef192ff2cf	ESIntegTestCase.jav: use ClusterState.prettyPrint for pending ClusterState assertions	2016-11-01 12:54:20 +01:00
Yannick Welsch	d7d5909e69	Disconnect from newly added nodes if cluster state publishing fails (#21197 ) Before publishing a cluster state the master connects to the nodes that are added in the cluster state. When publishing fails, however, it does not disconnect from these nodes, leaving NodeConnectionsService out of sync with the currently applied cluster state.	2016-10-31 15:09:43 +01:00
Simon Willnauer	9598616dfe	Fallback to '/' info call to fetch cluster version The `_cat/nodes` API might not be available in all clusters for instance if they have authorization enabled. This change falls back to the previously used method of using the '/' endpoint to fetch the nodes version, this is best effort and will emit a warning.	2016-10-28 16:22:53 +02:00
Adrien Grand	b3cc54cf0d	Upgrade to lucene-6.3.0-snapshot-ed102d6 (#21150 ) Lucene 6.3 is expected to be released in the next weeks so it'd be good to give it some integration testing. I had to upgrade randomized-testing too so that both Lucene and Elasticsearch are on the same version.	2016-10-28 14:47:15 +02:00
Simon Willnauer	43dbf9c7b6	Use all available hosts in REST tests and allow for real master election (#21161 ) Today we only use a single node to send requests to when we run REST tests. In some cases we have more than one node (ie. in the BWC case) where we should send requests to all nodes in a round-robin fashion. This change passes all available node endpoints to the rest test. Additionally, this change adds the setting of `discovery.zen.minimum_master_nodes` to the cluster formation forcing the nodes to wait for all other nodes until the cluster is formed. This allows for a more realistic master election and allows all master eligable nodes to become master while before always the first node in the cluster became the master. This also adds logging to each test run to log the master nodes version and the minimum node version in the cluster to help debugging BWC test failures.	2016-10-28 12:18:47 +02:00
Simon Willnauer	97cc426a89	Fix bwc cluster formation in order to run BWC tests against a mixed version cluster (#21145 ) This fixes our cluster formation task to run REST tests against a mixed version cluster. Yet, due to some limitations in our test framework `indices.rollover` tests are currently disabled for the BWC case since they select the current master as the merge node which happens to be a BWC node and we can't relocate all shards to it since the primaries are on a higher version node. This will be fixed in a followup. Closes #21142 Note: This has been cherry-picked from 5.0 and fixes several rest tests as well as a BWC break in `OsStats.java`	2016-10-27 17:03:53 +02:00
Yannick Welsch	f3e578f942	Stop delaying existing requests after network delay rule is cleared (#21129 ) The network disruption type "network delay" continues delaying existing requests even after the disruption has been cleared. This commit ensures that the requests get to execute right after the delay rule is cleared.	2016-10-27 13:48:17 +02:00
Jason Tedor	9c3e4d6e22	Add correct Content-Length on HEAD requests This commit fixes responses to HEAD requests so that the value of the Content-Length is correct per the HTTP spec. Namely, the value of this header should be equal to the Content-Length if the request were not a HEAD request. This commit also fixes a memory leak on HEAD requests to the main action that arose from the bytes on a builder not being released due to them being dropped on the floor to ensure that the response to the main action did not have a body. Relates #21123	2016-10-25 23:08:19 -04:00
Igor Motov	17ad88d539	Makes search action cancelable by task management API Long running searches now can be cancelled using standard task cancellation mechanism.	2016-10-25 12:27:34 -10:00
Christoph Büscher	f6f129b21f	Consolidate code for equals/hashCode testing in central utility class Currently test that check that equals() and hashCode() are working as expected for classes implementing them are quiet similar. This change moves common assertions in this method to a common utility class. In addition, another common utility function in most of these test classes that creates copies of input object by running them through a StreamOutput and reading them back in, is moved to ESTestCase so it can be shared across all these classes. Closes #20629	2016-10-24 15:50:40 +02:00
Simon Willnauer	0a410d3916	Pass executor name to request interceptor to support async intercept calls (#21089 ) Today the request interceptor can't support async calls since the response of the async call would execute on a different thread ie. a client or listener thread. This means in-turn that the intercepted handler is not executed with the thread it was supposed to run and therefor can, if it's executing blocking operations, potentially deadlock an entire server.	2016-10-24 13:57:07 +02:00
Ryan Ernst	53cff0f00f	Move all zen discovery classes into o.e.discovery.zen (#21032 ) * Move all zen discovery classes into o.e.discovery.zen This collapses sub packages of zen into zen. These all had just a couple classes each, and there is really no reason to have the subpackages. * fix checkstyle	2016-10-20 00:44:48 -07:00
javanna	c92b550df2	[TEST] Remove create special case in yaml test client Now that the create api has its own spec, we can remove the special case in the yaml test client for it Relates to #20924	2016-10-20 08:48:15 +02:00
Boaz Leskes	c3987156ab	Remove local discovery in favor of a simpler `MockZenPings` (#20960 ) `LocalDiscovery` is a discovery implementation that uses static in memory maps to keep track of current live nodes. This is used extensively in our tests in order to speed up cluster formation (i.e., shortcut the 3 second ping period used by `ZenDiscovery` by default). This is sad as that mean that most of the test run using a different discovery semantics than what is used in production. Instead of replacing the entire discovery logic, we can use a similar approach to only shortcut the pinging components.	2016-10-18 21:12:15 +02:00
Boaz Leskes	eaa105951f	Simplify GlobalCheckpointService and properly hook it for cluster state updates (#20720 ) During a recent merge from master, we lost the bridge from IndicesClusterStateService to the GlobalCheckpointService of primary shards, notifying them of changes to the current set of active/initializing shards. This commits add the bridge back (with unit tests). It also simplifies the GlobalCheckpoint tracking to use a simpler model (which makes use the fact that the global check point sync is done periodically). The old integration CheckpointIT test is moved to IndexLevelReplicationTests. I also added similar assertions to RelocationsIT, which surfaced a bug in the primary relocation logic and how it plays with global checkpoint updates. The test is currently await-fixed and will be fixed in a follow up issue.	2016-10-17 16:33:03 +02:00
Tanguy Leroux	1755cc08f3	REST API parser should fail on duplicate params/paths/methods/parts (#20940 ) This commit changes the current REST API parser to make it fail and throw an exception when a REST specification file contains a duplicated parameters, or path, or method, or path part.	2016-10-17 09:19:07 +02:00
Simon Willnauer	5137f44bd6	[TEST] return empty array if AbstractQueryTestCase#currentTypes is null This is important to allow any test to use RandomQueryBuilder#createQuery() since some of the query builders that are used in this test test the length of the types array and otherwise will thow NPE if the test is not a subclass of AbstractQueryTestCase.	2016-10-15 14:46:54 +02:00
Boaz Leskes	bc8ad8de5a	MockBigArrays should tell you who originally released them	2016-10-12 13:03:40 +02:00
Tanguy Leroux	44ac5d057a	Remove empty javadoc (#20871 ) This commit removes as many as empty javadocs comments my regexp has found	2016-10-12 10:27:09 +02:00
Adrien Grand	1914df7b5f	Do not cache script queries. (#20799 ) The cache relies on the equals() method so we just need to make sure script queries can never be equals, even to themselves in the case that a weight is used to produce a Scorer on the same segment multiple times. Closes #20763	2016-10-11 09:17:21 +02:00
Simon Willnauer	4fd1276542	Prevent AbstractArrays from release bytes more than once (#20819 ) Today we throw an assertion error if we release an AbstractArray more than once. Yet, it's recommended to implement close methods such that they can be invoked more than once. Guaranteed single release calls are hard to implement and some situations might not be tested causing for instance `CircuitBreaker` to operate on corrupted memory stats.	2016-10-10 17:30:37 +02:00
javanna	e154e6a758	[TEST] reformatted comment in query tests	2016-10-10 10:53:17 +02:00
Nik Everett	cf4038b668	DeGuice some of IndicesModule UpdateHelper, MetaDataIndexUpgradeService, and some recovery stuff. Move ClusterSettings to nullable ctor parameter of TransportService so it isn't forgotten.	2016-10-07 11:14:38 -04:00
Simon Willnauer	7452028e50	Simplify TransportAddress (#20798 ) since TransportAddress is now final we can simplify it's interface a bit and remove methods that are only used in tests or are plain delegates.	2016-10-07 15:56:54 +02:00
Simon Willnauer	194a6b1df0	Remove LocalTransport in favor of MockTcpTransport (#20695 ) This change proposes the removal of all non-tcp transport implementations. The mock transport can be used by default to run tests instead of local transport that has roughly the same performance compared to TCP or at least not noticeably slower. This is a master only change, deprecation notice in 5.x will be committed as a separate change.	2016-10-07 11:27:47 +02:00
Simon Willnauer	9c9afe3f01	Remove SearchContext#current and all it's threadlocals (#20778 ) Today SearchContext expose the current context as a thread local which makes any kind of sane interface design very very hard. This PR removes the thread local entirely and instead passes the relevant context anywhere needed. This simplifies state management dramatically and will allow for a much leaner SearchContext interface down the road.	2016-10-06 19:51:54 +02:00
Colin Goodheart-Smithe	40f8f281e0	Merge branch 'master' into dont_cache_scripts	2016-10-06 09:09:23 +01:00
Colin Goodheart-Smithe	ce6f6d3835	Review comments	2016-10-06 08:55:31 +01:00
Simon Willnauer	134b1f9b4d	Prevent thread suspension when inside SecurityManager (#20770 ) LongGCDisruption suspends and resumes node threads but respects several `unsafe` class name patterns where it's unsafe to suspend. For instance log4j uses a global lock so we can't suspend a thread that is currently calling into log4j. The same is true for the security manager, it's similar to log4j a shared resource between the test and the node that is _suspended_. This change adds `java.lang.SecrityManager` to the unsafe patterns. This prevents test framework deadlocking if a nodes thread is supended while it's calling into the security manager that uses synchronized maps etc.	2016-10-05 21:40:27 +02:00
Simon Willnauer	e556c289b9	use a private rewrite context to prevent exposing isCachable	2016-10-05 11:41:49 +02:00
Simon Willnauer	7ba22bb75b	fix random score function builder to deal with empty seeds	2016-10-05 10:45:24 +02:00
Simon Willnauer	587bdcef38	add extra safety when accessing scripts or now and reqeusts are cached	2016-10-05 09:41:48 +02:00
Simon Willnauer	94b7873b49	Add a #markAsNotCachable() method to context to mark requests as not cachable	2016-10-04 18:05:00 +02:00
Simon Willnauer	56f35baf47	Add date-math support to `_rollover` (#20709 ) today it's not possible to use date-math efficiently with the `_rollover` API. This change adds support for date-math in the target index as well as support for preserving the math logic when an existing index that was created with a date math expression all subsequent indices are created with the same expression.	2016-10-03 16:52:33 +02:00
Boaz Leskes	27eab74510	merge from master	2016-09-30 17:19:30 +02:00
Jason Tedor	3a4ffd7b86	Fix failing logging listener tests The logging listener tests started failing after `953a8a959b` when the tests are run with tests.es.logger.level set to any level other than debug. This is because these tests were based around the assumption that the default logging level was info, which was the case before that commit fixed setting the default logging level via that system property. This commit fixes these failing tests by adjusting this assumption to account for the fact that the default logging level could be different.	2016-09-30 08:09:35 +02:00
Boaz Leskes	a16d644c68	allow settings logging level via a sys config in unit tests Pipe in the `tests.es.logger.level` system property to the log4j config file used in tests. We still default to info. Also adapts the logger name to use the first letter of packages.	2016-09-29 03:04:43 +02:00
Jason Tedor	0808611184	Fix failing tests after merge This commit fixes failing tests in feature/seq_no after merging master in.	2016-09-29 03:04:37 +02:00
Boaz Leskes	953a8a959b	allow settings logging level via a sys config in unit tests Pipe in the `tests.es.logger.level` system property to the log4j config file used in tests. We still default to info. Also adapts the logger name to use the first letter of packages.	2016-09-29 01:33:13 +02:00
Jason Tedor	25fd9e26c4	Merge branch 'master' into feature/seq_no * master: (1199 commits) [DOCS] Remove non-valid link to mapping migration document Revert "Default `include_in_all` for numeric-like types to false" test: add a test with ipv6 address docs: clearify that both ip4 and ip6 addresses are supported Include complex settings in settings requests Add production warning for pre-release builds Clean up confusing error message on unhandled endpoint [TEST] Increase logging level in testDelayShards() change health from string to enum (#20661) Provide error message when plugin id is missing Document that sliced scroll works for reindex Make reindex-from-remote ignore unknown fields Remove NoopGatewayAllocator in favor of a more realistic mock (#20637) Remove Marvel character reference from guide Fix documentation for setting Java I/O temp dir Update client benchmarks to log4j2 Changes the API of GatewayAllocator#applyStartedShards and (#20642) Removes FailedRerouteAllocation and StartedRerouteAllocation IndexRoutingTable.initializeEmpty shouldn't override supplied primary RecoverySource (#20638) Smoke tester: Adjust to latest changes (#20611) ...	2016-09-29 00:22:31 +02:00
Jason Tedor	3c8ff45917	Add production warning for pre-release builds This commit adds a usage warning when Elasticsearch is started with a pre-release build. Relates #20674	2016-09-27 20:13:12 -04:00
Boaz Leskes	ee76c1a5c9	Remove NoopGatewayAllocator in favor of a more realistic mock (#20637 ) Many of our unit tests instantiate an `AllocationService`, which requires having a `GatewayAllocator`. Today almost all of our test use a class called `NoopGatewayAllocator` which does nothing, effectively leaving all shard assignments to the balanced allocator. This is sad as it means we test a system that behaves differently than our production logic in very basic things. For example, a started primary that is lost will be assigned to a node that didn't use to have it. This PR removes `NoopGatewayAllocator` in favor of a new `TestGatewayAllocator` that inherits the standard `GatewayAllocator` and overrides shard information fetching to return information based on historical assignments the allocator has done. The only exception is `BalanceConfigurationTests` which does test only the balancer and I opted to not have it work around the `GatewayAllocator` being in it's way.	2016-09-25 20:15:30 +02:00
Ali Beyad	ac1b13dde7	Changes the API of GatewayAllocator#applyStartedShards and (#20642 ) Changes the API of GatewayAllocator#applyStartedShards and GatewayAllocator#applyFailedShards to take both a RoutingAllocation and a list of shards to apply. This allows better mock allocators to be created as being done in #20637. Closes #20642	2016-09-23 09:31:46 -04:00
Ali Beyad	029fc909b5	Removes FailedRerouteAllocation and StartedRerouteAllocation Removes the FailedRerouteAllocation class and StartedRerouteAllocation class, as they were just wrappers for RerouteAllocation that stored started and failed shards, but these started and failed shards can be passed in directly to the methods that needed them, removing the need for this wrapper class and extra level of indirection. Closes #20626	2016-09-23 09:02:36 -04:00
Simon Willnauer	fe1803c957	Remove AnalysisService and reduce it to a simple name to analyzer mapping (#20627 ) Today we hold on to all possible tokenizers, tokenfilters etc. when we create an index service on a node. This was mainly done to allow the `_analyze` API to directly access all these primitive. We fixed this in #19827 and can now get rid of the AnalysisService entirely and replace it with a simple map like class. This ensures we don't create a gazillion long living objects that are entirely useless since they are never used in most of the indices. Also those objects might consume a considerable amount of memory since they might load stopwords or synonyms etc. Closes #19828	2016-09-23 08:53:50 +02:00
Simon Willnauer	0151974500	`_flush` should block by default (#20597 ) This commit changes the default behavior of `_flush` to block if other flushes are ongoing. This also removes the use of `FlushNotAllowedException` and instead simply return immediately by skipping the flush. Users should be aware if they set this option that the flush might or might not flush everything to disk ie. no transactional behavior of some sort. Closes #20569	2016-09-21 14:20:24 +02:00
Tanguy Leroux	7645abaad9	Remove duplicate methods in ByteSizeValue (#20560 ) This commit removes `ByteSizeValue`'s methods that are duplicated (ex: `mbFrac()` and `getMbFrac()`) in order to only keep the `getN` form. It also renames `mb()` -> `getMb()`, `kb()` -> `getKB()` in order to be more coherent with the `ByteSizeUnit` method names.	2016-09-20 14:07:23 +02:00
Ali Beyad	50584c4103	Merge pull request #20532 from rjernst/rolling_upgrades This PR introduces backward compatibility index tests to test the rolling upgrade process amongst Elasticsearch instances within the same major version. The test executes in three phases. In the first phase, we form a cluster of 2 ES instances on an old version. In the second phase, we keep one of the nodes from the old cluster, kill the other node, but preserve its data directory and start an instance of the current version of ES using the same data directory as the killed instance. In the third phase, we kill the other old version ES instance from the first phase and launch a new instance, using the same data directory as the killed instance. Therefore, during phase 3, we have fully migrated and have all current versions of ES running. In each phase, we run REST tests that index documents and search them, ensuring at each stage that the documents from the previous phase are still there. Note that because we haven't released a GA yet of 5.0, the tests currently don't start an old version cluster in the first phase. Once GA is released, this will be changed to make the backward compatibility version 5.0, while the current version in the cluster will be 5.x.	2016-09-19 16:14:38 -04:00
Simon Willnauer	ee8d14798f	Unguice Transport and friends (#20526 ) This change removes all guice interaction from Transport, HttpServerTransport, HttpServer and TransportService. All these classes as well as their subclasses or extended version configured via plugins are now created by using plain old bloody java constructors. YAY!	2016-09-19 22:10:47 +02:00
Boaz Leskes	2ee9ab25d9	Remove `RoutingAllocation.Result` (#20538 ) Currently all the reroute-like methods of `AllocationService` return a result object of type `RoutingAllocation.Result`. The result object contains the new `RoutingTable` and `MetaData` plus an indication whether those were changed. The caller is then responsible of updating a cluster state with these. These means that things can easily go wrong and one can take one of these but not the other causing inconsistencies. We already have a utility method on the `ClusterState` builder that does but no one forces you to do so. Also 99% of the callers do the same thing: i.e., check if the result was changed and if so update the very same cluster state that was passed to `AllocationService`. This PR folds this pattern into `AllocationService` and changes almost all it's methods to return a new cluster state (potentially the original one). This saves some 500 lines of code. The one exception here is the reroute API which executes allocation commands and potentially returns an explanation as well (next to the routing table and metadata). That API now returns a `CommandsResult` object which encapsulate a cluster state and the explanation.	2016-09-19 13:54:35 +02:00
Ali Beyad	98230d035a	Adds a preserveIndicesUponCompletion method to ESRestTestCase that can be overridden by subclasses if the test must not delete indices it created after exiting.	2016-09-16 19:21:26 -04:00
Ali Beyad	ce86ed1fdd	Merge remote-tracking branch 'upstream/master' into rolling_upgrades	2016-09-16 10:43:38 -04:00
Simon Willnauer	f5daa165f1	Remove ability to plug-in TransportService (#20505 ) TransportService is such a central part of the core server, replacing it's implementation is risky and can cause serious issues. This change removes the ability to plug in TransportService but allows registering a TransportInterceptor that enables plugins to intercept requests on both the sender and the receiver ends. This is a commonly used and overwritten functionality but encapsulates the custom code in a contained manner.	2016-09-16 09:47:53 +02:00
Boaz Leskes	577dcb3237	Add current cluster state version to zen pings and use them in master election (#20384 ) During a networking partition, cluster states updates (like mapping changes or shard assignments) are committed if a majority of the masters node received the update correctly. This means that the current master has access to enough nodes in the cluster to continue to operate correctly. When the network partition heals, the isolated nodes catch up with the current state and get the changes they couldn't receive before. However, if a second partition happens while the cluster is still recovering from the previous one and the old master is put in the minority side, it may be that a new master is elected which did not yet catch up. If that happens, cluster state updates can be lost. This commit fixed 95% of this rare problem by adding the current cluster state version to `PingResponse` and use them when deciding which master to join (and thus casting the node's vote). Note: this doesn't fully mitigate the problem as a cluster state update which is issued concurrently with a network partition can be lost if the partition prevents the commit message (part of the two phased commit of cluster state updates) from reaching any single node in the majority side and the partition does allow for the master to acknowledge the change. We are working on a more comprehensive fix but that requires considerate work and is targeted at 6.0.	2016-09-15 23:39:11 +02:00
Nik Everett	d0be96df7b	Clean up snapshots after each REST test The only repository we can be sure is safe to clean is `fs` so we clean any snapshots in those repositories after each test. Other repositories like url and azure tend to throw exceptions rather than let us fetch their contents during the REST test. So we clean what we can.... Closes #18159	2016-09-15 14:49:11 -04:00
Boaz Leskes	8469c98e34	Fix LongGCDisruption to be aware of log4j2 (#20348 ) LongGCDisruption simulates a Long GC by suspending all threads belonging to a node. That's fine, unless those threads hold shared locks that can prevent other nodes from running. Concretely the logging infrastructure, which is shared between the nodes, can cause some deadlocks. LongGCDisruption has protection for this, but it needs to be updated to point at log4j2 classes, introduced in #20235 This commit also fixes improper handling of retry logic in LongGCDisruption and adds a protection against deadlocking the test code which activates the disruption (and uses logging too! :)). On top of that we have some new, evil and nasty tests.	2016-09-15 08:50:18 +02:00
Ali Beyad	3f79874042	Prevent the rolling upgrades rest tests from cleaning up indices after finishing if a the tests.rest.preserve_indices system property is set	2016-09-14 23:34:19 -04:00
Simon Willnauer	17ddee7011	Remove TransportService#registerRequestHandler leniency (#20469 ) `TransportService#registerRequestHandler` allowed to register handlers more than once and issues an annoying warn log message when this happens. This change simple throws an exception to prevent regsitering the same handler more than once. This commit also removes the ability to remove request handlers. Relates to #20468	2016-09-14 20:32:29 +02:00
Luca Cavanna	14e17f44a1	Replace usage of LuceneTestCase#getBaseTempDirForTestClass (#20484 ) LuceneTestCase#getBaseTempDirForTestClass is deprecated, we should not use it. Closes #15845	2016-09-14 19:35:20 +02:00
Simon Willnauer	89640965d2	Unguice SearchModule (#20456 ) After this change SearchModule doesn't subclass AbstractModule anymore and all wiring happens in `Node.java`. As a side-effect several tests don't need a guice injector anymore.	2016-09-14 10:07:53 +02:00
Jason Tedor	7560101ec7	Complete Elasticsearch logger names This commit modifies the logger names within Elasticsearch to be the fully-qualified class name as opposed removing the org.elasticsearch prefix and dropping the class name. This change separates the root logger from the Elasticsearch loggers (they were equated from the removal of the org.elasticsearch prefix) and enables log levels to be set at the class level (instead of the package level). Relates #20457	2016-09-13 22:46:54 -04:00
Jason Tedor	fbe27664a6	Fix prefix logging Today we add a prefix when logging within Elasticsearch. This prefix contains the node name, and index and shard-level components if appropriate. Due to some implementation details with Log4j 2 , this does not work for integration tests; instead what we see is the node name for the last node to startup. The implementation detail here is that Log4j 2 there is only one logger for a name, message factory pair, and the key derived from the message factory is the class name of the message factory. So, when the last node starts up and starts setting prefixes on its message factories, it will impact the loggers for the other nodes. Additionally, the prefixes are lost when logging an exception. This is due to another implementation detail in Log4j 2. Namely, since we log exceptions using a parameterized message, Log4j 2 decides that that means that we do not want to use the message factory that we have provided (the prefix message factory) and so logs the exception without the prefix. This commit fixes both of these issues. Relates #20429	2016-09-13 14:46:34 -04:00
Nicholas Knize	1a60e1c3d2	Update docs for LatLonPoint cut over This commit removes documentation for: * geohash cell query * lat_lon parameter * geohash parameter * geohash_precision parameter * geohash_prefix parameter It also updates failing tests that reference these parameters for backcompat.	2016-09-13 12:18:21 -05:00
Nicholas Knize	ef926894f4	Cut over geo_point field and queries to new LatLonPoint type This commit cuts over geo_point fields to use Lucene's new point-based LatLonPoint type for indexes created in 5.0. Indexes created prior to 5.0 continue to use their respective encoding type. Below is a description of the changes made to support the new encoding type: * New indexes use a new LatLonPointFieldMapper which provides a parse method for the new type * The new LatLonPoint parse method removes support for lat_lon and geohash parameters * Backcompat testing for deprecated lat_lon and geohash parameters is added to all unit and integration tests * LatLonPointFieldMapper provides DocValues support (enabled by default) which uses Lucene's new LatLonDocValuesField type * New LatLonPoint field data classes are added for aggregation support (wraps LatLonPoint's Numeric Doc Values) * MultiFields use the geohash as the string value instead of the lat,lon string making it easier to perform geo string queries on the geohash instead of a lat,lon comma delimited string. Removed Features: * With the removal of geohash indexing, GeoHashCellQuery support is removed for all new indexes (still supported on existing indexes) * LatLonPoint does not support a Distance Range query because it is super inefficient. Instead, the geo_distance_range query should be accomplished using either the geo_distance aggregation, sorting by descending distance on a geo_distance query, or a boolean must not of the excluded distance (which is what the distance_range query did anyway). TODO: * fix/finish yaml changes for plugin and rest integration tests * update documentation	2016-09-13 12:17:36 -05:00
Jason Tedor	013e3f6fcc	Remove unused import from BootstrapForTesting This commit removes an unused import for o.e.c.l.LogConfigurator from o.e.b.BootstrapForTesting.	2016-09-13 09:49:15 -04:00
Tanguy Leroux	6090c51fc5	Add quiet option to disable console logging (#20422 ) This commit adds a -q/--quiet option to Elasticsearch so that it does not log anything in the console and closes stdout & stderr streams. This is useful for SystemD to avoid duplicate logs in both journalctl and /var/log/elasticsearch/elasticsearch.log while still allows the JVM to print error messages in stdout/stderr if needed. closes #17220	2016-09-13 14:08:24 +02:00
Lee Hinman	44278db1bc	Merge pull request #20433 from dakrone/remove-cluster-name-folder-fallback No longer allow cluster name in data path	2016-09-12 17:01:49 -05:00
Lee Hinman	94625d74e4	No longer allow cluster name in data path In 5.x we allowed this with a deprecation warning. This removes the code added for that deprecation, requiring the cluster name to not be in the data path. Resolves #20391	2016-09-12 15:47:01 -06:00
Simon Willnauer	686994ae2d	Deguice SearchService and friends (#20423 ) This change removes the guice dependency handling for SearchService and several related classes like SearchTransportController and SearchPhaseController. The latter two now have package private constructors and dependencies like FetchPhase are now created by calling their constructors explicitly. This also cleans up several users of the DefaultSearchContext and centralized it's creation inside SearchService.	2016-09-12 22:42:55 +02:00
Ali Beyad	b1e87aa13c	Split allocator decision making from decision application (#20347 ) Splits the PrimaryShardAllocator and ReplicaShardAllocator's decision making for a shard from the implementation of that decision on the routing table. This is a step toward making it easier to use the same logic for the cluster allocation explain APIs.	2016-09-12 16:21:39 -04:00
Boaz Leskes	b08352047d	Introduce IndexShardTestCase (#20411 ) Introduce a base class for unit tests that are based on real `IndexShard`s. The base class takes care of all the little details needed to create and recover shards. This commit also moves `IndexShardTests` and `ESIndexLevelReplicationTestCase` to use the new base class. All tests in `IndexShardTests` that required a full node environment were moved to a new `IndexShardIT` suite.	2016-09-12 18:20:25 +02:00
Ali Beyad	f39f9b9760	Update discovery nodes after cluster state is published (#20409 ) Before, when there was a new cluster state to publish, zen discovery would first update the set of nodes to ping based on the new cluster state, then publish the new cluster state. This is problematic because if the cluster state failed to publish, then the set of nodes to ping should not have been updated. This commit fixes the issue by updating the set of nodes to ping for fault detection only after the new cluster state has been published.	2016-09-12 12:07:51 -04:00
Luca Cavanna	4b00cc37a1	Merge pull request #20382 from javanna/enhancement/cleanup_parse_elements Cleanup sub fetch phase extension point	2016-09-09 22:47:15 +02:00
Tal Levy	dda32545bb	add ignore_missing option to relevant processors (#20194 )	2016-09-09 12:20:18 -07:00
javanna	90ab460fcc	move parsing of search ext sections to the coordinating node	2016-09-09 19:10:42 +02:00
javanna	65c7f61ad9	decouple registration of SearchExtParsers from sub fetch phases Search section supports an ext section that is used to provide additional config needed from plugins. It is now tied to sub fetch phases because it is the only section that may need additional config, but there is no reason for the two to be tightly coupled. It is now possible to register a searchExtParser independently from a sub fetch phase. All a search ext parser does is parsing some ext section of a search request, whose parsed resulting object is stored in the search context for later retrieval.	2016-09-09 18:05:49 +02:00
javanna	f9530dfe8f	remove FetchSubPhaseContext in favour of generic fetch sub phase builder of type object The context was an object where the parsed info are stored. That is more of what we call the builder since after the search refactoring. No need for generics in FetchSubPhaseParser then. Also the previous setHitsExecutionNeeded wasn't useful, it can be removed as well, given that once there is a parsed ext section, it will become a builder that can be retrieved by the sub fetch phase. The sub fetch phase is responsible for doing nothing in case the builder is not set, meaning that the fetch sub phase is plugged in but the request didn't have the corresponding section.	2016-09-09 18:05:49 +02:00
javanna	dc2ba90f48	clarify that SearchParseElement is only used for custom fetch sub phases and clean up extension point SearchParseElement is renamed to FetchSubPhaseParser and moved to the search.fetch package. Its parse method doesn't get the SearchContext as argument anymore, only the XContentParser, and the return type is what gets parsed (the fetch sub phase context which we may as well rename later). It is the parser that initializes the FetchSubPhaseContext then. SearchService retrieves the parser by name, calls parse against it and stores the result of parsing by name. No need for FetchSubPhase.ContextFactory anymore, which can be removed.	2016-09-09 18:05:49 +02:00
javanna	a33ca70ff5	make docValueFields similar to other standard sub fetch phases Given that doc value fields is our own fetch sub phase, it doesn't need to be implemented like if it was plugged in from the outside. It doesn't need its own fetch sub phase context, but it can just be an instance member in SearchContext	2016-09-09 18:05:49 +02:00
Jason Tedor	d8475488b8	Disable console logging Previously we would disable console logging in certain circumstances (for example, if Elasticsearch is not in the foreground, or if Elasticsearch is in the foreground but an exception was thrown during bootstrap). This commit makes this handling work with Log4j 2. This will prevent users from seeing double bootstrap check failure messages. Relates #20387	2016-09-09 09:15:35 -04:00
Jason Tedor	de43565abc	Do not log full bootstrap checks exception By default, when an exception causes the JVM to terminate, the stack trace is printed. In the case of failing bootstrap checks, this stack trace is useless to the user, and might even distract them from seeing that the bootstrap checks failed for reasons under their control. With this commit, we cause the stack trace for a failing bootstrap check to be truncated. We also modify some methods to not declare that they throw the top level checked exception type Exception, but instead explicitly declare the exceptions that they throw. These exceptions are caught and wrapped in a BootstrapException so that we can percolate only two exception types out of Bootstrap#init as checked exception, BootstrapException and NodeValidationException. Relates #19989	2016-09-08 10:56:11 -04:00
Tanguy Leroux	4fb7ac8254	Clean up XContentBuilder This commit cleans most of the methods of XContentBuilder so that: - Jackson's convenience methods are used instead of our custom ones (ie field(String,long) now uses Jackson's writeNumberField(String, long) instead of calling writeField(String) then writeNumber(long)) - null checks are added for all field names and values - methods are grouped by type in the class source - methods have the same parameters names - duplicated methods like field(String, String...) and array(String, String...) are removed - varargs methods now have the "array" name to reflect that it builds arrays - unused methods like field(String,BigDecimal) are removed - all methods now follow the execution path: field(String,?) -> field(String) then value(?), and value(?) -> writeSomething() method. Methods to build arrays also follow the same execution path.	2016-09-08 15:09:09 +02:00
Alexander Lin	f825e8f4cb	Exposing lucene 6.x minhash filter. (#20206 ) Exposing lucene 6.x minhash tokenfilter Generate min hash tokens from an incoming stream of tokens that can be used to estimate document similarity. Closes #20149	2016-09-07 09:38:12 +02:00
Simon Willnauer	11f2da5f14	Skip loading of jansi from log4j2 (#20334 ) Jython shades `jansi` into it's classpath without changing it's package or anything like that. This causes attempts to load native code on windows which blows up tests. This change adds `log4j.skipJansi=true` system property to our tests as well as to the JVM properties we set.	2016-09-06 05:53:00 -04:00
Simon Willnauer	5c2d9fa158	Improve error reporting for tests with BackgroundIndexer (#20324 ) The BackgroundIndexer now uses auto-generated IDs randomly. This causes some problems for tests that still rely on the fact that the IDs are increasing integers. This change exposes all IDs via a Set<String> to iterate over for tests.	2016-09-05 16:28:49 +02:00
Nik Everett	549ca3178b	Rename method in OldIndexUtils loadIndexList -> loadDataFilesList. The new method name is more accurate.	2016-09-02 10:16:30 -04:00
javanna	7c03f65c36	[TEST] adjusted EsTestCase#randomPositiveLong	2016-09-02 10:23:49 +02:00
javanna	536d13ff11	ProcessInfo to implement Writeable rather than Streamable	2016-09-02 10:23:05 +02:00
Simon Willnauer	825b80f2a6	[TEST] fix possible NPE in ClientYamlTestExecutionContext	2016-09-02 10:07:58 +02:00
Jason Tedor	1e80adbfbe	Configure test logging with Log4j 2 This commit configures test logging for Log4j 2. The default logger configuration uses the console appender but at the error level, so most tests are missing logging. Instead, this commit provides a configuration for tests which is picked up from the classpath by Log4j 2 when it initializes. However, this now means that we can no longer initialize Log4j with a bare-bones configuration when tests run as doing so will prevent Log4j 2 from attempting to configure logging via the classpath. Consequently, we move this needed initialization (as commented, to avoid a message about a status logger not being configured when we are preparing to configure Log4j from properties files in the config directory) to only run when we are explicitly configuring Log4j from properties files. Relates #20284	2016-09-01 14:00:47 -04:00
Simon Willnauer	a0becd26b1	Optimize indexing for the autogenerated ID append-only case (#20211 ) If elasticsearch controls the ID values as well as the documents version we can optimize the code that adds / appends the documents to the index. Essentially we an skip the version lookup for all documents unless the same document is delivered more than once. On the lucene level we can simply call IndexWriter#addDocument instead of #updateDocument but on the Engine level we need to ensure that we deoptimize the case once we see the same document more than once. This is done as follows: 1. Mark every request with a timestamp. This is done once on the first node that receives a request and is fixed for this request. This can be even the machine local time (see why later). The important part is that retry requests will have the same value as the original one. 2. In the engine we make sure we keep the highest seen time stamp of "retry" requests. This is updated while the retry request has its doc id lock. Call this `maxUnsafeAutoIdTimestamp` 3. When the engine runs an "optimized" request comes, it compares it's timestamp with the current `maxUnsafeAutoIdTimestamp` (but doesn't update it). If the the request timestamp is higher it is safe to execute it as optimized (no retry request with the same timestamp has been run before). If not we fall back to "non-optimzed" mode and run the request as a retry one and update the `maxUnsafeAutoIdTimestamp` unless it's been updated already to a higher value Relates to #19813	2016-09-01 10:39:40 +02:00
Simon Willnauer	419627c460	Ensure ESTestCase is initialized before we run tests	2016-09-01 09:39:44 +02:00
Jason Tedor	76ab02e002	Merge branch 'master' into log4j2 * master: Avoid NPE in LoggingListener Randomly use Netty 3 plugin in some tests Skip smoke test client on JDK 9 Revert "Don't allow XContentBuilder#writeValue(TimeValue)" [docs] Remove coming in 2.0.0 Don't allow XContentBuilder#writeValue(TimeValue) [doc] Remove leftover from CONSOLE conversion Parameter improvements to Cluster Health API wait for shards (#20223) Add 2.4.0 to packaging tests list Docs: clarify scale is applied at origin+offest (#20242)	2016-08-31 16:37:55 -04:00
Stian Lindhom	c2eddaf2c9	Avoid NPE in LoggingListener This commit avoids an NPE that could arise when implementing an ESTestCase for test classes placed in the default package. Relates #20269	2016-08-31 16:11:12 -04:00
Ali Beyad	4641254ea6	Parameter improvements to Cluster Health API wait for shards (#20223 ) * Params improvements to Cluster Health API wait for shards Previously, the cluster health API used a strictly numeric value for `wait_for_active_shards`. However, with the introduction of ActiveShardCount and the removal of write consistency level for replication operations, `wait_for_active_shards` is used for write operations to represent values for ActiveShardCount. This commit moves the cluster health API's usage of `wait_for_active_shards` to be consistent with its usage in the write operation APIs. This commit also changes `wait_for_relocating_shards` from a numeric value to a simple boolean value `wait_for_no_relocating_shards` to set whether the cluster health operation should wait for all relocating shards to complete relocation. * Addresses code review comments * Don't be lenient if `wait_for_relocating_shards` is set	2016-08-31 11:58:19 -04:00
Jason Tedor	e166459bbe	Merge branch 'master' into log4j2 * master: Increase visibility of deprecation logger Skip transport client plugin installed on JDK 9 Explicitly disable Netty key set replacement percolator: Fail indexing percolator queries containing either a has_child or has_parent query. Make it possible for Ingest Processors to access AnalysisRegistry Allow RestClient to send array-based headers Silence rest util tests until the bogusness can be simplified Remove unknown HttpContext-based test as it fails unpredictably on different JVMs Tests: Improve rest suite names and generated test names for docs tests Add support for a RestClient base path	2016-08-31 10:59:27 -04:00
Jason Tedor	abf8a1a3f0	Avoid allocating log parameterized messages This commit modifies the call sites that allocate a parameterized message to use a supplier so that allocations are avoided unless the log level is fine enough to emit the corresponding log message.	2016-08-30 18:17:09 -04:00
Ryan Ernst	2a7a187bf8	Silence rest util tests until the bogusness can be simplified	2016-08-30 14:58:44 -07:00
Ryan Ernst	e19f2b6348	Tests: Improve rest suite names and generated test names for docs tests Rest test suites are currently only the directory above the yaml test file. That is confusing when there are more than one directory level which contain yaml tests, as there are in generated docs tests. This change makes rest tests use the full relative path to the rest test root as the suite name, and also makes the test names for docs tests a little clearer (that they are testing an example from a specific line number, instead of just the line number as an opaque test name).	2016-08-30 13:55:44 -07:00
Jason Tedor	7da0cdec42	Introduce Log4j 2 This commit introduces Log4j 2 to the stack.	2016-08-30 13:31:24 -04:00
javanna	61145bfb2f	[TEST] minor cleanups to AbstractQueryTestCase Removed null check for token, if we are outside the null it already means it is null. Fixed typo in comment and remove leftover assignment to unused local variable.	2016-08-29 16:52:11 +02:00
Yannick Welsch	f070c8727b	[TEST] Add additional logging to testStaleMasterNotHijackingMajority This test is periodically failing. As I suspect that the GCDisruption scheme is somehow making the wrong node block on its cluster state update thread, I've added some more logging and a thread dump once the given assertion triggers again.	2016-08-29 13:42:13 +02:00
Yannick Welsch	1b75cb63a2	Add recovery source to ShardRouting (#19516 ) Adds an explicit recoverySource field to ShardRouting that characterizes the type of recovery to perform: - fresh empty shard copy - existing local shard copy - recover from peer (primary) - recover from snapshot - recover from other local shards on same node (shrink index action)	2016-08-27 16:11:10 +02:00
Tanguy Leroux	68b943dc53	Fix MoreLikeThisQueryBuilderTests.testUnknownObjectException() Objects hierarchy must be tracked when entering/leaving an object so that it better knows if the "newField" has been inserted into an arbitrary holding object. Can be reproduced with gradle :core:test -Dtests.seed=760F8BD0F7E46D45 -Dtests.class=org.elasticsearch.index.query.MoreLikeThisQueryBuilderTests -Dtests.method="testUnknownObjectException" -Dtests.security.manager=true -Dtests.locale=ko -Dtests.timezone=Etc/Zulu	2016-08-25 20:54:06 +02:00
Tanguy Leroux	fbcfddbb77	Fix AbstractQueryTestCase.testUnknownObjectException() When need to check the whole hierarchy of objects to know if the newly inserted "newField" object is part of an arbitrary holding object or not. Reproduced with `gradle :modules:percolator:test -Dtests.seed=736B0B67DA7A3632 -Dtests.class=org.elasticsearch.percolator.PercolateQueryBuilderTests -Dtests.method="testUnknownObjectException" -Dtests.security.manager=true -Dtests.locale=es-ES -Dtests.timezone=ART`	2016-08-25 16:24:22 +02:00
Michael McCandless	1fe3e36934	Merge pull request #20147 from mikemccand/lucene_620_upgrade Upgrade to Lucene 6.2.0	2016-08-25 06:03:34 -04:00
Tanguy Leroux	20719f9b2f	Improve AbstractQueryTestCase#unknownObjectExceptionTest() This method fails when a randomized string value contains a double-quote. This commit changes the method so that it is not based on string concatenation anymore. It now use XContentGenerator & XContentParser to mutate the valid queries. Related #19864	2016-08-25 10:57:30 +02:00
Mike McCandless	5eb66e3378	Mark Scandinavian analysis components as multi term aware	2016-08-24 19:50:25 -04:00
Mike McCandless	7492300544	Remove now unused Store.renameFile, and obsolete commented out code	2016-08-24 18:20:30 -04:00
Mike McCandless	0ccfe69789	Upgrade to Lucene 6.2.0	2016-08-24 17:26:28 -04:00
Jim Ferenczi	4682fc34ae	Add the ability to disable the retrieval of the stored fields entirely This change adds a special field named _none_ that allows to disable the retrieval of the stored fields in a search request or in a TopHitsAggregation. To completely disable stored fields retrieval (including disabling metadata fields retrieval such as _id or _type) use _none_ like this: ```` POST _search { "stored_fields": "_none_" } ````	2016-08-24 16:40:08 +02:00
Nicholas Knize	28ed0e7abf	Deprecate optimize_bbox on geodistance queries Deprecates the optimize_bbox parameter on geodistance queries. This has no longer been needed since version 2.2 because lucene geo distance queries (postings and LatLonPoint) already optimize by bounding box.	2016-08-23 09:14:54 -05:00
Yannick Welsch	771668f380	Use routingResult method to update cluster state after reroute This ensures that the routing table as well as the metadata (with the primary terms and in-sync allocation ids) is updated.	2016-08-19 17:15:02 +02:00
Ryan Ernst	8c60455ed6	Fix checkstyle line length violations in allocation tests	2016-08-17 16:28:31 -07:00
Ryan Ernst	1ff348ed7f	Plugins: Make custom allocation deciders use pull based extensions This change converts AllocationDecider registration from push based on ClusterModule to implementing with a new ClusterPlugin interface. AllocationDecider instances are allowed to use only Settings and ClusterSettings.	2016-08-17 15:55:31 -07:00
Ryan Ernst	2ea50bc162	Merge pull request #20018 from rjernst/split_disk_threshold Internal: Split disk threshold monitoring from decider	2016-08-17 07:57:50 -07:00
Yannick Welsch	27a760f9c1	Add routing changes API to RoutingAllocation (#19992 ) Adds a class that records changes made to RoutingAllocation, so that at the end of the allocation round other values can be more easily derived based on these changes. Most notably, it: - replaces the explicit boolean flag that is passed around everywhere to denote changes to the routing table. The boolean flag is automatically updated now when changes actually occur, preventing issues where it got out of sync with actual changes to the routing table. - records actual changes made to RoutingNodes so that primary term and in-sync allocation ids, which are part of index metadata, can be efficiently updated just by looking at the shards that were actually changed.	2016-08-17 10:46:59 +02:00
Ryan Ernst	b2c0f2d08f	Internal: Split disk threshold monitoring from decider In addition to be an allocation decider, DiskThresholdDecider also monitors the used disk in order to trigger a reroute when the thresholds are crossed. This change splits out the settings for disk thresholds into DiskThresholdSettings, and moves the monitoring to a new DiskThresholdMonitor. DiskThresholdDecider is then in line with other allocation deciders, needing only Settings and ClusterSettings for construction, which will allow deguicing allocation deciders.	2016-08-17 00:22:16 -07:00
Lee Hinman	1825d8060c	Merge remote-tracking branch 'dakrone/lockobtainfailed-replacement'	2016-08-16 14:41:27 -06:00
Lee Hinman	1de3388fa3	Switching LockObtainFailedException over to ShardLockObtainFailedException `LobObtainFailedException` should be reserved for on-disk locks that Lucene attempts (like `write.lock`). This switches our in-memory semaphore locks for shards to use a different exception. Additionally, ShardLockObtainFailedException no longer subclasses IOException, since no IO is being done is this case. Resolves #19978	2016-08-16 14:37:36 -06:00
Nik Everett	46bf8baf2e	Switch aggregation registration for push to pull Adds `getAggregations` to `SearchPlugin` which can be used to register aggregations. Fixup MockNode which wasn't createing MockBigArrays.	2016-08-16 09:08:36 -04:00
Nik Everett	cf6e1a4362	Move all FetchSubPhases to `o.e.search.fetch.subphase` As the most complicated `FetchSubPhase` highlighting gets its own package (`o.e.seach.fetch.subphase.highlight`. No other `FetchSubPhase`s get their own package. Instead they all reside together in `o.e.search.fetch.subphase`. Add package descriptions to `o.e.search.fetch` and subpackages.	2016-08-12 18:21:15 -04:00
Jason Tedor	1f0673c9bd	Default max local storage nodes to one This commit defaults the max local storage nodes to one. The motivation for this change is that a default value greather than one is dangerous as users sometimes end up unknowingly starting a second node and start thinking that they have encountered data loss. Relates #19964	2016-08-12 09:26:20 -04:00
Nik Everett	9f8f2ea54b	Remove ESIntegTestCase#pluginList It was a useful method in 1.7 when javac's type inference wasn't as good, but now we can just replace it with `Arrays.asList`.	2016-08-11 15:44:02 -04:00
Yannick Welsch	522b137097	Make NetworkPartition disruption scheme configurable (#19534 ) This commit separates the description of the links in the network that are to be disrupted from the failure that is to be applied to the links (disconnect/unresponsive/delay). Previously we had subclasses for the various kind of network disruption schemes combining on one hand failure mode (disconnect/unresponsive/delay) as well as the network links to cut (two partitions / bridge partitioning) into a single class.	2016-08-11 14:55:06 +02:00
Adrien Grand	0d6ac57acf	Collapse o.e.index.mapper packages. #19921 I also reduced the visibility of a couple classes and renamed/consolidated some test classes for consistency, eg. removing the `Simple` prefix or using the `<Type>FieldMapperTests` convention for testing field mappers.	2016-08-10 17:51:11 +02:00
javanna	7d4a6499e1	[TEST] add inline comments to AbstractQueryTestCase#unknownObjectExceptionTest	2016-08-10 12:21:25 +02:00
javanna	8391e6de37	[TEST] enable testUnknownObjectException for alternate query versions too	2016-08-10 12:21:25 +02:00
javanna	0a98b5e56e	[TEST] make AbstractQueryTestCase#testUnknownObjectException more accurate testUnknownObjectException used to generate malformed json objects in some cases, due to the existence of arrays as it was not closing the injected object correctly. That is why the test was catching JsonParseException among the exception that are expected to be thrown. That is fixed by tracking where the new object is placed and placing its end object marker to the right level rather than always at the end. Also introduced a mechanism to explicitly declare objects that won't cause any exception when they get additional objects injected, so that there is no need to override the method anymore as that caused copy pasting of the whole test method. This also makes sure that changes are reflected in tests, as those inner objects are not skipped but we actually check that what is declared is true (no exceptions get thrown when an additional object is added within them.	2016-08-10 11:48:51 +02:00
Lee Hinman	5849c488b5	Merge remote-tracking branch 'dakrone/compliation-breaker'	2016-08-09 11:57:26 -06:00
Lee Hinman	2be52eff09	Circuit break the number of inline scripts compiled per minute When compiling many dynamically changing scripts, parameterized scripts (<https://www.elastic.co/guide/en/elasticsearch/reference/master/modules-scripting-using.html#prefer-params>) should be preferred. This enforces a limit to the number of scripts that can be compiled within a minute. A new dynamic setting is added - `script.max_compilations_per_minute`, which defaults to 15. If more dynamic scripts are sent, a user will get the following exception: ```json { "error" : { "root_cause" : [ { "type" : "circuit_breaking_exception", "reason" : "[script] Too many dynamic script compilations within one minute, max: [15/min]; please use on-disk, indexed, or scripts with parameters instead", "bytes_wanted" : 0, "bytes_limit" : 0 } ], "type" : "search_phase_execution_exception", "reason" : "all shards failed", "phase" : "query", "grouped" : true, "failed_shards" : [ { "shard" : 0, "index" : "i", "node" : "a5V1eXcZRYiIk8lecjZ4Jw", "reason" : { "type" : "general_script_exception", "reason" : "Failed to compile inline script [\"aaaaaaaaaaaaaaaa\"] using lang [painless]", "caused_by" : { "type" : "circuit_breaking_exception", "reason" : "[script] Too many dynamic script compilations within one minute, max: [15/min]; please use on-disk, indexed, or scripts with parameters instead", "bytes_wanted" : 0, "bytes_limit" : 0 } } } ], "caused_by" : { "type" : "general_script_exception", "reason" : "Failed to compile inline script [\"aaaaaaaaaaaaaaaa\"] using lang [painless]", "caused_by" : { "type" : "circuit_breaking_exception", "reason" : "[script] Too many dynamic script compilations within one minute, max: [15/min]; please use on-disk, indexed, or scripts with parameters instead", "bytes_wanted" : 0, "bytes_limit" : 0 } } }, "status" : 500 } ``` This also fixes a bug in `ScriptService` where requests being executed concurrently on a single node could cause a script to be compiled multiple times (many in the case of a powerful node with many shards) due to no synchronization between checking the cache and compiling the script. There is now synchronization so that a script being compiled will only be compiled once regardless of the number of concurrent searches on a node. Relates to #19396	2016-08-09 10:26:27 -06:00
javanna	329eaaea65	[TEST] expand AbstractQueryTestCase#testQueryWrappedInArray to run against query alternate versions	2016-08-08 19:09:43 +02:00
javanna	2437226802	[TEST] restore tests repeatability in AbstractQueryTestCase Some random operations were conditionally performed in the before test, which made tests not repeatable. For instance take the seed chain to repeat a specific iteration and try to reproduce it, this conditional code would get executed in both cases when trying to isolate the failure, but not among the different iterations (as only the first method/iteration executes it), hence the failure will not reproduce. Moved the random operations to beforeClass and left the non random part in the before method, which is needed as it depends on some method that can be overridden by subclasses.	2016-08-05 22:38:31 +02:00
Luca Cavanna	4c1a3b9a53	Merge pull request #19791 from javanna/fix/multiple_fields_queries Query parsers to throw exception when multiple field names are provided	2016-08-05 15:53:35 +02:00
Ali Beyad	f59ca9083b	Snapshot repository cleans up empty index folders (#19751 ) This commit cleans up indices in a snapshot repository when all snapshots containing the index are all deleted. Previously, empty indices folders would lay around after all snapshots containing them were deleted.	2016-08-05 09:39:02 -04:00
javanna	7f0bd56094	[TEST] use expectThrows wherever possible in query builder unit tests	2016-08-05 13:55:18 +02:00
Nik Everett	1e587406d8	Fail yaml tests and docs snippets that get unexpected warnings Adds `warnings` syntax to the yaml test that allows you to expect a `Warning` header that looks like: ``` - do: warnings: - '[index] is deprecated' - quotes are not required because yaml - but this argument is always a list, never a single string - no matter how many warnings you expect get: index: test type: test id: 1 ``` These are accessible from the docs with: ``` // TEST[warning:some warning] ``` This should help to force you to update the docs if you deprecate something. You must add the warnings marker to the docs or the build will fail. While you are there you should update the docs to add deprecation warnings visible in the rendered results.	2016-08-04 15:23:05 -04:00
Daniel Mitterdorfer	4598c36027	Fix various concurrency issues in transport (#19675 ) Due to various issues (most notably a missing happens-before edge between socket accept and channel close in MockTcpTransport), MockTcpTransportTests sometimes did not terminate. With this commit we fix various concurrency issues that led to this hanging test. Failing example build: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+multijob-os-compatibility/os=oraclelinux/835/console	2016-08-04 21:00:59 +02:00
javanna	cd9388ce66	[TEST] parse query alternate versions in strict mode AbstractQueryTestCase parses the main version of the query in strict mode, meaning that it will fail if any deprecated syntax is used. It should do the same for alternate versions (e.g. short versions). This is the way it is because the two alternate versions for ids query are both deprecated. Moved testing for those to a specific test method that isolates the deprecations and actually tests that the two are deprecated.	2016-08-04 19:49:43 +02:00
javanna	146f02183d	[TEST] remove unused methods and fix some warnings in AbstractQueryTestCase Also fix line length issues	2016-08-04 10:06:25 +02:00
Luca Cavanna	c5a9427293	Merge pull request #19750 from javanna/fix/npe_parse_field_array Throw ParsingException if a query is wrapped in an array	2016-08-03 18:21:39 +02:00
javanna	4805250ecf	Throw ParsingException if a query is wrapped in an array Our parsing code accepted up until now queries in the following form (note that the query starts with `[`: ``` { "bool" : [ { "must" : [] } ] } ``` This would lead to a null pointer exception as most parsers assume that the field name ("must" in this example) is the first thing that can be found in a query if its json is valid, hence always non null while parsing. Truth is that the additional array layer doesn't make the json invalid, hence the following code fragment would cause NPE within ParseField, because null gets passed to `parseContext.isDeprecatedSetting`: ``` if (token == XContentParser.Token.FIELD_NAME) { currentFieldName = parser.currentName(); } else if (parseContext.isDeprecatedSetting(currentFieldName)) { // skip } else if (token == XContentParser.Token.START_OBJECT) { ``` We could add null checks in each of our parsers in lots of places, but we rely on `currentFieldName` being non null in all of our parsers, and we should consider it a bug when these unexpected situations are not caught explicitly. It would be best to find a way to prevent such queries altogether without changing all of our parsers. The reason why such a query goes through is that we've been allowing a query to start with either `[` or `{`. The only reason I found is that we accept `match_all : []`. This seems like an undocumented corner case that we could drop support for. Then we can be stricter and accept only `{` as start token of a query. That way the only next token that the parser can encounter if the json is valid (otherwise the json parser would barf earlier) is actually a field_name, hence the assumption that all our parser makes hold. The downside of this is simply dropping support for `match_all : []` Relates to #12887	2016-08-03 17:05:14 +02:00
Nik Everett	ca8f666c66	Add line number to yaml test failures Old: ``` > Throwable #1: java.lang.AssertionError: expected [2xx] status code but api [reindex] returned [400 Bad Request] [{"error":{"root_cause":[{"type":"parsing_exception","reason":"[reindex] failed to parse field [dest]","line":1,"col":25}],"type":"parsing_exception","reason":"[reindex] failed to parse field [dest]","line":1,"col":25,"caused_by":{"type":"illegal_argument_exception","reason":"[dest] unknown field [asdfadf], parser not found"}},"status":400}] > at __randomizedtesting.SeedInfo.seed([9325F8C5C6F227DD:1B71C71F680E4A25]:0) > at org.elasticsearch.test.rest.yaml.section.DoSection.execute(DoSection.java:119) > at org.elasticsearch.test.rest.yaml.ESClientYamlSuiteTestCase.test(ESClientYamlSuiteTestCase.java:309) > at java.lang.Thread.run(Thread.java:745) ``` New: ``` > Throwable #1: java.lang.AssertionError: Failure at [reindex/10_basic:12]: expected [2xx] status code but api [reindex] returned [400 Bad Request] [{"error":{"root_cause":[{"type":"parsing_exception","reason":"[reindex] failed to parse field [dest]","line":1,"col":25}],"type":"parsing_exception","reason":"[reindex] failed to parse field [dest]","line":1,"col":25,"caused_by":{"type":"illegal_argument_exception","reason":"[dest] unknown field [asdfadf], parser not found"}},"status":400}] > at __randomizedtesting.SeedInfo.seed([444DEEAF47322306:CC19D175E9CE4EFE]:0) > at org.elasticsearch.test.rest.yaml.ESClientYamlSuiteTestCase.executeSection(ESClientYamlSuiteTestCase.java:329) > at org.elasticsearch.test.rest.yaml.ESClientYamlSuiteTestCase.test(ESClientYamlSuiteTestCase.java:309) > at java.lang.Thread.run(Thread.java:745) > Caused by: java.lang.AssertionError: expected [2xx] status code but api [reindex] returned [400 Bad Request] [{"error":{"root_cause":[{"type":"parsing_exception","reason":"[reindex] failed to parse field [dest]","line":1,"col":25}],"type":"parsing_exception","reason":"[reindex] failed to parse field [dest]","line":1,"col":25,"caused_by":{"type":"illegal_argument_exception","reason":"[dest] unknown field [asdfadf], parser not found"}},"status":400}] > at org.elasticsearch.test.rest.yaml.section.DoSection.execute(DoSection.java:119) > at org.elasticsearch.test.rest.yaml.ESClientYamlSuiteTestCase.executeSection(ESClientYamlSuiteTestCase.java:325) > ... 37 more ``` Sorry for the longer stack trace, but I wanted to be sure I didn't throw anything away by accident.	2016-08-03 10:59:57 -04:00
Britta Weber	abcb4c8a97	[Test] move methods from bwc test to test package for use in plugins (#19738 ) * [Test] move methods from bwc test to test package for use in other plugins	2016-08-03 11:41:46 +02:00
Ryan Ernst	df8dc64e9b	Plugins: Make NamedWriteableRegistry immutable and add extenion point for named writeables Currently any code that wants to added NamedWriteables to the NamedWriteableRegistry can do so via guice injection of the registry, and registering at construction time. However, this makes the registry complex: it has both get and register methods synchronized, and there is likely contention on the read side from multiple threads. The registration has mostly already been contained to guice modules at node construction time. This change makes the registry immutable, taking all of the NamedWriteable readers at construction time. It also allows plugins to added arbitrary named writables that it may use in its own transport actions.	2016-08-02 15:56:25 -07:00
Ali Beyad	c4ae23f5d8	Enables implementations of the BlobContainer interface to (#19749 ) conform with the requirements of the writeBlob method by throwing a FileAlreadyExistsException if attempting to write to a blob that already exists. This change means implementations of BlobContainer should never overwrite blobs - to overwrite a blob, it must first be deleted and then can be written again. Closes #15579	2016-08-02 09:48:21 -04:00
Ali Beyad	456ea56527	Cleans up the BlobContainer interface by removing the (#19727 ) writeBlob method takes a BytesReference in favor of just the writeBlob method that takes an InputStream. Closes #18528	2016-08-02 09:21:43 -04:00
Ali Beyad	25d8eca62d	Removes the notion of write consistency level across all APIs in favor of waiting for active shard copy count (wait_for_active_shards).	2016-08-01 13:35:29 -04:00
Ali Beyad	9f88a8194a	Merge pull request #19706 from elastic/enhancement/snapshot-blob-handling More resilient blob handling in snapshot repositories	2016-08-01 12:03:53 -04:00
Tanguy Leroux	386902903e	[TEST] Kill remaining lang-groovy messy tests After #13834 many tests that used Groovy scripts (for good or bad reason) in their tests have been moved in the lang-groovy module and the issue #13837 has been created to track these messy tests in order to clean them up. The work started with #19280, #19302 and #19336 and this PR moves the remaining messy tests back in core, removes the dependency on Groovy, changes the scripts in order to use the mocked script engine, and change the tests to integration tests. It also moves IndexLookupIT test back (even if it has good chance to be removed soon) and fixes its tests. It also changes AbstractQueryTestCase to use custom script plugins in tests. closes #13837	2016-08-01 16:59:47 +02:00
Alexander Lin	9ac6389e43	Rename operation to result and reworking responses * Rename operation to result and reworking responses * Rename DocWriteResponse.Operation enum to DocWriteResponse.Result These are just easier to interpret names. Closes #19664	2016-08-01 10:42:58 -04:00
Alexander Lin	119026b4fb	Remove isCreated and isFound from the Java API This is cleanup work from #19566, where @nik9000 suggested trying to nuke the isCreated and isFound methods. I've combined nuking the two methods with removing UpdateHelper.Operation in favor of DocWriteResponse.Operation here. Closes #19631.	2016-07-29 14:21:43 -04:00
Nik Everett	2e7336dc10	Add package-info to o.e.test.rest This removes two packages, consolidating them into their parent package and adds `package-info.java` files to describe all of the packages under `org.elasticsearch.test.rest`.	2016-07-28 16:07:44 -04:00
David Pilato	0d2ccf0989	Merge branch 'pr/15724-gce-network-host-master'	2016-07-28 16:59:18 +02:00
Nik Everett	fb45f6a8a8	Add authentication to reindex-from-remote The tests for authentication extend ESIntegTestCase and use a mock authentication plugin. This way the clients don't have to worry about running it. Sadly, that means we don't really have good coverage on the REST portion of the authentication. This also adds ElasticsearchStatusException, and exception on which you can set an explicit status. The nice thing about it is that you can set the RestStatus that it returns to whatever arbitrary status you like based on the status that comes back from the remote system. reindex-from-remote then uses it to wrap all remote failures, preserving the status from the remote Elasticsearch or whatever proxy is between us and the remove Elasticsearch.	2016-07-27 14:17:41 -04:00
David Pilato	e9339a1960	Merge branch 'master' into pr/15724-gce-network-host-master	2016-07-27 11:24:53 +02:00
Boaz Leskes	6f76740a58	await fix testConcurrentSendRespondAndDisconnect	2016-07-26 23:42:10 +02:00
Nik Everett	9270e8b22b	Rename client yaml test infrastructure This makes it obvious that these tests are for running the client yaml suites. Now that there are other ways of running tests using the REST client against a running cluster we can't go on calling the shared client yaml tests "REST tests". They are rest tests, but they aren't the rest tests.	2016-07-26 13:53:44 -04:00
David Pilato	0d3edee928	Merge branch 'master' into pr/15724-gce-network-host-master	2016-07-26 18:51:01 +02:00
David Pilato	fde15ae470	Move custom name resolvers to NetworkService CTOR Instead of using NetworkModule we can directly inject them in NetworkService CTOR. See https://github.com/elastic/elasticsearch/pull/15765#issuecomment-235307974	2016-07-26 18:26:30 +02:00
Boaz Leskes	fabfd425f0	remove socket timeout from MockTcpTransport added in `b208a7dbae`	2016-07-26 18:04:05 +02:00
Boaz Leskes	dbdb6341a5	increase logging information in testConcurrentSendRespondAndDisconnect	2016-07-26 18:02:22 +02:00
Daniel Mitterdorfer	b208a7dbae	Add socket timeout in MockTcpTransport With this commit we set an explicit socket timeout in MockTcpTransport to avoid hanging tests in case of disconnections.	2016-07-26 16:04:51 +02:00
Nik Everett	a95d4f4ee7	Add Location header and improve REST testing This adds a header that looks like `Location: /test/test/1` to the response for the index/create/update API. The requirement for the header comes from https://www.w3.org/Protocols/rfc2616/rfc2616-sec10.html https://tools.ietf.org/html/rfc7231#section-7.1.2 claims that relative URIs are OK. So we use an absolute path which should resolve to the appropriate location. Closes #19079 This makes large changes to our rest test infrastructure, allowing us to write junit tests that test a running cluster via the rest client. It does this by splitting ESRestTestCase into two classes: * ESRestTestCase is the superclass of all tests that use the rest client to interact with a running cluster. * ESClientYamlSuiteTestCase is the superclass of all tests that use the rest client to run the yaml tests. These tests are shared across all official clients, thus the `ClientYamlSuite` part of the name.	2016-07-25 17:02:40 -04:00
Boaz Leskes	b90dff7292	increase log level to debug in testConcurrentSendRespondAndDisconnect	2016-07-25 22:01:09 +02:00
Ali Beyad	2f831c3abb	BytesArray tests fix: offsets don't matter on a zero bytes array Closes #19582	2016-07-25 15:22:08 -04:00
Tanguy Leroux	f745c96949	Clean up more messy tests After #13834 many tests that used Groovy scripts (for good or bad reason) in their tests have been moved in the lang-groovy module and the issue #13837 has been created to track these messy tests in order to clean them up. This commit moves more tests back in core, removes the dependency on Groovy, changes the scripts in order to use the mocked script engine, and change the tests to integration tests.	2016-07-25 17:02:49 +02:00
Boaz Leskes	cd596772ee	Persistent Node Names (#19456 ) With #19140 we started persisting the node ID across node restarts. Now that we have a "stable" anchor, we can use it to generate a stable default node name and make it easier to track nodes over a restarts. Sadly, this means we will not have those random fun Marvel characters but we feel this is the right tradeoff. On the implementation side, this requires a bit of juggling because we now need to read the node id from disk before we can log as the node node is part of each log message. The PR move the initialization of NodeEnvironment as high up in the starting sequence as possible, with only one logging message before it to indicate we are initializing. Things look now like this: ``` [2016-07-15 19:38:39,742][INFO ][node ] [_unset_] initializing ... [2016-07-15 19:38:39,826][INFO ][node ] [aAmiW40] node name set to [aAmiW40] by default. set the [node.name] settings to change it [2016-07-15 19:38:39,829][INFO ][env ] [aAmiW40] using [1] data paths, mounts [[ /(/dev/disk1)]], net usable_space [5.5gb], net total_space [232.6gb], spins? [unknown], types [hfs] [2016-07-15 19:38:39,830][INFO ][env ] [aAmiW40] heap size [1.9gb], compressed ordinary object pointers [true] [2016-07-15 19:38:39,837][INFO ][node ] [aAmiW40] version[5.0.0-alpha5-SNAPSHOT], pid[46048], build[473d3c0/2016-07-15T17:38:06.771Z], OS[Mac OS X/10.11.5/x86_64], JVM[Oracle Corporation/Java HotSpot(TM) 64-Bit Server VM/1.8.0_51/25.51-b03] [2016-07-15 19:38:40,980][INFO ][plugins ] [aAmiW40] modules [percolator, lang-mustache, lang-painless, reindex, aggs-matrix-stats, lang-expression, ingest-common, lang-groovy, transport-netty], plugins [] [2016-07-15 19:38:43,218][INFO ][node ] [aAmiW40] initialized ``` Needless to say, settings `node.name` explicitly still works as before. The commit also contains some clean ups to the relationship between Environment, Settings and Plugins. The previous code suggested the path related settings could be changed after the initial Environment was changed. This did not have any effect as the security manager already locked things down.	2016-07-23 22:46:48 +02:00
Jason Tedor	2d1b0587dd	Introduce Netty 4 This commit adds transport-netty4, a transport and HTTP implementation based on Netty 4. Relates #19526	2016-07-22 22:26:35 -04:00
Ali Beyad	a0a4d67eae	All snapshot metadata files use UUID for the blob ID	2016-07-22 13:52:13 -04:00
gfyoung	d98fd36dad	Added deleteBlob IOException test	2016-07-22 13:48:45 -04:00
javanna	db8beeba3b	Merge branch 'master' into feature/async_rest_client	2016-07-22 15:51:03 +02:00
Boaz Leskes	bd574d92ae	Verify lower level transport exceptions don't bubble up on disconnects (#19518 ) #19096 introduced a generic TCPTransport base class so we can have multiple TCP based transport implementation. These implementations can vary in how they respond internally to situations where we concurrently send, receive and handle disconnects and can have different exceptions. However, disconnects are important events for the rest of the code base and should be distinguished from other errors (for example, it signals TransportMasterAction that it needs to retry and wait for the a (new) master to come back). Therefore, we should make sure that all the implementations do the proper translation from their internal exceptions into ConnectTransportException which is used externally. Similarly we should make sure that the transport implementation properly recognize errors that were caused by a disconnect as such and deal with them correctly. This was, for example, the source of a build failure at https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+multijob-intake/1080 , where a concurrency issue cause SocketException to bubble out of MockTcpTransport. This PR adds a tests which concurrently simulates connects, disconnects, sending and receiving and makes sure the above holds. It also fixes anything (not much!) that was found it.	2016-07-22 14:35:47 +02:00
Tal Levy	f7cd86ef6d	rethrow script compilation exceptions into ingest configuration exceptions (#19318 ) * rethrow script compilation exceptions into ingest configuration exceptions * update readProcessor to rethrow any exception as an ElasticsearchException	2016-07-20 10:37:56 -07:00
javanna	a9b5c5adbe	restore throws IOException clause on all performRequest sync methods We throw IOException, which is the exception that is going to be thrown in 99% of the cases. A more generic exception can happen, and if it is a runtime one we just let it bubble up as is, otherwise we wrap it into runtime one so that we don't require to catch Exception everywhere, which seems odd. Also adjusted javadocs for all performRequest methods	2016-07-19 15:18:05 +02:00
javanna	1bb33cf572	Remove RestClient#JSON_CONTENT_TYPE constant, already available in ContentType class	2016-07-19 15:17:12 +02:00
javanna	e742d65e02	[TEST] Make sure the last response body is always available in our REST tests With the introduction of the async client, ResponseException doesn't eagerly read the response body anymore into a string. That is better, but raised a problem in our REST tests infra: we were reading the response body twice, while it can only be consumed once. Introduced a RestTestResponseException that wraps a ResponseException and exposes the body which now gets read only once.	2016-07-19 15:16:45 +02:00
javanna	41e97a7cb1	RestClient: take builder out to its own class The RestClient class is getting bigger and bigger, its builder can definitely be taken out to its own top level class: RestClientBuilder	2016-07-19 15:16:45 +02:00
javanna	1fbec71243	Rest client: introduce async performRequest method and use async client under the hood for sync requests too The new method accepts the usual parameters (method, endpoint, params, entity and headers) plus a response listener and an async response consumer. Shortcut methods are also added that don't require params, entity and the async response consumer optional. There are a few relevant api changes as a consequence of the move to async client that affect sync methods: - Response doesn't implement Closeable anymore, responses don't need to be closed - performRequest throws Exception rather than just IOException, as that is the the exception that we get from the FutureCallback#failed method in the async http client - ssl configuration is a bit simpler, one only needs to call setSSLStrategy from a custom HttpClientConfigCallback, that doesn't end up overridng any other default around connection pooling (it used to happen with the sync client and make ssl configuration more complex) Relates to #19055	2016-07-19 15:15:58 +02:00
Nik Everett	a2a7ea1f17	Make ExtendedBounds immutable We used to mutate it as part of building the aggregation. That caused assertVersionSerializable to fail because it assumes that requests aren't mutated after they are sent. Closes #19481	2016-07-19 08:48:14 -04:00
Simon Willnauer	8394544548	Add a dedicated client/transport project for transport-client (#19435 ) The `client/transport` project adds a new jar build project that pulls in all dependencies and configures all required modules. Preinstalled modules are: * transport-netty * lang-mustache * reindex * percolator The `TransportClient` classes are still in core while `TransportClient.Builder` has only a protected construcutor such that users are redirected to use the new `TransportClientBuilder` from the new jar. Closes #19412	2016-07-18 15:42:24 +02:00
Martijn van Groningen	e0ebf5da1c	Template cleanup: * Removed `Template` class and unified script & template parsing logic. Templates are scripts, so they should be defined as a script. Unless there will be separate template infrastructure, templates should share as much code as possible with scripts. * Removed ScriptParseException in favour for ElasticsearchParseException * Moved TemplateQueryBuilder to lang-mustache module because this query is hard coded to work with mustache only	2016-07-18 10:16:01 +02:00
Ali Beyad	687e2e12b3	Merge pull request #19450 from elastic/feature/friendly-index-creation Makes index creation more friendly	2016-07-15 11:48:21 -04:00
Ali Beyad	d78f40fb1e	Index creation waits for active shard copies before returning (#18985 ) Before returning, index creation now waits for the configured number of shard copies to be started. In the past, a client would create an index and then potentially have to check the cluster health to wait to execute write operations. With the cluster health semantics changing so that index creation does not cause the cluster health to go RED, this change enables waiting for the desired number of active shards to be active before returning from index creation. Relates #9126	2016-07-15 11:19:27 -04:00
Martijn van Groningen	d0069f0fbb	Provide access to ThreadContext in ingest plugins Also introduced a `Processor.Parameters` class that is holder for several services processors rely on, the IngestPlugin#getProcessors(...) method has been changed to accept `Processor.Parameters` instead of each service seperately.	2016-07-15 08:16:15 +02:00
Jason Tedor	31c648eee8	Rename transport-netty to transport-netty3 This commit renames the Netty 3 transport module from transport-netty to transport-netty3. This is to make room for a Netty 4 transport module, transport-netty4. Relates #19439	2016-07-14 22:03:14 -04:00
Jason Tedor	575fa4e00a	Fix line-length in o/e/t/r/s/Features.java This commit fixes a line-length checkstyle violation in o/e/t/r/s/Features.java.	2016-07-14 18:10:35 -04:00
Honza Král	e21b1e8066	[TEST] add 'yaml' feature for the test runner (#19436 ) Also renamed 30_yaml.yaml to 30_json.yaml since it tests json, not yaml	2016-07-14 17:30:32 +02:00
Simon Willnauer	5616251f22	Remove `node.mode` and `node.local` settings (#19428 ) Today `node.mode` and `node.local` serve almost the same purpose, they are a shortcut for `discovery.type` and `transport.type`. If `node.local: true` or `node.mode: local` is set elasticsearch will start in _local_ mode which means only nodes within the same JVM are discovered and a non-network based transport is used. The _local_ mode it only really used in tests or if nodes are embedded. For both, embedding and tests explicit configuration via `discovery.type` and `transport.type` should be preferred. This change removes all the usage of these settings and by-default doesn't configure a default transport implemenation since netty is now a module. Yet, to make the user expericence flawless, plugins or modules can set a `http.type.default` and `transport.type.default`. Plugins set this via `PluginService#additionalSettings()` which enforces _set-once_ which prevents node startup if set multiple times. This means that our distributions will just startup with netty transport since it's packaged as a module unless `transport.type` or `http.transport.type` is explicitly set. This change also found a bunch of bugs since several NamedWriteables were not registered if a transport client is used. Now that we don't rely on the `node.mode` leniency which is inherited instead of using explicit settings, `TransportClient` uses `AssertingLocalTransport` which detects these problems since it serializes all messages. Closes #16234	2016-07-14 13:21:10 +02:00
Simon Willnauer	29fd0f1bd8	[TEST] Remove wrong transportName from MockTcpTransport#ctor	2016-07-13 12:50:52 +02:00
Simon Willnauer	067ca1f996	[TEST] Use a semaphore to block unitl all in-flight requests are released	2016-07-13 10:31:05 +02:00
Simon Willnauer	814c7224f9	Merge pull request #19392 from elastic/modularize_netty This moves all netty related code into modules/transport-netty the module is build as a zip file as well as a JAR to serve as a dependency for transport client. For the time being this is required otherwise we have no network based impl. for transport client users. This might be subject to change given that we move forward http client.	2016-07-13 09:52:03 +02:00
Simon Willnauer	eba69ffade	[TEST] First decrement in-flight requests before releasing the latch	2016-07-12 22:58:03 +02:00
Simon Willnauer	ec55f9fff7	[TEST] Make AbstractSimpleTransportTestCase#testTimeoutSendExceptionWithDelayedResponse more robust and wait for in-flight request	2016-07-12 20:41:37 +02:00
Simon Willnauer	4fb79707bd	Fix remaining tests that either need access to the netty module or require explict configuration Some tests still start http implicitly or miss configuring the transport clients correctly. This commit fixes all remaining tests and adds a depdenceny to `transport-netty` from `qa/smoke-test-http` and `modules/reindex` since they need an http server running on the nodes. This also moves all required permissions for netty into it's module and out of core.	2016-07-12 16:29:57 +02:00
Luca Cavanna	f6aec3fdb5	Merge pull request #19373 from javanna/enhancement/rest_client_builder_callback Rest Client: add callback to customize http client settings	2016-07-12 13:30:27 +02:00
javanna	512b8be791	RestClient: simplify ssl configuration and make http config callback functional friendly	2016-07-12 13:25:55 +02:00
Boaz Leskes	081d04afac	Make NotMasterException a first class citizen (#19385 ) That exception is currently serialized as its current base class IllegalStateException which confuses code supposed to deal with the stepping down of a master. This is an important exception and we should be able to serialize it correctly. This commit fixes it by moving the exception to inherit from ElasticsearchException and properly register it. As a bonus I adapted CapturingTransport to properly simulate serialized exceptions.	2016-07-12 12:44:40 +02:00
javanna	fa0b354e66	Rest Client: add callback to customize http client settings The callback replaces the ability to fully replace the http client instance. By doing that, one used to lose any default that the RestClient had set for the underlying http client. Given that you'd usually override one or two things only, like a couple of timeout values, the ssl factory or the default credentials providers, it is not uder friendly if by doing that users end up replacing the whole http client instance and lose any default set by us.	2016-07-12 12:31:28 +02:00
Simon Willnauer	199a5a1f04	Fix TcpTransport#sendRequest to raise NotConnectedExcepiton if we get disconnected while sending This also fixes a race in AbstractSimpleTransportTestCase where we never wait long enough for all response to finish causing expected failures.	2016-07-12 10:56:20 +02:00
Ryan Ernst	93aebbef0f	Merge branch 'master' into modularize_netty	2016-07-11 23:49:00 -07:00
Ryan Ernst	7195d1e0ff	Fix plugins service to not double bind plugin components	2016-07-11 17:05:56 -07:00
Nik Everett	8263873783	Switch search extension from push to pull Switches most search behavior extensions from push (`onModule(SearchModule)`) to pull (`implements SearchPlugin`). This effort in general gives plugin authors a much cleaner view of how to extend Elasticsearch and starts to set up portions of Elasticsearch as "the plugin API". This commit in particular does that for search-time behavior like customized suggesters, highlighters, score functions, and significance heuristics. It also switches most such customization to being done at search module construction time which is much, much easier to reason about from a testing perspective. It also helps significantly in the process of de-guice-ing Elasticsearch's startup. There are at least two major search time extensions that aren't covered in this commit that will simply have to wait for the next commit on the topic because this one has already grown large: custom aggregations and custom queries. These will likely live in the same SearchPlugin interface as well.	2016-07-11 18:49:05 -04:00
Ryan Ernst	99ac65931a	Plugins: Add components creator as bridge between guice and new plugin init world This change adds a createComponents() method to Plugin implementations which they can use to return already constructed componenents/services. Eventually this should be just services ("components" don't really do anything), but for now it allows any object so that preconstructed instances by plugins can still be bound to guice. Over time we should add basic services as arguments to this method, but for now I have left it empty so as to not presume what is a necessary service.	2016-07-11 14:14:06 -07:00
Simon Willnauer	048e4416e7	Move netty transport and http into a module This moves all netty code and it's dependency into a module.	2016-07-11 22:21:29 +02:00
Ali Beyad	0faf638710	Blocked allocations on primary causes RED health If the allocation decision for a primary shard was NO, this should cause the cluster health for the shard to go RED, even if the shard belongs to a newly created index or is part of cluster recovery. Relates #9126	2016-07-11 15:32:13 -04:00
Ali Beyad	417bd0cd63	Index creation does not cause the cluster health to go RED Previously, index creation would momentarily cause the cluster health to go RED, because the primaries were still being assigned and activated. This commit ensures that when an index is created or an index is being recovered during cluster recovery and it does not have any active allocation ids, then the cluster health status will not go RED, but instead be YELLOW. Relates #9126	2016-07-11 15:30:47 -04:00
Simon Willnauer	47bd2f9ca5	More cleanups aroung tests that require HTTP to be enalbed. (#19363 ) this commit moves the most of the http related integ tests out into it's own `qa/smoke-test-http` project where most of the test can run against the external cluster.	2016-07-11 20:44:57 +02:00
Nik Everett	4b171b84cb	Fix modifier order checkstyle	2016-07-11 12:59:45 -04:00
Christoph Büscher	0d428b6ba8	Add test for GeoHashUtils#bbox()	2016-07-11 10:46:31 -05:00
Simon Willnauer	ee193f7697	[TEST] Catch RejectedOperationException when disconnecting from node in MockTcpTransport	2016-07-11 16:36:26 +02:00
Simon Willnauer	07260d4351	[TEST] Use AbstractRunnable when forking off threads on an executor	2016-07-11 16:27:07 +02:00
Simon Willnauer	3f3c93ec65	Add blocking socket based MockTcpTransport (#19332 ) Today we have a bunch of tests that use netty transport for several reasons these tests use it because they need to run some tcp based transport. Yet, this couples our tests tightly to the netty implementation which should be tested on it's own. This change adds a plain socket based blocking TcpTransport implementation that is used by default in tests if local transport is suppressed or if network is selected. It also adds another tcp network implementation as a showcase how the interface works.	2016-07-11 12:17:52 +02:00
javanna	942e342662	Rest Client: use short performRequest methods when possible	2016-07-11 10:36:26 +02:00
Jason Tedor	e86aa29f67	Die with dignity Today when a thread encounters a fatal unrecoverable error that threatens the stability of the JVM, Elasticsearch marches on. This includes out of memory errors, stack overflow errors and other errors that leave the JVM in a questionable state. Instead, the Elasticsearch JVM should die when these errors are encountered. This commit causes this to be the case. Relates #19272	2016-07-07 14:44:03 -04:00
Tanguy Leroux	b58f2eb5c2	Move back some messy tests from Groovy plugin to core This commit moves back some messy tests that have been placed in lang-groovy module in https://github.com/elastic/elasticsearch/pull/13834. It removes the dependency on Groovy plugin as well as change back the tests to integration tests (IT suffix). It also changes the current MockScriptEngine and MockScriptPlugin to make it easier to use.	2016-07-07 15:26:36 +02:00
Alexander Reelsen	71b48fb16c	Dependencies: Update to jopt-5.0 (#19278 ) The new version of jopt allows us to remove a couple of TODOs in the code. Closes #12368	2016-07-07 08:50:10 +02:00
Ryan Ernst	e7818f75e1	Fix checkstyle for TestProcessor	2016-07-05 22:33:08 -07:00
Ryan Ernst	2fc41adeb5	Merge branch 'master' into ingest_plugin_api	2016-07-05 20:53:03 -07:00
Jason Tedor	d0765d0761	Merge branch 'master' into feature/seq_no * master: (192 commits) [TEST] Fix rare OBOE in AbstractBytesReferenceTestCase Reindex from remote Rename writeThrowable to writeException Start transport client round-robin randomly Reword Refresh API reference (#19270) Update fielddata.asciidoc Fix stored_fields message Add missing footer notes in mapper size docs Remote BucketStreams Add doc values support to the _size field in the mapper-size plugin Bump version to 5.0.0-alpha5. Update refresh.asciidoc Update shrink-index.asciidoc Change Debian repository for Vagrant debian-8 box [TEST] fix test to account for internal empyt reference optimization Upgrade to netty 3.10.6.Final (#19235) [TEST] fix histogram test when extended bounds overlaps data Remove redundant modifier Simplify TcpTransport interface by reducing send code to a single send method (#19223) Fix style violation in InstallPluginCommand.java ...	2016-07-05 22:01:07 -04:00
Nik Everett	b3c015e2bb	Reindex from remote This adds a remote option to reindex that looks like ``` curl -POST 'localhost:9200/_reindex?pretty' -d'{ "source": { "remote": { "host": "http://otherhost:9200" }, "index": "target", "query": { "match": { "foo": "bar" } } }, "dest": { "index": "target" } }' ``` This reindex has all of the features of local reindex: * Using queries to filter what is copied * Retry on rejection * Throttle/rethottle The big advantage of this version is that it goes over the HTTP API which can be made backwards compatible. Some things are different: The query field is sent directly to the other node rather than parsed on the coordinating node. This should allow it to support constructs that are invalid on the coordinating node but are valid on the target node. Mostly, that means old syntax.	2016-07-05 16:13:17 -04:00
Jason Tedor	96f283c195	Rename writeThrowable to writeException This commit renames writeThrowable to writeException. The situation here stems from the fact that the StreamOutput method for serializing Exceptions needs to accept Throwables too as Throwables can be the cause of serialized Exceptions. Yet, we do not serialize Throwables in the Error sub-hierarchy in a way that they can be deserialized into their initial type. This leads to an asymmetry in the StreamOutput method for serializing Exceptions and the StreamInput method for writing Excpetions. Namely, the former will accept Throwables but the latter will only return Exceptions. A goal with the stream methods has always been symmetry in the method names so that serialization/deserialization routines appear symmetrical in code. It is this asymmetry on the input/output types for Exceptions on StreamOutput/StreamInput that clashes with the desired symmetry of naming. Despite this, we should favor symmetry in the naming of the methods. This commit renames StreamOutput#writeThrowable to StreamOutput#writeException which leaves us with Exception StreamInput#readException and void StreamOutput#writeException(Throwable).	2016-07-05 14:37:01 -04:00
Boaz Leskes	6861d3571e	Persistent Node Ids (#19140 ) Node IDs are currently randomly generated during node startup. That means they change every time the node is restarted. While this doesn't matter for ES proper, it makes it hard for external services to track nodes. Another, more minor, side effect is that indexing the output of, say, the node stats API results in creating new fields due to node ID being used as keys. The first approach I considered was to use the node's published address as the base for the id. We already [treat nodes with the same address as the same](https://github.com/elastic/elasticsearch/blob/master/core/src/main/java/org/elasticsearch/discovery/zen/NodeJoinController.java#L387) so this is a simple change (see [here](https://github.com/elastic/elasticsearch/compare/master...bleskes:node_persistent_id_based_on_address)). While this is simple and it works for probably most cases, it is not perfect. For example, if after a node restart, the node is not able to bind to the same port (because it's not yet freed by the OS), it will cause the node to still change identity. Also in environments where the host IP can change due to a host restart, identity will not be the same. Due to those limitation, I opted to go with a different approach where the node id will be persisted in the node's data folder. This has the upside of connecting the id to the nodes data. It also means that the host can be adapted in any way (replace network cards, attach storage to a new VM). I It does however also have downsides - we now run the risk of two nodes having the same id, if someone copies clones a data folder from one node to another. To mitigate this I changed the semantics of the protection against multiple nodes with the same address to be stricter - it will now reject the incoming join if a node exists with the same id but a different address. Note that if the existing node doesn't respond to pings (i.e., it's not alive) it will be removed and the new node will be accepted when it tries another join. Last, and most importantly, this change requires that all nodes persist data to disk. This is a change from current behavior where only data & master nodes store local files. This is the main reason for marking this PR as breaking. Other less important notes: - DummyTransportAddress is removed as we need a unique network address per node. Use `LocalTransportAddress.buildUnique()` instead. - I renamed `node.add_lid_to_custom_path` to `node.add_lock_id_to_custom_path` to avoid confusion with the node ID which is now part of the `NodeEnvironment` logic. - I removed the `version` paramater from `MetaDataStateFormat#write` , it wasn't really used and was just in the way :) - TribeNodes are special in the sense that they do start multiple sub-nodes (previously known as client nodes). Those sub-nodes do not store local files but derive their ID from the parent node id, so they are generated consistently.	2016-07-04 21:09:25 +02:00
Tanguy Leroux	0e7faf1005	Enable Checkstyle RedundantModifier	2016-07-04 15:22:12 +02:00
Jason Tedor	3343ceeae4	Do not catch throwable Today throughout the codebase, catch throwable is used with reckless abandon. This is dangerous because the throwable could be a fatal virtual machine error resulting from an internal error in the JVM, or an out of memory error or a stack overflow error that leaves the virtual machine in an unstable and unpredictable state. This commit removes catch throwable from the codebase and removes the temptation to use it by modifying listener APIs to receive instances of Exception instead of the top-level Throwable. Relates #19231	2016-07-04 08:41:06 -04:00
Ryan Ernst	5a66c08ae9	Merge branch 'master' into ingest_plugin_api	2016-07-01 16:27:52 -07:00
Ryan Ernst	822c995367	Internal: Remove generics from LifecycleComponent The only reason for LifecycleComponent taking a generic type was so that it could return that type on its start and stop methods. However, this chaining has no practical necessity. Instead, start and stop can be void, and a whole bunch of confusing generics disappear.	2016-07-01 16:17:42 -07:00

... 14 15 16 17 18 ...

2003 Commits