OpenSearch

Commit Graph

Author	SHA1	Message	Date
Nik Everett	872984d21a	Continue consolidating `XContentParser` construction in tests (#22145 ) Consolidate more parser creation in tests Moves more parser creation in tests to the `createParser` methods in `ESTestCase`.	2016-12-13 17:22:39 -05:00
Tal Levy	f56097b57a	Fixes GrokProcessor's ignorance of named-captures with same name. (#22131 ) Grok was originally ignoring potential matches to named-capture groups larger than one. For example, If you had two patterns containing the same named field, but only the second pattern matched, it would fail to pick this up. This PR fixes this by exploring all potential places where a named-capture was used and chooses the first one that matched. Fixes #22117.	2016-12-13 13:19:55 -08:00
Simon Willnauer	7a9b667e98	Introduce a low level protocol handshake (#22094 ) Today we rely on the version that the API user passes in together with the DiscoveryNode. This commit introduces a low level handshake where nodes exchange their version to be used with the transport protocol that is executed every time a connection to a node is established. This, on the one hand allows to change the wire protocol based on the version we are talking to even without a full cluster restart. Today we would need to carry on a BWC layer across major versions but with a handshake we can rely on the fact that the latest version of the previous minor executes a handshake and uses the latest protocol version across all communication with the N+1 version nodes. This change is yet fully backwards compatible, a followup PR will remove the BWC in 6.0 once this has been back-ported to the 5.x branch	2016-12-13 21:06:23 +01:00
Adrien Grand	049fd3991c	Remove `AggregationContext`. (#22124 ) This class is just a wrapper around `SearchContext`, so let's use `SearchContext` directly. The change is mechanical, except the `ValuesSourceConfig` class, where I moved the logic to get a `ValuesSource` given a config.	2016-12-13 09:09:40 +01:00
Luca Cavanna	6d987a9b69	Remove support for empty queries (#22092 ) Our query DSL supports empty queries (`{}`), which have a different meaning depending on the query that holds it, either ignored, match_all or match_none. We deprecated the support for empty queries in 5.0, where we log a deprecation warning wherever they are used. The way we supported it once we moved query parsing to the coordinating node was having an Optional<QueryBuilder> return type in all of our parse methods (called fromXContent). See #17624. The central place for this was QueryParseContext#parseInnerQueryBuilder. We can now remove all the optional return types and simply throw an exception whenever an empty query is found.	2016-12-12 12:37:12 +01:00
Simon Willnauer	01d67e09b9	Detach handshake from connect to node (#22037 ) Today we connect and publish the nodes connection before we execute a handshake with the node we connect to. In the case of connecting to a node that won't pass the handshake this connection is already `published` and other code paths can use it. This commit detaches the connection and the publish of the connection such that `TransportService` can do a handshake before actually connect and publish the connection.	2016-12-10 10:03:26 +01:00
Nik Everett	3adefb7b4a	Begin centralizing XContentParser creation into RestRequest (#22041 ) To get #22003 in cleanly we need to centralize as much `XContentParser` creation as possible into `RestRequest`. That'll mean we have to plumb the `NamedXContentRegistry` into fewer places. This removes `RestAction.hasBody`, `RestAction.guessBodyContentType`, and `RestActions.getRestContent`, moving callers over to `RestRequest.hasContentOrSourceParam`, `RestRequest.contentOrSourceParam`, and `RestRequest.contentOrSourceParamParser` and `RestRequest.withContentOrSourceParamParserOrNull`. The idea is to use `withContentOrSourceParamParserOrNull` if you need to handle requests without any sort of body content and to use `contentOrSourceParamParser` otherwise. I believe the vast majority of this PR to be purely mechanical but I know I've made the following behavioral change (I'll add more if I think of more): * If you make a request to an endpoint that requires a request body and has cut over to the new APIs instead of getting `Failed to derive xcontent` you'll get `Body required`. * Template parsing is now non-strict by default. This is important because we need to be able to deprecate things without requests failing.	2016-12-09 20:23:02 -05:00
Nik Everett	fc2060ba7e	Don't close rest client from its callback (#22061 ) If you try to close the rest client inside one of its callbacks then it blocks itself. The thread pool switches the status to one that requests a shutdown and then waits for the pool to shutdown. When another thread attempts to honor the shutdown request it waits for all the threads in the pool to finish what they are working on. Thus thread a is waiting on thread b while thread b is waiting on thread a. It isn't quite that simple, but it is close. Relates to #22027	2016-12-09 10:39:51 -05:00
Adrien Grand	36f598138a	Start using `ObjectParser` for aggs. (#22048 ) This is an attempt to start moving aggs parsing to `ObjectParser`. There is still A LOT to do, but ObjectParser is way better than the way aggregations parsing works today. For instance in most cases, we reject numbers that are provided as strings, which we are supposed to accept since some client languages (looking at you Perl) cannot make sure to use the appropriate types. Relates to #22009	2016-12-09 09:45:16 +01:00
Ryan Ernst	b1cef5fdf8	Remove 2.0 prerelease version constants (#22004 ) * Remove 2.0 prerelease version constants This is a start to addressing #21887. This removes: * pre 2.0 snapshot format support * automatic units addition to cluster settings * bwc check for delete by query in pre 2.0 indexes	2016-12-08 21:48:35 -08:00
Lee Hinman	ef64d230e7	Merge remote-tracking branch 'dakrone/index-seq-id-and-primary-term'	2016-12-08 19:47:21 -07:00
Lee Hinman	ee22a477df	Add internal _primary_term doc values field, fix _seq_no indexing This adds the `_primary_term` field internally to the mappings. This field is populated with the current shard's primary term. It is intended to be used for collision resolution when two document copies have the same sequence id, therefore, doc_values for the field are stored but the filed itself is not indexed. This also fixes the `_seq_no` field so that doc_values are retrievable (they were previously stored but irretrievable) and changes the `stats` implementation to more efficiently use the points API to retrieve the min/max instead of iterating on each doc_value value. Additionally, even though we intend to be able to search on the field, it was previously not searchable. This commit makes it searchable. There is no user-visible `_primary_term` field. Instead, the fields are updated by calling: ```java index.parsedDoc().updateSeqID(seqNum, primaryTerm); ``` This includes example methods in `Versions` and `Engine` for retrieving the sequence id values from the index (see `Engine.getSequenceID`) that are only used in unit tests. These will be extended/replaced by actual implementations once we make use of sequence numbers as a conflict resolution measure. Relates to #10708 Supercedes #21480 P.S. As a side effect of this commit, `SlowCompositeReaderWrapper` cannot be used for documents that contain `_seq_no` because it is a Point value and SCRW cannot wrap documents with points, so the tests have been updated to loop through the `LeafReaderContext`s now instead.	2016-12-08 19:47:03 -07:00
Christoph Büscher	7454a9647b	Add fromXContent to HighlightField This adds a fromXContent method and unit test to the HighlightField class so we can parse it as part of a serch response. This is part of the preparation for parsing search responses on the client side.	2016-12-07 16:32:44 +01:00
Nik Everett	ef83dbfbe6	Reindex: Better error message for pipeline in wrong place (#21985 ) `_update_by_query` supports specifying the `pipeline` to process the documents as a url parameter but `_reindex` doesn't. It doesn't because everything about the `_reindex` request that has to do with writing the documents is grouped under the `dest` object in the request body. This changes the response parameter from `request [_reindex] contains unrecognized parameter: [pipeline]` to `_reindex doesn't support [pipeline] as a query parmaeter. Specify it in the [dest] object instead.`	2016-12-06 14:55:46 -05:00
Ryan Ernst	c8f241f284	Plugins: Remove response action filters (#21950 ) Action filters currently have the ability to filter both the request and response. But the response side was not actually used. This change removes support for filtering responses with action filters.	2016-12-05 16:14:04 -08:00
Nik Everett	2087234d74	Timeout improvements for rest client and reindex (#21741 ) Changes the default socket and connection timeouts for the rest client from 10 seconds to the more generous 30 seconds. Defaults reindex-from-remote to those timeouts and make the timeouts configurable like so: ``` POST _reindex { "source": { "remote": { "host": "http://otherhost:9200", "socket_timeout": "1m", "connect_timeout": "10s" }, "index": "source", "query": { "match": { "test": "data" } } }, "dest": { "index": "dest" } } ``` Closes #21707	2016-12-05 10:54:51 -05:00
Igor Motov	c391b3fff6	Add proper descriptions to reindex, update-by-query and delete-by-query tasks. Related to #21768	2016-12-02 21:46:38 -05:00
Jack Conradson	0ecdef026d	Test fix for def equals test in Painless. (#21945 ) Closes #21801	2016-12-02 14:41:13 -08:00
Nik Everett	0c724b1878	Keep context during reindex's retries (#21941 ) * Keep context during reindex's retries This fixes reindex and friend's retries to keep the context. * Docs	2016-12-02 13:48:51 -05:00
Simon Willnauer	842e00c689	[TEST] Add back skip of external clusters	2016-12-02 11:53:33 +01:00
Simon Willnauer	572b4c3e72	Port assert from 5.x to master I added an assertion to Netty4/Netty3Transport in 5.x that is not in master yet. This commit port the assert to ensure we consumed all connection in `connectToChannels`	2016-12-02 10:34:33 +01:00
Simon Willnauer	adf9bd90a4	Remove legacy BWC test infrastructure and tests (#21915 ) We don't use the test infra nor do we run the tests. They might all be entirely out of date. We also have a different BWC test infra in-place. This change removes all of the legacy infra.	2016-12-02 08:06:20 +01:00
Simon Willnauer	155de53fe3	Add a connect timeout to the ConnectionProfile to allow per node connect timeouts (#21847 ) Timeouts are global today across all connections this commit allows to specify a connection timeout per node such that depending on the context connections can be established with different timeouts. Relates to #19719	2016-12-01 15:39:49 +01:00
Boaz Leskes	fe01c0f83b	fix TemplateQueryBuilderTests & Murmur3FieldMapperTests	2016-12-01 14:21:57 +01:00
Simon Willnauer	dd5256c324	Reduce number of connections per node depending on the nodes role (#21849 ) We currently treat every node equally when we establish connections to a node. Yet, if we are not master eligible or can't hold any data there is no point in creating a dedicated connection for sending the cluster state or running remote recoveries respectively. The usage of STATE and RECOVERY connections on non-master and/or non-data nodes will result in an IllegalStateException.	2016-12-01 08:00:48 +01:00
Jason Tedor	6c45695d52	Add version 5.1.1 This commit removes the version constant for 5.1.0 (due to an inadvertent release) and adds the version constant for 5.1.1. Relates #21890	2016-11-30 11:14:17 -05:00
Luca Cavanna	5b8bdba12e	Remove subrequests method from CompositeIndicesRequest (#21873 )	2016-11-30 15:03:58 +01:00
Adrien Grand	6231009a8f	Remove 2.x backward compatibility of mappings. (#21670 ) For the record, I also had to remove the geo-hash cell and geo-distance range queries to make the code compile. These queries already throw an exception in all cases with 5.x indices, so that does not hurt any more. I also had to rename all 2.x bwc indices from `index-${version}` to `unsupported-${version}` to make `OldIndexBackwardCompatibilityIT` happy.	2016-11-30 13:34:46 +01:00
Luca Cavanna	6eaff9432d	SearchTemplateRequest to implement CompositeIndicesRequest (#21865 ) SearchTemplateRequest to implement CompositeIndicesRequest Given that SearchTemplateRequest effectively delegates to search when a search is being executed, it should implement the CompositeIndicesRequest interface. The subrequests method should return a single search request. When a search is not going to be executed, because we are in simulate mode, there are no inner requests, and there are no corresponding indices to that request either. Closes #21747	2016-11-29 20:52:43 +01:00
Jim Ferenczi	d791ddf704	Upgrade to lucene-6.4.0-snapshot-ec38570 (#21853 ) Set lucene version to 6.4.0-snapshot-ec38570 and update all the sha1s/license Fix invalid combo after upgrade in query_string query. split_on_whitespace=false is disallowed if auto_generate_phrase_queries=true Adapt the expectations of some tests to the new format of the Lucene explain output	2016-11-29 18:40:31 +01:00
Nicholas Knize	af1ab68b64	Add RangeFieldMapper for numeric and date range types Lucene 6.2 added index and query support for numeric ranges. This commit adds a new RangeFieldMapper for indexing numeric (int, long, float, double) and date ranges and creating appropriate range and term queries. The design is similar to NumericFieldMapper in that it uses a RangeType enumerator for implementing the logic specific to each type. The following range types are supported by this field mapper: int_range, float_range, long_range, double_range, date_range. Lucene does not provide a DocValue field specific to RangeField types so the RangeFieldMapper implements a CustomRangeDocValuesField for handling doc value support. When executing a Range query over a Range field, the RangeQueryBuilder has been enhanced to accept a new relation parameter for defining the type of query as one of: WITHIN, CONTAINS, INTERSECTS. This provides support for finding all ranges that are related to a specific range in a desired way. As with other spatial queries, DISJOINT can be achieved as a MUST_NOT of an INTERSECTS query.	2016-11-29 10:10:14 -06:00
Simon Willnauer	f5ff69fabe	Remove connectToNodeLight and replace it with a connection profile (#21799 ) The Transport#connectToNodeLight concepts is confusing and not very flexible. neither really testable on a unittest level. This commit cleans up the code used to connect to nodes and simplifies transport implementations to share more code. This also allows to connect to nodes with custom profiles if needed, for instance future improvements can be added to connect to/from nodes that are non-data nodes without dedicated bulks and recovery connections.	2016-11-29 09:35:07 +01:00
Jason Tedor	a6082eb563	Grant Netty permission to read system somaxconn When Netty listens on a socket, it specifies the established connection backlog for the socket. On Linux, Netty tries to read the system-wide configuration for this from /proc/sys/net/core/somaxconn and falls back to a default value when it can not read this value. This commit grants Netty permission to read this file so that it can honor the system-wide configuration for the connection backlog for sockets that it is listening on. This also removes an obnoxious stack trace that appears when Netty logging is set to debug logging. Relates #21840	2016-11-28 18:47:32 -05:00
Luca Cavanna	360b74eda8	[TEST] Don't reinitialize YamlTestClient and RestClient before each single test (#21807 ) In the past we ran yaml tests against an internal cluster, which would get restarted after each test failure, hence the client objects needed to eventually be refreshed before each test. That is why we had the initClient method to re-initialize the YamlTestClient in the execution context. We ended up though re-initializing the client unconditionally, which is not needed. Also, ESRestTestCase recreates the RestClient against the external cluster before each test, which is not needed given that nothing changes in the external cluster. This commit removes the initClient method from the yaml tests execution context. The YamlTestClient can be eagerly created before the first yaml test runs and then re-used in subsequent tests. Also api calls to check for nodes versions etc. are moved out of YamlTestClient to ESClientYamlSuiteTestCase. Also the RestClient is now initialized in ESRestTestCase before the first test runs, and kept around afterwards as a static member. Basically each subclass of EsRestTestCase will have its own RestClient instance, but the client will be shared across the different tests within the same class. The yaml test suite is just a special suite, composed of 600+ tests that are loaded from files, which will share the same client instance. This change should speed tests up as well, as we don't recreate the RestClient before each single test, and we don't call _cat/nodes either before each single test.	2016-11-28 18:43:27 +01:00
Jason Tedor	6f95261632	Remove unused imports from Netty4Utils This commit removes two unused imports from Netty4Utils that were leftover from a previous change.	2016-11-27 13:18:50 -05:00
Jason Tedor	5e73282bbc	Simplify handling of fatal network layer errors This commit simplifies the handling of fatal errors on the network layer. The simplification here is to remove the use of a StringWriter/PrintWriter pair to format the stack trace, removing the need for the method to declare that it throws a checked IOException.	2016-11-27 13:14:24 -05:00
Tanguy Leroux	28dc02f01a	[Test] Mute EqualsTests..testBranch(Not)EqualsDefAndPrimitive It fails regurlarly and it is tracked by https://github.com/elastic/elasticsearch/issues/21801	2016-11-25 17:21:59 +01:00
Ryan Ernst	c3ec8e22b8	Wrap VerifyError in ScriptException (#21769 ) If a bug occurs in painless compilation (not from a user, but from the painless infrastructure), a VerifyError may be thrown when compiling the broken generated class. This commit wraps VerifyErrors in ScriptException so that useful information is returned to the user, which can be passed on to the ES team for analysis.	2016-11-23 14:45:21 -08:00
Jack Conradson	ba2d772668	Fix a VerifyError bug in Painless (#21765 ) This bug would cause a VerifyError when scripts using the === operator were comparing a def type against a primitive type since the primitive type wasn't being appropriately boxed.	2016-11-23 13:57:14 -08:00
Jason Tedor	8416b16dfd	Improve handling of unreleased versions Today when handling unreleased versions for backwards compatilibity support, we scatted version constants across the code base and add some asserts to support removing these constants when the version in question is actually released. This commit improves this situation, enabling us to just add a single unreleased version constant that can be renamed when the version is actually released. This should make maintenance of these versions simpler. Relates #21760	2016-11-23 15:49:05 -05:00
Nik Everett	434fa4bd26	Docs and tests for painless lack of boxing for ?: and ?. (#21756 ) NOTE: The result of `?.` and `?:` can't be assigned to primitives. So `int[] someArray = null; int l = someArray?.length` and `int s = params.size ?: 100` don't work. Do `def someArray = null; def l = someArray?.length` and `def s = params.size ?: 100` instead. Relates to #21748	2016-11-23 14:33:32 -05:00
Ryan Ernst	6940b2b8c7	Remove groovy scripting language (#21607 ) * Scripting: Remove groovy scripting language Groovy was deprecated in 5.0. This change removes it, along with the legacy default language infrastructure in scripting.	2016-11-22 19:24:12 -08:00
Nik Everett	dbdcf9e95c	Move painless yaml tests into painless dir They were in a directory named "plan_a", the old name for painless.	2016-11-22 20:27:14 -05:00
Nik Everett	457c2d8fb0	Add Debug.explain to painless You can use `Debug.explain(someObject)` in painless to throw an `Error` that can't be caught by painless code and contains an object's class. This is useful because painless's sandbox doesn't allow you to call `someObject.getClass()`. Closes #20263	2016-11-22 12:46:02 -05:00
Jason Tedor	446037ccb8	Die with dignity on the network layer When a fatal error is thrown on the network layer, such an error never makes its way to the uncaught exception handler. This prevents the node from being torn down if an out of memory error or other fatal error is thrown while handling HTTP or transport traffic. This commit adds logic to ensure that such errors bubble their way up to the uncaught exception handler, even though Netty tries really hard to swallow everything. Relates #21720	2016-11-21 22:14:30 -05:00
Nik Everett	f5c8c746e6	Implement toString in painless's AST This should make debugging painless' analysis and code generation a little easier. The `toString` implementations mirror the AST somewhat, and look like `(SSource (SReturn (ENumeric 1)))`.	2016-11-21 16:24:10 -05:00
Simon Willnauer	cb5c25ab4f	Add a StreamInput#readArraySize method that ensures sane array sizes (#21697 ) Today we read a vint from the stream to allocate the size of an array up-front before we start reading the values. This can be dangerous if for instance we read from a corrupted stream or if some manipulated bytes are send for instance from an attacker or a fuzzer. In most of the cases we can apply some best effort and validate the array size to be _sane_ by ensuring we can at read at least N bytes where N is the expected size of the array.	2016-11-21 21:39:21 +01:00
Jason Tedor	655c4fe172	Wrap GroovyBugErrors in ScriptExceptions When Groovy detects a bug in its runtime because an internal assertion was violated, it throws an GroovyBugError. This descends from AssertionError and if it goes uncaught will land in the uncaught exception handler and will not deliver any useful information to the user. This commit wraps GroovyBugErrors in ScriptExceptions so that useful information is returned to the user.	2016-11-19 07:11:13 -05:00
Nik Everett	ae468441dc	Implement the ?: operator in painless (#21506 ) Implements a null coalescing operator in painless that looks like `?:`. This form was chosen to emulate Groovy's `?:` operator. It is different in that it only coalesces null values, instead of Groovy's `?:` operator which coalesces all falsy values. I believe that makes it the same as Kotlin's `?:` operator. In other languages this operator looks like `??` (C#) and `COALESCE` (SQL) and `:-` (bash). This operator is lazy, meaning the right hand side is only evaluated at all if the left hand side is null.	2016-11-18 13:54:26 -05:00
Jack Conradson	ced433e9a8	Fix reserved variable availability in lambdas in Painless	2016-11-17 13:39:08 -08:00
Jason Tedor	b08a2e1f31	Expose executor service interface from thread pool This commit exposes the executor service interface from thread pool. This will enable some high-level concurrency primitives that will make some code cleaner and simpler. Relates #21608	2016-11-17 09:18:49 -05:00
Simon Willnauer	de04aad994	Remove `modules/transport_netty_3` in favor of `netty_4` (#21590 ) We kept `netty_3` as a fallback in the 5.x series but now that master is 6.0 we don't need this or in other words all issues coming up with netty 4 will be blockers for 6.0.	2016-11-17 12:44:42 +01:00
Jason Tedor	d06a8903fd	Merge branch 'master' into feature/seq_no * master: (22 commits) Add proper toString() method to UpdateTask (#21582) Fix `InternalEngine#isThrottled` to not always return `false`. (#21592) add `ignore_missing` option to SplitProcessor (#20982) fix trace_match behavior for when there is only one grok pattern (#21413) Remove dead code from GetResponse.java Fixes date range query using epoch with timezone (#21542) Do not cache term queries. (#21566) Updated dynamic mapper section Docs: Clarify date_histogram bucket sizes for DST time zones Handle release of 5.0.1 Fix skip reason for stats API parameters test Reduce skip version for stats API parameter tests Strict level parsing for indices stats Remove cluster update task when task times out (#21578) [DOCS] Mention "all-fields" mode doesn't search across nested documents InternalTestCluster: when restarting a node we should validate the cluster is formed via the node we just restarted Fixed bad asciidoc in boolean mapping docs Fixed bad asciidoc ID in node stats Be strict when parsing values searching for booleans (#21555) Fix time zone rounding edge case for DST overlaps ...	2016-11-16 09:10:35 -05:00
Tal Levy	6796464f16	add `ignore_missing` option to SplitProcessor (#20982 ) Closes #20840.	2016-11-16 15:46:09 +02:00
Tal Levy	04b712bdc5	fix trace_match behavior for when there is only one grok pattern (#21413 ) There is an issue in the Grok Processor, where trace_match: true does not inject the _ingest._grok_match_index into the ingest-document when there is just one pattern provided. This is due to an optimization in the regex construction. This commit adds a check for when this is the case, and injects a static index value of "0", since there is only one pattern matched (at the first index into the patterns). To make this clearer, more documentation was added to the grok-processor docs. Fixes #21371.	2016-11-16 15:41:54 +02:00
Boaz Leskes	2c0338fa87	Merge remote-tracking branch 'upstream/master' into feature/seq_no	2016-11-15 17:09:08 +00:00
Adrien Grand	df4482fdc8	Do not cache the QueryShardContext in PercolatorFieldMapper: it is cheap to create.	2016-11-15 15:45:18 +01:00
Adrien Grand	54809065a6	Make PercolatorFieldMapper get a QueryShardContext lazily.	2016-11-15 12:02:40 +01:00
Boaz Leskes	c9f49039d3	Merge remote-tracking branch 'upstream/master' into feature/seq_no	2016-11-15 10:14:47 +00:00
Ryan Ernst	d14c470b89	Remove generics from ActionRequest closes #21368	2016-11-14 15:32:01 -08:00
Adrien Grand	1fd5c47e7f	Upgrade to lucene-6.3.0. (#21464 )	2016-11-14 09:36:45 +01:00
Jason Tedor	c7a1b3eb50	Merge branch 'master' into feature/seq_no * master: Hack around cluster service and logging race Do not prematurely shutdown Log4j Support decimal constants with trailing [dD] in painless (#21412) In painless suggest a long constant if int won't do (#21415) Account for different paths for sysctl utilities [TEST] testRebalancePossible() may not have an assigned node id Tests: Disable merge in SearchCancellationTests Tests: clean search scroll at the end of SearchCancellationIT	2016-11-13 20:01:44 -05:00
Nik Everett	2a328034ef	Support decimal constants with trailing [dD] in painless (#21412 ) This adds support to painless for decimal constants with trailing `d` or `D` to make it compatible with Java. It already supported integer constants with a trailing `d` or `D` but this adds tests for it. Closes #21116	2016-11-12 11:08:39 -05:00
Nik Everett	a26b5a113c	In painless suggest a long constant if int won't do (#21415 ) In painless we prefer explicit types over implicit ones whereas groovy is the other way around. Take this groovy code: ``` > 86400000.class java.lang.Integer > 864000000000.class java.lang.Long ``` Painless accepts `86400000` just fine because that is a valid `int` in the jvm. It rejects `864000000000` as an invlid `int` constant because, in painless as in java, `long` constants always end in `L` or `l`. To ease the transition from groovy to painless, this changes the compilation error returned from these invalid constants from: ``` Invalid int constant [864000000000]. ``` to ``` Invalid int constant [864000000000]. If you want a long constant then change it to [864000000000L]. ``` Inspired by #21313	2016-11-12 11:08:18 -05:00
Jason Tedor	d3417fb022	Merge branch 'master' into feature/seq_no * master: (516 commits) Avoid angering Log4j in TransportNodesActionTests Add trace logging when aquiring and releasing operation locks for replication requests Fix handler name on message not fully read Remove accidental import. Improve log message in TransportNodesAction Clean up of Script. Update Joda Time to version 2.9.5 (#21468) Remove unused ClusterService dependency from SearchPhaseController (#21421) Remove max_local_storage_nodes from elasticsearch.yml (#21467) Wait for all reindex subtasks before rethrottling Correcting a typo-Maan to Man-in README.textile (#21466) Fix InternalSearchHit#hasSource to return the proper boolean value (#21441) Replace all index date-math examples with the URI encoded form Fix typos (#21456) Adapt ES_JVM_OPTIONS packaging test to ubuntu-1204 Add null check in InternalSearchHit#sourceRef to prevent NPE (#21431) Add VirtualBox version check (#21370) Export ES_JVM_OPTIONS for SysV init Skip reindex rethrottle tests with workers Make forbidden APIs be quieter about classpath warnings (#21443) ...	2016-11-10 23:40:33 -05:00
Jack Conradson	aeb97ff412	Clean up of Script. Closes #21321	2016-11-10 09:59:13 -08:00
Nik Everett	4db21db0aa	Wait for all reindex subtasks before rethrottling In the test for reindex and friend's rethrottling feature we were waiting only for a single reindex sub task to start before rethrottling. This mostly worked because starting tasks is fast. But it didn't *always work and CI found that for us. This fixes the test to wait for all subtasks to start before rethrottling. I reproduced this locally semi-consistently with some fairly creative `Thread.sleep` calls and this test fix fixes the issue even with the sleeps so I'm fairly sure this will work consistently. Closes #21446	2016-11-10 10:49:25 -05:00
Luca Cavanna	bd23921a3a	Fix InternalSearchHit#hasSource to return the proper boolean value (#21441 ) The method used to be called `isSourceEmpty`, and was renamed to `hasSource`, but the return value never changed. Updated tests and users accordingly. Closes #21419	2016-11-10 13:13:38 +01:00
Nik Everett	b0f5ea3f59	Skip reindex rethrottle tests with workers They are flakey and spuriously fail the build. I'll hunt down the cause soon and reenabled but for now they should stop. Relates #21446	2016-11-09 17:50:09 -05:00
Nik Everett	d03b8e4abb	Implement reading from null safe dereferences Null safe dereferences make handling null or missing values shorter. Compare without: ``` if (ctx._source.missing != null && ctx._source.missing.foo != null) { ctx._source.foo_length = ctx.source.missing.foo.length() } ``` To with: ``` Integer length = ctx._source.missing?.foo?.length(); if (length != null) { ctx._source.foo_length = length } ``` Combining this with the as of yet unimplemented elvis operator allows for very concise defaults for nulls: ``` ctx._source.foo_length = ctx._source.missing?.foo?.length() ?: 0; ``` Since you have to start somewhere, we started with null safe dereferenes. Anyway, this is a feature borrowed from groovy. Groovy allows writing to null values like: ``` def v = null v?.field = 'cat' ``` And the writes are simply ignored. Painless doesn't support this at this point because it'd be complex to implement and maybe not all that useful. There is no runtime cost for this feature if it is not used. When it is used we implement it fairly efficiently, adding a jump rather than a temporary variable. This should also work fairly well with doc values.	2016-11-09 07:20:11 -05:00
Nik Everett	a3bd6d1ad9	Switch reindex with slices error to IAE If you try to reindex with multiple slices against a node that doesn't support it we throw an `IllegalArgumentException` so `assertVersionSerializable` is ok with it and so if this happens in REST it comes back as a 400 error.	2016-11-08 11:42:07 -05:00
Luca Cavanna	293a3cab01	Rest client: don't reuse that same HttpAsyncResponseConsumer across multiple retries (#21378 ) * Rest client: don't reuse that same HttpAsyncResponseConsumer across multiple retries Turns out that AbstractAsyncResponseConsumer from apache async http client is stateful and cannot be reused across multiple requests. The failover mechanism was mistakenly reusing that same instance, which can be provided by users, across retries in case nodes are down or return 5xx errors. The downside is that we have to change the signature of two public methods, as HttpAsyncResponseConsumer cannot be provided directly anymore, rather its factory needs to be provided which is going to be used to create one instance of the consumer per request attempt. Up until now we tested our RestClient against multiple nodes only in a mock environment, where we don't really send http requests. In that scenario we can verify that retries etc. work properly but the interaction with the http client library in a real scenario is different and can catch other problems. With this commit we also add an integration test that sends requests to multiple hosts, and some of them may also get stopped meanwhile. The specific test for pathPrefix was also removed as pathPrefix is now randomly applied by default, hence implicitly tested. Moved also a small test method that checked the validity of the path argument to the unit test RestClientSingleHostTests. Also increase default buffer limit to 100MB and make it required in default consumer The default buffer limit used to be 10MB but that proved not to be high enough for scroll requests (see reindex from remote). With this commit we increase the limit to 100MB and make it a bit more visibile in the consumer factory.	2016-11-08 16:42:42 +01:00
Ryan Ernst	7a2c984bcc	Test: Remove multi process support from rest test runner (#21391 ) At one point in the past when moving out the rest tests from core to their own subproject, we had multiple test classes which evenly split up the tests to run. However, we simplified this and went back to a single test runner to have better reproduceability in tests. This change removes the remnants of that multiplexing support.	2016-11-07 15:07:34 -08:00
Jason Tedor	23a271f092	Address race condition in HTTP pipeline tests This commit adapts a previous fix to the HTTP pipeline tests for Netty 4 to Netty 3. Relates #19845	2016-11-07 13:20:22 -05:00
Nik Everett	a13a050271	Add automatic parallelization support to reindex and friends (#20767 ) Adds support for `?slices=N` to reindex which automatically parallelizes the process using parallel scrolls on `_uid`. Performance testing sees a 3x performance improvement for simple docs on decent hardware, maybe 30% performance improvement for more complex docs. Still compelling, especially because clusters should be able to get closer to the 3x than the 30% number. Closes #20624	2016-11-04 20:59:15 -04:00
Adrien Grand	2a70f6e7b1	Upgrade to lucene-6.3.0-snapshot-a66a445. (#21309 ) This addresses a bug that was introduced with https://issues.apache.org/jira/browse/LUCENE-7501.	2016-11-04 10:34:04 +01:00
Nik Everett	24d5f31a54	Make painless's assertion about out of bound less brittle Instead of asserting that the message is shaped a certain way we cause the exception and catch it and assert that the messages are the same. This is the way to go because the exception message from the jvm is both local and jvm dependent. This is the CI failure that found this: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+5.x+java9-periodic/515/consoleFull	2016-11-02 12:38:51 -04:00
Christoph Büscher	b3370de715	Tests: Add warning header checks to QueryBuilder tests and QueryParseContextTests This adds checks for expected warning headers to the query builder test infrastructure. Tests that are adding deprecation warnings to the response headers need to check those, otherwise the abstract base class for the test class will complain at teardown.	2016-11-02 15:45:33 +01:00
Adrien Grand	aa6cd93e0f	Require arguments for QueryShardContext creation. (#21196 ) The `IndexService#newQueryShardContext()` method creates a QueryShardContext on shard `0`, with a `null` reader and that uses `System.currentTimeMillis()` to resolve `now`. This may hide bugs, since the shard id is sometimes used for query parsing (it is used to salt random score generation in `function_score`), passing a `null` reader disables query rewriting and for some use-cases, it is simply not ok to rely on the current timestamp (eg. percolation). So this pull request removes this method and instead requires that all call sites provide these parameters explicitly.	2016-11-02 09:48:49 +01:00
Nik Everett	a612e5988e	Bump reindex-from-remote's buffer to 200mb It was 10mb and that was causing trouble when folks reindex-from-remoted with large documents. We also improve the error reporting so it tells folks to use a smaller batch size if they hit a buffer size exception. Finally, adds some docs to reindex-from-remote mentioning the buffer and giving an example of lowering the size. Closes #21185	2016-11-01 13:19:28 -04:00
Jason Tedor	38663351dc	Fix logger names for Netty Previously Elasticsearch would only use the package name for logging levels, truncating the package prefix and the class name. This meant that logger names for Netty were just prefixed by netty3 and netty. We changed this for Elasticsearch so that it's the fully-qualified class name now, but never corrected this for Netty. This commit fixes the logger names for the Netty modules so that their levels are controlled by the fully-qualified class name. Relates #21223	2016-10-31 17:23:21 -04:00
Jack Conradson	185dff7346	Cleanup ScriptType (#21179 ) Refactored ScriptType to clean up some of the variable and method names. Added more documentation. Deprecated the 'in' ParseField in favor of 'stored' to match the indexed scripts being replaced by stored scripts.	2016-10-31 13:48:51 -07:00
Nik Everett	1bbd3c5400	Fix painless's out of bounds assertions in java 9 Java 9's exception message when lists have an out of bounds index is much better than java 8 but the painless code asserted on the java 8 message. Now it'll accept either. I'm tempted to weaken the assertion but I like asserting that the message is readable.	2016-10-29 22:21:57 -04:00
Nik Everett	3a7a218e8f	Support negative array ofsets in painless Adds support for indexing into lists and arrays with negative indexes meaning "counting from the back". So for if `x = ["cat", "dog", "chicken"]` then `x[-1] == "chicken"`. This adds an extra branch to every array and list access but some performance testing makes it look like the branch predictor successfully predicts the branch every time so there isn't a in execution time for this feature when the index is positive. When the index is negative performance testing showed the runtime is the same as writing `x[x.length - 1]`, again, presumably thanks to the branch predictor. Those performance metrics were calculated for lists and arrays but `def`s get roughly the same treatment though instead of inlining the test they need to make a invoke dynamic so we don't screw up maps. Closes #20870	2016-10-29 16:12:40 -04:00
Adrien Grand	b3cc54cf0d	Upgrade to lucene-6.3.0-snapshot-ed102d6 (#21150 ) Lucene 6.3 is expected to be released in the next weeks so it'd be good to give it some integration testing. I had to upgrade randomized-testing too so that both Lucene and Elasticsearch are on the same version.	2016-10-28 14:47:15 +02:00
Jack Conradson	512a77a633	Refactor ScriptType to be a top-level class.	2016-10-26 10:21:22 -07:00
Jason Tedor	9c3e4d6e22	Add correct Content-Length on HEAD requests This commit fixes responses to HEAD requests so that the value of the Content-Length is correct per the HTTP spec. Namely, the value of this header should be equal to the Content-Length if the request were not a HEAD request. This commit also fixes a memory leak on HEAD requests to the main action that arose from the bytes on a builder not being released due to them being dropped on the floor to ensure that the response to the main action did not have a body. Relates #21123	2016-10-25 23:08:19 -04:00
Nik Everett	18393a06f3	Fix reindex-from-remote for parent/child from <2.0 Versions before 2.0 needed to be told to return interesting fields like `_parent`, `_routing`, `_ttl`, and `_timestamp`. And they come back inside a `fields` block which we need to parse. Closes #21044	2016-10-21 13:14:33 -04:00
Jason Tedor	f51bf8ee47	Upgrade to Netty 4.1.6 This commit upgrades the transport-netty4 module dependency from Netty version 4.1.5 to version 4.1.6. This is a bug fix release of Netty. Relates #21051	2016-10-20 20:13:29 -04:00
Jack Conradson	ceaae47d38	Remove more equivalents of the now method from the Painless whitelist.	2016-10-20 10:35:26 -07:00
Nik Everett	b5da42905f	Remove publishAddress from reindex whitelist Removes the `publishAddress` parameter from the reindex-from-remote whitelist checking because it isn't in use after #21004.	2016-10-20 12:51:10 -04:00
Fanfan	043a45746c	some misspelled words in code (#21012 ) as the title mentioned, misspelling as follows, "construct" to "constrcut", "cumulation" to "cumalation", "initialize" to "intialize".	2016-10-19 11:42:38 -04:00
Nik Everett	acf7c7430b	Add "simple match" support for reindex-from-remote whitelist This allows you to whitelist `localhost:` or `127.0.10.:9200`. It explicitly checks for patterns like `*` in the whitelist and refuses to start if the whitelist would match everything. Beyond that the user is on their own designing a secure whitelist.	2016-10-18 21:47:21 -04:00
Tal Levy	38c650f376	make painless the default scripting language for ScriptProcessor (#20981 ) - fixes a bug in the docs that mentions `lang` as optional - now `lang` defaults to "painless"	2016-10-18 16:22:01 -07:00
Ryan Ernst	dca614aa3b	Build: Change `gradle run` to use zip distribution (#21001 ) When running `gradle run`, a developer usually intends to get a running instance as if they had run elasticsearch from the command line. This is different than the isolated environment we use for integration testing plugins. This change switches the run task to use the zip distribution, so that all modules included in the normal distribution are included.	2016-10-18 11:48:58 -07:00
Ryan Ernst	3d3dd7185d	Add support for booleans in scripts (#20950 ) * Scripting: Add support for booleans in scripts Since 2.0, booleans have been represented as numeric fields (longs). However, in scripts, this is odd, since you expect doing a comparison against a boolean to work. While languages like groovy will auto convert between booleans and longs, painless does not. This changes the doc values accessor for boolean fields in scripts to return Boolean objects instead of Long objects. closes #20949 * Make Booleans final and remove wrapping of `this` for getValues()	2016-10-17 11:11:42 -07:00
Jason Tedor	c1bdaaf80f	Fix connection keep-alive header handling This commit fixes an issue with the handling of the value "keep-alive" on the Connection header in the Netty 4 HTTP implementation while handling an HTTP 1.0 request. The issue was using the wrong equals method to compare an AsciiString instance and a String instance (they could never be equal). This commit fixes this to use the correct equals method to compare for content equality.	2016-10-16 19:51:00 -04:00
Jason Tedor	cd5777593a	Fix connection close header handling This commit fixes an issue with the handling of the value "close" on the Connection header in the Netty 4 HTTP implementation. The issue was using the wrong equals method to compare an AsciiString instance and a String instance (they could never be equal). This commit fixes this to use the correct equals method to compare for content equality. Relates #20956	2016-10-16 13:18:09 -04:00
Jason Tedor	595ec8c948	Remove artificial default processors limit Today Elasticsearch limits the number of processors used in computing thread counts to 32. This was from a time when Elasticsearch created more threads than it does now and users would run into out of memory errors. It appears the real cause of these out of memory errors was not well understood (it's often due to ulimit settings) and so users were left hitting these out of memory errors on boxes with high core counts. Today Elasticsearch creates less threads (but still a lot) and we have a bootstrap check in place to ensure that the relevant ulimit is not too low. There are some caveats still to having too many concurrent indexing threads as it can lead to too many little segments, and it's not a magical go faster knob if indexing is already bottlenecked by disk, but this limitation is artificial and surprising to users and so it should be removed. This commit also increases the lower bound of the max processes ulimit, to prepare for a world where Elasticsearch instances might be running with more the previous cap of 32 processors. With the current settings, Elasticsearch wants to create roughly 576 + 25 * p / 2 threads, where p is the number of processors. Add in roughly 7 * p / 8 threads for the GC threads and a fudge factor, and 4096 should cover us pretty well up to 256 cores. Relates #20874	2016-10-14 05:47:26 -04:00
Tanguy Leroux	e71c30c71d	Mustache: Add {{#url}}{{/url}} function to URL encode strings (#20838 ) This commit adds a new Mustache function (codename: url) and a new URLEncoder that can be used to URL encode strings in mustache templates.	2016-10-13 16:17:28 +02:00
Simon Willnauer	12392b5425	Ensure port range is readable in the exception message (#20893 ) Both netty3 and netty4 http implementation printed the default toString representation of PortRange if ports couldn't be bound. This commit adds a better default toString method to PortRange and uses the string representation for the error message in the http implementations.	2016-10-12 22:33:47 +02:00
Areek Zillur	133be6631d	Merge branch 'master' into cleanup/transport_bulk	2016-10-12 13:09:29 -04:00
Tanguy Leroux	44ac5d057a	Remove empty javadoc (#20871 ) This commit removes as many as empty javadocs comments my regexp has found	2016-10-12 10:27:09 +02:00
Areek Zillur	481f7909ae	Merge branch 'master' into cleanup/transport_bulk	2016-10-11 16:04:47 -04:00
Areek Zillur	0e8b6532ec	rename DocumentRequest to DocWriteRequest	2016-10-11 16:00:10 -04:00
Tanguy Leroux	e4c7d8183e	XContentBuilder: Avoid building self-referencing objects (#20550 ) Some objects like maps, iterables or arrays of objects can self-reference themselves. This is mostly due to a bug in code but the XContentBuilder should be able to detect such situations and throws an IllegalArgumentException instead of building objects over and over until a stackoverflow occurs. closes #20540 closes #19475	2016-10-11 11:41:54 +02:00
Simon Willnauer	37ca38df3d	Expose `ctx._now` in update scripts (#20835 ) Update scripts might want to update the documents `_timestamp` but need a notion of `now()`. Painless doesn't support any notion of now() since it would make scripts non-pure functions. Yet, in the update case this is a valid value and we can pass it with the context together to allow the script to record the timestamp the document was updated. Relates to #17895	2016-10-10 21:14:14 +02:00
Jim Ferenczi	c80a563a71	Replace org.elasticsearch.common.lucene.search.MatchNoDocsQuery with its Lucene version (org.apache.lucene.search.MatchNoDocsQuery) (#20832 ) * Replace org.elasticsearch.common.lucene.search.MatchNoDocsQuery with its Lucene version (org.apache.lucene.search.MatchNoDocsQuery) This change removes the ES version of the match no docs query and replaces it with the Lucene version. relates #18030 * Add missing change	2016-10-10 17:45:19 +02:00
Simon Willnauer	4fd1276542	Prevent AbstractArrays from release bytes more than once (#20819 ) Today we throw an assertion error if we release an AbstractArray more than once. Yet, it's recommended to implement close methods such that they can be invoked more than once. Guaranteed single release calls are hard to implement and some situations might not be tested causing for instance `CircuitBreaker` to operate on corrupted memory stats.	2016-10-10 17:30:37 +02:00
Areek Zillur	c747085b70	Merge branch 'master' into cleanup/transport_bulk	2016-10-07 17:52:53 -04:00
Areek Zillur	396f80c963	Revert "rename DocumentRequest to DocumentWriteRequest" This reverts commit `b5079ce009`.	2016-10-07 17:50:07 -04:00
Nik Everett	cf4038b668	DeGuice some of IndicesModule UpdateHelper, MetaDataIndexUpgradeService, and some recovery stuff. Move ClusterSettings to nullable ctor parameter of TransportService so it isn't forgotten.	2016-10-07 11:14:38 -04:00
Simon Willnauer	7452028e50	Simplify TransportAddress (#20798 ) since TransportAddress is now final we can simplify it's interface a bit and remove methods that are only used in tests or are plain delegates.	2016-10-07 15:56:54 +02:00
Simon Willnauer	194a6b1df0	Remove LocalTransport in favor of MockTcpTransport (#20695 ) This change proposes the removal of all non-tcp transport implementations. The mock transport can be used by default to run tests instead of local transport that has roughly the same performance compared to TCP or at least not noticeably slower. This is a master only change, deprecation notice in 5.x will be committed as a separate change.	2016-10-07 11:27:47 +02:00
Simon Willnauer	9c9afe3f01	Remove SearchContext#current and all it's threadlocals (#20778 ) Today SearchContext expose the current context as a thread local which makes any kind of sane interface design very very hard. This PR removes the thread local entirely and instead passes the relevant context anywhere needed. This simplifies state management dramatically and will allow for a much leaner SearchContext interface down the road.	2016-10-06 19:51:54 +02:00
Areek Zillur	b5079ce009	rename DocumentRequest to DocumentWriteRequest	2016-10-06 05:05:59 -04:00
Areek Zillur	9b691f0d93	Merge branch 'master' into cleanup/transport_bulk	2016-10-05 17:51:39 -04:00
Simon Willnauer	ce21b607bb	move test to a single node test	2016-10-05 21:55:50 +02:00
Simon Willnauer	50ca10378e	Merge branch 'master' into dont_cache_scripts	2016-10-05 20:43:52 +02:00
Simon Willnauer	838c28eeb4	add percolate with script query test	2016-10-05 20:43:46 +02:00
Jack Conradson	ba88d9af57	Remove all date 'now' methods from the Painless whitelist to ensure Painless scripts are pure functions.	2016-10-05 09:47:20 -07:00
Simon Willnauer	57afbadf33	PercolateQuery is never cacheable	2016-10-05 16:38:47 +02:00
Colin Goodheart-Smithe	7bffe95025	Fix percolator queries to not be cacheable	2016-10-05 15:03:29 +01:00
Simon Willnauer	587bdcef38	add extra safety when accessing scripts or now and reqeusts are cached	2016-10-05 09:41:48 +02:00
Simon Willnauer	80bf44f709	Merge branch 'master' into dont_cache_scripts	2016-10-04 21:09:40 +02:00
Areek Zillur	bd4a03a426	Merge branch 'master' into cleanup/transport_bulk	2016-10-04 14:06:17 -04:00
Jason Tedor	51d53791fe	Remove lenient URL parameter parsing Today when parsing a request, Elasticsearch silently ignores incorrect (including parameters with typos) or unused parameters. This is bad as it leads to requests having unintended behavior (e.g., if a user hits the _analyze API and misspell the "tokenizer" then Elasticsearch will just use the standard analyzer, completely against intentions). This commit removes lenient URL parameter parsing. The strategy is simple: when a request is handled and a parameter is touched, we mark it as such. Before the request is actually executed, we check to ensure that all parameters have been consumed. If there are remaining parameters yet to be consumed, we fail the request with a list of the unconsumed parameters. An exception has to be made for parameters that format the response (as opposed to controlling the request); for this case, handlers are able to provide a list of parameters that should be excluded from tripping the unconsumed parameters check because those parameters will be used in formatting the response. Additionally, some inconsistencies between the parameters in the code and in the docs are corrected. Relates #20722	2016-10-04 12:45:29 -04:00
Simon Willnauer	94b7873b49	Add a #markAsNotCachable() method to context to mark requests as not cachable	2016-10-04 18:05:00 +02:00
Areek Zillur	248ac240ed	Merge branch 'master' into cleanup/transport_bulk	2016-10-03 16:12:11 -04:00
Boaz Leskes	27eab74510	merge from master	2016-09-30 17:19:30 +02:00
Martijn van Groningen	55dce523c2	docs: marked `foreach` processor as experimental Closes #19602	2016-09-30 12:23:42 +02:00
Tal Levy	33b9e2065b	no null values in ingest configuration error messages (#20616 ) The invalid ingest configuration field name used to show itself, even when it was null, in error messages. Sometimes this does not make sense. e.g. ```[null] Only one of [file], [id], or [inline] may be configure``` vs. ```Only one of [file], [id], or [inline] may be configure``` The above deals with three fields, therefore this no one property responsible.	2016-09-29 11:34:52 +02:00
Tal Levy	1872d2ae46	[fix] JSON Processor was not properly added (#20613 )	2016-09-29 03:04:43 +02:00
Jason Tedor	25fd9e26c4	Merge branch 'master' into feature/seq_no * master: (1199 commits) [DOCS] Remove non-valid link to mapping migration document Revert "Default `include_in_all` for numeric-like types to false" test: add a test with ipv6 address docs: clearify that both ip4 and ip6 addresses are supported Include complex settings in settings requests Add production warning for pre-release builds Clean up confusing error message on unhandled endpoint [TEST] Increase logging level in testDelayShards() change health from string to enum (#20661) Provide error message when plugin id is missing Document that sliced scroll works for reindex Make reindex-from-remote ignore unknown fields Remove NoopGatewayAllocator in favor of a more realistic mock (#20637) Remove Marvel character reference from guide Fix documentation for setting Java I/O temp dir Update client benchmarks to log4j2 Changes the API of GatewayAllocator#applyStartedShards and (#20642) Removes FailedRerouteAllocation and StartedRerouteAllocation IndexRoutingTable.initializeEmpty shouldn't override supplied primary RecoverySource (#20638) Smoke tester: Adjust to latest changes (#20611) ...	2016-09-29 00:22:31 +02:00
Tal Levy	92ab44d35c	[fix] JSON Processor was not properly added (#20613 )	2016-09-28 23:04:22 +02:00
Nik Everett	370afa371b	Make reindex-from-remote ignore unknown fields reindex-from-remote should ignore unknown fields so it is mostly future compatible. This makes it ignore unknown fields by adding an option to `ObjectParser` and `ConstructingObjectParser` that, if enabled, causes them to ignore unknown fields. Closes #20504	2016-09-26 00:55:46 +02:00
Simon Willnauer	fe1803c957	Remove AnalysisService and reduce it to a simple name to analyzer mapping (#20627 ) Today we hold on to all possible tokenizers, tokenfilters etc. when we create an index service on a node. This was mainly done to allow the `_analyze` API to directly access all these primitive. We fixed this in #19827 and can now get rid of the AnalysisService entirely and replace it with a simple map like class. This ensures we don't create a gazillion long living objects that are entirely useless since they are never used in most of the indices. Also those objects might consume a considerable amount of memory since they might load stopwords or synonyms etc. Closes #19828	2016-09-23 08:53:50 +02:00
Jack Conradson	5755dd256d	Fix String concatentation bug.	2016-09-21 15:49:56 -07:00
Yannick Welsch	bf5d425ab9	Fix wrong logger usages These misusages were found by the logger usage checker that was re-enabled in the previous commit.	2016-09-21 14:45:28 +02:00
Tanguy Leroux	7645abaad9	Remove duplicate methods in ByteSizeValue (#20560 ) This commit removes `ByteSizeValue`'s methods that are duplicated (ex: `mbFrac()` and `getMbFrac()`) in order to only keep the `getN` form. It also renames `mb()` -> `getMb()`, `kb()` -> `getKB()` in order to be more coherent with the `ByteSizeUnit` method names.	2016-09-20 14:07:23 +02:00
Ryan Ernst	85b8f29415	Build: Remove old maven deploy support (#20403 ) * Build: Remove old maven deploy support This change removes the old maven deploy that we have in parallel to maven-publish, and makes maven-publish fully work with publishing to maven local. Using `gradle publishToMavenLocal` should be used to publish to .m2. Note that there is an unfortunate hack that means for zip artifacts we must first create/publish a dummy pom file, and then follow that with the real pom file. It would be nice to have the pom file contains packaging=zip, but maven central then requires sources and javadocs. But our zips are really just attached artifacts, so we already set the packaging type to pom for our zip files. This change just works around a limitation of the underlying maven publishing library which silently skips attached artifacts when the packaging type is set to pom. relates #20164 closes #20375 * Remove unnecessary extra spacing	2016-09-19 15:10:41 -07:00
Simon Willnauer	ee8d14798f	Unguice Transport and friends (#20526 ) This change removes all guice interaction from Transport, HttpServerTransport, HttpServer and TransportService. All these classes as well as their subclasses or extended version configured via plugins are now created by using plain old bloody java constructors. YAY!	2016-09-19 22:10:47 +02:00
David Pilato	ed4d0881b1	Add profile and explain parameters to template API We can now run templates using `explain` and/or `profile` parameters. Which is interesting when you have defined a complicated profile but want to debug it in an easier way than running the full query again. You can use `explain` parameter when running a template: ```js GET /_search/template { "file": "my_template", "params": { "status": [ "pending", "published" ] }, "explain": true } ``` You can use `profile` parameter when running a template: ```js GET /_search/template { "file": "my_template", "params": { "status": [ "pending", "published" ] }, "profile": true } ```	2016-09-19 17:52:13 +02:00
Simon Willnauer	f5daa165f1	Remove ability to plug-in TransportService (#20505 ) TransportService is such a central part of the core server, replacing it's implementation is risky and can cause serious issues. This change removes the ability to plug in TransportService but allows registering a TransportInterceptor that enables plugins to intercept requests on both the sender and the receiver ends. This is a commonly used and overwritten functionality but encapsulates the custom code in a contained manner.	2016-09-16 09:47:53 +02:00
Luca Cavanna	37489c3274	Add clusterUUID to RestMainAction output (#20503 ) Add clusterUUID to RestMainAction output GET / now returns the clusterUUID as well as part of its output for monitoring purposes	2016-09-15 16:25:17 +02:00
Jim Ferenczi	1764ec56b3	Fixed naming inconsistency for fields/stored_fields in the APIs (#20166 ) This change replaces the fields parameter with stored_fields when it makes sense. This is dictated by the renaming we made in #18943 for the search API. The following list of endpoint has been changed to use `stored_fields` instead of `fields`: * get * mget * explain The documentation and the rest API spec has been updated to cope with the changes for the following APIs: * delete_by_query * get * mget * explain The `fields` parameter has been deprecated for the following APIs (it is replaced by _source filtering): * update: the fields are extracted from the _source directly. * bulk: the fields parameter is used but fields are extracted from the source directly so it is allowed to have non-stored fields. Some APIs still have the `fields` parameter for various reasons: * cat.fielddata: the fields paramaters relates to the fielddata fields that should be printed. * indices.clear_cache: used to indicate which fielddata fields should be cleared. * indices.get_field_mapping: used to filter fields in the mapping. * indices.stats: get stats on fields (stored or not stored). * termvectors: fields are retrieved from the stored fields if possible and extracted from the _source otherwise. * mtermvectors: * nodes.stats: the fields parameter is used to concatenate completion_fields and fielddata_fields so it's not related to stored_fields at all. Fixes #20155	2016-09-13 20:54:41 +02:00
Lee Hinman	94625d74e4	No longer allow cluster name in data path In 5.x we allowed this with a deprecation warning. This removes the code added for that deprecation, requiring the cluster name to not be in the data path. Resolves #20391	2016-09-12 15:47:01 -06:00
Nik Everett	69bf08f6c6	Disable regexes by default in painless Adds a new node level, non-dynamic setting, `script.painless.regex.enabled` can be used to enable regexes. Closes #20397	2016-09-12 14:09:43 -04:00
Tal Levy	9f1f5fdedc	introduce the JSON Processor (#20128 ) introduce the JSON Processor	2016-09-09 14:34:32 -07:00
Luca Cavanna	4b00cc37a1	Merge pull request #20382 from javanna/enhancement/cleanup_parse_elements Cleanup sub fetch phase extension point	2016-09-09 22:47:15 +02:00
Tal Levy	dda32545bb	add ignore_missing option to relevant processors (#20194 )	2016-09-09 12:20:18 -07:00
javanna	90ab460fcc	move parsing of search ext sections to the coordinating node	2016-09-09 19:10:42 +02:00
Martijn van Groningen	245882cde3	* Removed `script.default_lang` setting and made `painless` the hardcoded default script language. ** The default script language is now maintained in `Script` class. * Added `script.legacy.default_lang` setting that controls the default language for scripts that are stored inside documents (for example percolator queries). This defaults to groovy. Added `QueryParseContext#getDefaultScriptLanguage()` that manages the default scripting language. Returns always `painless`, unless loading query/search request in legacy mode then the returns what is configured in `script.legacy.default_lang` setting. In the aggregation parsing code added `ParserContext` that also holds the default scripting language like `QueryParseContext`. Most parser don't have access to `QueryParseContext`. This is for scripts in aggregations. * The `lang` script field is always serialized (toXContent). Closes #20122	2016-09-06 18:44:48 +02:00
Alexander Reelsen	b6bf20c2da	Dependencies: Updates to mustache 0.9.3 (#20337 )	2016-09-06 13:31:59 +02:00
Martijn van Groningen	6f6d17dc9c	ingest: Add `dot_expander` processor that can turn fields with dots in the field name into object fields.	2016-09-05 07:28:38 +02:00
javanna	536d13ff11	ProcessInfo to implement Writeable rather than Streamable	2016-09-02 10:23:05 +02:00
Jack Conradson	d0f9ab06f6	Removed extraneous import.	2016-09-01 16:29:08 -07:00
Jack Conradson	71d8ee5eac	Merge branch 'master' into deprecate	2016-09-01 08:51:29 -07:00
Martijn van Groningen	a110498ad8	settings: Make `action.auto_create_index` setting a dynamic cluster setting. Closes #7513	2016-09-01 12:33:30 +02:00
Jason Tedor	d9064f454e	Fix additional exception logging calls This commit modifies a pair of exception logging calls to use parameterized messages from Log4j.	2016-08-31 23:14:13 -04:00
Jack Conradson	1cdfc3ccfe	Merge branch 'master' into deprecate	2016-08-31 15:58:14 -07:00
Jack Conradson	3b3baa6e6c	Made deprecation of Groovy, Javascript, and Python more explicit.	2016-08-31 15:56:31 -07:00
Jason Tedor	76ab02e002	Merge branch 'master' into log4j2 * master: Avoid NPE in LoggingListener Randomly use Netty 3 plugin in some tests Skip smoke test client on JDK 9 Revert "Don't allow XContentBuilder#writeValue(TimeValue)" [docs] Remove coming in 2.0.0 Don't allow XContentBuilder#writeValue(TimeValue) [doc] Remove leftover from CONSOLE conversion Parameter improvements to Cluster Health API wait for shards (#20223) Add 2.4.0 to packaging tests list Docs: clarify scale is applied at origin+offest (#20242)	2016-08-31 16:37:55 -04:00
Jason Tedor	54083f7d6e	Randomly use Netty 3 plugin in some tests When Netty 4 was introduced, it was not the default network implementation. Some tests were constructed to randomly use Netty 4 instead of the default network implementation. When Netty 4 was made the default implementation, these tests were not updated. Thus, these tests are randomly choosing between the default network implementation (Netty 4) and Netty 4. This commit updates these tests to reverse the role of Netty 3 and Netty 4 so that the randomization is choosing between Netty 3 and the default (again, now Netty 4). Relates #20265	2016-08-31 15:41:39 -04:00
Jason Tedor	e166459bbe	Merge branch 'master' into log4j2 * master: Increase visibility of deprecation logger Skip transport client plugin installed on JDK 9 Explicitly disable Netty key set replacement percolator: Fail indexing percolator queries containing either a has_child or has_parent query. Make it possible for Ingest Processors to access AnalysisRegistry Allow RestClient to send array-based headers Silence rest util tests until the bogusness can be simplified Remove unknown HttpContext-based test as it fails unpredictably on different JVMs Tests: Improve rest suite names and generated test names for docs tests Add support for a RestClient base path	2016-08-31 10:59:27 -04:00
Martijn van Groningen	3fcb95b814	percolator: Fail indexing percolator queries containing either a has_child or has_parent query. Closes #2960	2016-08-31 07:46:17 +02:00
Jason Tedor	abf8a1a3f0	Avoid allocating log parameterized messages This commit modifies the call sites that allocate a parameterized message to use a supplier so that allocations are avoided unless the log level is fine enough to emit the corresponding log message.	2016-08-30 18:17:09 -04:00
Jason Tedor	7da0cdec42	Introduce Log4j 2 This commit introduces Log4j 2 to the stack.	2016-08-30 13:31:24 -04:00
Jack Conradson	c8f57df5cf	Merge branch 'master' into deprecate	2016-08-30 09:11:18 -07:00
Jack Conradson	7930233527	Deprecate Groovy, Python, and Javascript scripts.	2016-08-30 09:06:18 -07:00
Jason Tedor	0df92a8da8	Upgrade to Netty 4.1.5 This commit upgrades the Netty dependencies from version 4.1.4 to version 4.1.5. This upgrade brings several bug fixes including the removal of a obnoxious and scary-looking log message when unsafe is explicitly disabled. Relates #20222	2016-08-30 05:46:40 -04:00
Martijn van Groningen	1925813e09	ingest: Fix rename processor change rename leaf fields into branch fields Instead of get, set and remove we do get, remove and then set to avoid type conflicts in IngestDocument. If the set still fails we try to restore the original field in ingest document. Closes #19892	2016-08-30 07:38:01 +02:00
Tanguy Leroux	9727f123b9	Rename Netty TCP transports thread factories from http_* to transport_* Netty3/4 TcpTransport implementations are creating thread factories with a "http_server" thread prefix whereas it should start with "transport_server" and let the "http_server" prefix for the HttpServerTransport implementations.	2016-08-29 13:49:52 +02:00
Jun Ohtani	2a00c9dc46	Merge pull request #19860 from johtani/fix/validate_empty_field_name Validate blank field name	2016-08-29 11:52:18 +09:00
Martijn van Groningen	48926b4d66	ingest: don't render template twice for append processor	2016-08-26 18:07:32 +02:00
Chris Earle	bd0b06440e	Add "Async" to the end of each Async RestClient method This makes it much harder to accidentally miss the Response.	2016-08-26 10:51:33 -04:00
Jun Ohtani	450f47d5b5	Validate blank field name add validation and validate only 5.0+ Add tests before 5.0 Closes #19251	2016-08-26 20:10:33 +09:00
Jason Tedor	bc136a90d5	Add network types to cluster stats The network types in use on a cluster can be useful information to have, so this commit adds aggregate metrics for the network types in use in a cluster to the cluster stats. Relates #20144	2016-08-25 21:08:05 -04:00
Jason Tedor	5a48ad661d	Address race condition in HTTP pipeline tests The Netty 4 HTTP server pipeline tests contains two different test cases. The general idea behind these tests is to submit some requests to a Netty 4 HTTP server, one test with pipelining enabled and another test with pipelining disabled. These requests are submitted to two endpoints, one with a path like /{id} and another with a path like /slow with a query string parameter sleep. This parameter tells the request handler how long to sleep for before replying. The idea is that in the case of the pipelining enabled tests, the requests should come back exactly in the order submitted, even with some of the requests hitting the slow endpoint with random sleep durations; this is the guarantee that pipelining provides. And in the case of the pipelining disabled tests, requests were randombly submitted to /{id} and /slow with sleep parameters starting at 600ms and increasing by 100ms for each slow request constructed. We would expect the requests to come back with the all the responses to the /{id} requests first because these requests will execute instantaneously, and then the responses to the /slow requests. Further, it was expected that the slow requests would come back ordered by the length of the sleep, the thinking being that 100ms should be enough of a difference between each request that we would avoid any race conditions. Sadly, this is not the case, the threads do sometimes hit race conditions. This commit modifies the HTTP server pipelining tests to address this race condition. The modification is that the query string parameter on the /slow endpoint is removed in favor of just submitting requests to the path /slow/{id}, where id just used a marker to distinguish each request. The server chooses a random sleep of at least 500ms for each request on the slow path. The assertion here then is that the /{id} responses arrive first, then then /slow responses. We can not make an assertion on the order of the responses, but we can assert that we did see every expected response. Relates #19845	2016-08-25 14:34:11 -04:00
Jack Conradson	0fdadf4737	Merge branch 'master' into break	2016-08-25 09:26:04 -07:00
Michael McCandless	1fe3e36934	Merge pull request #20147 from mikemccand/lucene_620_upgrade Upgrade to Lucene 6.2.0	2016-08-25 06:03:34 -04:00
Jack Conradson	3deea3dbde	Made for/each break tests more robust in Painless.	2016-08-24 15:17:18 -07:00
Mike McCandless	0ccfe69789	Upgrade to Lucene 6.2.0	2016-08-24 17:26:28 -04:00
Jack Conradson	c60885b5d4	Fix break bug in for/foreach loops.	2016-08-24 14:25:54 -07:00
Igor Motov	b36fbc4452	Add support for parameters to the script ingest processor The script processor should support `params` to be consistent with all other script consumers.	2016-08-24 16:49:48 -04:00
Jim Ferenczi	4682fc34ae	Add the ability to disable the retrieval of the stored fields entirely This change adds a special field named _none_ that allows to disable the retrieval of the stored fields in a search request or in a TopHitsAggregation. To completely disable stored fields retrieval (including disabling metadata fields retrieval such as _id or _type) use _none_ like this: ```` POST _search { "stored_fields": "_none_" } ````	2016-08-24 16:40:08 +02:00
Clinton Gormley	abc025e18b	Fixed the reindex_rethrottle REST tests The API was renamed from reindex.rethrottle to reindex_rethrottle	2016-08-24 14:55:02 +02:00
Areek Zillur	80ca78479f	Make bulk item-level requests implement DocumentRequest interface Currently, bulk item requests can be any ActionRequest, this commit restricts bulk item requests to DocumentRequest. This simplifies handling failures during bulk requests. Additionally, a new enum is added to DocumentRequest to represent the intended operation to be performed by a document request. Now, index operation type also uses the new enum to specify whether the request should create or index a document.	2016-08-23 10:33:37 -04:00
Jack Conradson	131e370a16	Make Painless the default scripting language. Closes #20017	2016-08-22 17:38:02 -07:00
Nik Everett	312a7d45ba	Wait for task to start in reindex test `RethrottleTests#testReindex` fail in CI: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+multijob-intake/1274/console I was unable to reproduce it locally but it looks like a race to start the task. So I've added a wait for it to start just in case.	2016-08-17 12:08:55 -04:00
Nik Everett	39d8f5f123	Reindex tests should expect the right failure Reindex intentionally tries to fail the search operation to make sure that the exception flows back. The exception message changed so we should catch the appropriate exception.	2016-08-17 10:25:38 -04:00
Nik Everett	34bbd27f84	Fix _update_by_query's ingest pipeline support It wasn't being serialized so it wasn't working with the transport client.	2016-08-16 16:03:18 -04:00
Nik Everett	862843ec90	Suppress failing test This test was failing in the presence of transport clients. This turns off transport clients while I fix the test so it doesn't fail for everyone in the mean time.	2016-08-16 15:12:40 -04:00
Ryan Ernst	743d9fd008	Merge branch 'master' into search_parser	2016-08-16 11:28:59 -07:00
Nik Everett	fdd50612ae	Fix reindex under the transport client The big change here is cleaning up the `TaskListResponse` so it doesn't have a breaky `toString` implementation. That was causing the reindex tests to break. Also removed `NetworkModule#registerTaskStatus` which is part of the Plugin API. Use `Plugin#getNamedWriteables` instead.	2016-08-16 12:15:15 -04:00
Nik Everett	46bf8baf2e	Switch aggregation registration for push to pull Adds `getAggregations` to `SearchPlugin` which can be used to register aggregations. Fixup MockNode which wasn't createing MockBigArrays.	2016-08-16 09:08:36 -04:00
Ryan Ernst	7fde410586	Internal: Consolidate search parser registries Parsing a search request is currently split up among a number of classes, using multiple public static methods, which take multiple regstries of elements that may appear in the search request like query parsers and aggregations. This change begins consolidating all this code by collapsing the registries normally used for parsing search requests into a single SearchRequestParsers class. It is also made available to plugin services to enable templating of search requests. Eventually all of the actual parsing logic should move to the class, and the registries should be hidden, but for now they are at least co-located to reduce the number of objects that must be passed around.	2016-08-16 01:59:24 -07:00
Nik Everett	1452ab4b9f	Squash the rest of o.e.rest.action Squashes all the subpackages of `org.elasticsearch.rest.action` down to the following: * `o.e.rest.action.admin` - Administrative actions * `o.e.rest.action.cat` - Actions that make tables for `grep`ing * `o.e.rest.action.document` - Actions that act on documents * `o.e.rest.action.ingest` - Actions that act on ingest pipelines * `o.e.rest.action.search` - Actions that search I'm tempted to merge `search` into `document` but the `document` package feels fairly complete as is and `Suggest` isn't actually always about documents either.... I'm also tempted to merge `ingest` into `admin.cluster` because the latter contains the actions for dealing with stored scripts. I've moved the `o.e.rest.action.support` into `o.e.rest.action`. I've also added `package-info.java`s to all packges in `o.e.rest`. I figure if the package is too small to deserve a `package-info.java` file then it is too small to deserve to be a package.... Also fixes checkstyle in all moved classes.	2016-08-15 21:06:32 -04:00
Tal Levy	84bf24b1e9	remove ability to set field value in script-processor configuration (#19981 )	2016-08-15 10:57:39 -07:00
Igor Motov	10a766704e	Rename Task Persistence into Storing Task Results The term persisted task was used to indicate that a task should store its results upon its completion. We would like to use this term to indicate that a task can survive restart of nodes instead. This commit removes usages of the term "persist" when it means store results.	2016-08-15 10:02:43 -04:00

... 2 3 4 5 6 ...

3913 Commits