OpenSearch

Commit Graph

Author	SHA1	Message	Date
Tal Levy	c53b2ee9cd	introduce KV Processor in Ingest Node (#22272 ) Now you can parse field values of the `key=value` variety and have `key` be inserted as a field name in an ingest document. Closes #22222.	2016-12-20 13:26:17 -08:00
Nik Everett	a04dcfb95b	Introduce XContentParser#namedObject (#22003 ) Introduces `XContentParser#namedObject which works a little like `StreamInput#readNamedWriteable`: on startup components register parsers under names and a superclass. At runtime we look up the parser and call it to parse the object. Right now the parsers take a context object they use to help with the parsing but I hope to be able to eliminate the need for this context as most what it is used for at this point is to move around parser registries which should be replaced by this method eventually. I make no effort to do so in this PR because it is big enough already. This is meant to the a start down a road that allows us to remove classes like `QueryParseContext`, `AggregatorParsers`, `IndicesQueriesRegistry`, and `ParseFieldRegistry`. The goal here is to reduce the amount of plumbing required to allow parsing pluggable things. With this you don't have to pass registries all over the place. Instead you must pass a super registry to fewer places and use it to wrap the reader. This is the same tradeoff that we use for NamedWriteable and it allows much, much simpler binary serialization. We think we want that same thing for xcontent serialization. The only parsing actually converted to this method is parsing `ScoreFunctions` inside of `FunctionScoreQuery`. I chose this because it is relatively self contained.	2016-12-20 11:05:24 -05:00
Nik Everett	73320566c1	Reindex test: catch exception name instead of reason It looks like the exception reason can differ in different default locales, so the build would fail in any non-English locale. This switches the catch to the name of the exception which shouldn't vary.	2016-12-20 10:00:14 -05:00
Nik Everett	8de4be9e4d	Reinex test: don't fail if iis is running on port 0	2016-12-19 16:44:08 -05:00
Grzegorz Gajos	f6b6e4e376	Added ability to remove pipelines via wildcards (#22149 ) (#22191 ) This commit is adding an ability to remove pipelines with wildcards.	2016-12-19 10:59:59 -08:00
javanna	5dae10db11	[TEST] add warnings check to ESTestCase We are currenlty checking that no deprecation warnings are emitted in our query tests. That can be moved to ESTestCase (disabled in ESIntegTestCase) as it allows us to easily catch where our tests use deprecated features and assert on the expected warnings.	2016-12-19 19:39:56 +01:00
javanna	6a27628f12	Remove support for strict parsing mode We return deprecation warnings as response headers, besides logging them. Strict parsing mode stayed around, but was only used in query tests, though we also introduced checks for deprecation warnings there that don't need strict parsing anymore (see #20993). We can then safely remove support for strict parsing mode. The final goal is to remove the ParseFieldMatcher class, but there are many many users of it. This commit prepares the field for the removal, by deprecating ParseFieldMatcher and making it effectively not needed. Strict parsing is removed from ParseFieldMatcher, and strict parsing is replaced in tests where needed with deprecation warnings checks. Note that the setting to enable strict parsing was never ported to the new settings infra hance it cannot be set in production. It is really only used in our own tests. Relates to #19552	2016-12-19 19:39:56 +01:00
Nik Everett	2d71ced221	Properly fail reindex-from-remote if can't detect content type	2016-12-19 12:51:38 -05:00
Yannick Welsch	63af03a104	Atomic mapping updates across types (#22220 ) This commit makes mapping updates atomic when multiple types in an index are updated. Mappings for an index are now applied in a single atomic operation, which also allows to optimize some of the cross-type updates and checks.	2016-12-19 14:39:50 +01:00
Simon Willnauer	ccfeac8dd5	Remove `doHandshake` test-only settings from TcpTransport (#22241 ) In #22094 we introduce a test-only setting to simulate transport impls that don't support handshakes. This commit implements the same logic without a setting.	2016-12-18 09:26:53 +01:00
Tal Levy	bb37167946	Enables the ability to inject serialized json fields into root of document. (#22179 ) The JSON processor has an optional field called "target_field". If you don't specify target_field then target_field becomes what you specified as "field". There isn't anyway to add the fields to the root of a document. By setting `add_to_root`, now serialized fields will be inserted into the top-level fields of the ingest document. Closes #21898.	2016-12-16 10:17:27 -08:00
Jason Tedor	df43c268da	Eagerly initialize Netty 4 Today we initialize Netty in a static initializer. We trigger this method via static initializers from Netty-related classes, but we can trigger this method earlier than we do to ensure that Netty is initialized how we want it to be.	2016-12-15 13:24:47 -05:00
Tal Levy	eaf82a6e7e	compile ScriptProcessor inline scripts when creating ingest pipelines (#21858 ) Inline scripts defined in Ingest Pipelines are now compiled at creation time to preemptively catch errors on initialization of the pipeline. Fixes #21842.	2016-12-14 17:26:51 -08:00
Simon Willnauer	80d6539e9c	Handle connection close / reset events gracefully during handshake (#22178 ) Low level handshake code doesn't handle situations gracefully if the connection is concurrently closed or reset by peer. This commit adds the relevant code to fail the handshake if the connection is closed.	2016-12-14 23:04:14 +01:00
Nik Everett	749039ad4f	Consolidate the last easy parser construction (#22095 ) Moves the last of the "easy" parser construction into `RestRequest`, this time with a new method `RestRequest#contentParser`. The rest of the production code that builds `XContentParser` isn't "easy" because it is exposed in the Transport Client API (a Builder) object.	2016-12-14 15:41:25 -05:00
Adrien Grand	149ef74b26	Fix `missing` on aggs on `boolean` fields. (#22135 ) The creation of the `ValuesSource` used to pass `DateTimeZone.UTC` as a time zone all the time in case of empty fields in spite of the fact that all doc value formats but the date one reject this parameter. This commit centralizes the creation of the `ValuesSource` and adds unit tests to it. Closes #22009	2016-12-14 10:03:09 +01:00
Daniel Mitterdorfer	7e5058037b	Enable strict duplicate checks for JSON content With this commit we enable the Jackson feature 'STRICT_DUPLICATE_DETECTION' by default. This ensures that JSON keys are always unique. While this has a performance impact, benchmarking has indicated that the typical drop in indexing throughput is around 1 - 2%. As a last resort, we allow users to still disable strict duplicate checks by setting `-Des.json.strict_duplicate_detection=false` which is intentionally undocumented. Closes #19614	2016-12-14 09:35:53 +01:00
Nik Everett	49bdd29f91	Consolidate more parser creation into ESTestCase This will make it easier to add the forthcoming required argument, `NamedXContentRegistry`.	2016-12-13 20:28:41 -05:00
Nik Everett	872984d21a	Continue consolidating `XContentParser` construction in tests (#22145 ) Consolidate more parser creation in tests Moves more parser creation in tests to the `createParser` methods in `ESTestCase`.	2016-12-13 17:22:39 -05:00
Tal Levy	f56097b57a	Fixes GrokProcessor's ignorance of named-captures with same name. (#22131 ) Grok was originally ignoring potential matches to named-capture groups larger than one. For example, If you had two patterns containing the same named field, but only the second pattern matched, it would fail to pick this up. This PR fixes this by exploring all potential places where a named-capture was used and chooses the first one that matched. Fixes #22117.	2016-12-13 13:19:55 -08:00
Simon Willnauer	7a9b667e98	Introduce a low level protocol handshake (#22094 ) Today we rely on the version that the API user passes in together with the DiscoveryNode. This commit introduces a low level handshake where nodes exchange their version to be used with the transport protocol that is executed every time a connection to a node is established. This, on the one hand allows to change the wire protocol based on the version we are talking to even without a full cluster restart. Today we would need to carry on a BWC layer across major versions but with a handshake we can rely on the fact that the latest version of the previous minor executes a handshake and uses the latest protocol version across all communication with the N+1 version nodes. This change is yet fully backwards compatible, a followup PR will remove the BWC in 6.0 once this has been back-ported to the 5.x branch	2016-12-13 21:06:23 +01:00
Adrien Grand	049fd3991c	Remove `AggregationContext`. (#22124 ) This class is just a wrapper around `SearchContext`, so let's use `SearchContext` directly. The change is mechanical, except the `ValuesSourceConfig` class, where I moved the logic to get a `ValuesSource` given a config.	2016-12-13 09:09:40 +01:00
Luca Cavanna	6d987a9b69	Remove support for empty queries (#22092 ) Our query DSL supports empty queries (`{}`), which have a different meaning depending on the query that holds it, either ignored, match_all or match_none. We deprecated the support for empty queries in 5.0, where we log a deprecation warning wherever they are used. The way we supported it once we moved query parsing to the coordinating node was having an Optional<QueryBuilder> return type in all of our parse methods (called fromXContent). See #17624. The central place for this was QueryParseContext#parseInnerQueryBuilder. We can now remove all the optional return types and simply throw an exception whenever an empty query is found.	2016-12-12 12:37:12 +01:00
Simon Willnauer	01d67e09b9	Detach handshake from connect to node (#22037 ) Today we connect and publish the nodes connection before we execute a handshake with the node we connect to. In the case of connecting to a node that won't pass the handshake this connection is already `published` and other code paths can use it. This commit detaches the connection and the publish of the connection such that `TransportService` can do a handshake before actually connect and publish the connection.	2016-12-10 10:03:26 +01:00
Nik Everett	3adefb7b4a	Begin centralizing XContentParser creation into RestRequest (#22041 ) To get #22003 in cleanly we need to centralize as much `XContentParser` creation as possible into `RestRequest`. That'll mean we have to plumb the `NamedXContentRegistry` into fewer places. This removes `RestAction.hasBody`, `RestAction.guessBodyContentType`, and `RestActions.getRestContent`, moving callers over to `RestRequest.hasContentOrSourceParam`, `RestRequest.contentOrSourceParam`, and `RestRequest.contentOrSourceParamParser` and `RestRequest.withContentOrSourceParamParserOrNull`. The idea is to use `withContentOrSourceParamParserOrNull` if you need to handle requests without any sort of body content and to use `contentOrSourceParamParser` otherwise. I believe the vast majority of this PR to be purely mechanical but I know I've made the following behavioral change (I'll add more if I think of more): * If you make a request to an endpoint that requires a request body and has cut over to the new APIs instead of getting `Failed to derive xcontent` you'll get `Body required`. * Template parsing is now non-strict by default. This is important because we need to be able to deprecate things without requests failing.	2016-12-09 20:23:02 -05:00
Nik Everett	fc2060ba7e	Don't close rest client from its callback (#22061 ) If you try to close the rest client inside one of its callbacks then it blocks itself. The thread pool switches the status to one that requests a shutdown and then waits for the pool to shutdown. When another thread attempts to honor the shutdown request it waits for all the threads in the pool to finish what they are working on. Thus thread a is waiting on thread b while thread b is waiting on thread a. It isn't quite that simple, but it is close. Relates to #22027	2016-12-09 10:39:51 -05:00
Adrien Grand	36f598138a	Start using `ObjectParser` for aggs. (#22048 ) This is an attempt to start moving aggs parsing to `ObjectParser`. There is still A LOT to do, but ObjectParser is way better than the way aggregations parsing works today. For instance in most cases, we reject numbers that are provided as strings, which we are supposed to accept since some client languages (looking at you Perl) cannot make sure to use the appropriate types. Relates to #22009	2016-12-09 09:45:16 +01:00
Ryan Ernst	b1cef5fdf8	Remove 2.0 prerelease version constants (#22004 ) * Remove 2.0 prerelease version constants This is a start to addressing #21887. This removes: * pre 2.0 snapshot format support * automatic units addition to cluster settings * bwc check for delete by query in pre 2.0 indexes	2016-12-08 21:48:35 -08:00
Lee Hinman	ef64d230e7	Merge remote-tracking branch 'dakrone/index-seq-id-and-primary-term'	2016-12-08 19:47:21 -07:00
Lee Hinman	ee22a477df	Add internal _primary_term doc values field, fix _seq_no indexing This adds the `_primary_term` field internally to the mappings. This field is populated with the current shard's primary term. It is intended to be used for collision resolution when two document copies have the same sequence id, therefore, doc_values for the field are stored but the filed itself is not indexed. This also fixes the `_seq_no` field so that doc_values are retrievable (they were previously stored but irretrievable) and changes the `stats` implementation to more efficiently use the points API to retrieve the min/max instead of iterating on each doc_value value. Additionally, even though we intend to be able to search on the field, it was previously not searchable. This commit makes it searchable. There is no user-visible `_primary_term` field. Instead, the fields are updated by calling: ```java index.parsedDoc().updateSeqID(seqNum, primaryTerm); ``` This includes example methods in `Versions` and `Engine` for retrieving the sequence id values from the index (see `Engine.getSequenceID`) that are only used in unit tests. These will be extended/replaced by actual implementations once we make use of sequence numbers as a conflict resolution measure. Relates to #10708 Supercedes #21480 P.S. As a side effect of this commit, `SlowCompositeReaderWrapper` cannot be used for documents that contain `_seq_no` because it is a Point value and SCRW cannot wrap documents with points, so the tests have been updated to loop through the `LeafReaderContext`s now instead.	2016-12-08 19:47:03 -07:00
Christoph Büscher	7454a9647b	Add fromXContent to HighlightField This adds a fromXContent method and unit test to the HighlightField class so we can parse it as part of a serch response. This is part of the preparation for parsing search responses on the client side.	2016-12-07 16:32:44 +01:00
Nik Everett	ef83dbfbe6	Reindex: Better error message for pipeline in wrong place (#21985 ) `_update_by_query` supports specifying the `pipeline` to process the documents as a url parameter but `_reindex` doesn't. It doesn't because everything about the `_reindex` request that has to do with writing the documents is grouped under the `dest` object in the request body. This changes the response parameter from `request [_reindex] contains unrecognized parameter: [pipeline]` to `_reindex doesn't support [pipeline] as a query parmaeter. Specify it in the [dest] object instead.`	2016-12-06 14:55:46 -05:00
Ryan Ernst	c8f241f284	Plugins: Remove response action filters (#21950 ) Action filters currently have the ability to filter both the request and response. But the response side was not actually used. This change removes support for filtering responses with action filters.	2016-12-05 16:14:04 -08:00
Nik Everett	2087234d74	Timeout improvements for rest client and reindex (#21741 ) Changes the default socket and connection timeouts for the rest client from 10 seconds to the more generous 30 seconds. Defaults reindex-from-remote to those timeouts and make the timeouts configurable like so: ``` POST _reindex { "source": { "remote": { "host": "http://otherhost:9200", "socket_timeout": "1m", "connect_timeout": "10s" }, "index": "source", "query": { "match": { "test": "data" } } }, "dest": { "index": "dest" } } ``` Closes #21707	2016-12-05 10:54:51 -05:00
Igor Motov	c391b3fff6	Add proper descriptions to reindex, update-by-query and delete-by-query tasks. Related to #21768	2016-12-02 21:46:38 -05:00
Jack Conradson	0ecdef026d	Test fix for def equals test in Painless. (#21945 ) Closes #21801	2016-12-02 14:41:13 -08:00
Nik Everett	0c724b1878	Keep context during reindex's retries (#21941 ) * Keep context during reindex's retries This fixes reindex and friend's retries to keep the context. * Docs	2016-12-02 13:48:51 -05:00
Simon Willnauer	842e00c689	[TEST] Add back skip of external clusters	2016-12-02 11:53:33 +01:00
Simon Willnauer	572b4c3e72	Port assert from 5.x to master I added an assertion to Netty4/Netty3Transport in 5.x that is not in master yet. This commit port the assert to ensure we consumed all connection in `connectToChannels`	2016-12-02 10:34:33 +01:00
Simon Willnauer	adf9bd90a4	Remove legacy BWC test infrastructure and tests (#21915 ) We don't use the test infra nor do we run the tests. They might all be entirely out of date. We also have a different BWC test infra in-place. This change removes all of the legacy infra.	2016-12-02 08:06:20 +01:00
Simon Willnauer	155de53fe3	Add a connect timeout to the ConnectionProfile to allow per node connect timeouts (#21847 ) Timeouts are global today across all connections this commit allows to specify a connection timeout per node such that depending on the context connections can be established with different timeouts. Relates to #19719	2016-12-01 15:39:49 +01:00
Boaz Leskes	fe01c0f83b	fix TemplateQueryBuilderTests & Murmur3FieldMapperTests	2016-12-01 14:21:57 +01:00
Simon Willnauer	dd5256c324	Reduce number of connections per node depending on the nodes role (#21849 ) We currently treat every node equally when we establish connections to a node. Yet, if we are not master eligible or can't hold any data there is no point in creating a dedicated connection for sending the cluster state or running remote recoveries respectively. The usage of STATE and RECOVERY connections on non-master and/or non-data nodes will result in an IllegalStateException.	2016-12-01 08:00:48 +01:00
Jason Tedor	6c45695d52	Add version 5.1.1 This commit removes the version constant for 5.1.0 (due to an inadvertent release) and adds the version constant for 5.1.1. Relates #21890	2016-11-30 11:14:17 -05:00
Luca Cavanna	5b8bdba12e	Remove subrequests method from CompositeIndicesRequest (#21873 )	2016-11-30 15:03:58 +01:00
Adrien Grand	6231009a8f	Remove 2.x backward compatibility of mappings. (#21670 ) For the record, I also had to remove the geo-hash cell and geo-distance range queries to make the code compile. These queries already throw an exception in all cases with 5.x indices, so that does not hurt any more. I also had to rename all 2.x bwc indices from `index-${version}` to `unsupported-${version}` to make `OldIndexBackwardCompatibilityIT` happy.	2016-11-30 13:34:46 +01:00
Luca Cavanna	6eaff9432d	SearchTemplateRequest to implement CompositeIndicesRequest (#21865 ) SearchTemplateRequest to implement CompositeIndicesRequest Given that SearchTemplateRequest effectively delegates to search when a search is being executed, it should implement the CompositeIndicesRequest interface. The subrequests method should return a single search request. When a search is not going to be executed, because we are in simulate mode, there are no inner requests, and there are no corresponding indices to that request either. Closes #21747	2016-11-29 20:52:43 +01:00
Jim Ferenczi	d791ddf704	Upgrade to lucene-6.4.0-snapshot-ec38570 (#21853 ) Set lucene version to 6.4.0-snapshot-ec38570 and update all the sha1s/license Fix invalid combo after upgrade in query_string query. split_on_whitespace=false is disallowed if auto_generate_phrase_queries=true Adapt the expectations of some tests to the new format of the Lucene explain output	2016-11-29 18:40:31 +01:00
Nicholas Knize	af1ab68b64	Add RangeFieldMapper for numeric and date range types Lucene 6.2 added index and query support for numeric ranges. This commit adds a new RangeFieldMapper for indexing numeric (int, long, float, double) and date ranges and creating appropriate range and term queries. The design is similar to NumericFieldMapper in that it uses a RangeType enumerator for implementing the logic specific to each type. The following range types are supported by this field mapper: int_range, float_range, long_range, double_range, date_range. Lucene does not provide a DocValue field specific to RangeField types so the RangeFieldMapper implements a CustomRangeDocValuesField for handling doc value support. When executing a Range query over a Range field, the RangeQueryBuilder has been enhanced to accept a new relation parameter for defining the type of query as one of: WITHIN, CONTAINS, INTERSECTS. This provides support for finding all ranges that are related to a specific range in a desired way. As with other spatial queries, DISJOINT can be achieved as a MUST_NOT of an INTERSECTS query.	2016-11-29 10:10:14 -06:00
Simon Willnauer	f5ff69fabe	Remove connectToNodeLight and replace it with a connection profile (#21799 ) The Transport#connectToNodeLight concepts is confusing and not very flexible. neither really testable on a unittest level. This commit cleans up the code used to connect to nodes and simplifies transport implementations to share more code. This also allows to connect to nodes with custom profiles if needed, for instance future improvements can be added to connect to/from nodes that are non-data nodes without dedicated bulks and recovery connections.	2016-11-29 09:35:07 +01:00

1 2 3 4 5 ...

3781 Commits