OpenSearch

Commit Graph

Author	SHA1	Message	Date
Tanguy Leroux	8c52e8814b	Remove ReindexResponse in favor of BulkIndexByScrollResponse	2016-05-09 17:03:16 +02:00
Chris Earle	5be79ed02c	Add Failure Details to every NodesResponse Most of the current implementations of BaseNodesResponse (plural Nodes) ignore FailedNodeExceptions. - This adds a helper function to do the grouping to TransportNodesAction - Requires a non-null array of FailedNodeExceptions within the BaseNodesResponse constructor - Reads/writes the array to output - Also adds StreamInput and StreamOutput methods for generically reading and writing arrays	2016-05-06 14:59:43 -04:00
Adrien Grand	7d8708716e	QueryBuilder does not need generics. #18133 QueryBuilder has generics, but those are never used: all call sites use `QueryBuilder<?>`. Only `AbstractQueryBuilder` needs generics so that the base class can contain a default implementation for setters that returns `this`.	2016-05-06 08:38:20 +02:00
Robert Muir	e3ce6c9048	Painless: add fielddata accessors (.value/.values/.distance()/etc) This gives better coverage and consistency with the scripting APIs, by whitelisting the primary search scripting API classes and using them instead of only Map and List methods. For example, accessing fields can now be done with `.value` instead of `.0` because `getValue()` is whitelisted. For now, access to a document's fields in this way (loads) are fast-pathed in the code, to avoid dynamic overhead. Access to geo fields and geo distance functions is now supported. TODO: date support (e.g. whitelist ReadableDateTime methods as a start) TODO: improve docs (like expressions and groovy have for document's fields) TODO: remove fast-path hack Closes #18169 Squashed commit of the following: commit ec9f24b2424891a7429bb4c0a03f9868cba0a213 Author: Robert Muir <rmuir@apache.org> Date: Thu May 5 17:59:37 2016 -0400 cutover to <Def> instead of <Object> here commit 9edb1550438acd209733bc36f0d2e0aecf190ecb Author: Robert Muir <rmuir@apache.org> Date: Thu May 5 17:03:02 2016 -0400 add fast-path for docvalues field loads commit f8e38c0932fccc0cfa217516130ad61522e59fe5 Author: Robert Muir <rmuir@apache.org> Date: Thu May 5 16:47:31 2016 -0400 Painless: add fielddata accessors (.value/.values/.distance()/etc)	2016-05-05 18:38:41 -04:00
Jack Conradson	2cae575f53	Added single-quoted strings. Closes #18150	2016-05-05 09:26:02 -07:00
Robert Muir	59c135b58d	make internal Def methods private and add basic javadocs	2016-05-05 03:47:56 -04:00
Robert Muir	928e2b904d	painless: optimize/simplify dynamic field and method access	2016-05-05 03:42:14 -04:00
Jason Tedor	784c9e5fb9	Introduce node handshake This commit introduces a handshake when initiating a light connection. During this handshake, node information, cluster name, and version are received from the target node of the connection. This information can be used to immediately validate that the target node is a member of the same cluster, and used to set the version on the stream. This will allow us to extend APIs that are used during initial cluster recovery without a major version change. Relates #15971	2016-05-04 20:06:47 -04:00
Nik Everett	230697c202	[reindex] Switch throttle to Float.POSITIVE_INFITINTY/"unlimited" All other values are errors. Add java test for throttling. We had a REST test but it only ran against one node so it didn't catch serialization errors. Add Simple round trip test for rethrottle request	2016-05-04 16:14:32 -04:00
Christoph Büscher	ca21aa0cb5	Make reset() in QueryShardContext private The query shard reset() method resets some internal state in the query shard context, like clearing query names, the filter flag or named queries. The problem with this method being public is that it currently (miss?) used for modifying an existing context for recursive invocatiob, but the contexts that have been reseted that way cannot be properly set back to their previous state. This PR is a step towards removing reset() entirely by first making it only be used internally in QueryShardContext. In places where reset() was used we can either create new QueryShardContexts or modify the existing context because it is discarded afterwards anyway.	2016-05-03 18:56:16 +02:00
Robert Muir	fff82db681	Add tests/doc for boolean fields with expressions	2016-05-02 18:13:03 -04:00
Robert Muir	693c1f6671	Support geo_point fields in lucene expressions. Closes #18096	2016-05-02 17:49:21 -04:00
Robert Muir	28409e4509	Add support for .empty to expressions, and some docs improvements Closes #18077	2016-05-02 09:07:25 -04:00
Martijn van Groningen	7aca1389e2	ingest: Add `date_index_name` processor. Closes #17814	2016-04-29 17:20:48 +02:00
Ryan Ernst	d12a4bb51d	Merge pull request #17933 from rjernst/camelcase4 Remove camelCase support	2016-04-22 13:46:43 -07:00
Nik Everett	cc1a55423c	Reindex: properly mark things as child tasks Do this by creating a Client subclass that automatically assigns the parentTask to all requests that come through it. Code that doesn't want to set the parentTask can call `unwrap` on the Client to get the inner client instance that doesn't set the parentTask. Reindex uses this for its ClearScrollRequest so that the request will run properly after the reindex request has been canceled.	2016-04-22 14:00:11 -04:00
Ryan Ernst	55388590c1	Remove camelCase support Now that the current uses of magical camelCase support have been deprecated, we can remove these in master (sans remaining issues like BulkRequest). This change removes camel case support from ParseField, query types, analysis, and settings lookup. see #8988	2016-04-22 09:18:10 -07:00
Nik Everett	51621f9d75	Remove ChildTaskRequest and always pass parentTaskId when building a task Passing parentTaskId forces the caller to handle the parentTaskId.	2016-04-22 11:26:18 -04:00
Nik Everett	ffeb5e38fc	Remove parent-less task methods Callers should explicitly handle parents - either using EMPTY_TASK_ID when a parent isn't possible or piping parents from the TransportRequest when possible.	2016-04-22 11:26:18 -04:00
Martijn van Groningen	c5ad2e2865	Changed indexed scripts to be stored in the cluster state instead of the `.scripts` index. Also added max script size soft limit for stored scripts. Closes #16651	2016-04-22 13:42:55 +02:00
Isabel Drost-Fromm	233fe86ee4	Makes Script type writeable Used to be Streamable. Left-over of the PROTOTYPE related refactoring by @nik9000 Closes #17753	2016-04-21 12:40:29 +02:00
Christoph Büscher	e06e122f9f	Wrap xcontent parser creation in try-with-resource statement where possible	2016-04-18 16:13:56 +02:00
Christoph Büscher	cdb36a2b0c	Merge pull request #17417 Clean up QueryParseContext and don't hold it inside QueryRewrite/ShardContext	2016-04-18 15:13:53 +02:00
Nik Everett	d3b1306069	Reindex: never report negative throttled_until Just clamp the value at 0. It isn't useful to tell the user "this thread should have woken 5ms ago". Closes #17783	2016-04-15 16:53:23 -04:00
Christoph Büscher	fbd558382d	Clean up QueryParseContext and don't hold it inside QueryRewrite/ShardContext This change cleans up a few things in QueryParseContext and QueryShardContext that make it hard to reason about the state of these context objects and are thus error prone and should be simplified. Currently the parser that used in QueryParseContext can be set and reset any time from the outside, which makes reasoning about it hard. This change makes the parser a mandatory constructor argument removes ability to later set a different ParseFieldMatcher. If none is provided at construction time, the one set inside the parser is used. If a ParseFieldMatcher is specified at construction time, it is implicitely set on the parser that is beeing used. The ParseFieldMatcher is only kept inside the parser. Also currently the QueryShardContext historically holds an inner QueryParseContext (in the super class QueryRewriteContext), that is mainly used to hold the parser and parseFieldMatcher. For that reason, the parser can also be reset, which leads to the same problems as above. This change removes the QueryParseContext from QueryRewriteContext and removes the ability to reset or retrieve it from the QueryShardContext. Instead, `QueryRewriteContext#newParseContext(parser)` can be used to create new parse contexts with the given parser from a shard context when needed.	2016-04-15 17:13:01 +02:00
Christoph Büscher	de036d63d8	Rename context.parseFieldMatcher() to context.getParseFieldMatcher	2016-04-15 15:15:32 +02:00
Christoph Büscher	15c59d07b3	Remove ParseFieldMatcher from AbstractXContentParser Currently we are able to set a ParseFieldMatcher on XContentParsers, mainly to conveniently carry it around to be available where the actual parsing happens. This was just recently introduced together with ObjectParser so that ObjectParser can make use of deprecation logging and throwing errors while parsing. This however is trappy because we create parsers in so many places in the code and it is easy to forget setting the right ParseFieldMatcher. Instead we should hold the ParseFieldMatcher only in the parse contexts (e.g. QueryParseContext). This PR removes the ParseFieldMatcher from XContentParser. ObjectParser can still make use of it because we can make the otherwise unbounded `context` type to extend an interface that makes sure contexts used in ObjectParser can supply a ParseFieldMatcher. Contexts in ObjectParser are now no longer optional, but it is sufficient to pass in a small lambda expression in places where no other context is available. Relates to #17417	2016-04-15 15:14:46 +02:00
Adrien Grand	d84c643f58	Use the new points API to index numeric fields. #17746 This makes all numeric fields including `date`, `ip` and `token_count` use points instead of the inverted index as a lookup structure. This is expected to perform worse for exact queries, but faster for range queries. It also requires less storage. Notes about how the change works: - Numeric mappers have been split into a legacy version that is essentially the current mapper, and a new version that uses points, eg. LegacyDateFieldMapper and DateFieldMapper. - Since new and old fields have the same names, the decision about which one to use is made based on the index creation version. - If you try to force using a legacy field on a new index or a field that uses points on an old index, you will get an exception. - IP addresses now support IPv6 via Lucene's InetAddressPoint and store them in SORTED_SET doc values using the same encoding (fixed length of 16 bytes and sortable). - The internal MappedFieldType that is stored by the new mappers does not have any of the points-related properties set. Instead, it keeps setting the index options when parsing the `index` property of mappings and does `if (fieldType.indexOptions() != IndexOptions.NONE) { // add point field }` when parsing documents. Known issues that won't fix: - You can't use numeric fields in significant terms aggregations anymore since this requires document frequencies, which points do not record. - Term queries on numeric fields will now return constant scores instead of giving better scores to the rare values. Known issues that we could work around (in follow-up PRs, this one is too large already): - Range queries on `ip` addresses only work if both the lower and upper bounds are inclusive (exclusive bounds are not exposed in Lucene). We could either decide to implement it, or drop range support entirely and tell users to query subnets using the CIDR notation instead. - Since IP addresses now use a different representation for doc values, aggregations will fail when running a terms aggregation on an ip field on a list of indices that contains both pre-5.0 and 5.0 indices. - The ip range aggregation does not work on the new ip field. We need to either implement range aggs for SORTED_SET doc values or drop support for ip ranges and tell users to use filters instead. #17700 Closes #16751 Closes #17007 Closes #11513	2016-04-14 17:56:23 +02:00
Christoph Büscher	e15e7f7e6e	Remove parser argument from methods where we already pass in a parse context When we pass down both parser and QueryParseContext to a method, we cannot make sure that the parser contained in the context and the parser that is parsed as an argument have the same state. This removes the parser argument from methods where we currently have both the parser and the parse context as arguments and instead retrieves the parse from the context inside the method.	2016-04-14 16:18:05 +02:00
Martijn van Groningen	2928fd6ef3	Cleanup query builder for inner hits construction. * Inner hits can now only be provided and prepared via setter in the nested, has_child and has_parent query. * Also made `score_mode` a required constructor parameter. * Moved has_child's min_child/max_children validation from doToQuery(...) to a setter.	2016-04-14 14:43:21 +02:00
Nik Everett	cca3154c43	Rename isSourceEmpty to hasSource And add a test case for {} to reindex.	2016-04-13 08:19:58 -04:00
Nik Everett	c2e745bf3b	reindex: Guard against user disabling fields	2016-04-13 08:19:58 -04:00
Nik Everett	0f9804b0e2	reindex: gracefully handle when _source is disabled Closes #17666	2016-04-13 08:19:58 -04:00
Adrien Grand	013acf9179	Remove MappedFieldType.value. #17557 This commit removes `MappedFieldType.value` and simplifies `MappedFieldType.valueforSearch`. `valueforSearch` was used to post-process values that come for stored fields (eg. to convert a long back to a string representation of a date in the case of a date field) and also values that are extracted from the source but only in the case of GET calls: it would not be called when performing source filtering on search requests. `valueforSearch` is now only called for stored fields, since values that are extracted from the source should already be formatted as expected.	2016-04-12 09:12:56 +02:00
Adrien Grand	a14db8e17e	Remove MappedFieldType.useTermQueryWithQueryString() and isNumeric(). #17599 In both cases, what elasticsearch is really interested in is whether the field is an analyzed string field. So it can just check `tokenized()` instead.	2016-04-12 08:45:28 +02:00
Adrien Grand	496c7fbd84	Upgrade Lucene 6 Release * upgrades numerics to new Point format * updates geo api changes * adds GeoPointDistanceRangeQuery as XGeoPointDistanceRangeQuery * cuts over to ES GeoHashUtils	2016-04-11 16:50:04 -05:00
Adrien Grand	0eb1a816c8	Allow the query cache to be disabled. #16268 This replaces the internal `index.queries.cache.type` setting with a new `index.queries.cache.enabled` setting, which is documented. Closes #15802	2016-04-11 18:06:16 +02:00
Adrien Grand	42526ac28e	Remove Settings.settingsBuilder. We have both `Settings.settingsBuilder` and `Settings.builder` that do exactly the same thing, so we should keep only one. I kept `Settings.builder` since it has my preference but also it is the one that we use in examples of the Java API.	2016-04-08 18:10:02 +02:00
Adrien Grand	bef38a4d12	Fix test bug.	2016-04-07 09:49:27 +02:00
Adrien Grand	e1bfe23c22	ExtendedStatsAggregator should also pass sigma to emtpy aggs. #17388 Because sigma is also used at reduce time, it should be passed to empty aggs. Otherwise it causes bugs when an empty aggregation is used to perform reduction is it would assume a sigma of zero. Closes #17362	2016-04-07 09:34:11 +02:00
Nik Everett	16c12afabe	Rework ScoreFunctionBuilder registration to remove PROTOTYPEs This removes PROTOTYPEs from ScoreFunctionsBuilders. To do so we rework registration so it doesn't need PROTOTYPEs and lines up with the recent changes to query registration.	2016-04-06 13:04:11 -04:00
Nik Everett	2b6866d26b	Fix references to the removed parsers Mostly stuff is just in the builder now.	2016-04-06 11:15:22 -04:00
Adrien Grand	4c4bbb3e45	Replace FieldStatsProvider with a method on MappedFieldType. #17334 FieldStatsProvider had to perform instanceof calls to properly handle dates or ip addresses. By moving the logic to MappedFieldType, each field type can check whether all values are within bounds its way. Note that this commit only keeps rewriting support for dates, which are the only field for which the rewriting mechanism is likely to help (because of time-based indices).	2016-04-01 10:28:58 +02:00
Nik Everett	14d37baa4b	[reindex] Don't get rejected BulkByScrollTaskTest#testDelayAndRethrottle was getting rejected exceptions every once in a while. This was reproducible ~20% of the time for me. I added a CyclicBarrier to prevent the test from shutting down the thread pool before the threads get finished.	2016-03-31 14:50:14 -04:00
Nik Everett	0c762fca35	Fix test mistake	2016-03-31 12:27:35 -04:00
Nik Everett	7f794e7b77	Test for invalid scroll_size	2016-03-31 12:21:32 -04:00
Nik Everett	30a1862339	Remove PROTOTYPE from BulkItemResponse.Failure Closes #17086	2016-03-31 09:10:36 -04:00
javanna	32b6e529f4	Merge branch 'master' into enhancement/discovery_node_one_getter	2016-03-31 10:49:26 +02:00
Jack Conradson	a37e53c50f	Painless clean up including fixing _score issues and improving type error messages. Closes #17428	2016-03-30 16:40:17 -07:00
Nik Everett	78ab6c5b7f	[reindex] Dynamic throttle! This allows the user to update the reindex throttle on the fly, with changes that speed up the throttling being applied immediately and changes that slow down the throttling being applied during the next batch. This means that if a user throttles reindex in such a way that it tries to sleep for 16 years and then realizes that they've done something wrong then they can change the throttle and reindex will wake up again. We don't apply slow downs immediately so we never get in danger of losing the scan context. Also, if reindex is canceled while it is sleeping (how it honor throttling) then it'll immediately wake up and cancel itself.	2016-03-30 16:40:42 -04:00

1 2 3 4 5 ...

3043 Commits