OpenSearch

Commit Graph

Author	SHA1	Message	Date
Martijn van Groningen	cb1204774b	Include the _index, _type and _id to nested search hits in the top_hits and inner_hits response. Also include _type and _id for parent/child hits inside inner hits. In the case of top_hits aggregation the nested search hits are directly returned and are not grouped by a root or parent document, so it is important to include the _id and _index attributes in order to know to what documents these nested search hits belong to. Closes #27053	2017-11-28 14:05:29 +01:00
Colin Goodheart-Smithe	99aca9cdfc	Enhances exists queries to reduce need for `_field_names` (#26930 ) * Enhances exists queries to reduce need for `_field_names` Before this change we wrote the name all the fields in a document to a `_field_names` field and then implemented exists queries as a term query on this field. The problem with this approach is that it bloats the index and also affects indexing performance. This change adds a new method `existsQuery()` to `MappedFieldType` which is implemented by each sub-class. For most field types if doc values are available a `DocValuesFieldExistsQuery` is used, falling back to using `_field_names` if doc values are disabled. Note that only fields where no doc values are available are written to `_field_names`. Closes #26770 * Addresses review comments * Addresses more review comments * implements existsQuery explicitly on every mapper * Reinstates ability to perform term query on `_field_names` * Added bwc depending on index created version * Review Comments * Skips tests that are not supported in 6.1.0 These values will need to be changed after backporting this PR to 6.x	2017-11-01 10:46:59 +00:00
Simon Willnauer	8dda827ff4	Don't refresh on `_flush` `_force_merge` and `_upgrade` (#27000 ) Today all these API calls have a sideeffect of making documents visible to search requests. While this is sometimes desired it's an unnecessary sideeffect and now that we have an internal (engine-private) index reader (#26972) we artificially add a refresh call for bwc. This change removes this sideeffect in 7.0.	2017-10-16 10:16:35 +02:00
Martijn van Groningen	dca787ed8a	upgrade to Lucene 7.1.0 snapshot version	2017-10-05 09:06:56 +02:00
Simon Willnauer	aab4655e63	Unify Settings xcontent reading and writing (#26739 ) This change adds a fromXContent method to Settings that allows to read the xcontent that is produced by toXContent. It also replaces the entire settings loader infrastructure and removes the structured map representation. Future PRs will also tackle the `getAsMap` that exposes the internal represenation of settings for better encapsulation.	2017-09-25 13:23:01 +02:00
Christoph Büscher	86b00b84bc	Remove parse field deprecations in query builders (#26711 ) The `fielddata` field and the use of the `_name` field in the short syntax of the range query have been deprecated in 5.0 and can be removed. The same goes for the deprecated `score_mode` field in HasParentQueryBuilder, the deprecated `like_text`, `ids` and `docs` parameter in the `more_like_this` query, the deprecated query name in the short version of the `regexp` query, and several deprecated alternative field names in other query builders.	2017-09-20 16:22:21 +02:00
Martijn van Groningen	78e9c96d7f	Added a limit to from + size in top_hits and inner hits. Relates to #11511	2017-09-05 08:44:45 +02:00
Colin Goodheart-Smithe	ce1d85d7d0	Moves deferring code into its own subclass (#26421 ) * Moves deferring code into its own subclass This change moves the code that deals with deferring collection to a subclass of BucketAggregator called DeferringBucketAggregator. This means that the code in AggregatorBase is simplified and also means that the code for deferring colleciton is in one place and easier to maintain. * Makes SIngleBucketAggregator an interface This is so aggregators that extend BucketsAggregator directly and those that extend DeferringBucketAggregator can be a single bucket aggregator * review comments * More review comments	2017-08-30 11:15:40 +01:00
olcbean	5c4c1c5e15	Verify that _bulk and _msearch requests are terminated by a newline (#25740 )	2017-08-08 10:45:44 +02:00
Simon Willnauer	82fa531ab4	Remove `_index` fielddata hack if cluster alias is present (#26082 ) We introduced a hack in #25885 to respect the cluster alias if available on the `_index` field. This is important if aggregations or other field data related operations are executed. Yet, we added a small hack that duplicated an implementation detail from the `_index` field data builder to make this work. This change adds a necessary but simple API change that allows us to remove the hack and only have a single implementation.	2017-08-08 09:24:24 +02:00
Adrien Grand	f0cba4fce5	Add a scripted similarity. (#25831 ) The goal of this similarity is to help users who would like to keep the functionality of the `tf-idf` similarity that we want to remove, or to allow for specific usec-cases (disabling idf, disabling tf, disabling length norm, etc.) to not have to build a custom plugin and familiarize with the low-level Lucene API.	2017-08-08 08:55:12 +02:00
Adrien Grand	88d456989e	Make FieldMapper.copyTo() always non-null. (#25994 ) Otherwise it is confusing that both a null copyTo and an empty copyTo should be treated the same.	2017-08-02 10:07:29 +02:00
Jim Ferenczi	562c3744ca	Merge FunctionScoreQuery and FiltersFunctionScoreQuery (#25889 ) This change merges the functionality of the FiltersFunctionScoreQuery in the FunctionScoreQuery. It also ensures that an exception is thrown when the computed score is equals to Float.NaN or Float.NEGATIVE_INFINITY. These scores are invalid for TopDocsCollectors that relies on score comparison. Fixes #15709 Fixes #23628	2017-07-28 09:22:20 +02:00
Simon Willnauer	634ce90dc0	Respect cluster alias in `_index` aggs and queries (#25885 ) Today when we aggregate on the `_index` field the cross cluster search alias is not taken into account. Neither is it respected when we search on the field. This change adds support for cluster alias when the cluster alias is present on the `_index` field. Closes #25606	2017-07-26 09:16:52 +02:00
Adrien Grand	40bb1663ee	Index ids in binary form. (#25352 ) Indexing ids in binary form should help with indexing speed since we would have to compare fewer bytes upon sorting, should help with memory usage of the live version map since keys will be shorter, and might help with disk usage depending on how efficient the terms dictionary is at compressing terms. Since we can only expect base64 ids in the auto-generated case, this PR tries to use an encoding that makes the binary id equal to the base64-decoded id in the majority of cases (253 out of 256). It also specializes numeric ids, since this seems to be common when content that is stored in Elasticsearch comes from another database that uses eg. auto-increment ids. Another option could be to require base64 ids all the time. It would make things simpler but I'm not sure users would welcome this requirement. This PR should bring some benefits, but I expect it to be mostly useful when coupled with something like #24615. Closes #18154	2017-07-07 14:22:47 +02:00
Martijn van Groningen	d0f9f425bd	parent/child: Removed ParentJoinFieldSubFetchPhase	2017-07-06 13:15:02 +02:00
Martijn van Groningen	407273f81d	parent/child: Support parent id being specified as number in the _source	2017-07-06 11:48:57 +02:00
Christoph Büscher	f576c987ce	Remove QueryParseContext (#25486 ) QueryParseContext is currently only used as a wrapper for an XContentParser, so this change removes it entirely and changes the appropriate APIs that use it so far to only accept a parser instead.	2017-07-03 17:30:40 +02:00
Christoph Büscher	927111c91d	Remove QueryParseContext from parsing QueryBuilders (#25448 ) Currently QueryParseContext is only a thin wrapper around an XContentParser that adds little functionality of its own. I provides helpers for long deprecated field names which can be removed and two helper methods that can be made static and moved to other classes. This is a first step in helping to remove QueryParseContext entirely.	2017-06-29 17:10:20 +02:00
olcbean	3518e313b8	Unify the result interfaces from get and search in Java client (#25361 ) As GetField and SearchHitField have the same members, they have been unified into DocumentField. Closes #16440	2017-06-29 11:35:28 +02:00
Simon Willnauer	d338a09812	Remove `mapping.single_type` from parent join test (#25391 ) This removes the remaining usage of `mapping.single_type` from the parent join module and moves it's bwc test to the mixed cluster tests Relates to #24961 Relates to #20257	2017-06-26 17:33:07 +02:00
Simon Willnauer	4ae426a552	Remove remaining `index.mapping.single_type=false` (#25369 ) This change cleans up remaining tests to not use index.mapping.single_type=false but instead where applicable use a single type or markt the index as created with a pre 6.x version. Yet, there is still on leftover in the client tests that needs special attention. See `org.elasticsearch.client.SearchIT` Relates to #24961	2017-06-23 10:26:06 +02:00
Jim Ferenczi	5e64cd08bc	[Test] restore BWC for parent-join now that the new mapping format is in 5.x	2017-06-15 15:15:48 +02:00
Jim Ferenczi	9ca33e2450	Add a section named "relations" in the ParentJoinFieldMapper (#25248 ) * Add a section named "relation" in the ParentJoinFieldMapper This commit puts the parent/child definition in an inner section named "relation". Mapping for the parent-join will look like this: ``` "join_field": { "type": "join" "relations": "parent": "child" } } ```	2017-06-15 14:56:20 +02:00
Tanguy Leroux	27f1206999	Use SPI in High Level Rest Client to load XContent parsers (#25098 ) This commit adds a NamedXContentProvider interface that can be implemented by plugins or modules using Java's SPI feature in order to provide additional NamedXContent parsers to external applications like the Java High Level Rest Client.	2017-06-15 12:50:02 +02:00
Adrien Grand	a8ea2f0df4	Leverage scorerSupplier when applicable. (#25109 ) The `scorerSupplier` API allows to give a hint to queries in order to let them know that they will be consumed in a random-access fashion. We should use this for aggregations, function_score and matched queries.	2017-06-08 10:19:38 +02:00
Jim Ferenczi	3924fd79ef	Add BWC rest test for parent-join after the backport to 5.x	2017-06-07 19:29:01 +02:00
Martijn van Groningen	db8aa8e94e	Changed inner_hits to work with the new join field type and at the same time maintaining support for the `_parent` meta field type/ Relates to #20257	2017-06-07 10:52:49 +02:00
Jim Ferenczi	7e60cf3e54	Move parent_id query to the parent-join module (#25072 ) This change moves the parent_id query to the parent-join module and handles the case when only the parent-join field can be declared on an index (index with single type on). If single type is off it uses the legacy parent join field mapper and switch to the new one otherwise (default in 6). Relates #20257	2017-06-06 19:35:14 +02:00
Martijn van Groningen	2a71a7bffc	Change `has_child`, `has_parent` queries and `childen` aggregation to work with the new join field type and at the same time maintaining support for the `_parent` meta field type. Relates to #20257	2017-06-02 23:27:16 +02:00
Jim Ferenczi	4077600035	Disallow the new parent join field on indices with multiple types Relates https://github.com/elastic/elasticsearch/pull/24978	2017-06-02 18:28:03 +02:00
Jim Ferenczi	b8605775df	Add the ability to set eager_global_ordinals in the new parent-join field (#25019 ) Defaults to true	2017-06-02 15:34:22 +02:00
Jim Ferenczi	f4aee1e583	Disallow multiple parent-join fields per mapping (#25002 ) This change ensures that there is a single parent-join field defined per mapping. The verification is done through the addition of a special field mapper (MetaJoinFieldMapper) with a unique name (_parent_join) that is registered to the mapping service when the first parent-join field is defined. If a new parent-join is added, this field mapper will clash with the new one and the update will fail. This change also simplifies the parent join fetch sub phase by retrieving the parent-join field without iterating on all fields in the mapping.	2017-06-02 09:21:15 +02:00
Jim Ferenczi	b5d62ae747	Introduce ParentJoinFieldMapper, a field mapper that creates parent/child relation within documents of the same index (#24978 ) * Introduce ParentJoinFieldMapper, a field mapper that creates parent/child relation within documents of the same index This change adds a new field mapper named ParentJoinFieldMapper. This mapper is a replacement for the ParentFieldMapper but instead of using the types in the mapping it uses an internal field to materialize parent/child relation within a single index. This change also adds a fetch sub phase that automatically retrieves the join name (parent or child name) and the parent id for child documents in the response hit fields. The compatibility with `has_parent`, `has_child` queries and `children` agg will be added in a follow up. Relates #20257	2017-05-31 18:07:21 +02:00
Nik Everett	5da8ce8318	Remove the need for _UNRELEASED suffix in versions (#24798 ) Removes the need for the `_UNRELEASED` suffix on versions by detecting if a version should be unreleased or not based on the versions around it. This should make it simpler to automate the task of adding a new version label.	2017-05-26 18:36:32 -04:00
Jim Ferenczi	4707377cea	Move InnerHitBuilder queries BWC version to 5.5 after the backport Relates #24676	2017-05-23 22:41:39 +02:00
Jim Ferenczi	9087803cd9	Add the ability to define custom inner hit sub context builder (#24676 ) This commit moves the handling of nested and parent/child inner hits to specialized classes that can be defined outside of ES core. InnerHitBuilderContext is now used by the parent query (nested or hasChild, ...) to build the sub context from the InnerHitBuilder definition. BWC is also ensured so that nodes in previous versions can still send/receive inner hits to/from this version. Relates #20257	2017-05-23 13:06:22 +02:00
javanna	db0490343e	Merge branch 'master' into feature/client_aggs_parsing	2017-05-19 18:17:06 +02:00
Jim Ferenczi	d241c4898e	Removes parent child fielddata specialization (#24737 ) This change removes the field data specialization needed for the parent field and replaces it with a simple DocValuesIndexFieldData. The underlying global ordinals are retrieved via a new function called IndexOrdinalsFieldData#getOrdinalMap. The children aggregation is also modified to use a simple WithOrdinals value source rather than the deleted WithOrdinals.Parent. Relates #20257	2017-05-19 17:11:23 +02:00
javanna	ce7326eb88	Merge branch 'master' into feature/client_aggs_parsing	2017-05-17 17:59:00 +02:00
Ryan Ernst	2a65bed243	Tests: Change rest test extension from .yaml to .yml (#24659 ) This commit renames all rest test files to use the .yml extension instead of .yaml. This way the extension used within all of elasticsearch for yaml is consistent.	2017-05-16 17:24:35 -07:00
Christoph Büscher	0b688a8733	Small improvement in InternalAggregationTestCase test setup after changes in master (#24675 )	2017-05-15 15:06:01 +02:00
Christoph Büscher	42e8d4b761	Merge branch 'master' into feature/client_aggs_parsing Conflicts: core/src/test/java/org/elasticsearch/search/aggregations/bucket/filter/InternalFilterTests.java core/src/test/java/org/elasticsearch/search/aggregations/bucket/global/InternalGlobalTests.java core/src/test/java/org/elasticsearch/search/aggregations/bucket/missing/InternalMissingTests.java core/src/test/java/org/elasticsearch/search/aggregations/bucket/nested/InternalNestedTests.java core/src/test/java/org/elasticsearch/search/aggregations/bucket/nested/InternalReverseNestedTests.java core/src/test/java/org/elasticsearch/search/aggregations/bucket/sampler/InternalSamplerTests.java modules/parent-join/src/test/java/org/elasticsearch/join/aggregations/InternalChildrenTests.java test/framework/src/main/java/org/elasticsearch/search/aggregations/InternalSingleBucketAggregationTestCase.java	2017-05-15 12:25:07 +02:00
Jim Ferenczi	279a18a527	Add parent-join module (#24638 ) * Add parent-join module This change adds a new module named `parent-join`. The goal of this module is to provide a replacement for the `_parent` field but as a first step this change only moves the `has_child`, `has_parent` queries and the `children` aggregation to this module. These queries and aggregations are no longer in core but they are deployed by default as a module. Relates #20257	2017-05-12 15:58:06 +02:00

1 2

94 Commits