OpenSearch

Commit Graph

Author	SHA1	Message	Date
Dimitris Athanasiou	66bef26495	Aggregations: bucket_sort pipeline aggregation (#27152 ) This commit adds a parent pipeline aggregation that allows sorting the buckets of a parent multi-bucket aggregation. The aggregation also offers [from] and [size] parameters in order to truncate the result as desired. Closes #14928	2017-11-09 17:59:57 +00:00
David Turner	1c6f5ce9cb	Improve error message for parse failures of completion fields (#27297 ) Fix spacing/grammar/punctuation, and include the field name and location in the source document.	2017-11-09 10:45:44 +00:00
Simon Willnauer	a34c2f0b8d	Ensure external refreshes will also refresh internal searcher to minimize segment creation (#27253 ) We cut over to internal and external IndexReader/IndexSearcher in #26972 which uses two independent searcher managers. This has the downside that refreshes of the external reader will never clear the internal version map which in-turn will trigger additional and potentially unnecessary segment flushes since memory must be freed. Under heavy indexing load with low refresh intervals this can cause excessive segment creation which causes high GC activity and significantly increases the required segment merges. This change adds a dedicated external reference manager that delegates refreshes to the internal reference manager that then `steals` the refreshed reader from the internal reference manager for external usage. This ensures that external and internal readers are consistent on an external refresh. As a sideeffect this also releases old segments referenced by the internal reference manager which can potentially hold on to already merged away segments until it is refreshed due to a flush or indexing activity.	2017-11-09 08:40:22 +00:00
David Turner	4abb5fa297	Remove optimisations to reuse objects when applying a new `ClusterState` (#27317 ) In order to avoid churn when applying a new `ClusterState`, there are some checks that compare parts of the old and new states and, if equal, the new object is discarded and the old one reused. Since `ClusterState` updates are now largely diff-based, this code is unnecessary: applying a diff also reuses any old objects if unchanged. Moreover, the code compares the parts of the `ClusterState` using their `version()` values which is not guaranteed to be correct, because of a lack of consensus. This change removes this optimisation, and tests that objects are still reused as expected via the diff mechanism.	2017-11-09 08:09:14 +00:00
Costin Leau	6f04b8c9be	Add unreleased 5.6.5 version number	2017-11-08 18:19:04 +02:00
Boaz Leskes	229bf29ba1	testCreateSplitIndexToN: do not set `routing_partition_size` to >= `number_of_routing_shards` It's an illegal value	2017-11-08 17:09:33 +01:00
Igor Motov	0fe2003ae6	Snapshot/Restore: better handle incorrect chunk_size settings in FS repo (#26844 ) Specifying a negative value or null as a chunk_size in FS repository can lead to corrupt snapshots. Closes #26843	2017-11-08 10:43:28 -05:00
Jason Tedor	6810aa8452	Correct comment in index shard test This commit fixes a comment in an index shard test which was inaccurate after it was copied from another test and not modified to reflect the reasoning in the test that it was copied into.	2017-11-08 09:33:12 -05:00
Jason Tedor	927d7f6b6c	Roll translog generation on primary promotion When a primary is promoted, rolling the translog generation here makes simpler reasoning about the relationship between primary terms and translog generation. Note that this is not strictly necessary for correctness (e.g., to avoid duplicate operations with the same sequence number within a single generation). Relates #27313	2017-11-08 09:14:08 -05:00
olcbean	bd5e7002be	ObjectParser: Replace IllegalStateException with ParsingException (#27302 ) Relates to #27147	2017-11-08 14:10:11 +01:00
Reese Levine	74b1e7db51	scripted_metric _agg parameter disappears if params are provided (#27159 ) * Fixes #19768: scripted_metric _agg parameter disappears if params are provided * Test case for #19768 * Compare boolean to false instead of negating it * Added mocked script in ScriptedMetricIT * Fix test in ScriptedMetricIT for implicit _agg map	2017-11-08 08:45:47 +00:00
Boaz Leskes	ace446f335	Update shrink's bwc version to 6.1.0 and enabled bwc tests	2017-11-07 15:35:46 +01:00
Mayya Sharipova	148376c2c5	Add limits for ngram and shingle settings (#27211 ) * Add limits for ngram and shingle settings (#27211) Create index-level settings: max_ngram_diff - maximum allowed difference between max_gram and min_gram in NGramTokenFilter/NGramTokenizer. Default is 1. max_shingle_diff - maximum allowed difference between max_shingle_size and min_shingle_size in ShingleTokenFilter. Default is 3. Throw an IllegalArgumentException when trying to create NGramTokenFilter, NGramTokenizer, ShingleTokenFilter where difference between max_size and min_size exceeds the settings value. Closes #25887	2017-11-07 08:14:55 -05:00
Boaz Leskes	95cf3df6ac	TemplateUpgradeService should only run on the master (#27294 ) The `TemplateUpgradeService` allows plugins to register a call back that mutates index templates upon recovery. This is handy for upgrade logic that needs to make sure that an existing index template is updated once the cluster is upgraded to a new version of the plugin (and ES). Currently, the service has complicated logic to decide which node should perform the upgrade. It will prefer the master node, if it is of the highest version of the cluster and otherwise it will fall back to one of the non-coordinating nodes which are on the latest version. While this attempts to make sure that new nodes can assume their template version is in place (but old node still need to be able to operate under both old and new template), it has an inherent problem in that the master (on an old version) may not be able to process the put template request with the new template - it may miss certain features. This PR changes the logic to be simpler and always rely on the current master nodes. This comes at the price that new nodes need to operate both with old templates and new. That price is small as they need to operate with old indices regardless of the template. On the flip side we reduce a lot of complexity in what can happen in the cluster.	2017-11-07 08:35:00 +01:00
Jason Tedor	d5451b2037	Die with dignity while merging If an out of memory error is thrown while merging, today we quietly rewrap it into a merge exception and the out of memory error is lost. Instead, we need to rethrow out of memory errors, and in fact any fatal error here, and let those go uncaught so that the node is torn down. This commit causes this to be the case. Relates #27265	2017-11-06 17:55:11 -05:00
Zachary Tong	6e9e07d6f8	Fix profiling naming issues (#27133 ) Some code-paths use anonymous classes (such as NonCollectingAggregator in terms agg), which messes up the display name of the profiler. If we encounter an anonymous class, we need to grab the super's name. Another naming issue was that ProfileAggs were not delegating to the wrapped agg's name for toString(), leading to ugly display. This PR also fixes up the profile documentation. Some of the examples were executing against empty indices, which shows different profile results than a populated index (and made for confusing examples). Finally, I switched the agg display names from the fully qualified name to the simple name, so that it's similar to how the query profiles work. Closes #26405	2017-11-06 16:37:33 -05:00
Jason Tedor	766d29e7cf	Correctly encode warning headers The warnings headers have a fairly limited set of valid characters (cf. quoted-text in RFC 7230). While we have assertions that we adhere to this set of valid characters ensuring that our warning messages do not violate the specificaion, we were neglecting the possibility that arbitrary user input would trickle into these warning headers. Thus, missing here was tests for these situations and encoding of characters that appear outside the set of valid characters. This commit addresses this by encoding any characters in a deprecation message that are not from the set of valid characters. Relates #27269	2017-11-06 13:20:30 -05:00
kel	d7fa09153a	Remove duplicated SnapshotStatus (#27276 )	2017-11-06 16:19:16 +01:00
Simon Willnauer	bd7efa908a	Add ability to split shards (#26931 ) This change adds a new `_split` API that allows to split indices into a new index with a power of two more shards that the source index. This API works alongside the `_shrink` API but doesn't require any shard relocation before indices can be split. The split operation is conceptually an inverse `_shrink` operation since we initialize the index with a _syntetic_ number of routing shards that are used for the consistent hashing at index time. Compared to indices created with earlier versions this might produce slightly different shard distributions but has no impact on the per-index backwards compatibility. For now, the user is required to prepare an index to be splittable by setting the `index.number_of_routing_shards` at index creation time. The setting allows the user to prepare the index to be splittable in factors of `index.number_of_routing_shards` ie. if the index is created with `index.number_of_routing_shards: 16` and `index.number_of_shards: 2` it can be split into `4, 8, 16` shards. This is an intermediate step until we can make this the default. This also allows us to safely backport this change to 6.x. The `_split` operation is implemented internally as a DeleteByQuery on the lucene level that is executed while the primary shards execute their initial recovery. Subsequent merges that are triggered due to this operation will not be executed immediately. All merges will be deferred unti the shards are started and will then be throttled accordingly. This change is intended for the 6.1 feature release but will not support pre-6.1 indices to be split unless these indices have been shrunk before. In that case these indices can be split backwards into their original number of shards.	2017-11-06 11:37:55 +01:00
Tanguy Leroux	43e7a4a349	Upgrade to Jackson 2.8.10 (#27230 ) While it's not possible to upgrade the Jackson dependencies to their latest versions yet (see #27032 (comment) for more) it's still possible to upgrade to the latest 2.8.x version.	2017-11-06 10:20:05 +01:00
kel	76f81e002c	Remove unused parameters in AnalysisRegistry (#27232 ) Removes unused parameters for AnalysisRegistry#processAnalyzerFactory and AnalysisRegistry#processNormalizerFactory.	2017-11-06 09:48:57 +01:00
kel	5d661df174	Add more information on `_failed_to_convert_` exception (#27034 )	2017-11-06 09:40:28 +01:00
Jim Ferenczi	429275a773	Remove ElasticsearchQueryCachingPolicy (#27190 ) We have an hidden setting called `index.queries.cache.term_queries` that disables caching of term queries in the query cache. Though term queries are not cached in the Lucene UsageTrackingQueryCachingPolicy since version 6.5. This makes the es policy useless but also makes it impossible to re-enable caching for term queries. This change appeared in Lucene 6.5 so this setting is no-op since version 5.4 of Elasticsearch The change in this PR removes the setting and the custom policy.	2017-11-06 08:26:24 +01:00
Nhat	fd3fac9565	Backport the size-based index rollver to v6.1.0 Relates #27004	2017-11-04 20:14:59 -04:00
Nhat	c7ce5a07f2	Add size-based condition to the index rollover API (#27160 ) This is to add a max_size condition to the index rollover API. We use a totalSizeInBytes from DocsStats to evaluate this condition. Closes #27004	2017-11-04 19:51:48 -04:00
David Roberts	749c3ec716	Remove the single argument Environment constructor (#27235 ) Only tests should use the single argument Environment constructor. To enforce this the single arg Environment constructor has been replaced with a test framework factory method. Production code (beyond initial Bootstrap) should always use the same Environment object that Node.getEnvironment() returns. This Environment is also available via dependency injection.	2017-11-04 13:25:09 +00:00
Chris Earle	964016e228	Fix RestGetAction name typo This changes the name from docuemnt_get_action to document_get_action. Relates #27266	2017-11-04 08:29:00 -04:00
Igor Motov	117f0f3a44	Fix snapshot getting stuck in INIT state (#27214 ) If the master disconnects from the cluster after initiating snapshot, but just before the snapshot switches from INIT to STARTED state, the snapshot can get indefinitely stuck in the INIT state. This error is specific to v5.x+ and was triggered by keeping the master node that stepped down in the node list, the cleanup logic in snapshot/restore assumed that if master steps down it is always removed from the the node list. This commit changes the logic to trigger cleanup even if no nodes left the cluster. Closes #27180	2017-11-03 19:36:08 -04:00
Colin Goodheart-Smithe	20e8005859	Fixes QueryStringQueryBuilderTests Closes #27246	2017-11-03 13:24:56 +00:00
Jim Ferenczi	262422375e	[Test] Fix QueryStringQueryBuilderTests.testExistsFieldQuery BWC Handle BWC version in this test. Closes #27246	2017-11-03 14:17:11 +01:00
kel	0f21262b36	Do not create directories if repository is readonly (#26909 ) For FsBlobStore and HdfsBlobStore, if the repository is read only, the blob store should be aware of the readonly setting and do not create directories if they don't exist. Closes #21495	2017-11-03 13:10:50 +01:00
Christoph Büscher	9abc26ee92	[Test] Fix InternalStatsTests After recent changes in InternalStats#doXContentBody the corresponding xContent output of the parsed aggregation needed to be changed in a similar way.	2017-11-03 11:26:41 +01:00
Jim Ferenczi	d503782699	[Test] Fix QueryStringQueryBuilderTests.testExistsFieldQuery Adapt the test to check for the new NormsFieldExistsQuery. Closes #27246	2017-11-03 11:24:45 +01:00
Colin Goodheart-Smithe	28b4d95cf5	Uses norms for exists query if enabled (#27237 ) * Uses norms for exists query if enabled This change means that for indexes created from 6.1.0, if normas are enabled we will not write the field name to the `_field_names` field and for an exists query we will instead use the NormsFieldExistsQuery which was added in Lucene 7.1.0. If norms are not enabled or if the index was created before 6.1.0 `_field_names` will be used as before. * Fixes tests	2017-11-03 08:51:40 +00:00
Mathias Fußenegger	827ba7f82d	Avoid uid creation in ParsedDocument (#27241 ) The uid bytes (as the type#id) were needlessly being created even though they are no longer needed after the move to single type per index. This commit avoids creating these when parsed documents are constructed. Relates #27241	2017-11-02 20:10:07 -04:00
kel	55b9dfdd52	Rander sum as zero if count is zero for stats aggregation (#26893 ) (#27193 )	2017-11-02 16:02:47 +00:00
Simon Willnauer	b294250aba	Remove unused searcher parameter in SearchService#createContext (#27227 ) This parameter isn't used anywhere and just adds complexity.	2017-11-02 14:58:34 +01:00
Colin Goodheart-Smithe	c1b8140c83	Upgrade to Lucene 7.1 (#27225 )	2017-11-02 13:25:33 +00:00
Simon Willnauer	f928d613ad	Move IndexShard#getWritingBytes() under InternalEngine (#27209 ) We do some accounting in IndexShard that is not necessarily correct since we maintain two different index readers. This change moves the accounting under the engine which knows what reader we are refreshing. Relates to #26972	2017-11-02 10:43:17 +01:00
olcbean	b9896465cd	Introducing took time for _msearch This commit adds the took time to the response for _msearch. Relates #23767	2017-11-01 21:39:04 -04:00
Jason Tedor	59657ad1cb	Lazy initialize checkpoint tracker bit sets This local checkpoint tracker uses collections of bit sets to track which sequence numbers are complete, eventually removing these bit sets when the local checkpoint advances. However, these bit sets were eagerly allocated so that if a sequence number far ahead of the checkpoint was marked as completed, all bit sets between the "last" bit set and the bit set needed to track the marked sequence number were allocated. If this sequence number was too far ahead, the memory requirements could be excessive. This commit opts for a different strategy for holding on to these bit sets and enables them to be lazily allocated. Relates #27179	2017-11-01 21:26:52 -04:00
Jason Tedor	90d6317437	Remove checkpoint tracker bit sets setting We added an index-level setting for controlling the size of the bit sets used to back the local checkpoint tracker. This setting is really only needed to control the memory footprint of the bit sets but we do not think this setting is going to be needed. This commit removes this setting before it is released to the wild after which we would have to worry about BWC implications. Relates #27191	2017-11-01 21:13:01 -04:00
Colin Goodheart-Smithe	99aca9cdfc	Enhances exists queries to reduce need for `_field_names` (#26930 ) * Enhances exists queries to reduce need for `_field_names` Before this change we wrote the name all the fields in a document to a `_field_names` field and then implemented exists queries as a term query on this field. The problem with this approach is that it bloats the index and also affects indexing performance. This change adds a new method `existsQuery()` to `MappedFieldType` which is implemented by each sub-class. For most field types if doc values are available a `DocValuesFieldExistsQuery` is used, falling back to using `_field_names` if doc values are disabled. Note that only fields where no doc values are available are written to `_field_names`. Closes #26770 * Addresses review comments * Addresses more review comments * implements existsQuery explicitly on every mapper * Reinstates ability to perform term query on `_field_names` * Added bwc depending on index created version * Review Comments * Skips tests that are not supported in 6.1.0 These values will need to be changed after backporting this PR to 6.x	2017-11-01 10:46:59 +00:00
Martijn van Groningen	d805c41b28	Added new terms_set query This query returns documents that match with at least one ore more of the provided terms. The number of terms that must match varies per document and is either controlled by a minimum should match field or computed per document in a minimum should match script. Closes #26915	2017-11-01 10:55:18 +01:00
Jack Conradson	fd73e5fa41	Add version 6.0.0	2017-10-31 17:49:52 -07:00
Tanguy Leroux	13cd08b1e6	Convert index blocks to cluster block exceptions (#27050 )	2017-10-31 16:11:18 +01:00
Shai Erera	bd0261916c	Fix Laplace scorer to multiply by alpha (and not add) (#27125 )	2017-10-31 13:08:44 +01:00
javanna	34666844b3	[DOCS] Clarify migrate guide and search request validation Relates to #26811	2017-10-31 12:36:00 +01:00
kel	c3e2bdf20c	Raise IllegalArgumentException if query validation failed (#26811 ) Closes #26799	2017-10-31 12:17:27 +01:00
Armin Braun	a4c159e91e	prevent duplicate fields when mixing parent and root nested includes (#27072 ) Closes #26990	2017-10-31 10:01:33 +01:00

1 2 3 4 5 ...

9020 Commits