OpenSearch

Commit Graph

Author	SHA1	Message	Date
jimczi	0449869511	Fix unchecked warning in RollupIndexerIndexingTests#testSimpleDateHistoWithOverlappingDelay	2019-05-21 12:28:57 +02:00
Jim Ferenczi	ec63160243	Fix max boundary for rollups job that use a delay (#42158 ) Rollup jobs can define how long they should wait before rolling up new documents. However if the delay is smaller or if it's not a multiple of the rollup interval the job can create incomplete buckets because the max boundary for a job is computed from the time when the job started rounded to the interval minus the delay. This change fixes this computation by applying the delay substraction before the rounding in order to ensure that we never create a boundary that falls in a middle of a bucket.	2019-05-21 08:48:53 +02:00
Zachary Tong	6ae6f57d39	[7.x Backport] Force selection of calendar or fixed intervals (#41906 ) The date_histogram accepts an interval which can be either a calendar interval (DST-aware, leap seconds, arbitrary length of months, etc) or fixed interval (strict multiples of SI units). Unfortunately this is inferred by first trying to parse as a calendar interval, then falling back to fixed if that fails. This leads to confusing arrangement where `1d` == calendar, but `2d` == fixed. And if you want a day of fixed time, you have to specify `24h` (e.g. the next smallest unit). This arrangement is very error-prone for users. This PR adds `calendar_interval` and `fixed_interval` parameters to any code that uses intervals (date_histogram, rollup, composite, datafeed, etc). Calendar only accepts calendar intervals, fixed accepts any combination of units (meaning `1d` can be used to specify `24h` in fixed time), and both are mutually exclusive. The old interval behavior is deprecated and will throw a deprecation warning. It is also mutually exclusive with the two new parameters. In the future the old dual-purpose interval will be removed. The change applies to both REST and java clients.	2019-05-20 12:07:29 -04:00
Zachary Tong	f410f91f13	Cleanup RollupSearch exceptions, disallow partial results (#41272 ) - msearch exceptions should be thrown directly instead of wrapping in a RuntimeException - Do not allow partial results (where some indices are missing), instead throw an exception if any index is missing	2019-05-08 12:38:42 -04:00
Ryan Ernst	6fd8924c5a	Switch run task to use real distro (#41590 ) The run task is supposed to run elasticsearch with the given plugin or module. However, for modules, this is most realistic if using the full distribution. This commit changes the run setup to use the default or oss as appropriate.	2019-05-06 12:34:07 -07:00
Hendrik Muhs	0c03707704	[ML-DataFrame] reset/clear the position after indexer is done (#41736 ) reset/clear the position after indexer is done	2019-05-06 09:41:51 +02:00
Zachary Tong	ec5dd0594f	Disallow null/empty or duplicate composite sources (#41359 ) Adds some validation to prevent duplicate source names from being used in the composite agg. Also refactored to use a ConstructingObjectParser and removed the private ctor and setter for sources, making it mandatory.	2019-04-24 13:23:31 -04:00
Zachary Tong	7e62ff2823	[Rollup] Validate timezones based on rules not string comparision (#36237 ) The date_histogram internally converts obsolete timezones (such as "Canada/Mountain") into their modern equivalent ("America/Edmonton"). But rollup just stored the TZ as provided by the user. When checking the TZ for query validation we used a string comparison, which would fail due to the date_histo's upgrading behavior. Instead, we should convert both to a TimeZone object and check if their rules are compatible.	2019-04-17 13:46:44 -04:00
Hendrik Muhs	3df6798c4c	Rollup/DataFrame: disallow partial results (#41114 ) disallow partial results in rollup and data frame, after this change the client throws an error directly replacing the previous runtime exception thrown, allowing better error handling in implementations.	2019-04-12 07:31:22 +02:00
Hendrik Muhs	c37b127a07	fix a timing issue: isFinished is used for a busy loop in testing, (#41055 ) test: ensure state is persisted before the isFinished is changed fixes #41046	2019-04-10 18:47:34 +02:00
Julie Tibshirani	0d5f86a001	Mute RollupIndexerStateTests#testIndexing. Tracked in #41046.	2019-04-09 17:17:04 -07:00
Hendrik Muhs	d5fcbf2f4a	refactor onStart and onFinish to take runnables and executed them guarded by state (#40855 ) refactor onStart and onFinish to take action listeners and execute them when indexer is in indexing state.	2019-04-07 21:46:37 +02:00
Jim Ferenczi	a15f55b2de	Rollup ignores time_zone on date histogram (#40844 ) When translating the original aggregation for the rollup indices, the timezone of the date histogram is validated against the rollup job but the value is not copied in the newly created date_histogram.	2019-04-04 21:16:50 +02:00
Zachary Tong	abbfc75052	Remove timezone validation on rollup range queries (#40647 ) We enforced the timezone of range queries when using the rollup search endpoint, but this validation is not needed. Since rollup dates are stored in UTC, and range queries are always converted to UTC (even if specifying a `time_zone`) the validation is not needed and can prevent legitimate queries from running.	2019-04-02 14:25:16 -04:00
Jay Modi	697911c31d	Fixed missed stopping of SchedulerEngine (#39193 ) The SchedulerEngine is used in several places in our code and not all of these usages properly stopped the SchedulerEngine, which could lead to test failures due to leaked threads from the SchedulerEngine. This change adds stopping to these usages in order to avoid the thread leaks that cause CI failures and noise. Closes #38875	2019-02-21 14:31:33 -07:00
Alexander Reelsen	daa2ec8a60	Switch mapping/aggregations over to java time (#36363 ) This commit moves the aggregation and mapping code from joda time to java time. This includes field mappers, root object mappers, aggregations with date histograms, query builders and a lot of changes within tests. The cut-over to java time is a requirement so that we can support nanoseconds properly in a future field mapper. Relates #27330	2019-01-23 10:40:05 +01:00
Zachary Tong	de52ba1f78	Fix RollupDocumentation test to wait for job to stop Also adds some extra state debug information to various log messages	2019-01-11 14:14:58 -05:00
Jim Ferenczi	e38cf1d0dc	Add the ability to set the number of hits to track accurately (#36357 ) In Lucene 8 searches can skip non-competitive hits if the total hit count is not requested. It is also possible to track the number of hits up to a certain threshold. This is a trade off to speed up searches while still being able to know a lower bound of the total hit count. This change adds the ability to set this threshold directly in the track_total_hits search option. A boolean value (true, false) indicates whether the total hit count should be tracked in the response. When set as an integer this option allows to compute a lower bound of the total hits while preserving the ability to skip non-competitive hits when enough matches have been collected. Relates #33028	2019-01-04 20:36:49 +01:00
Josh Soref	1df66d21fe	Spelling: replace uknown with unknown (#37056 )	2019-01-02 17:33:02 +01:00
Zachary Tong	6d49873ab7	Fix Rollup's metadata parser (#36791 ) The parser used for rollup configs in _meta fields was not able to handle unrelated data in the meta field. If an unrelated object was encountered, it would half-consume the JSON object, realize it wasn'ta rollup config, then stop parsing. This would leave the object halfway consumed and the parsing framework would throw an exception. This commit replaces the parsing logic with a set of minimal parsers, each for the specific component we care about (`_doc`, `_meta`, `_rollup`) and configured to ignore unknown fields where applicable. More verbose, but less hacky than before and should be more robust. Also adds tests (randomized and explicit) to make sure this doesn't break in the future.	2018-12-18 16:35:39 -05:00
Ryan Ernst	c4f4378006	Core: Rework multi date formatter merging (#36447 ) This commit moves the MergedDateFormatter to a package private class and reworks joda DateFormatter instances to use that instead of a single DateTimeFormatter with multiple parsers. This will allow the java and joda multi formats to share the same format parsing method in a followup.	2018-12-11 23:47:44 -08:00
Nik Everett	03daad9812	Re-deprecate xpack rollup endpoints (#36451 ) Redeprecates the `/_xpack/rollup` endpoints in favor of `/_rollup`. When we cleanup the rollup in a cluster containing 6.x nodes we need to use `/_xpack/rollup` instead of `/_rollup` because the 6.x nodes don't know about `/_rollup`. In those cases we must ignore the deprecation warnings that the 7.0 node will return for the end point. Closes #36044	2018-12-11 19:43:17 -05:00
Jason Tedor	0909a631ba	Add non-X-Pack centric rollup endpoints (#36383 ) * Add non-X-Pack centric rollup endpoints This commit adds new endpoints for rollup that do not have _xpack in their path. The purpose for this change is to take these endpoints into 6.x as well so that they can be available in mixed cluster tests too. A follow-up change will deprecate the use of _xpack in the rollup endpoints. And finally, in the future, we would remove the _xpack endpoints. * Remove import * Fix typo	2018-12-10 14:50:30 -05:00
Ryan Ernst	a27f2efca5	Core: Converge FormatDateTimeFormatter and DateFormatter apis (#36390 ) This commit makes FormatDateTimeFormatter and DateFormatter apis close to each other, so that the former can be removed in favor of the latter. This PR does not change the uses of FormatDateTimeFormatter yet, so that that future change can be purely mechanical.	2018-12-07 17:23:41 -08:00
Nik Everett	ead2b9e08b	HLRC: Add rollup search (#36334 ) Relates to #29827	2018-12-07 14:39:58 -05:00
Jim Ferenczi	18866c4c0b	Make hits.total an object in the search response (#35849 ) This commit changes the format of the `hits.total` in the search response to be an object with a `value` and a `relation`. The `value` indicates the number of hits that match the query and the `relation` indicates whether the number is accurate (in which case the relation is equals to `eq`) or a lower bound of the total (in which case it is equals to `gte`). This change also adds a parameter called `rest_total_hits_as_int` that can be used in the search APIs to opt out from this change (retrieve the total hits as a number in the rest response). Note that currently all search responses are accurate (`track_total_hits: true`) or they don't contain `hits.total` (`track_total_hits: true`). We'll add a way to get a lower bound of the total hits in a follow up (to allow numbers to be passed to `track_total_hits`). Relates #33028	2018-12-05 19:49:06 +01:00
Martijn van Groningen	11935cd480	Replace Streamable w/ Writeable in BaseTasksResponse and subclasses (#36176 ) This commit replaces usages of Streamable with Writeable for the BaseTasksResponse / TransportTasksAction classes and subclasses of these classes. Note that where possible response fields were made final. Relates to #34389	2018-12-05 13:14:10 +01:00
Martijn van Groningen	43773a32a4	Replace Streamable w/ Writeable in BaseTasksRequest and subclasses (#35854 ) * Replace Streamable w/ Writeable in BaseTasksRequest and subclasses This commit replaces usages of Streamable with Writeable for the BaseTasksRequest / TransportTasksAction classes and subclasses of these classes. Relates to #34389	2018-12-03 08:04:29 +01:00
Zachary Tong	61c2db5ebb	Revert "Deprecate X-Pack centric rollup endpoints (#35962 )" This reverts commit `b84f1f6a3a`.	2018-11-29 12:58:23 -05:00
Jim Ferenczi	8a7f3f75f3	Add support for rest_total_hits_as_int (#36051 ) The support for rest_total_hits_as_int has already been merged to 6x in #35848 so this change adds this new option to master. The plan was to add this new option as part of #35848 but we've decided to wait a few days before merging this breaking change so this commit just handles the new option as a noop exactly like 6x for now. This will allow users to migrate to this parameter before #35848 is merged. Relates #33028	2018-11-29 18:36:16 +01:00
Jason Tedor	b84f1f6a3a	Deprecate X-Pack centric rollup endpoints (#35962 ) This commit is part of our plan to deprecate and ultimately remove the use of _xpack in the REST APIs.	2018-11-27 20:34:17 -05:00
Zachary Tong	48fa251812	[Rollup] Add more diagnostic stats to job (#35471 ) * [Rollup] Add more diagnostic stats to job To help debug future performance issues, this adds the min/max/avg/count/total latencies (in milliseconds) for search and bulk phase. This latency is the total service time including transfer between nodes, not just the `took` time. It also adds the count of search/bulk failures encountered during runtime. This information is also in the log, but a runtime counter will help expose problems faster * review cleanup * Remove dead ParseFields	2018-11-27 15:46:10 -05:00
Zachary Tong	c346a0f027	[Rollup] Add `wait_for_completion` option to StopRollupJob API (#34811 ) This adds a `wait_for_completion` flag which allows the user to block the Stop API until the task has actually moved to a stopped state, instead of returning immediately. If the flag is set, a `timeout` parameter can be specified to determine how long (at max) to block the API call. If unspecified, the timeout is 30s. If the timeout is exceeded before the job moves to STOPPED, a timeout exception is thrown. Note: this is just signifying that the API call itself timed out. The job will remain in STOPPING and evenutally flip over to STOPPED in the background. If the user asks the API to block, we move over the the generic threadpool so that we don't hold up a networking thread.	2018-11-13 16:37:17 -05:00
Jason Tedor	4f4fc3b8f8	Replicate index settings to followers (#35089 ) This commit uses the index settings version so that a follower can replicate index settings changes as needed from the leader. Co-authored-by: Martijn van Groningen <martijn.v.groningen@gmail.com>	2018-11-07 21:20:51 -05:00
Alexander Reelsen	409050e8de	Refactor: Remove settings from transport action CTOR (#35208 ) As settings are not used in the transport action constructor, this removes the passing of the settings in all the transport actions.	2018-11-05 13:08:18 +01:00
Nik Everett	e28509fbfe	Core: Less settings to AbstractComponent (#35140 ) Stop passing `Settings` to `AbstractComponent`'s ctor. This allows us to stop passing around `Settings` in a ton of places. While this change touches many files, it touches them all in fairly small, mechanical ways, doing a few things per file: 1. Drop the `super(settings);` line on everything that extends `AbstractComponent`. 2. Drop the `settings` argument to the ctor if it is no longer used. 3. If the file doesn't use `logger` then drop `extends AbstractComponent` from it. 4. Clean up all compilation failure caused by the `settings` removal and drop any now unused `settings` isntances and method arguments. I've intentionally not removed the `settings` argument from a few files: 1. TransportAction 2. AbstractLifecycleComponent 3. BaseRestHandler These files don't need `settings` either, but this change is large enough as is. Relates to #34488	2018-10-31 21:23:20 -04:00
Zachary Tong	f9dd33a0b9	[Rollup] Proactively resolve index patterns in RollupSearch endoint (#34930 ) This changes the RollupSearch endpoint to proactively resolve index patterns. If the index pattern(s) match more than one rollup index, an exception is throw as before. But if the pattern only matches one rollup index, execution is allowed to continue (unlike before where it would assume all patterns were for raw data). This also allows the search endpoint to resolve aliases that point to a rollup index. Also tweaks the documentation to make this clear. Closes #34828	2018-10-30 13:50:50 -04:00
Pratik Sanglikar	f1135ef0ce	Core: Replace deprecated Loggers calls with LogManager. (#34691 ) Replace deprecated Loggers calls with LogManager. Relates to #32174	2018-10-29 15:52:30 -04:00
Andrey Atapin	5f588180f9	Improve IndexNotFoundException's default error message (#34649 ) This commit adds the index name to the error message when an index is not found.	2018-10-24 12:53:31 -07:00
Benjamin Trent	cd27b0b996	Revert "Rollup add default metrics to histo groups (#34534 )" (#34815 ) This reverts commit `4236358f5d`.	2018-10-24 14:25:10 -05:00
Julie Tibshirani	90fd15bb56	Mute RollupIndexerIndexingTests#testRandomizedDateHisto as we await a fix.	2018-10-23 11:14:43 -07:00
Zachary Tong	4dbf498721	[Rollup] Job deletion should be invoked on the allocated task (#34574 ) We should delete a job by directly talking to the allocated task and telling it to shutdown. Today we shut down a job via the persistent task framework. This is not ideal because, while the job has been removed from the persistent task CS, the allocated task continues to live until it gets the shutdown message. This means a user can delete a job, immediately delete the rollup index, and then see new documents appear in the just-deleted index. This happens because the indexer in the allocated task is still running and indexes a few more documents before getting the shutdown command. In this PR, the transport action is changed to a TransportTasksAction, and we invoke onCancelled() directly on the matching job. The race condition still exists after this PR (albeit less likely), but this was a precursor to fixing the issue and a self-contained chunk of code. A second PR will followup to fix the race itself.	2018-10-23 12:23:22 -04:00
Benjamin Trent	4236358f5d	Rollup add default metrics to histo groups (#34534 ) * Rollup: Adding default metrics for histo group timefield (#34379) * Rollup: Adding default histo metrics and tests * fixing failing client side test with new default values * Adding HLRC docs for default values * Addressing PR comments * Removing value_count default agg * Updating docs for rollups * Minor interval change	2018-10-19 07:23:25 -05:00
Zachary Tong	45546e71c2	Add GetRollupCaps API to high level rest client (#32880 ) Adds GetRollupCaps API to the HLRC, and tweaks some of the Caps objects to be immutable. Also various style tweaks	2018-10-18 17:12:38 -04:00
Zachary Tong	ca51fb6873	[Rollup] Add support for date histo `format` (#34537 ) Adds support for query-time formatting of the date histo keys when executing a rollup search. Closes #34391	2018-10-18 12:12:17 -04:00
Kazuhiro Sera	d45fe43a68	Fix a variety of typos and misspelled words (#32792 )	2018-10-03 18:11:38 +01:00
Benjamin Trent	10201e06cb	Allowing {index}/_xpack/rollup/data to accept comma delimited list (#34115 ) * Allowing `{index}/_xpack/rollup/data` to accept comma delimited list * Address PR comments	2018-10-02 06:21:46 -07:00
Hendrik Muhs	e2f310b56c	Fix AggregationFactories.Builder equality and hash regarding order (#34005 ) Fixes the equals and hash function to ignore the order of aggregations to ensure equality after serialization and deserialization. This ensures storing configs with aggregation works properly. This also addresses a potential issue in caching when the same query contains aggregations but in different order. 1st it will not hit in the cache, 2nd cache objects which shall be equal might end up twice in the cache.	2018-09-28 13:30:50 +02:00
Ryan Ernst	7800b4fa91	Core: Abstract DateMathParser in an interface (#33905 ) This commits creates a DateMathParser interface, which is already implemented for both joda and java time. While currently the java time DateMathParser is not used, this change will allow a followup which will create a DateMathParser from a DateFormatter, so the caller does not need to know the internals of the DateFormatter they have.	2018-09-26 07:56:25 -07:00
Christoph Büscher	ba3ceeaccf	Clean up "unused variable" warnings (#31876 ) This change cleans up "unused variable" warnings. There are several cases were we most likely want to suppress the warnings (especially in the client documentation test where the snippets contain many unused variables). In a lot of cases the unused variables can just be deleted though.	2018-09-26 14:09:32 +02:00

1 2

93 Commits