OpenSearch

Commit Graph

Author	SHA1	Message	Date
Nik Everett	3032a7c653	Cache FieldStats This caches FieldStats at the field level. For one off requests or for few indicies this doesn't save anything, but when there are 30 indices, 5 shards, 1 replica, 100 parallel requests this is about twice as fast as not caching. I expect lots of usage won't see much benefit from this but pointing kibana to a cluster with many indexes and shards, will be faster. Closes #18717	2016-06-14 13:57:18 -04:00
Nik Everett	e392e0b1df	Create get task API that falls back to the .tasks index This adds a get task API that supports GET /_tasks/${taskId} and removes that responsibility from the list tasks API. The get task API supports wait_for_complation just as the list tasks API does but doesn't support any of the list task API's filters. In exchange, it supports falling back to the .results index when the task isn't running any more. Like any good GET API it 404s when it doesn't find the task. Then we change reindex, update-by-query, and delete-by-query to persist the task result when wait_for_completion=false. The leads to the neat behavior that, once you start a reindex with wait_for_completion=false, you can fetch the result of the task by using the get task API and see the result when it has finished. Also rename the .results index to .tasks.	2016-06-14 13:37:34 -04:00
Simon Willnauer	ee2ba13cce	Register Highlighter instances instead of classes (#18859 ) This change detaches highlighter registration from Guice. It's just a small step into the right direction.	2016-06-14 17:04:58 +02:00
Colin Goodheart-Smithe	d7e3f9e4eb	#18854 Remove size 0 options in aggregations Remove size 0 options in aggregations	2016-06-14 15:32:42 +01:00
Christoph Büscher	32f141223d	Merge pull request #18800 from cbuescher/fix-interval-rounding-uneven Fix invalid rounding value for TimeIntervalRounding close to DST transitions	2016-06-14 16:22:11 +02:00
Christoph Büscher	03f5aa8ea0	Don't throw IllegalInstantException to determine DST gap By taking the logic from DateTimeZone#convertLocalToUTC(long, boolean) we can avoid throwing the exception.	2016-06-14 15:36:00 +02:00
Simon Willnauer	4d78f280ed	Remove dead code and dead parameters (#18855 )	2016-06-14 15:25:44 +02:00
Christoph Büscher	5abe1f7bb2	Fix invalid rounding value for TimeIntervalRounding close to DST transition There are edge cases where rounding a date to a certain interval using a time zone with DST shifts can currently cause the rounded date to be bigger than the original date. This happens when rounding a date closely after a DST start and the rounded date falls into the DST gap. Here is an example for CET time zone, where local time is set forward by one hour at 2016-03-27T02:00:00+01:00 to 2016-03-27T03:00:00.000+02:00: The date 2016-03-27T03:01:00.000+02:00 (1459040460000) which is just after the DST change is first converted to local time (1459047660000). If we then apply interval rounding for a 14m interval in local time, this takes us to 1459047240000, which unfortunately falls into the DST gap. When converting this back to UTC, joda provides options to throw exceptions on illegal dates like this, or correct this by adjusting the date to the new time zone offset. We currently do the later, but this leads to converting this illegal date back to 2016-03-27T03:54:00.000+02:00 (1459043640000), giving us a date that is larger than the original date we wanted to round. This change fixes this by using the "strict" option of 'convertLocalToUTC()' to detect rounded dates that fall into the DST gap. If this happens, we can use the time of the DST change instead as the interval start. Even before this change, intervals around DST shifts like this can be shorter than the desired interval. This, for example, happens when the requested interval width doesn't completely fit into the remaining time span when the DST shift happens. For example, using a 14m interval in UTC+1 (CET before DST starts) leads to the following valid rounding values around the time where DST happens: 2016-03-27T01:30:00+01:00 2016-03-27T01:44:00+01:00 2016-03-27T01:58:00+01:00 2016-03-27T02:12:00+01:00 2016-03-27T02:26:00+01:00 ... while the rounding values in UTC+2 (CET after DST start) are placed like this around the same time: 2016-03-27T02:40:00+02:00 2016-03-27T02:54:00+02:00 2016-03-27T03:08:00+02:00 2016-03-27T03:22:00+02:00 ... From this we can see then when we switch from UTC+1 to UTC+2 at 02:00 the last rounding value in UTC+1 is at 01:58 and the first valid one in UTC+2 is at 03:08, so even if we decide to put all the dates in between into one rounding interval, it will only cover 10 minutes. With this change we choose to use the moment of DST shift as an aditional interval separator, leaving us with a 2min interval from [01:58,02:00) before the shift and an 8min interval from [03:00,03:08) after the shift. This change also adds tests for the above example and adds randomization to the existing TimeIntervalRounding tests.	2016-06-14 14:59:51 +02:00
Colin Goodheart-Smithe	bec621d46f	changes from review	2016-06-14 13:45:03 +01:00
Colin Goodheart-Smithe	cfd3356ee3	Remove size 0 options in aggregations This removes the ability to set `size: 0` in the `terms`, `significant_terms` and `geohash_grid` aggregations for the reasons described in https://github.com/elastic/elasticsearch/issues/18838 Closes #18838	2016-06-14 13:07:02 +01:00
Boaz Leskes	7a226122e3	MasterFaultDetection can leak an exception during shutdown	2016-06-14 01:16:17 +03:00
Ryan Ernst	991c2221a1	Set next version back to alpha4	2016-06-13 09:26:45 -07:00
Simon Willnauer	7379b17e61	Revert "Make random UUIDs reproducible in tests" This reverts commit `a25b8ee1bf`.	2016-06-13 11:14:30 +02:00
Christoph Büscher	f20928b146	Remove redundant parseElementst() method in RescorePhase and SuggestPhase The default implementation in SearchPhase does the same.	2016-06-13 10:20:23 +02:00
Martijn van Groningen	3b96055b23	msearch: Cap the number of searches the msearch api will concurrently execute By default the number of searches msearch executes is capped by the number of nodes multiplied with the default size of the search threadpool. This default can be overwritten by using the newly added `max_concurrent_searches` parameter. Before the msearch api would concurrently execute all searches concurrently. If many large msearch requests would be executed this could lead to some searches being rejected while other searches in the msearch request would succeed. The goal of this change is to avoid this exhausting of the search TP. Closes #17926	2016-06-13 10:13:08 +02:00
Nik Everett	387155559e	Make TimeValue Writeable instead of Streamable Writeable is better for immutable objects like TimeValue. Switch to writeZLong which takes up less space than the original writeLong in the majority of cases. Since we expect negative TimeValues we shouldn't use writeVLong.	2016-06-10 18:24:16 -04:00
Jason Tedor	86f1bedaab	Rename NettyTransportChannel#close This commit renames the NettyTransportChannel#close method to NettyTransportChannel#release to clarify the semantics.	2016-06-10 15:26:49 -04:00
Adrien Grand	44c653f5a8	Upgrade to lucene-6.1.0-snapshot-3a57bea.	2016-06-10 16:18:12 +02:00
Jason Tedor	a25b8ee1bf	Make random UUIDs reproducible in tests Today we use a random source of UUIDs for assigning allocation IDs, cluster IDs, etc. Yet, the source of randomness for this is not reproducible in tests. Since allocation IDs end up as keys in hash maps, this means allocation decisions and not reproducible in tests and this leads to non-reproducible test failures. This commit modifies the behavior of random UUIDs so that they are reproducible under tests. The behavior for production code is not changed, we still use a true source of secure randomness but under tests we just use a reproducible source of non-secure randomness. It is important to note that there is a test, UUIDTests#testThreadedRandomUUID that relies on the UUIDs being truly random. Thus, we have to modify the setup for this test to use a true source of randomness. Thus, this is one test that will never be reproducible but it is intentionally so. Relates #18808	2016-06-10 10:18:06 -04:00
Ali Beyad	43e07c0c88	Better handling of an empty shard's segments_N file When trying to restore a snapshot of an index created in a previous version of Elasticsearch, it is possible that empty shards in the snapshot have a segments_N file that has an unsupported Lucene version and a missing checksum. This leads to issues with restoring the snapshot. This commit handles this special case by avoiding a restore of a shard that has no data, since there is nothing to restore anyway. Closes #18707	2016-06-10 09:57:09 -04:00
Nik Everett	d733fb689b	Better error message when mapping configures null Closes #18803	2016-06-10 09:43:18 -04:00
Yannick Welsch	a2c506acd3	Fix sync flush total shards statistics (#18766 )	2016-06-10 13:39:47 +02:00
Yannick Welsch	6ea89004cd	Make IndicesClusterStateService unit testable (#17270 ) Testability of ICSS is achieved by introducing interfaces for IndicesService, IndexService and IndexShard. These interfaces extract all relevant methods used by ICSS (which do not deal directly with store) and give the possibility to easily mock all the store behavior away in the tests (and cuts down on dependencies).	2016-06-10 12:47:41 +02:00
Colin Goodheart-Smithe	1d76177510	Adds aggregation profiling (not including reduce phase) Add Aggregation profiling initially only be for the shard phases (i.e. the reduce phase will not be profiled in this change) This change refactors the query profiling class to extract abstract classes where it is useful for other profiler types to share code.	2016-06-10 09:02:07 +01:00
Jim Ferenczi	439b2a96e5	Add an index setting to limit the maximum number of slices allowed in a scroll request (default to 1024).	2016-06-10 09:43:32 +02:00
Daniel Mitterdorfer	7229c91289	Remove trace logging from NettyHttpRequestSizeLimitIT With this commit we revert back to normal behavior as the underlying issue has been fixed with #18627.	2016-06-10 07:46:04 +02:00
Nik Everett	e02d9f0945	Squash a race condition in RefreshListeners It presented as listeners never being called if you refresh at the same time as the listener is added. It was caught rarely by testConcurrentRefresh. mostly this is removing code and adding a comment: ``` Note that it is not safe for us to abort early if we haven't advanced the position here because we set and read lastRefreshedLocation outside of a synchronized block. We do that so that waiting for a refresh that has already passed is just a volatile read but the cost is that any check whether or not we've advanced the position will introduce a race between adding the listener and the position check. We could work around this by moving this assignment into the synchronized block below and double checking lastRefreshedLocation in addOrNotify's synchronized block but that doesn't seem worth it given that we already skip this process early if there aren't any listeners to iterate. ```	2016-06-09 13:48:41 -04:00
gfyoung	6f222b5be1	Support flags in pattern replace char filter Works just like pattern analyzer's flags param. Closes #18362.	2016-06-09 12:39:23 -04:00
Nik Everett	fb52c258fd	[test] Check if RefreshListeners was called immediately Return a boolean from RefreshListeners, true if we called the listener inline and false if we didn't, and check it in the test.	2016-06-09 12:08:36 -04:00
Nik Everett	bd276ef5f1	[test] Check for listener calling error Failing to call a refresh listener is logger at WARN but that'll cause test failure. This adds explicit assertions that there are no errors.	2016-06-09 11:26:08 -04:00
Jason Tedor	e9017f619e	Improve performance of applyDeletedShards This commit addresses a performance issue in IndicesClusterStateService#applyDeletedShards. Namely, the current implementation is O(number of indices * number of shards). This is because of an outer loop over the indices and an inner loop over the assigned shards, all to check if a shard is in the outer index. Instead, we can group the shards by index, and then just do a map lookup for each index. Testing this on a single-node with 2500 indices, each with 2 shards, creating an index before this optimization takes 0.90s and after this optimization takes 0.19s. Relates #18788	2016-06-08 16:08:00 -04:00
Simon Willnauer	9497b704bb	[TEST] Fix NodeEnvironmentTests on Windows - use Path.resolve instead of platform dependent path seperator	2016-06-08 21:40:35 +02:00
Nik Everett	4b21157906	Remove setRefresh It has been replaced with `setRefreshPolicy` which has support for waiting until refresh with `setRefreshPolicy(WAIT_FOR)`. Related to #1063	2016-06-08 13:50:59 -04:00
Lee Hinman	92349f70e2	Merge remote-tracking branch 'dakrone/igs-false2'	2016-06-08 10:49:20 -06:00
Lee Hinman	c637fea84b	Change the default of `include_global_state` from true to false for restores This changes the default value to be false only for restore operations. Resolves #18569	2016-06-08 10:48:36 -06:00
Nik Everett	5161afe5e3	Support optional ctor args in ConstructingObjectParser You declare them like ``` static { PARSER.declareInt(optionalConstructorArg(), new ParseField("animal")); } ``` Other than being optional they follow all of the rules of regular `constructorArg()`s. Parsing an object with optional constructor args is going to be slightly less efficient than parsing an object with all required args if some of the optional args aren't specified because ConstructingObjectParser isn't able to build the target before the end of the json object.	2016-06-08 12:38:40 -04:00
Simon Willnauer	bec26015b2	[TEST] add a dedicated test for empty files	2016-06-08 15:40:14 +02:00
Christoph Büscher	a2372778dd	Fix problem with TimeIntervalRounding on DST end Due to an error in our current TimeIntervalRounding, two dates can round to the same key, even when they are 1h apart when using short interval roundings (e.g. 20m) and a time zone with DST change. Here is an example for the CET time zone: On 25 October 2015, 03:00:00 clocks are turned backward 1 hour to 02:00:00 local standard time. The dates "2015-10-25T02:15:00+02:00" (1445732100000) (before DST end) and "2015-10-25T02:15:00+01:00" (1445735700000) (after DST end) are thus 1h apart, but currently they round to the same value "2015-10-25T02:00:00.000+01:00" (1445734800000). This violates an important invariant of rounding, namely that the rounded value must be less or equal to the value that is rounded. It also leads to wrong histogram bucket counts because documents in [02:00:00+02:00, 02:20:00+02:00) go to the same bucket as documents from [02:00:00+01:00, 02:20:00+01:00). The problem happens because in TimeIntervalRounding#roundKey() we need to perform the rounding operation in local time, but on converting back to UTC we don't honor the original values time zone offset. This fix changes that and adds tests both for DST start and DST end as well as a test that demonstrates what happens to bucket sizes when the dst change is not evently divisibly by the interval.	2016-06-08 13:05:52 +02:00
Jim Ferenczi	712c77264d	Fix ut: make sure that the number of slices is bigger than 1 in the SliceBuilder tests.	2016-06-08 11:51:46 +02:00
Lee Hinman	762bbdbd0c	Revert "Change the default of `include_global_state` from true to false." This reverts commit `052a62250c`.	2016-06-07 15:07:37 -06:00
Lee Hinman	052a62250c	Change the default of `include_global_state` from true to false. Resolves #18569	2016-06-07 15:06:20 -06:00
Nik Everett	a405c2ba99	Switch QueryBuilders to new MatchPhraseQueryBuilder It was doing deprecated things with MatchQueryBuilder.	2016-06-07 14:35:23 -04:00
Lee Hinman	32bd869b28	Merge remote-tracking branch 'dakrone/no-cluster-name-in-path'	2016-06-07 10:14:23 -06:00
Lee Hinman	feb244c14a	Remove cluster name from data path Previously Elasticsearch used $DATA_DIR/$CLUSTER_NAME/nodes for the path where data is stored, this commit changes that to be $DATA_DIR/nodes. On startup, if the old folder structure is detected it will be used. This behavior will be removed in Elasticsearch 6.0 Resolves #17810	2016-06-07 10:13:48 -06:00
Jim Ferenczi	43b419b230	rehash the docvalues in DocValuesSliceQuery using BitMixer.mix instead of the naive Long.hashCode.	2016-06-07 17:58:32 +02:00
Martijn van Groningen	f611f1c99e	ingest: Move processors from core to ingest-common module. Folded grok processor into ingest-common module. The rest tests have been moved to ingest-common module as well, because these tests don't run in the rest-api-spec module but in the distribution:integ-test-zip module and adding a test plugin there felt just wrong to me. I think this is ok. I left a tiny ingest rest test behind in that tests with an empty pipeline. Removed messy tests, these tests were already covered in the rest tests Added ingest test plugin in test infra so that each module testing integration with ingest doesn't need write its own plugin Moved reindex ingest tests to qa module Closes #18490	2016-06-07 17:32:52 +02:00
trangvh	c0da8e4060	Fix some typos (#18746 ) * Update java-doc of SearchResponse.getProfileResults() * Fix a trivial typo in Reference document	2016-06-07 16:41:39 +02:00
Jim Ferenczi	692c42b23a	Fix ut	2016-06-07 16:29:18 +02:00
Jim Ferenczi	b9030bf6fe	Add the ability to partition a scroll in multiple slices. API: ``` curl -XGET 'localhost:9200/twitter/tweet/_search?scroll=1m' -d '{ "slice": { "field": "_uid", <1> "id": 0, <2> "max": 10 <3> }, "query": { "match" : { "title" : "elasticsearch" } } } ``` <1> (optional) The field name used to do the slicing (_uid by default) <2> The id of the slice By default the splitting is done on the shards first and then locally on each shard using the _uid field with the following formula: `slice(doc) = floorMod(hashCode(doc._uid), max)` For instance if the number of shards is equal to 2 and the user requested 4 slices then the slices 0 and 2 are assigned to the first shard and the slices 1 and 3 are assigned to the second shard. Each scroll is independent and can be processed in parallel like any scroll request. Closes #13494	2016-06-07 16:21:53 +02:00
Jason Tedor	c3e3a6337e	Use method name in bootstrap check might fork test This commit modifies the bootstrap check invocations in the might fork tests to use the underlying test name when setting up the logging prefix when invoking the bootstrap checks. This is done to give clear logs in case of failure.	2016-06-07 09:33:17 -04:00

1 2 3 4 5 ...

5356 Commits