OpenSearch

Commit Graph

Author	SHA1	Message	Date
javanna	d7ad748be7	ScriptStats to implement Writeable rather than Streamable Also removed ScriptStats#add method which was unused	2016-09-07 10:03:41 +02:00
javanna	3521e2e1a9	HttpStats to implement Writeable rather than Streamable	2016-09-07 10:03:41 +02:00
javanna	e263c64072	TransportStats to implement Writeable rather than Streamable	2016-09-07 10:03:41 +02:00
javanna	9c62a12fee	ThreadPoolStats to implement Writeable rather than Streamable	2016-09-07 10:03:41 +02:00
javanna	102dac2cd9	JvmStats to implement Writeable rather than Streamable also removed null checks in toXContent for subobjects that cannot be null and added @Nullable annotation for memory pools	2016-09-07 10:03:41 +02:00
javanna	931a164b1f	ProcessStats to implement Writeable rather than Streamable	2016-09-07 10:03:41 +02:00
Colin Goodheart-Smithe	55d9e99f51	Fix filter cache setting to allow percentages During adding the new settings infrastructure the option to specify the size of the filter cache as a percentage of the heap size which accidentally removed. This change adds that ability back. In addition the `Setting` class had multiple `.byteSizeSetting` methods which all except one used `ByteSizeValue.parseBytesSizeValue` to parse the value. One method used `MemorySizeValue.parseBytesSizeValueOrHeapRatio`. This was confusing as the way the value was parsed depended on how many arguments were provided. This change makes all `Setting.byteSizeSetting` methods parse the value the same way using `ByteSizeValue.parseBytesSizeValue` and adds `Setting.memorySizeSetting` methods to parse settings that express memory sizes (i.e. can be absolute bytes values or percentages). Relevant settings have been moved to use these new methods. Closes #20330	2016-09-07 08:53:41 +01:00
Alexander Lin	f825e8f4cb	Exposing lucene 6.x minhash filter. (#20206 ) Exposing lucene 6.x minhash tokenfilter Generate min hash tokens from an incoming stream of tokens that can be used to estimate document similarity. Closes #20149	2016-09-07 09:38:12 +02:00
Lee Hinman	7da8be9874	Merge remote-tracking branch 'dakrone/disk-decider-relocation-switcharoo'	2016-09-06 14:46:15 -06:00
Lee Hinman	28d3c4488e	Change DiskThresholdDecider's behavior when factoring in leaving shards This changes DiskThresholdDecider to only factor in leaving shards when checking if a shard can remain. Previously, leaving shards were factored in for both the `canAllocate` and `canRemain` checks, however, this makes only the leaving shard sizes subtracted in the `canRemain` check. It was possible that multiple shards relocating away from the node would have their entire size subtracted, and the node had a chance to go over the disk threshold (or hit the disk full) because it subtracted space that was still being used for other in-progress relocations.	2016-09-06 14:26:18 -06:00
Nik Everett	eb9d2b6659	Make ConcreteShardRequest public and static Request interceptors need to be able to work with it.	2016-09-06 15:41:14 -04:00
Martijn van Groningen	245882cde3	* Removed `script.default_lang` setting and made `painless` the hardcoded default script language. ** The default script language is now maintained in `Script` class. * Added `script.legacy.default_lang` setting that controls the default language for scripts that are stored inside documents (for example percolator queries). This defaults to groovy. Added `QueryParseContext#getDefaultScriptLanguage()` that manages the default scripting language. Returns always `painless`, unless loading query/search request in legacy mode then the returns what is configured in `script.legacy.default_lang` setting. In the aggregation parsing code added `ParserContext` that also holds the default scripting language like `QueryParseContext`. Most parser don't have access to `QueryParseContext`. This is for scripts in aggregations. * The `lang` script field is always serialized (toXContent). Closes #20122	2016-09-06 18:44:48 +02:00
Jason Tedor	0d7dfcd798	Merge pull request #20338 from jasontedor/remove-plugin Print message when removing plugin with config	2016-09-06 11:43:51 -04:00
Jason Tedor	6df70444a3	Remove Log4j 1 jar hell exemption When Elasticsearch depended on Log4j 1, there was jar hell from the log4j and the apache-log4j-extras jar. As these dependencies are gone, the jar hell exemption for Log4j 1 can be removed. Relates #20336	2016-09-06 10:25:22 -04:00
Jason Tedor	f427d7fe74	More verbose message on preserving plugin config This commit expands on the message printed when config files are preserved when removing a plugin to give the user an indication of the reason the config files are preserved.	2016-09-06 08:51:12 -04:00
Boaz Leskes	c56cd46162	Verify AllocationIDs in replication actions (#20320 ) Replicated operation consist of a routing action (the original), which is in charge of sending the operation to the primary shard, a primary action which executes the operation on the resolved primary and replica actions which performs the operation on a specific replica. This commit adds the targeted shard's allocation id to the primary and replica actions and makes sure that those match the shard the actions end up executing on. This helps preventing extremely rare failure mode where a shard moves off a node and back to it, all between an action is sent and the time it's processed. For example: 1) Primary action is sent to a relocating primary on node A. 2) The primary finishes relocation to node B and start relocating back. 3) The relocation back gets to the phase and opens up the target engine, on the original node, node A. 4) The primary action is executed on the target engine before the relocation finishes, at which the shard copy on node B is still the official primary - i.e., it is executed on the wrong primary.	2016-09-06 14:32:48 +02:00
Jason Tedor	75956604eb	Print message when removing plugin with config When removing a plugin with a config directory, we preserve the config directory. This is because the workflow for upgrading a plugin involves removing and then installing the plugin again and losing the plugin config in this case would be terrible. This commit causes a message regarding this to be printed in case the user wants to manually delete these files.	2016-09-06 08:01:43 -04:00
Jason Tedor	ab86660c65	Add finals to RemovePluginCommand This commit marks the RemovePluginCommand class as final, and marks some local variables as final too.	2016-09-06 07:39:23 -04:00
Jason Tedor	e081b2b2e8	Remove length violation in RemovePluginCommand This commit removes a line-length violation in RemovePluginCommand.java and removes this file from the list of files for which the line-length check is suppressed.	2016-09-06 07:28:05 -04:00
Jason Tedor	7b43d9b0ec	Add test for Log4j throwable proxy leniency We have intentionally introduced leniency for ThrowableProxy from Log4j to work around a bug there. Yet, a test for this introduced leniency was not addded. This commit introduces such a test. Relates #20329	2016-09-06 05:55:06 -04:00
Jason Tedor	0003196749	Remove Joda-Time jar hell exemption Previously we had an exemption for Joda-Time BaseDateTime because we forked this class to remove the usage of a volatile field. This hack is no longer in place, so the exemption is no longer necessary. This commit removes that exemption. Relates #20328	2016-09-06 04:47:42 -04:00
Jun Ohtani	f0be657699	Clean up Analyze API test case Using expectThrows instead of using try-catch	2016-09-06 15:46:18 +09:00
Simon Willnauer	5c2d9fa158	Improve error reporting for tests with BackgroundIndexer (#20324 ) The BackgroundIndexer now uses auto-generated IDs randomly. This causes some problems for tests that still rely on the fact that the IDs are increasing integers. This change exposes all IDs via a Set<String> to iterate over for tests.	2016-09-05 16:28:49 +02:00
Jason Tedor	433cae47ed	Mark CSIT#testLoggerLevelUpdate as awaits fix This commit marks ClusterSettingsIT#testLoggerLevelUpdate as awaiting a fix due to a test bug.	2016-09-04 11:09:08 -04:00
Jason Tedor	41637a1294	Only warn on old log configs if resolving configs A warning was introduced if old log config files are present (e.g., logging.yml). However, this check is executed unconditionally. This can lead to no such file exceptions when logging configs are not being resolved, for example when installing a plugin. This commit moves this check to only execute when logging configs are being resolved.	2016-09-03 09:48:09 -04:00
Jason Tedor	e297fd419b	Workaround possible JVM bug on Windows Some assertions in MaxMapCountCheckTests assert that certain messages are logged. These assertions pass everywhere except Windows where the JVM seems confused. The issue is not the javac compiler as the bytecode produced on OS X and Windows is identical for the relevant classes so this leaves a possible JVM bug. It is not worth investigating the ultimate cause of this bug so instead this commit introduces a workaround.	2016-09-03 09:26:03 -04:00
Jason Tedor	b9966fed36	Hack around Log4j bug rendering exceptions Log4j has a bug where it does not handle a security exception that can be thrown when it is rendering a stack trace. This commit intentionally introduces jar hell with the ThrowableProxy class to work around this bug until a fix is a released. Relates #20306	2016-09-02 20:26:32 -04:00
Jason Tedor	40f889b825	Warn if unsupported logging configuration present This commit adds a warning that an unsupported logging configuration is present and points users to the new logging configuration file. Relates #20309	2016-09-02 18:36:57 -04:00
Simon Willnauer	c992a007c8	Pass on maxUnsafeAutoIdTimestamp on recovery / relocation (#20300 ) To ensure we don't add documents more than once even if it's mostly paranoia except of one case where we relocated a shards away and back to the same node while an initial request is in flight but has not yet finished AND is retried. Yet, this is a possible case and for that reason we ensure we pass on the maxUnsafeAutoIdTimestamp on when we prepare for translog recovery. Relates to #20211	2016-09-02 21:07:55 +02:00
Ali Beyad	d2ab42eabe	[TESTS] added higher level logging to the testShadowReplicaNaturalRelocation test	2016-09-02 14:57:22 -04:00
Jun Ohtani	c4759bcc02	Merge pull request #20285 from johtani/fix/remove_token_filter_param_in_analyze_api Remove `token_filter` in _analyze API	2016-09-03 02:03:51 +09:00
Masaru Hasegawa	af959c0c91	Merge pull request #20299 from masaruh/query_string_fuzzy query_string_query should take term length into consideration when fuzziness is auto	2016-09-02 23:33:49 +09:00
Nik Everett	549ca3178b	Rename method in OldIndexUtils loadIndexList -> loadDataFilesList. The new method name is more accurate.	2016-09-02 10:16:30 -04:00
Masaru Hasegawa	3a13f54755	query_string_query should take term length into consideration when fuzziness is auto Fixes #15972	2016-09-02 22:17:02 +09:00
javanna	52581d2df6	[TEST] fix bad merge	2016-09-02 10:27:59 +02:00
javanna	51620f755b	[TEST] expand NodeInfoStreamingTests to also test serialization of nullable values	2016-09-02 10:23:49 +02:00
javanna	746632fcf9	remove redundant serialization test for JvmInfo and OsInfo and expand existing NodeInfoStreamingTests	2016-09-02 10:23:49 +02:00
javanna	e5a741ab67	fix line length in some touched classes	2016-09-02 10:23:49 +02:00
javanna	c0a0100308	[TEST] use single line ternary over more verbose ifs	2016-09-02 10:23:05 +02:00
javanna	6873454f33	use read/writeList and readMap where possible	2016-09-02 10:23:05 +02:00
javanna	68eb58f9e3	[TEST] use randomPositiveLong where possible	2016-09-02 10:23:05 +02:00
javanna	774244a61f	ThreadPool.Info and SizeValue to implement Writeable rather than Streamable	2016-09-02 10:23:05 +02:00
javanna	84b8c9de19	PluginInfo to implement Writeable rather than Streamable	2016-09-02 10:23:05 +02:00
javanna	555db744f1	use read/writeOptionalWriteable in NodeInfo serialization code	2016-09-02 10:23:05 +02:00
javanna	e98e37295a	PluginsAndModules to implement Writeable rather than Streamable	2016-09-02 10:23:05 +02:00
javanna	2b2fb8daed	TransportInfo to implement Writeable rather than Streamable	2016-09-02 10:23:05 +02:00
javanna	536d13ff11	ProcessInfo to implement Writeable rather than Streamable	2016-09-02 10:23:05 +02:00
javanna	2370c25fa4	ThreadPoolInfo to implement Writeable rather than Streamable	2016-09-02 10:23:05 +02:00
javanna	27e7fc734c	HttpInfo to implement Writeable rather than Streamable	2016-09-02 10:23:05 +02:00
javanna	279f8b27e3	JvmInfo to implement Writeable rather than Streamable	2016-09-02 10:23:05 +02:00
javanna	bea863c660	OsInfo to implement Writeable rather than Streamable This allows to make all instance members final. Also added serialization tests and sorted out inizialization that was scattered in two places.	2016-09-02 10:23:05 +02:00
javanna	f6ab4e1078	ByteSizeValue to implement Writeable rather than Streamable With this we can make ByteSizeValue immutable for real.	2016-09-02 10:23:05 +02:00
Luca Cavanna	faa03ad9fa	Merge pull request #20255 from javanna/enhancement/cluster_stats_available_memory Add mem section back to cluster stats	2016-09-02 10:19:51 +02:00
Simon Willnauer	724e8ec39c	[TEST] Fix settings keys to be the actual keys rather than the toString() of the Setting	2016-09-02 10:00:31 +02:00
Adrien Grand	5bfab76c96	Source filtering should keep working when the source contains numbers greater than `Long.MAX_VALUE`. #20278 Currently it does not because our parsers do not support big integers/decimals (on purpose) but we do not have to ask our parser for the number type, we can just ask the jackson parser for a number representation of the value with the right type. Note that I did not add similar tests for big decimals because Jackson seems to never return big decimals, even for decimal values that are out of the range of values that can be represented by doubles. Closes #11508	2016-09-02 08:56:04 +02:00
Jun Ohtani	aef2e5d90e	Remove `token_filter` in _analyze API Fix wording in docs Refactoring RestAnalyzeActionTests using expectThrows() Closes #20283	2016-09-02 15:08:28 +09:00
Jason Tedor	1e80adbfbe	Configure test logging with Log4j 2 This commit configures test logging for Log4j 2. The default logger configuration uses the console appender but at the error level, so most tests are missing logging. Instead, this commit provides a configuration for tests which is picked up from the classpath by Log4j 2 when it initializes. However, this now means that we can no longer initialize Log4j with a bare-bones configuration when tests run as doing so will prevent Log4j 2 from attempting to configure logging via the classpath. Consequently, we move this needed initialization (as commented, to avoid a message about a status logger not being configured when we are preparing to configure Log4j from properties files in the config directory) to only run when we are explicitly configuring Log4j from properties files. Relates #20284	2016-09-01 14:00:47 -04:00
javanna	186a5d74b8	[TEST] improve ClusterStatsIT to better check mem values returned Rather than checking that those values are greater than 0, we can sum up the values gotten from all nodes and check that what is returned is that same value.	2016-09-01 19:22:13 +02:00
Jun Ohtani	3d9f8ed764	Remove `token_filter` in _analyze API Remove the param and change docs Closes #20283	2016-09-02 01:36:45 +09:00
Clinton Gormley	0e8a43e826	Elasticsearch 2.4.0 uses Lucene 5.5.2	2016-09-01 12:52:01 +02:00
Martijn van Groningen	a110498ad8	settings: Make `action.auto_create_index` setting a dynamic cluster setting. Closes #7513	2016-09-01 12:33:30 +02:00
Clinton Gormley	e5ff3da802	Added version 2.4.0 with bwc indices	2016-09-01 11:36:49 +02:00
javanna	042675432e	make sure that mem, cpu and swap are never null in OsStats	2016-09-01 11:26:03 +02:00
javanna	5f299ff46f	add mem section back to cluster stats The mem section was buggy in cluster stats and removed. It is now added back with the same structure as in node stats, containing total memory, available memory, used memory and percentages. All the values are the sum of all the nodes across the cluster (or at least the ones that we were able to get the values from).	2016-09-01 11:26:03 +02:00
javanna	5211b6b4bc	OsStats.Cpu, OsStats.Mem & OsStats.Swap to implement ToXContent	2016-09-01 11:24:56 +02:00
javanna	0a7a52a31e	OsStats and subobjects to implement Writeable rather than Streamable We can now have final instance members, also drop some optional values and related null checks that weren't needed.	2016-09-01 11:24:56 +02:00
Adrien Grand	34aaea641d	Fix NPE when running a range query on a `scaled_float` with no upper bound. #20253 The null check was there, but on the wrong variable.	2016-09-01 11:23:32 +02:00
Simon Willnauer	a0becd26b1	Optimize indexing for the autogenerated ID append-only case (#20211 ) If elasticsearch controls the ID values as well as the documents version we can optimize the code that adds / appends the documents to the index. Essentially we an skip the version lookup for all documents unless the same document is delivered more than once. On the lucene level we can simply call IndexWriter#addDocument instead of #updateDocument but on the Engine level we need to ensure that we deoptimize the case once we see the same document more than once. This is done as follows: 1. Mark every request with a timestamp. This is done once on the first node that receives a request and is fixed for this request. This can be even the machine local time (see why later). The important part is that retry requests will have the same value as the original one. 2. In the engine we make sure we keep the highest seen time stamp of "retry" requests. This is updated while the retry request has its doc id lock. Call this `maxUnsafeAutoIdTimestamp` 3. When the engine runs an "optimized" request comes, it compares it's timestamp with the current `maxUnsafeAutoIdTimestamp` (but doesn't update it). If the the request timestamp is higher it is safe to execute it as optimized (no retry request with the same timestamp has been run before). If not we fall back to "non-optimzed" mode and run the request as a retry one and update the `maxUnsafeAutoIdTimestamp` unless it's been updated already to a higher value Relates to #19813	2016-09-01 10:39:40 +02:00
Simon Willnauer	419627c460	Ensure ESTestCase is initialized before we run tests	2016-09-01 09:39:44 +02:00
Jason Tedor	d9064f454e	Fix additional exception logging calls This commit modifies a pair of exception logging calls to use parameterized messages from Log4j.	2016-08-31 23:14:13 -04:00
Jason Tedor	76ab02e002	Merge branch 'master' into log4j2 * master: Avoid NPE in LoggingListener Randomly use Netty 3 plugin in some tests Skip smoke test client on JDK 9 Revert "Don't allow XContentBuilder#writeValue(TimeValue)" [docs] Remove coming in 2.0.0 Don't allow XContentBuilder#writeValue(TimeValue) [doc] Remove leftover from CONSOLE conversion Parameter improvements to Cluster Health API wait for shards (#20223) Add 2.4.0 to packaging tests list Docs: clarify scale is applied at origin+offest (#20242)	2016-08-31 16:37:55 -04:00
Jason Tedor	487ffe8375	Remove code references to logging.yml This commit removes code references to logging.yml in TranslogToolCli and PluginCli.	2016-08-31 15:50:45 -04:00
Nik Everett	bd93c7054c	Revert "Don't allow XContentBuilder#writeValue(TimeValue)" This reverts commit `7f70c00dad`.	2016-08-31 14:45:03 -04:00
Nik Everett	7f70c00dad	Don't allow XContentBuilder#writeValue(TimeValue) We have specific support for writing `TimeValue`s in the form of `XContentBuilder#timeValueField`. Writing a `TimeValue` using `XContentBuilder#writeValue` is a bug waiting to happen.	2016-08-31 13:23:38 -04:00
Ali Beyad	4641254ea6	Parameter improvements to Cluster Health API wait for shards (#20223 ) * Params improvements to Cluster Health API wait for shards Previously, the cluster health API used a strictly numeric value for `wait_for_active_shards`. However, with the introduction of ActiveShardCount and the removal of write consistency level for replication operations, `wait_for_active_shards` is used for write operations to represent values for ActiveShardCount. This commit moves the cluster health API's usage of `wait_for_active_shards` to be consistent with its usage in the write operation APIs. This commit also changes `wait_for_relocating_shards` from a numeric value to a simple boolean value `wait_for_no_relocating_shards` to set whether the cluster health operation should wait for all relocating shards to complete relocation. * Addresses code review comments * Don't be lenient if `wait_for_relocating_shards` is set	2016-08-31 11:58:19 -04:00
Jason Tedor	e166459bbe	Merge branch 'master' into log4j2 * master: Increase visibility of deprecation logger Skip transport client plugin installed on JDK 9 Explicitly disable Netty key set replacement percolator: Fail indexing percolator queries containing either a has_child or has_parent query. Make it possible for Ingest Processors to access AnalysisRegistry Allow RestClient to send array-based headers Silence rest util tests until the bogusness can be simplified Remove unknown HttpContext-based test as it fails unpredictably on different JVMs Tests: Improve rest suite names and generated test names for docs tests Add support for a RestClient base path	2016-08-31 10:59:27 -04:00
Jason Tedor	1a805bb675	Increase visibility of deprecation logger The deprecation logger is an important way to make visible features of Elasticsearch that are deprecated. Yet, the default logging makes the log messages for the deprecation logger invisible. We want these log messages to be visible, so the default logging for the deprecation logger should enable these log messages. This commit changes the log level of deprecation log message to warn, and configures the deprecation logger so that these log messages are visible out of the box. Relates #20254	2016-08-31 10:51:17 -04:00
Jason Tedor	ac8c2e98ab	Enable console logging for CLI tools This commit enables CLI tools to have console logging. For the CLI tools, we skip configuring the logging infrastructure via the config file, and instead set the level only via a system property.	2016-08-31 09:05:26 -04:00
Jason Tedor	0fdc5ca587	Remove logger getter from DeprecationLogger This commit removes an unused getter for the logger field from the DeprecationLogger.	2016-08-30 21:19:16 -04:00
Igor Motov	a68083f5cb	Make it possible for Ingest Processors to access AnalysisRegistry The analysis registry will be used in PMML plugin ingest processor.	2016-08-30 21:09:41 -04:00
Jason Tedor	abe3efdfa9	Fix failing max map count check test This commit fixes failing max map count check test due to the use of a logging message supplier.	2016-08-30 18:49:39 -04:00
Jason Tedor	abf8a1a3f0	Avoid allocating log parameterized messages This commit modifies the call sites that allocate a parameterized message to use a supplier so that allocations are avoided unless the log level is fine enough to emit the corresponding log message.	2016-08-30 18:17:09 -04:00
Jason Tedor	7da0cdec42	Introduce Log4j 2 This commit introduces Log4j 2 to the stack.	2016-08-30 13:31:24 -04:00
Nik Everett	df73292256	Add an alias action to delete an index While removing an index isn't actually an alias action, if we add an alias action that deletes an index then we can delete and index and add an alias with the same name as the index atomically, in the same cluster state update. Closes #20064	2016-08-30 10:15:21 -04:00
Simon Willnauer	497e7d1054	User lambda instead of annoymous class in SearchPhaseController	2016-08-30 12:58:54 +02:00
Tanguy Leroux	b4245c7ad9	Add exclusion filters support to filter_path This commit adds the support for exclusion filter to the response filtering (filter_path) feature. It changes the XContentBuilder APIs so that it now accepts two types of filters: inclusive and exclusive. Filters are no more String arrays but sets of String instead.	2016-08-30 09:08:30 +02:00
Martijn van Groningen	1925813e09	ingest: Fix rename processor change rename leaf fields into branch fields Instead of get, set and remove we do get, remove and then set to avoid type conflicts in IngestDocument. If the set still fails we try to restore the original field in ingest document. Closes #19892	2016-08-30 07:38:01 +02:00
Ali Beyad	a132405642	Ensures that during the restore process, if a file in the snapshot (#20220 ) already has a file of the same name in the Store, but is different in content (different checksum/length), then those files are first deleted before restoring the files in question.	2016-08-29 17:51:35 -04:00
Ali Beyad	55b91cdc17	Removes unused test helper method to write old blob store format	2016-08-29 12:44:58 -04:00
Areek Zillur	99734ec576	Merge pull request #20034 from areek/cleanup/index_operation Set created flag in index operation	2016-08-29 12:34:24 -04:00
Nik Everett	9c3f6d58ac	Support downgrading keyword/text into string This changes Elasticsearch to automatically downgrade `text` and `keyword` fields into appropriate `string` fields when changing the mapping of indexes imported from 2.x. This allows users to use the modern, documented syntax against 2.x indexes. It also makes it clear that reindexing in order to recreate the index in 5.0 is required for any long lived indexes. This change is useful for the times when you can't (cluster is just starting, not stable enough for reindex) or shouldn't (index will only live 90 days or something).	2016-08-29 11:27:37 -04:00
javanna	d7ec2db9b0	[TEST] enable cacheKey check in ShardSearchTransportRequestTests Now that #20081 is merged we can check that cacheKey is consistent across equal search requests, something that wasn't true before due to ordering of map keys when using index boost. Relates to #19986	2016-08-29 17:20:26 +02:00
Tanguy Leroux	9727f123b9	Rename Netty TCP transports thread factories from http_* to transport_* Netty3/4 TcpTransport implementations are creating thread factories with a "http_server" thread prefix whereas it should start with "transport_server" and let the "http_server" prefix for the HttpServerTransport implementations.	2016-08-29 13:49:52 +02:00
Yannick Welsch	f070c8727b	[TEST] Add additional logging to testStaleMasterNotHijackingMajority This test is periodically failing. As I suspect that the GCDisruption scheme is somehow making the wrong node block on its cluster state update thread, I've added some more logging and a thread dump once the given assertion triggers again.	2016-08-29 13:42:13 +02:00
Martijn van Groningen	2d82bea040	fix test bug	2016-08-29 13:28:23 +02:00
Jun Ohtani	2a00c9dc46	Merge pull request #19860 from johtani/fix/validate_empty_field_name Validate blank field name	2016-08-29 11:52:18 +09:00
Simon Willnauer	62b821ccf4	[TEST] Ensure test never hangs but fails if it doesn't finish after 10 seconds waiting for threads	2016-08-27 23:20:55 +02:00
Simon Willnauer	162ad1251c	Fsync documents in an async fashion (#20145 ) today we fsync in a blocking fashion where all threads block while another syncs. Yet, we can improve this and make use of the async infrastrucutre added for `wait_for_refresh` and make fsyncing single threaded while all other threads can continue indexing. The syncing thread then notifies a listener once the requests location is synced. This also allows to send docs to replicas before its actually fsynced allowing for cocurrent replica processing. This patch has a significant impact on performance on slower discs. An initial single node benchmark shows that on very fast SSDs there is no noticable impact but on slow spinning disk this patch shows a ~32% performance improvement. ``` NVME SSD: `336ec0ac9a` (master): Total docs/sec: 47200.9 Total docs/sec: 46440.4 23543a97e3e7f72a31e26b50e00931919784426c (async wait for translog): Total docs/sec: 47461.6 Total docs/sec: 46188.3 ------------------------------------------------------------------- Spinning disk: `336ec0ac9a` (master): Total docs/sec: 22733.0 Total docs/sec: 24129.8 23543a97e3e7f72a31e26b50e00931919784426c (async wait for translog): Total docs/sec: 32724.1 Total docs/sec: 32845.4 -------------------------------------------------------------------- ```	2016-08-27 21:42:38 +02:00
Igor Motov	3d6270b5cd	Don't rebuild pipeline on every cluster state update Currently, after at least one pipeline is registered it is getting rebuilt on every single cluster state update, even when this update is not related to ingest metadata. This change adds a check that the ingest metadata changed before trying to rebuild all pipelines.	2016-08-27 10:11:51 -04:00
Yannick Welsch	1b75cb63a2	Add recovery source to ShardRouting (#19516 ) Adds an explicit recoverySource field to ShardRouting that characterizes the type of recovery to perform: - fresh empty shard copy - existing local shard copy - recover from peer (primary) - recover from snapshot - recover from other local shards on same node (shrink index action)	2016-08-27 16:11:10 +02:00
qwerty4030	9172653211	Fix NPE during search with source filtering if the source is disabled. (#20093 ) * Fix NPE during search with source filtering if the source is disabled. Instead of throwing an NPE, a search response with source filtering will not contain the source if it is disabled in the mapping. Closes #7758 * Created unit tests for FetchSourceSubPhase. Tests similar to SourceFetchingIT. Removed SourceFetchingIT#testSourceDisabled (now covered via unit test FetchSourceSubPhaseTests#testSourceDisabled). * Updated FetchSouceSubPhase unit tests per comments. Renamed main unit test method. Use assertEquals and assertNull instead of assertThat (less code).	2016-08-27 07:24:45 -04:00
Ali Beyad	230f0b514f	Fixes test to use admin client to check the cluster state instead of a random node's cluster service.	2016-08-27 01:29:29 -04:00
Alex Benusovich	201217945f	Fix IndexNotFoundException if an multi index search request had a concrete index followed by an add/remove concrete index. The code now properly adds/removes the index instead of throwing an exception. Closes #3839	2016-08-26 16:59:22 -07:00
Ali Beyad	5fac32e699	Removed an unecessary TODO for snapshot file restoration and instead added comments explaining what happens during the restore process.	2016-08-26 17:13:14 -04:00
Lee Hinman	abdd1b6f86	Merge remote-tracking branch 'dakrone/prop-script-settings'	2016-08-26 13:53:48 -06:00
Lee Hinman	3fbfb3e7e7	Fix propagating the default value for script settings Fixes an issue where the value for the `script.engine.<lang>.inline` settings would be _set_ properly, but would not accurately be reflected in the `include_defaults` output. Adds a test to ensure the default raw setting is now correct. Resolves #20159	2016-08-26 13:03:32 -06:00
Xiang Chen	22242ec881	Fix request cache key for search * Make sure indexBoost is serialized in a consistent order * remove hasIndexBoost by using indexBoost size * Make sure phrase suggester's collateParams is serialized in consistent order * Make StreamOutput writer to serialize maps in consistent order	2016-08-26 12:03:24 -04:00
Jun Ohtani	0ad231546d	Validate blank field name Validate only 5.0 alpha 6+ index only Closes #19251	2016-08-26 20:10:33 +09:00
Jun Ohtani	450f47d5b5	Validate blank field name add validation and validate only 5.0+ Add tests before 5.0 Closes #19251	2016-08-26 20:10:33 +09:00
Jason Tedor	287cb00474	Avoid prematurely triggering logger initialization The class Setting holds a static reference to a deprecation logger instance. When the class initializer for Setting runs, it starts triggering log4j initialization. There is a chain of initializations from InternalSettingsPreparer to Environment to Setting that triggers this initialization before log4j configuration has occurred. This commit modifies this initialization so that initialization is not done eagerly. Relates #20170	2016-08-26 05:07:05 -04:00
Adrien Grand	3ed0da5a58	GET operations should not extract fields from `_source`. #20158 This makes GET operations more consistent with `_search` operations which expect `(stored_)fields` to work on stored fields and source filtering to work on the `_source` field. This is now possible thanks to the fact that GET operations do not read from the translog anymore (#20102) and also allows to get rid of `FieldMapper#isGenerated`. The `_termvectors` API (and thus more_like_this too) was relying on the fact that GET operations would extract fields from either stored fields or the source so the logic to do this that used to exist in `ShardGetService` has been moved to `TermVectorsService`. It would be nice that term vectors do not rely on this, but this does not seem to be a low hanging fruit.	2016-08-26 10:35:23 +02:00
Yannick Welsch	6fe9ae29ea	Mark shard as stale on non-replicated write, not on node shutdown (#20023 ) Non-stale shard copies are currently tracked using their allocation ids in the cluster state. When a node leaves the cluster, shard copies of that node are marked as stale by removing their allocation ids from the active set in the cluster. For full cluster restarts, this can have the unwanted effect that only the last node holding a copy of the shard will be seen as non-stale. The other shard copies are not really stale though as long as no writes have happened on this shard copy. Shard copies should thus only be marked as stale (by the master in the cluster state) if other active shards have received writes. This commit implements the above logic and also renames the persistent structure used to track non-stale shard copies from "active_allocations" to "in_sync_allocations" as we now also support tracking non-stale shard copies that have no active routing entries in the cluster state.	2016-08-26 10:09:57 +02:00
Adrien Grand	c5f8e1b64d	Do not parse numbers as both strings and numbers when not included in `_all`. #20167 We need to get the string representation of numbers in order to include in `_all`. However this has a cost and disabling `_all` is rather common so we should look into skipping it.	2016-08-26 10:00:36 +02:00
Jason Tedor	bc136a90d5	Add network types to cluster stats The network types in use on a cluster can be useful information to have, so this commit adds aggregate metrics for the network types in use in a cluster to the cluster stats. Relates #20144	2016-08-25 21:08:05 -04:00
Chris Earle	1cf694b63e	Use StringBuilder in favor of StringBuffer This removes all instances of StringBuffer that are removeable. Uncontended synchronization in Java is pretty cheap, but it's unnecessary.	2016-08-25 16:20:03 -04:00
Chris Earle	b41508a344	Make MapOfLists Generic This moves the Writer interface from StreamOutput into Writeable, as a peer of its inner Reader interface. This should hopefully help to avoid random functional interfaces being created for the same purpose. It also makes use of the moved class by updating writeMapOfLists and readMapOfLists.	2016-08-25 16:10:48 -04:00
Colin Goodheart-Smithe	f5fbb3eb8b	Fix agg profiling when using breadth_first collect mode Previous to this change the nesting of aggregation profiling results would be incorrect when the request contains a terms aggregation and the collect mode is (implicitly or explicitly) set to `breadth_first`. This was because the aggregation profiling has to make the assumption that the `preCollection()` method of children aggregations is always called in the `preCollection()` method of their parent aggregation. When the collect mode is `breadth_first` the `preCollection` of the children aggregations was delayed until the documents were replayed. This change moves the `preCollection()` of deferred aggregations to run during the `preCollection()` of the parent aggregation. This should have no adverse impact on the breadth_first mode as there is no allocation of memory in any of the aggregations. We also apply the same logic to the diversified sampler aggregation as we did to the terms aggregation to move the `preCollection()` of the child aggregations method to be called during the `preCollection()` of the parent aggregation. This commit also includes a fix so that the `ProfilingLeafBucketCollector` propagates the scorer to its delegate so the diversified sampler agg works when profiling is enabled.	2016-08-25 14:57:52 +01:00
Adrien Grand	b521638f52	Revert "Revert "Save one utf8 conversion in KeywordFieldMapper. #19867"" This reverts commit `d805266d94`.	2016-08-25 13:37:14 +02:00
Adrien Grand	f93ce94afe	The root object mapper should support updating `numeric_detection`, `date_detection` and `dynamic_date_formats`. #20119 If they are specified by a mapping update, these properties are currently ignored. This commit also fixes the handling of `dynamic_templates` so that it is possible to remove templates (and so that it works more similarly to all other mapping properties). Closes #20111	2016-08-25 12:39:38 +02:00
Mike McCandless	7a14cd4b1d	Pass baseSimilarity to super (PerFieldSimilarityWrapper)	2016-08-25 04:43:56 -04:00
Mike McCandless	5eb66e3378	Mark Scandinavian analysis components as multi term aware	2016-08-24 19:50:25 -04:00
Mike McCandless	7492300544	Remove now unused Store.renameFile, and obsolete commented out code	2016-08-24 18:20:30 -04:00
Mike McCandless	0ccfe69789	Upgrade to Lucene 6.2.0	2016-08-24 17:26:28 -04:00
Nicholas Knize	9eb63fb885	Refactor GeoPointFieldMapperLegacy and Legacy BBox query helpers This is a house cleaning commit that refactors GeoPointFieldMapperLegacy to LegacyGeoPointFieldMapper for consistency with Legacy Numerics and IP field mappers. IndexedGeoBoundingBoxQuery and InMemoryGeoBoundingBoxQuery are also deprecated and refactored as Legacy classes.	2016-08-24 14:40:25 -05:00
Jim Ferenczi	4682fc34ae	Add the ability to disable the retrieval of the stored fields entirely This change adds a special field named _none_ that allows to disable the retrieval of the stored fields in a search request or in a TopHitsAggregation. To completely disable stored fields retrieval (including disabling metadata fields retrieval such as _id or _type) use _none_ like this: ```` POST _search { "stored_fields": "_none_" } ````	2016-08-24 16:40:08 +02:00
Simon Willnauer	c499427166	Use _refresh instead of reading from Translog in the RT GET case (#20102 ) Today we do a lot of accounting inside the engine to maintain locations of documents inside the transaction log. This is only needed to ensure we can return the documents source from the engine if it hasn't been refreshed. Aside of the added complexity to be able to read from the currently writing translog, maintainance of pointers into the translog this also caused inconsistencies like different values of the `_ttl` field if it was read from the tlog or not. TermVectors are totally different if the document is fetched from the tranlog since copy fields are ignored etc. This chance will simply call `refresh` if the documents latest version is not in the index. This streamlines the semantics of the `_get` API and allows for more optimizations inside the engine and on the transaction log. Note: `_refresh` is only called iff the requested document is not refreshed yet but has recently been updated or added. #Relates to #19787	2016-08-24 15:30:08 +02:00
Simon Willnauer	1b1a1acad8	Don't index the `_version` field (#20132 ) The `_version` field doesn't allow to be searched anyway since it's set `IndexOptions#NONE` for it instead.	2016-08-24 10:04:27 +02:00
Adrien Grand	5d6c9b0745	Fix RAM usage estimation of LiveVersionMap. #20123 I was writing tests for RAM usage estimation of LiveVersionMap and found a couple issues: - The BytesRef objects used as uids were oversized since they were created via `new BytesRef(CharSequence)` which creates a `byte[]` whose size is 3x the length of the provided char sequence. Given that our uids are most of times ASCII sequences, this is a waste of memory. - `VersionValue` was using `translogLocation.size` instead of `translogLocation.ramBytesUsed()` for RAM estimation, which is completely unrelated to the memory footprint of the `Translog.Location` object. In particular, the latter issue could cause RAM usage estimation to be significantly overestimated, especially on large documents. I also added tests for ram accounting.	2016-08-24 09:54:06 +02:00
Lee Hinman	3298a4ed38	Revert "Merge remote-tracking branch 'dakrone/exclude-numerics-from-all'" This reverts commit `514585290c`, reversing changes made to `8563c8d897`.	2016-08-23 09:24:33 -06:00
Nicholas Knize	8234fad9ca	Deprecate geohash parameters for geo_point parser This commit deprecates all geohash parameters in the geo_point field parser.	2016-08-23 09:19:21 -05:00
Nicholas Knize	28ed0e7abf	Deprecate optimize_bbox on geodistance queries Deprecates the optimize_bbox parameter on geodistance queries. This has no longer been needed since version 2.2 because lucene geo distance queries (postings and LatLonPoint) already optimize by bounding box.	2016-08-23 09:14:54 -05:00
Michael McCandless	668dac722a	Don't suppress AlreadyClosedException (#19975 ) Catching and suppressing AlreadyClosedException from Lucene is dangerous because it can mean there is a bug in ES since ES should normally guard against invoking Lucene classes after they were closed. I reviewed the cases where we catch AlreadyClosedException from Lucene and removed the ones that I believe are not needed, or improved comments explaining why ACE is OK in that case. I think (@s1monw can you confirm?) that holding the engine's readLock means IW will not be closed, except if disaster strikes (failEngine) at which point I think it's fine to see the original ACE in the logs? Closes #19861	2016-08-23 12:37:38 +02:00
Masaru Hasegawa	f3cddef61e	Merge pull request #20046 from masaruh/same_shard_host_setting Move cluster.routing.allocation.same_shard.host setting to new settings infrastructure	2016-08-23 11:34:59 +09:00
Jack Conradson	131e370a16	Make Painless the default scripting language. Closes #20017	2016-08-22 17:38:02 -07:00
Lee Hinman	514585290c	Merge remote-tracking branch 'dakrone/exclude-numerics-from-all'	2016-08-22 12:36:25 -06:00
Thiago Souza	8563c8d897	Merge pull request #20042 from tsouza/fix/issue-19364 Use internal from/to when creating InternalDateRange.Bucket	2016-08-22 14:38:13 -03:00
Simon Willnauer	29336b231b	Add ref-counting to SearchContext to prevent accessing already closed readers (#20095 ) When a SearchContext is closed it's reader / searcher reference is closed too. If this happens while a search is accessing it's reader reference it can lead to an unexpected `AlreadyClosedException` or worst case, an already closed MMapDirectory is access causing a `SIGSEV` like in #20008 (even though the window for this is very small). SearchContext can be closed concurrently if: * an index is deleted / removed from the node * a search context is idle for too long and is cleaned by the reaper * an explicit freeContext message is received This change adds reference counting to the SearchContext base class and it's used inside SearchService each time the context is accessed. Closes #20008	2016-08-22 15:41:05 +02:00
Masaru Hasegawa	c7e36536f6	Move cluster.routing.allocation.same_shard.host setting to new settings infrastructure Fixes #20045	2016-08-22 11:07:42 +09:00
Ryan Ernst	e7393529b1	Merge branch 'master' into remove_index_template_filter	2016-08-19 21:14:12 -07:00
Ryan Ernst	1a7a9d3c62	Merge pull request #20071 from rjernst/pull_shards_allocator Plugins: Switch custom ShardsAllocators to pull based model	2016-08-19 20:55:31 -07:00
Ryan Ernst	3a9055b55d	Merge pull request #20073 from rjernst/deguice_indices_service Deguice IndicesService	2016-08-19 20:47:07 -07:00
Lee Hinman	d7e516c0b4	Default `include_in_all` for numeric-like types to false This includes: - All regular numeric types such as int, long, scaled-float, double, etc - IP addresses - Dates - Geopoints and Geoshapes Relates to #19784	2016-08-19 15:50:38 -06:00
Jason Tedor	6cda12871c	Merge pull request #20083 from jasontedor/improve-startup-exception Improve startup exception	2016-08-19 16:44:41 -04:00
Ali Beyad	1c9b64e09a	Adds ignoreUnavailable option to the snapshot status API (#20066 ) Adds ignoreUnavailable to the snapshot status API to be consistent with the get snapshots API which has a similar parameter. If ignoreUnavailable is set to true, then the snapshot status request will ignore any snapshots that were not found in the repository, instead of throwing a SnapshotMissingException. Closes #18522	2016-08-19 16:19:56 -04:00
Jason Tedor	c3849d9e7d	Add print stack trace override to StartupException StartupException overrides Throwable#printStackTrace(PrintStream) but not Throwable#printStackTrace(PrintWriter). The former override is used when the JVM terminates with an exception, but the latter override can be used in some logging frameworks when rendering an exception (e.g., log4j). This commit adds an override for the latter, with the behavior for the two overrides being the same.	2016-08-19 15:10:54 -04:00
Jason Tedor	3a6f7eb07a	Rename StartupError to StartupException This commit renames StartupError to StartupException. This rename is due to the fact that this class inherits from Exception not Error in the Throwable class hierarchy.	2016-08-19 14:53:08 -04:00
Ali Beyad	cf32f8de34	Fixes tests so allocation ids in IndexMetaData is in sync with what is in the RoutingTable	2016-08-19 14:42:02 -04:00
Jason Tedor	069fc22696	Remove minimum master nodes bootstrap check This commit removes the minimum master nodes bootstrap check. The motivation for this check was to raise awareness of the minimum master nodes setting but this check gives a false sense of security because it's too easy to set the setting to one when first standing up a cluster and never update it when adding master-eligible nodes, or have it out of sync on various nodes and still pass this check. Since this check does not have the security that other bootstrap checks provide, it should be removed in favor of a stronger guarantee in the future. We do log a warning if an election occurs with minimum master nodes less than a quorum of master-eligible nodes that participated in an election and this is the best that we can do right now. Relates #20082	2016-08-19 14:21:17 -04:00
Thiago Souza	9ea3f4ace3	Use supported random methods instead of DateTime.now()	2016-08-19 14:09:15 -03:00
Thiago Souza	2ba508a761	Use a better name for unit test method	2016-08-19 13:53:15 -03:00

1 2 3 4 5 ...

6310 Commits