OpenSearch

Commit Graph

Author	SHA1	Message	Date
javanna	84b8c9de19	PluginInfo to implement Writeable rather than Streamable	2016-09-02 10:23:05 +02:00
javanna	555db744f1	use read/writeOptionalWriteable in NodeInfo serialization code	2016-09-02 10:23:05 +02:00
javanna	e98e37295a	PluginsAndModules to implement Writeable rather than Streamable	2016-09-02 10:23:05 +02:00
javanna	2b2fb8daed	TransportInfo to implement Writeable rather than Streamable	2016-09-02 10:23:05 +02:00
javanna	536d13ff11	ProcessInfo to implement Writeable rather than Streamable	2016-09-02 10:23:05 +02:00
javanna	2370c25fa4	ThreadPoolInfo to implement Writeable rather than Streamable	2016-09-02 10:23:05 +02:00
javanna	27e7fc734c	HttpInfo to implement Writeable rather than Streamable	2016-09-02 10:23:05 +02:00
javanna	279f8b27e3	JvmInfo to implement Writeable rather than Streamable	2016-09-02 10:23:05 +02:00
javanna	bea863c660	OsInfo to implement Writeable rather than Streamable This allows to make all instance members final. Also added serialization tests and sorted out inizialization that was scattered in two places.	2016-09-02 10:23:05 +02:00
javanna	f6ab4e1078	ByteSizeValue to implement Writeable rather than Streamable With this we can make ByteSizeValue immutable for real.	2016-09-02 10:23:05 +02:00
Luca Cavanna	faa03ad9fa	Merge pull request #20255 from javanna/enhancement/cluster_stats_available_memory Add mem section back to cluster stats	2016-09-02 10:19:51 +02:00
Simon Willnauer	724e8ec39c	[TEST] Fix settings keys to be the actual keys rather than the toString() of the Setting	2016-09-02 10:00:31 +02:00
Adrien Grand	5bfab76c96	Source filtering should keep working when the source contains numbers greater than `Long.MAX_VALUE`. #20278 Currently it does not because our parsers do not support big integers/decimals (on purpose) but we do not have to ask our parser for the number type, we can just ask the jackson parser for a number representation of the value with the right type. Note that I did not add similar tests for big decimals because Jackson seems to never return big decimals, even for decimal values that are out of the range of values that can be represented by doubles. Closes #11508	2016-09-02 08:56:04 +02:00
Jun Ohtani	aef2e5d90e	Remove `token_filter` in _analyze API Fix wording in docs Refactoring RestAnalyzeActionTests using expectThrows() Closes #20283	2016-09-02 15:08:28 +09:00
Jason Tedor	1e80adbfbe	Configure test logging with Log4j 2 This commit configures test logging for Log4j 2. The default logger configuration uses the console appender but at the error level, so most tests are missing logging. Instead, this commit provides a configuration for tests which is picked up from the classpath by Log4j 2 when it initializes. However, this now means that we can no longer initialize Log4j with a bare-bones configuration when tests run as doing so will prevent Log4j 2 from attempting to configure logging via the classpath. Consequently, we move this needed initialization (as commented, to avoid a message about a status logger not being configured when we are preparing to configure Log4j from properties files in the config directory) to only run when we are explicitly configuring Log4j from properties files. Relates #20284	2016-09-01 14:00:47 -04:00
javanna	186a5d74b8	[TEST] improve ClusterStatsIT to better check mem values returned Rather than checking that those values are greater than 0, we can sum up the values gotten from all nodes and check that what is returned is that same value.	2016-09-01 19:22:13 +02:00
Jun Ohtani	3d9f8ed764	Remove `token_filter` in _analyze API Remove the param and change docs Closes #20283	2016-09-02 01:36:45 +09:00
Clinton Gormley	0e8a43e826	Elasticsearch 2.4.0 uses Lucene 5.5.2	2016-09-01 12:52:01 +02:00
Martijn van Groningen	a110498ad8	settings: Make `action.auto_create_index` setting a dynamic cluster setting. Closes #7513	2016-09-01 12:33:30 +02:00
Clinton Gormley	e5ff3da802	Added version 2.4.0 with bwc indices	2016-09-01 11:36:49 +02:00
javanna	042675432e	make sure that mem, cpu and swap are never null in OsStats	2016-09-01 11:26:03 +02:00
javanna	5f299ff46f	add mem section back to cluster stats The mem section was buggy in cluster stats and removed. It is now added back with the same structure as in node stats, containing total memory, available memory, used memory and percentages. All the values are the sum of all the nodes across the cluster (or at least the ones that we were able to get the values from).	2016-09-01 11:26:03 +02:00
javanna	5211b6b4bc	OsStats.Cpu, OsStats.Mem & OsStats.Swap to implement ToXContent	2016-09-01 11:24:56 +02:00
javanna	0a7a52a31e	OsStats and subobjects to implement Writeable rather than Streamable We can now have final instance members, also drop some optional values and related null checks that weren't needed.	2016-09-01 11:24:56 +02:00
Adrien Grand	34aaea641d	Fix NPE when running a range query on a `scaled_float` with no upper bound. #20253 The null check was there, but on the wrong variable.	2016-09-01 11:23:32 +02:00
Simon Willnauer	a0becd26b1	Optimize indexing for the autogenerated ID append-only case (#20211 ) If elasticsearch controls the ID values as well as the documents version we can optimize the code that adds / appends the documents to the index. Essentially we an skip the version lookup for all documents unless the same document is delivered more than once. On the lucene level we can simply call IndexWriter#addDocument instead of #updateDocument but on the Engine level we need to ensure that we deoptimize the case once we see the same document more than once. This is done as follows: 1. Mark every request with a timestamp. This is done once on the first node that receives a request and is fixed for this request. This can be even the machine local time (see why later). The important part is that retry requests will have the same value as the original one. 2. In the engine we make sure we keep the highest seen time stamp of "retry" requests. This is updated while the retry request has its doc id lock. Call this `maxUnsafeAutoIdTimestamp` 3. When the engine runs an "optimized" request comes, it compares it's timestamp with the current `maxUnsafeAutoIdTimestamp` (but doesn't update it). If the the request timestamp is higher it is safe to execute it as optimized (no retry request with the same timestamp has been run before). If not we fall back to "non-optimzed" mode and run the request as a retry one and update the `maxUnsafeAutoIdTimestamp` unless it's been updated already to a higher value Relates to #19813	2016-09-01 10:39:40 +02:00
Simon Willnauer	419627c460	Ensure ESTestCase is initialized before we run tests	2016-09-01 09:39:44 +02:00
Jason Tedor	d9064f454e	Fix additional exception logging calls This commit modifies a pair of exception logging calls to use parameterized messages from Log4j.	2016-08-31 23:14:13 -04:00
Jason Tedor	76ab02e002	Merge branch 'master' into log4j2 * master: Avoid NPE in LoggingListener Randomly use Netty 3 plugin in some tests Skip smoke test client on JDK 9 Revert "Don't allow XContentBuilder#writeValue(TimeValue)" [docs] Remove coming in 2.0.0 Don't allow XContentBuilder#writeValue(TimeValue) [doc] Remove leftover from CONSOLE conversion Parameter improvements to Cluster Health API wait for shards (#20223) Add 2.4.0 to packaging tests list Docs: clarify scale is applied at origin+offest (#20242)	2016-08-31 16:37:55 -04:00
Jason Tedor	487ffe8375	Remove code references to logging.yml This commit removes code references to logging.yml in TranslogToolCli and PluginCli.	2016-08-31 15:50:45 -04:00
Nik Everett	bd93c7054c	Revert "Don't allow XContentBuilder#writeValue(TimeValue)" This reverts commit `7f70c00dad`.	2016-08-31 14:45:03 -04:00
Nik Everett	7f70c00dad	Don't allow XContentBuilder#writeValue(TimeValue) We have specific support for writing `TimeValue`s in the form of `XContentBuilder#timeValueField`. Writing a `TimeValue` using `XContentBuilder#writeValue` is a bug waiting to happen.	2016-08-31 13:23:38 -04:00
Ali Beyad	4641254ea6	Parameter improvements to Cluster Health API wait for shards (#20223 ) * Params improvements to Cluster Health API wait for shards Previously, the cluster health API used a strictly numeric value for `wait_for_active_shards`. However, with the introduction of ActiveShardCount and the removal of write consistency level for replication operations, `wait_for_active_shards` is used for write operations to represent values for ActiveShardCount. This commit moves the cluster health API's usage of `wait_for_active_shards` to be consistent with its usage in the write operation APIs. This commit also changes `wait_for_relocating_shards` from a numeric value to a simple boolean value `wait_for_no_relocating_shards` to set whether the cluster health operation should wait for all relocating shards to complete relocation. * Addresses code review comments * Don't be lenient if `wait_for_relocating_shards` is set	2016-08-31 11:58:19 -04:00
Jason Tedor	e166459bbe	Merge branch 'master' into log4j2 * master: Increase visibility of deprecation logger Skip transport client plugin installed on JDK 9 Explicitly disable Netty key set replacement percolator: Fail indexing percolator queries containing either a has_child or has_parent query. Make it possible for Ingest Processors to access AnalysisRegistry Allow RestClient to send array-based headers Silence rest util tests until the bogusness can be simplified Remove unknown HttpContext-based test as it fails unpredictably on different JVMs Tests: Improve rest suite names and generated test names for docs tests Add support for a RestClient base path	2016-08-31 10:59:27 -04:00
Jason Tedor	1a805bb675	Increase visibility of deprecation logger The deprecation logger is an important way to make visible features of Elasticsearch that are deprecated. Yet, the default logging makes the log messages for the deprecation logger invisible. We want these log messages to be visible, so the default logging for the deprecation logger should enable these log messages. This commit changes the log level of deprecation log message to warn, and configures the deprecation logger so that these log messages are visible out of the box. Relates #20254	2016-08-31 10:51:17 -04:00
Jason Tedor	ac8c2e98ab	Enable console logging for CLI tools This commit enables CLI tools to have console logging. For the CLI tools, we skip configuring the logging infrastructure via the config file, and instead set the level only via a system property.	2016-08-31 09:05:26 -04:00
Jason Tedor	0fdc5ca587	Remove logger getter from DeprecationLogger This commit removes an unused getter for the logger field from the DeprecationLogger.	2016-08-30 21:19:16 -04:00
Igor Motov	a68083f5cb	Make it possible for Ingest Processors to access AnalysisRegistry The analysis registry will be used in PMML plugin ingest processor.	2016-08-30 21:09:41 -04:00
Jason Tedor	abe3efdfa9	Fix failing max map count check test This commit fixes failing max map count check test due to the use of a logging message supplier.	2016-08-30 18:49:39 -04:00
Jason Tedor	abf8a1a3f0	Avoid allocating log parameterized messages This commit modifies the call sites that allocate a parameterized message to use a supplier so that allocations are avoided unless the log level is fine enough to emit the corresponding log message.	2016-08-30 18:17:09 -04:00
Jason Tedor	7da0cdec42	Introduce Log4j 2 This commit introduces Log4j 2 to the stack.	2016-08-30 13:31:24 -04:00
Nik Everett	df73292256	Add an alias action to delete an index While removing an index isn't actually an alias action, if we add an alias action that deletes an index then we can delete and index and add an alias with the same name as the index atomically, in the same cluster state update. Closes #20064	2016-08-30 10:15:21 -04:00
Simon Willnauer	497e7d1054	User lambda instead of annoymous class in SearchPhaseController	2016-08-30 12:58:54 +02:00
Tanguy Leroux	b4245c7ad9	Add exclusion filters support to filter_path This commit adds the support for exclusion filter to the response filtering (filter_path) feature. It changes the XContentBuilder APIs so that it now accepts two types of filters: inclusive and exclusive. Filters are no more String arrays but sets of String instead.	2016-08-30 09:08:30 +02:00
Martijn van Groningen	1925813e09	ingest: Fix rename processor change rename leaf fields into branch fields Instead of get, set and remove we do get, remove and then set to avoid type conflicts in IngestDocument. If the set still fails we try to restore the original field in ingest document. Closes #19892	2016-08-30 07:38:01 +02:00
Ali Beyad	a132405642	Ensures that during the restore process, if a file in the snapshot (#20220 ) already has a file of the same name in the Store, but is different in content (different checksum/length), then those files are first deleted before restoring the files in question.	2016-08-29 17:51:35 -04:00
Ali Beyad	55b91cdc17	Removes unused test helper method to write old blob store format	2016-08-29 12:44:58 -04:00
Areek Zillur	99734ec576	Merge pull request #20034 from areek/cleanup/index_operation Set created flag in index operation	2016-08-29 12:34:24 -04:00
Nik Everett	9c3f6d58ac	Support downgrading keyword/text into string This changes Elasticsearch to automatically downgrade `text` and `keyword` fields into appropriate `string` fields when changing the mapping of indexes imported from 2.x. This allows users to use the modern, documented syntax against 2.x indexes. It also makes it clear that reindexing in order to recreate the index in 5.0 is required for any long lived indexes. This change is useful for the times when you can't (cluster is just starting, not stable enough for reindex) or shouldn't (index will only live 90 days or something).	2016-08-29 11:27:37 -04:00
javanna	d7ec2db9b0	[TEST] enable cacheKey check in ShardSearchTransportRequestTests Now that #20081 is merged we can check that cacheKey is consistent across equal search requests, something that wasn't true before due to ordering of map keys when using index boost. Relates to #19986	2016-08-29 17:20:26 +02:00
Tanguy Leroux	9727f123b9	Rename Netty TCP transports thread factories from http_* to transport_* Netty3/4 TcpTransport implementations are creating thread factories with a "http_server" thread prefix whereas it should start with "transport_server" and let the "http_server" prefix for the HttpServerTransport implementations.	2016-08-29 13:49:52 +02:00
Yannick Welsch	f070c8727b	[TEST] Add additional logging to testStaleMasterNotHijackingMajority This test is periodically failing. As I suspect that the GCDisruption scheme is somehow making the wrong node block on its cluster state update thread, I've added some more logging and a thread dump once the given assertion triggers again.	2016-08-29 13:42:13 +02:00
Martijn van Groningen	2d82bea040	fix test bug	2016-08-29 13:28:23 +02:00
Jun Ohtani	2a00c9dc46	Merge pull request #19860 from johtani/fix/validate_empty_field_name Validate blank field name	2016-08-29 11:52:18 +09:00
Simon Willnauer	62b821ccf4	[TEST] Ensure test never hangs but fails if it doesn't finish after 10 seconds waiting for threads	2016-08-27 23:20:55 +02:00
Simon Willnauer	162ad1251c	Fsync documents in an async fashion (#20145 ) today we fsync in a blocking fashion where all threads block while another syncs. Yet, we can improve this and make use of the async infrastrucutre added for `wait_for_refresh` and make fsyncing single threaded while all other threads can continue indexing. The syncing thread then notifies a listener once the requests location is synced. This also allows to send docs to replicas before its actually fsynced allowing for cocurrent replica processing. This patch has a significant impact on performance on slower discs. An initial single node benchmark shows that on very fast SSDs there is no noticable impact but on slow spinning disk this patch shows a ~32% performance improvement. ``` NVME SSD: `336ec0ac9a` (master): Total docs/sec: 47200.9 Total docs/sec: 46440.4 23543a97e3e7f72a31e26b50e00931919784426c (async wait for translog): Total docs/sec: 47461.6 Total docs/sec: 46188.3 ------------------------------------------------------------------- Spinning disk: `336ec0ac9a` (master): Total docs/sec: 22733.0 Total docs/sec: 24129.8 23543a97e3e7f72a31e26b50e00931919784426c (async wait for translog): Total docs/sec: 32724.1 Total docs/sec: 32845.4 -------------------------------------------------------------------- ```	2016-08-27 21:42:38 +02:00
Igor Motov	3d6270b5cd	Don't rebuild pipeline on every cluster state update Currently, after at least one pipeline is registered it is getting rebuilt on every single cluster state update, even when this update is not related to ingest metadata. This change adds a check that the ingest metadata changed before trying to rebuild all pipelines.	2016-08-27 10:11:51 -04:00
Yannick Welsch	1b75cb63a2	Add recovery source to ShardRouting (#19516 ) Adds an explicit recoverySource field to ShardRouting that characterizes the type of recovery to perform: - fresh empty shard copy - existing local shard copy - recover from peer (primary) - recover from snapshot - recover from other local shards on same node (shrink index action)	2016-08-27 16:11:10 +02:00
qwerty4030	9172653211	Fix NPE during search with source filtering if the source is disabled. (#20093 ) * Fix NPE during search with source filtering if the source is disabled. Instead of throwing an NPE, a search response with source filtering will not contain the source if it is disabled in the mapping. Closes #7758 * Created unit tests for FetchSourceSubPhase. Tests similar to SourceFetchingIT. Removed SourceFetchingIT#testSourceDisabled (now covered via unit test FetchSourceSubPhaseTests#testSourceDisabled). * Updated FetchSouceSubPhase unit tests per comments. Renamed main unit test method. Use assertEquals and assertNull instead of assertThat (less code).	2016-08-27 07:24:45 -04:00
Ali Beyad	230f0b514f	Fixes test to use admin client to check the cluster state instead of a random node's cluster service.	2016-08-27 01:29:29 -04:00
Ali Beyad	5fac32e699	Removed an unecessary TODO for snapshot file restoration and instead added comments explaining what happens during the restore process.	2016-08-26 17:13:14 -04:00
Lee Hinman	abdd1b6f86	Merge remote-tracking branch 'dakrone/prop-script-settings'	2016-08-26 13:53:48 -06:00
Lee Hinman	3fbfb3e7e7	Fix propagating the default value for script settings Fixes an issue where the value for the `script.engine.<lang>.inline` settings would be _set_ properly, but would not accurately be reflected in the `include_defaults` output. Adds a test to ensure the default raw setting is now correct. Resolves #20159	2016-08-26 13:03:32 -06:00
Xiang Chen	22242ec881	Fix request cache key for search * Make sure indexBoost is serialized in a consistent order * remove hasIndexBoost by using indexBoost size * Make sure phrase suggester's collateParams is serialized in consistent order * Make StreamOutput writer to serialize maps in consistent order	2016-08-26 12:03:24 -04:00
Jun Ohtani	0ad231546d	Validate blank field name Validate only 5.0 alpha 6+ index only Closes #19251	2016-08-26 20:10:33 +09:00
Jun Ohtani	450f47d5b5	Validate blank field name add validation and validate only 5.0+ Add tests before 5.0 Closes #19251	2016-08-26 20:10:33 +09:00
Jason Tedor	287cb00474	Avoid prematurely triggering logger initialization The class Setting holds a static reference to a deprecation logger instance. When the class initializer for Setting runs, it starts triggering log4j initialization. There is a chain of initializations from InternalSettingsPreparer to Environment to Setting that triggers this initialization before log4j configuration has occurred. This commit modifies this initialization so that initialization is not done eagerly. Relates #20170	2016-08-26 05:07:05 -04:00
Adrien Grand	3ed0da5a58	GET operations should not extract fields from `_source`. #20158 This makes GET operations more consistent with `_search` operations which expect `(stored_)fields` to work on stored fields and source filtering to work on the `_source` field. This is now possible thanks to the fact that GET operations do not read from the translog anymore (#20102) and also allows to get rid of `FieldMapper#isGenerated`. The `_termvectors` API (and thus more_like_this too) was relying on the fact that GET operations would extract fields from either stored fields or the source so the logic to do this that used to exist in `ShardGetService` has been moved to `TermVectorsService`. It would be nice that term vectors do not rely on this, but this does not seem to be a low hanging fruit.	2016-08-26 10:35:23 +02:00
Yannick Welsch	6fe9ae29ea	Mark shard as stale on non-replicated write, not on node shutdown (#20023 ) Non-stale shard copies are currently tracked using their allocation ids in the cluster state. When a node leaves the cluster, shard copies of that node are marked as stale by removing their allocation ids from the active set in the cluster. For full cluster restarts, this can have the unwanted effect that only the last node holding a copy of the shard will be seen as non-stale. The other shard copies are not really stale though as long as no writes have happened on this shard copy. Shard copies should thus only be marked as stale (by the master in the cluster state) if other active shards have received writes. This commit implements the above logic and also renames the persistent structure used to track non-stale shard copies from "active_allocations" to "in_sync_allocations" as we now also support tracking non-stale shard copies that have no active routing entries in the cluster state.	2016-08-26 10:09:57 +02:00
Adrien Grand	c5f8e1b64d	Do not parse numbers as both strings and numbers when not included in `_all`. #20167 We need to get the string representation of numbers in order to include in `_all`. However this has a cost and disabling `_all` is rather common so we should look into skipping it.	2016-08-26 10:00:36 +02:00
Jason Tedor	bc136a90d5	Add network types to cluster stats The network types in use on a cluster can be useful information to have, so this commit adds aggregate metrics for the network types in use in a cluster to the cluster stats. Relates #20144	2016-08-25 21:08:05 -04:00
Chris Earle	1cf694b63e	Use StringBuilder in favor of StringBuffer This removes all instances of StringBuffer that are removeable. Uncontended synchronization in Java is pretty cheap, but it's unnecessary.	2016-08-25 16:20:03 -04:00
Chris Earle	b41508a344	Make MapOfLists Generic This moves the Writer interface from StreamOutput into Writeable, as a peer of its inner Reader interface. This should hopefully help to avoid random functional interfaces being created for the same purpose. It also makes use of the moved class by updating writeMapOfLists and readMapOfLists.	2016-08-25 16:10:48 -04:00
Colin Goodheart-Smithe	f5fbb3eb8b	Fix agg profiling when using breadth_first collect mode Previous to this change the nesting of aggregation profiling results would be incorrect when the request contains a terms aggregation and the collect mode is (implicitly or explicitly) set to `breadth_first`. This was because the aggregation profiling has to make the assumption that the `preCollection()` method of children aggregations is always called in the `preCollection()` method of their parent aggregation. When the collect mode is `breadth_first` the `preCollection` of the children aggregations was delayed until the documents were replayed. This change moves the `preCollection()` of deferred aggregations to run during the `preCollection()` of the parent aggregation. This should have no adverse impact on the breadth_first mode as there is no allocation of memory in any of the aggregations. We also apply the same logic to the diversified sampler aggregation as we did to the terms aggregation to move the `preCollection()` of the child aggregations method to be called during the `preCollection()` of the parent aggregation. This commit also includes a fix so that the `ProfilingLeafBucketCollector` propagates the scorer to its delegate so the diversified sampler agg works when profiling is enabled.	2016-08-25 14:57:52 +01:00
Adrien Grand	b521638f52	Revert "Revert "Save one utf8 conversion in KeywordFieldMapper. #19867"" This reverts commit `d805266d94`.	2016-08-25 13:37:14 +02:00
Adrien Grand	f93ce94afe	The root object mapper should support updating `numeric_detection`, `date_detection` and `dynamic_date_formats`. #20119 If they are specified by a mapping update, these properties are currently ignored. This commit also fixes the handling of `dynamic_templates` so that it is possible to remove templates (and so that it works more similarly to all other mapping properties). Closes #20111	2016-08-25 12:39:38 +02:00
Mike McCandless	7a14cd4b1d	Pass baseSimilarity to super (PerFieldSimilarityWrapper)	2016-08-25 04:43:56 -04:00
Mike McCandless	5eb66e3378	Mark Scandinavian analysis components as multi term aware	2016-08-24 19:50:25 -04:00
Mike McCandless	7492300544	Remove now unused Store.renameFile, and obsolete commented out code	2016-08-24 18:20:30 -04:00
Mike McCandless	0ccfe69789	Upgrade to Lucene 6.2.0	2016-08-24 17:26:28 -04:00
Nicholas Knize	9eb63fb885	Refactor GeoPointFieldMapperLegacy and Legacy BBox query helpers This is a house cleaning commit that refactors GeoPointFieldMapperLegacy to LegacyGeoPointFieldMapper for consistency with Legacy Numerics and IP field mappers. IndexedGeoBoundingBoxQuery and InMemoryGeoBoundingBoxQuery are also deprecated and refactored as Legacy classes.	2016-08-24 14:40:25 -05:00
Jim Ferenczi	4682fc34ae	Add the ability to disable the retrieval of the stored fields entirely This change adds a special field named _none_ that allows to disable the retrieval of the stored fields in a search request or in a TopHitsAggregation. To completely disable stored fields retrieval (including disabling metadata fields retrieval such as _id or _type) use _none_ like this: ```` POST _search { "stored_fields": "_none_" } ````	2016-08-24 16:40:08 +02:00
Simon Willnauer	c499427166	Use _refresh instead of reading from Translog in the RT GET case (#20102 ) Today we do a lot of accounting inside the engine to maintain locations of documents inside the transaction log. This is only needed to ensure we can return the documents source from the engine if it hasn't been refreshed. Aside of the added complexity to be able to read from the currently writing translog, maintainance of pointers into the translog this also caused inconsistencies like different values of the `_ttl` field if it was read from the tlog or not. TermVectors are totally different if the document is fetched from the tranlog since copy fields are ignored etc. This chance will simply call `refresh` if the documents latest version is not in the index. This streamlines the semantics of the `_get` API and allows for more optimizations inside the engine and on the transaction log. Note: `_refresh` is only called iff the requested document is not refreshed yet but has recently been updated or added. #Relates to #19787	2016-08-24 15:30:08 +02:00
Simon Willnauer	1b1a1acad8	Don't index the `_version` field (#20132 ) The `_version` field doesn't allow to be searched anyway since it's set `IndexOptions#NONE` for it instead.	2016-08-24 10:04:27 +02:00
Adrien Grand	5d6c9b0745	Fix RAM usage estimation of LiveVersionMap. #20123 I was writing tests for RAM usage estimation of LiveVersionMap and found a couple issues: - The BytesRef objects used as uids were oversized since they were created via `new BytesRef(CharSequence)` which creates a `byte[]` whose size is 3x the length of the provided char sequence. Given that our uids are most of times ASCII sequences, this is a waste of memory. - `VersionValue` was using `translogLocation.size` instead of `translogLocation.ramBytesUsed()` for RAM estimation, which is completely unrelated to the memory footprint of the `Translog.Location` object. In particular, the latter issue could cause RAM usage estimation to be significantly overestimated, especially on large documents. I also added tests for ram accounting.	2016-08-24 09:54:06 +02:00
Lee Hinman	3298a4ed38	Revert "Merge remote-tracking branch 'dakrone/exclude-numerics-from-all'" This reverts commit `514585290c`, reversing changes made to `8563c8d897`.	2016-08-23 09:24:33 -06:00
Nicholas Knize	8234fad9ca	Deprecate geohash parameters for geo_point parser This commit deprecates all geohash parameters in the geo_point field parser.	2016-08-23 09:19:21 -05:00
Nicholas Knize	28ed0e7abf	Deprecate optimize_bbox on geodistance queries Deprecates the optimize_bbox parameter on geodistance queries. This has no longer been needed since version 2.2 because lucene geo distance queries (postings and LatLonPoint) already optimize by bounding box.	2016-08-23 09:14:54 -05:00
Michael McCandless	668dac722a	Don't suppress AlreadyClosedException (#19975 ) Catching and suppressing AlreadyClosedException from Lucene is dangerous because it can mean there is a bug in ES since ES should normally guard against invoking Lucene classes after they were closed. I reviewed the cases where we catch AlreadyClosedException from Lucene and removed the ones that I believe are not needed, or improved comments explaining why ACE is OK in that case. I think (@s1monw can you confirm?) that holding the engine's readLock means IW will not be closed, except if disaster strikes (failEngine) at which point I think it's fine to see the original ACE in the logs? Closes #19861	2016-08-23 12:37:38 +02:00
Masaru Hasegawa	f3cddef61e	Merge pull request #20046 from masaruh/same_shard_host_setting Move cluster.routing.allocation.same_shard.host setting to new settings infrastructure	2016-08-23 11:34:59 +09:00
Jack Conradson	131e370a16	Make Painless the default scripting language. Closes #20017	2016-08-22 17:38:02 -07:00
Lee Hinman	514585290c	Merge remote-tracking branch 'dakrone/exclude-numerics-from-all'	2016-08-22 12:36:25 -06:00
Thiago Souza	8563c8d897	Merge pull request #20042 from tsouza/fix/issue-19364 Use internal from/to when creating InternalDateRange.Bucket	2016-08-22 14:38:13 -03:00
Simon Willnauer	29336b231b	Add ref-counting to SearchContext to prevent accessing already closed readers (#20095 ) When a SearchContext is closed it's reader / searcher reference is closed too. If this happens while a search is accessing it's reader reference it can lead to an unexpected `AlreadyClosedException` or worst case, an already closed MMapDirectory is access causing a `SIGSEV` like in #20008 (even though the window for this is very small). SearchContext can be closed concurrently if: * an index is deleted / removed from the node * a search context is idle for too long and is cleaned by the reaper * an explicit freeContext message is received This change adds reference counting to the SearchContext base class and it's used inside SearchService each time the context is accessed. Closes #20008	2016-08-22 15:41:05 +02:00
Masaru Hasegawa	c7e36536f6	Move cluster.routing.allocation.same_shard.host setting to new settings infrastructure Fixes #20045	2016-08-22 11:07:42 +09:00
Ryan Ernst	e7393529b1	Merge branch 'master' into remove_index_template_filter	2016-08-19 21:14:12 -07:00
Ryan Ernst	1a7a9d3c62	Merge pull request #20071 from rjernst/pull_shards_allocator Plugins: Switch custom ShardsAllocators to pull based model	2016-08-19 20:55:31 -07:00
Ryan Ernst	3a9055b55d	Merge pull request #20073 from rjernst/deguice_indices_service Deguice IndicesService	2016-08-19 20:47:07 -07:00
Lee Hinman	d7e516c0b4	Default `include_in_all` for numeric-like types to false This includes: - All regular numeric types such as int, long, scaled-float, double, etc - IP addresses - Dates - Geopoints and Geoshapes Relates to #19784	2016-08-19 15:50:38 -06:00
Jason Tedor	6cda12871c	Merge pull request #20083 from jasontedor/improve-startup-exception Improve startup exception	2016-08-19 16:44:41 -04:00
Ali Beyad	1c9b64e09a	Adds ignoreUnavailable option to the snapshot status API (#20066 ) Adds ignoreUnavailable to the snapshot status API to be consistent with the get snapshots API which has a similar parameter. If ignoreUnavailable is set to true, then the snapshot status request will ignore any snapshots that were not found in the repository, instead of throwing a SnapshotMissingException. Closes #18522	2016-08-19 16:19:56 -04:00
Jason Tedor	c3849d9e7d	Add print stack trace override to StartupException StartupException overrides Throwable#printStackTrace(PrintStream) but not Throwable#printStackTrace(PrintWriter). The former override is used when the JVM terminates with an exception, but the latter override can be used in some logging frameworks when rendering an exception (e.g., log4j). This commit adds an override for the latter, with the behavior for the two overrides being the same.	2016-08-19 15:10:54 -04:00
Jason Tedor	3a6f7eb07a	Rename StartupError to StartupException This commit renames StartupError to StartupException. This rename is due to the fact that this class inherits from Exception not Error in the Throwable class hierarchy.	2016-08-19 14:53:08 -04:00
Ali Beyad	cf32f8de34	Fixes tests so allocation ids in IndexMetaData is in sync with what is in the RoutingTable	2016-08-19 14:42:02 -04:00
Jason Tedor	069fc22696	Remove minimum master nodes bootstrap check This commit removes the minimum master nodes bootstrap check. The motivation for this check was to raise awareness of the minimum master nodes setting but this check gives a false sense of security because it's too easy to set the setting to one when first standing up a cluster and never update it when adding master-eligible nodes, or have it out of sync on various nodes and still pass this check. Since this check does not have the security that other bootstrap checks provide, it should be removed in favor of a stronger guarantee in the future. We do log a warning if an election occurs with minimum master nodes less than a quorum of master-eligible nodes that participated in an election and this is the best that we can do right now. Relates #20082	2016-08-19 14:21:17 -04:00
Thiago Souza	9ea3f4ace3	Use supported random methods instead of DateTime.now()	2016-08-19 14:09:15 -03:00
Thiago Souza	2ba508a761	Use a better name for unit test method	2016-08-19 13:53:15 -03:00
Yannick Welsch	57c3dcb7d7	Merge pull request #20075 from ywelsch/fix/update-cs-with-routingresult Some time ago, AllocationService.reroute was changed to not only return updates to the routing table but also to the metadata (which contain primary terms and in-sync allocation ids). A lot of test code still only updates the routing table though, which is fixed by this PR.	2016-08-19 18:18:30 +02:00
Yannick Welsch	771668f380	Use routingResult method to update cluster state after reroute This ensures that the routing table as well as the metadata (with the primary terms and in-sync allocation ids) is updated.	2016-08-19 17:15:02 +02:00
Adrien Grand	b586465a4c	Make generics explicit to please ECJ.	2016-08-19 15:55:24 +02:00
Yannick Welsch	a74f77b632	Check that all active shards have their allocation id in the in-sync set	2016-08-19 10:41:11 +02:00
Ryan Ernst	59636a0844	Internal: Deguice IndicesService Almost all the dependencies of indices service are already created outside of guice. This change deguices MetaStateService, and then IndicesService.	2016-08-19 00:27:37 -07:00
Adrien Grand	a4ea7e7223	Switch indices.exists_type from `{index}/{type}` to `{index}/_mapping/{type}`. #20055 This will help remove types as we will need `{index}/{id}` to tell whether a document exists. Relates #15613	2016-08-19 09:18:24 +02:00
Ryan Ernst	207d3a60e7	Fix staging url for official plugins This was incorrectly setup in #19996, without the version in the staging build id.	2016-08-18 23:06:14 -07:00
Ryan Ernst	00c123b59f	Plugins: Remove IndexTemplateFilter How index templates match is currently controlled by the IndexTemplateFilter interface. It is pluggable, to add additional filter implementations to the default glob matcher. This change removes the IndexTemplateFilter interface completely. This is a very esoteric extension point, and not worth maintaining. Instead, any improvements should be made to all of our glob matching.	2016-08-18 22:41:25 -07:00
Ryan Ernst	ab404d90ed	Plugins: Switch custom ShardsAllocators to pull based model This change moves custom ShardsAllocators from registration on ClusterModule, to implementing getShardsAllocators() in ClusterPlugin. It also removes the legacy alias "even_shard" for the balanced allocator which was removed in 2.0.	2016-08-18 22:18:33 -07:00
Thiago Souza	8281a3ce79	Merge pull request #20041 from tsouza/fix/issue-19142 Make exception message more descriptive	2016-08-18 17:31:16 -03:00
Ryan Ernst	165565a817	Merge pull request #20040 from rjernst/pull_allocation_deciders Make custom allocation deciders use pull based extensions	2016-08-18 12:07:09 -07:00
Ryan Ernst	45144edd73	Fix cat allocation test line length violations	2016-08-18 10:51:59 -07:00
Adrien Grand	8f8ae8f577	Mapping updates on objects should propagate `include_an_all`. #20051 Today you can't update `include_an_all` on an existing object. The bug affects 2.x too.	2016-08-18 12:45:28 +02:00
Martijn van Groningen	825edd8dba	tests for Script parsing and serialization	2016-08-18 12:19:43 +02:00
Adrien Grand	d805266d94	Revert "Save one utf8 conversion in KeywordFieldMapper. #19867" This reverts commit `c44679d952`. Conflicts: core/src/main/java/org/elasticsearch/index/mapper/BaseGeoPointFieldMapper.java core/src/main/java/org/elasticsearch/index/mapper/GeoPointFieldMapperLegacy.java core/src/test/java/org/elasticsearch/index/mapper/GeoPointFieldMapperTests.java	2016-08-18 08:17:28 +02:00
Adrien Grand	a7a7123d74	Simplify inclusion in `_all`. #20028 Currently, when you set `include_in_all` on an object, it will propagate the information to its sub mappers immediately. This is annoying because this is done using a different mechanism than regular mapping updates. This PR changes object fields to propagate the information at document parsing time rather than when `include_an_all` is updated. While moving this cost to document parsing time rather than mapping update time is probably a bad trade-off, I am confident that this cost is very low and think this new way makes things simpler.	2016-08-18 08:13:55 +02:00
Thiago Souza	d9bc2693a3	Use internal from/to when creating InternalDateRange.Bucket InternalDateRange.Factory.createBucket should use prototype's internal from/to Fixes https://github.com/elastic/elasticsearch/issues/19364	2016-08-18 00:26:37 -03:00
Ryan Ernst	1ff348ed7f	Plugins: Make custom allocation deciders use pull based extensions This change converts AllocationDecider registration from push based on ClusterModule to implementing with a new ClusterPlugin interface. AllocationDecider instances are allowed to use only Settings and ClusterSettings.	2016-08-17 15:55:31 -07:00
Thiago Souza	8e8614483b	Make exception message more descriptive Exception message should be more descriptive about what to do when inner_hit names colides. Fixes https://github.com/elastic/elasticsearch/issues/19142	2016-08-17 19:54:42 -03:00
Lee Hinman	f6b166f19e	Merge remote-tracking branch 'dakrone/forbid-simpleregex-in-index-name'	2016-08-17 16:01:09 -06:00
Lee Hinman	6030acb43b	Disallow creating indices starting with '-' or '+' Previously this was possible, which was problematic when issuing a request like `DELETE /-myindex`, which was interpretted as "delete everything except for myindex". Resolves #19800	2016-08-17 15:13:03 -06:00
Areek Zillur	fe5cdd30d5	Set created flag in index operation Now document created flag is set in the index operation instead of being returned from engine operation. This change makes the engine index and delete operations have the same signature.	2016-08-17 17:09:34 -04:00
Ryan Ernst	2ea50bc162	Merge pull request #20018 from rjernst/split_disk_threshold Internal: Split disk threshold monitoring from decider	2016-08-17 07:57:50 -07:00
Ryan Ernst	efd8d837e8	Make disk threshold settings final	2016-08-17 07:58:27 -07:00
Yannick Welsch	27a760f9c1	Add routing changes API to RoutingAllocation (#19992 ) Adds a class that records changes made to RoutingAllocation, so that at the end of the allocation round other values can be more easily derived based on these changes. Most notably, it: - replaces the explicit boolean flag that is passed around everywhere to denote changes to the routing table. The boolean flag is automatically updated now when changes actually occur, preventing issues where it got out of sync with actual changes to the routing table. - records actual changes made to RoutingNodes so that primary term and in-sync allocation ids, which are part of index metadata, can be efficiently updated just by looking at the shards that were actually changed.	2016-08-17 10:46:59 +02:00
Adrien Grand	d894db1590	Only use `PUT` for index creation, not POST. #20001 Currently both `PUT` and `POST` can be used to create indices. This commit removes support for `POST index_name` so that we can use it to index documents with auto-generated ids once types are removed. Relates #15613	2016-08-17 10:15:42 +02:00
Adrien Grand	ffee9e8833	Automatically upgrade analyzed string fields that have `index_options` or `position_increment_gap` set. #20002 Closes #19974	2016-08-17 10:14:25 +02:00
Ryan Ernst	b2c0f2d08f	Internal: Split disk threshold monitoring from decider In addition to be an allocation decider, DiskThresholdDecider also monitors the used disk in order to trigger a reroute when the thresholds are crossed. This change splits out the settings for disk thresholds into DiskThresholdSettings, and moves the monitoring to a new DiskThresholdMonitor. DiskThresholdDecider is then in line with other allocation deciders, needing only Settings and ClusterSettings for construction, which will allow deguicing allocation deciders.	2016-08-17 00:22:16 -07:00
Lee Hinman	1825d8060c	Merge remote-tracking branch 'dakrone/lockobtainfailed-replacement'	2016-08-16 14:41:27 -06:00
Lee Hinman	1de3388fa3	Switching LockObtainFailedException over to ShardLockObtainFailedException `LobObtainFailedException` should be reserved for on-disk locks that Lucene attempts (like `write.lock`). This switches our in-memory semaphore locks for shards to use a different exception. Additionally, ShardLockObtainFailedException no longer subclasses IOException, since no IO is being done is this case. Resolves #19978	2016-08-16 14:37:36 -06:00
Areek Zillur	75d4a9f6e4	Allow plugins to upgrade global custom metadata on startup Currently plugins can not inspect or upgrade custom meta data on startup. This commit allow plugins to check and/or upgrade global custom meta data on startup. Plugins can stop a node if any custom meta data is not supported.	2016-08-16 16:24:43 -04:00
Ryan Ernst	743d9fd008	Merge branch 'master' into search_parser	2016-08-16 11:28:59 -07:00
Ryan Ernst	f716a86f40	Add comment about making parser members private instead of public	2016-08-16 11:25:34 -07:00
Nik Everett	fdd50612ae	Fix reindex under the transport client The big change here is cleaning up the `TaskListResponse` so it doesn't have a breaky `toString` implementation. That was causing the reindex tests to break. Also removed `NetworkModule#registerTaskStatus` which is part of the Plugin API. Use `Plugin#getNamedWriteables` instead.	2016-08-16 12:15:15 -04:00
Ali Beyad	88aff40eef	Primary shard allocator observes limits in forcing allocation (#19811 ) Primary shard allocation observes limits in forcing allocation Previously, during primary shards allocation of shards with prior allocation IDs, if all nodes returned a NO decision for allocation (e.g. the settings blocked allocation on that node), we would chose one of those nodes and force the primary shard to be allocated to it. However, this meant that primary shard allocation would not adhere to the decision of the MaxRetryAllocationDecider, which would lead to attempting to allocate a shard which has failed N number of times already (presumably due to some configuration issue). This commit solves this issue by introducing the notion of force allocating a primary shard to a node and each decider implementation must implement whether this is allowed or not. In the case of MaxRetryAllocationDecider, it just forwards the request to canAllocate. Closes #19446	2016-08-16 11:25:45 -04:00
Nik Everett	46bf8baf2e	Switch aggregation registration for push to pull Adds `getAggregations` to `SearchPlugin` which can be used to register aggregations. Fixup MockNode which wasn't createing MockBigArrays.	2016-08-16 09:08:36 -04:00
Ryan Ernst	7fde410586	Internal: Consolidate search parser registries Parsing a search request is currently split up among a number of classes, using multiple public static methods, which take multiple regstries of elements that may appear in the search request like query parsers and aggregations. This change begins consolidating all this code by collapsing the registries normally used for parsing search requests into a single SearchRequestParsers class. It is also made available to plugin services to enable templating of search requests. Eventually all of the actual parsing logic should move to the class, and the registries should be hidden, but for now they are at least co-located to reduce the number of objects that must be passed around.	2016-08-16 01:59:24 -07:00
Ryan Ernst	0996ae03a4	Merge pull request #19996 from rjernst/plugin_location Plugins: Update official plugin location with unified release	2016-08-15 20:36:01 -07:00
Nik Everett	1452ab4b9f	Squash the rest of o.e.rest.action Squashes all the subpackages of `org.elasticsearch.rest.action` down to the following: * `o.e.rest.action.admin` - Administrative actions * `o.e.rest.action.cat` - Actions that make tables for `grep`ing * `o.e.rest.action.document` - Actions that act on documents * `o.e.rest.action.ingest` - Actions that act on ingest pipelines * `o.e.rest.action.search` - Actions that search I'm tempted to merge `search` into `document` but the `document` package feels fairly complete as is and `Suggest` isn't actually always about documents either.... I'm also tempted to merge `ingest` into `admin.cluster` because the latter contains the actions for dealing with stored scripts. I've moved the `o.e.rest.action.support` into `o.e.rest.action`. I've also added `package-info.java`s to all packges in `o.e.rest`. I figure if the package is too small to deserve a `package-info.java` file then it is too small to deserve to be a package.... Also fixes checkstyle in all moved classes.	2016-08-15 21:06:32 -04:00
chengpohi	2adc2a1971	Enable BoostingQuery with FVH highlighter (#19984 ) * Enable BoostingQuery with FVH highlighter * apply boost with negativeBoost * flatten boosting query with its own boost and update boost query to a single layer	2016-08-15 21:00:16 -04:00
Nik Everett	4f262ce11e	Clear some more static state in tests This was causing CI build failures that didn't reproduce consistently locally. Hopefully this will fix the error on CI.	2016-08-15 18:51:17 -04:00
Nik Everett	eb9b84e6c3	Fix broken test Randomized testing requires that we clean all the static state in test classess.	2016-08-15 17:27:01 -04:00
Luca Cavanna	8804035205	Restore assignment of time value when deserializing a scroll instance (#19977 ) * Assign scroll keepAlive when deserializing The scroll time value was never assign when deserializing from the transport layer, meaning that it would always be null when received from another node, although the originating search request might have it set to some value. * add tests for SearchRequest serialization and fail fast with illegal arguments To ease testing, also introduced equals, hashcode and toString methods in SearchRequest and Scroll. The serialization test brought up a few wrong assumptions about non null instance members, for which some null checks were needed to avoid NPEs when serializing. * make Scroll implement Writeable rather than Streamable * [TEST] add serialization test for ShardSearchTransportRequest This also covers ShardSearchLocalRequest implicitly as most of the serialization code is in it.	2016-08-15 17:26:48 -04:00

1 2 3 4 5 ...

6266 Commits