OpenSearch

Commit Graph

Author	SHA1	Message	Date
Christoph Büscher	f0020caf7c	Merge pull request #18589 : Improve TimeUnitRounding for edge cases and DST transitions	2016-05-27 15:37:20 +02:00
Simon Willnauer	98fd45dc02	Move DocStats under Engine to get more accurate numbers (#18587 ) Today we pull doc stats from an index reader which might not reflect reality. IndexWriter might have merged all deletes away but due to a missing refresh the stats are completely off. This change pulls doc stats from the IndexWriter directly instead of relying on refreshes to run regularly. Note: Buffered deletes are still not visible until the segments are flushed to disk.	2016-05-27 15:31:40 +02:00
Yannick Welsch	2b47a2643c	Only fail relocation target shard if failing source shard is a primary (#18574 ) If the relocation source fails during the relocation of a shard from one node to another, the relocation target is currently failed as well. For replica shards this is not necessary, however, as the actual shard recovery of the relocation target is done via the primary shard.	2016-05-27 15:28:57 +02:00
Christoph Büscher	29c970958c	Adding tests for derivatives on date histogram aggregation with time zones	2016-05-27 15:18:11 +02:00
Christoph Büscher	4439a86cb8	Improve TimeUnitRounding for edge cases and DST transitions Our current testing for TimeUnitRoundings rounding() and nextRoundingValue() methods that are used especially for date histograms lacked proper randomization for time zones. We only did randomized tests for fixed offset time zones (e.g. +01:00, -05:00) but didn't account for real world time zones with DST transitions. Adding those tests revealed a couple of problems with our current rounding logic. In some cases, usually happening around those transitions, rounding a value down could land on a value that itself is not a proper rounded value. Also sometimes the nextRoundingValue would not line up properly with the rounded value of all dates in the next unit interval. This change improves the current rounding logic in TimeUnitRounding in two ways: it makes sure that calling round(date) always returns a date that when rounded again won't change (making round() idempotent) by handling special cases happening during dst transitions by performing a second rounding. It also changes the nextRoundingValue() method to itself rely on the round method to make sure we always return rounded values for the unit interval boundaries. Also adding tests for randomized TimeUnitRounding that assert important basic properties the rounding values should have. For better understanding and readability a few of the pathological edge cases are also added as a special test case.	2016-05-27 15:18:11 +02:00
Jason Tedor	41710f1028	Upgrade joda-time to 2.9.4 This commit upgrades joda-time to version 2.9.4 to integrate a bug fix there into Elasticsearch. Relates #18609	2016-05-27 08:51:19 -04:00
Martijn van Groningen	0e9f3addd2	Nested inner hits shouldn't use relative paths Like on other places in the query dsl the full field name should be used. Before this change this wasn't the case for nested inner hits when source filtering was used. Highlighting has a workaround, which is now removed as the source of nested inner hits can only be refered by the full name. Closes #16653	2016-05-27 13:41:45 +02:00
Jason Tedor	cebbf0de41	Do not replay into translog on local recovery When performing a local recovery, the engine replays operations recovered from the translog into the translog again. These duplicated operations are eventually cleared after successful recovery on flush, but there is no need to play these operations into the translog at all. This commit modifies the local recovery process so as to not replay these operations into the translog. Relates #18547	2016-05-27 06:04:11 -04:00
Boaz Leskes	318a4e3ef6	Introduce dedicated master nodes in testing infrastructure (#18514 ) This PR changes the InternalTestCluster to support dedicated master nodes. The creation of dedicated master nodes can be controlled using a new `supportsMasterNodes` parameter to the ClusterScope annotation. If set to true (the default), dedicated master nodes will randomly be used. If set to false, no master nodes will be created and data nodes will also be allowed to become masters. If active, test runs will either have 1 or 3 masternodes	2016-05-27 08:44:20 +02:00
Igor Motov	fb763c1e8e	Add ability to store results for long running tasks The results of the tasks are stored in a special index .results	2016-05-26 19:49:13 -04:00
Robert Muir	3f06d9f3b8	Merge pull request #18600 from rmuir/new_script_exception replace ScriptException with a better one	2016-05-26 17:51:34 -04:00
Robert Muir	76ca4af561	move instanceof to catch block	2016-05-26 15:03:13 -04:00
Jason Tedor	d23db39445	Merge pull request #18594 from jasontedor/plugins-cleanup Plugins cleanup	2016-05-26 14:46:09 -04:00
Jason Tedor	f16f65741e	Fix when plugins directory is symlink This commit fixes an issue with the plugins directory being a symbolic link. Namely, the install plugins command attempts to always create the plugins directory just in case it does not exist. The JDK method used here guarantees that the directory is created, and an exception is not thrown if the directory could not be created because it already exists. The problem is that this JDK method does not respect symlinks so its internal existence checks fails, it proceeds to attempt to create the directory, but the directory creation fails because the symlink exists. This is documented as being not an issue. We work around this by checking if there is a symlink where we expect the plugins directory to be, and only attempt to create if not. We add a unit test that plugin installation to a symlinked plugins directory works as expected.	2016-05-26 14:10:32 -04:00
Yannick Welsch	f98ca5310c	Fix ReplicaShardAllocatorTests when unassigned reason is ALLOCATION_FAILED When mocking unassigned shards which have failed with reason ALLOCATION_FAILED we have to ensure that the failed allocation counter is strictly positive.	2016-05-26 19:01:43 +02:00
Ryan Ernst	a8a38c282a	Remove extra mostly duplicate readme file It looks like the readme was duplicated when plugins were merged back into the repo. We removed all these extra files from the plugins, this removes the remaining duplicate from core. closes #18597	2016-05-26 08:54:35 -07:00
Robert Muir	f037807117	replace ScriptException with a better one	2016-05-26 11:43:29 -04:00
Jason Tedor	d29844e597	Remove custom plugins path This commit removes the ability to specify a custom plugins path. Instead, the plugins path will always be a subdirectory called "plugins" off of the home directory.	2016-05-26 10:16:25 -04:00
Jason Tedor	0f529e10a8	Fix plugin command name in remove plugin command This commit fixes the name of the plugin command that is output when a user attempts to remove a plugin that does not exist.	2016-05-26 10:14:39 -04:00
Yannick Welsch	31b0777c91	Simplify delayed shard allocation (#18351 ) This commit simplifies the delayed shard allocation implementation by assigning clear responsibilities to the various components that are affected by delayed shard allocation: - UnassignedInfo gets a boolean flag delayed which determines whether assignment of the shard should be delayed. The flag gets persisted in the cluster state and is thus available across nodes, i.e. each node knows whether a shard was delayed-unassigned in a specific cluster state. Before, nodes other than the current master were unaware of that information. - This flag is initially set as true if the shard becomes unassigned due to a node leaving and the index setting index.unassigned.node_left.delayed_timeout being strictly positive. From then on, unassigned shards can only transition from delayed to non-delayed, never in the other direction. - The reroute step is in charge of removing the delay marker (comparing timestamp when node left to current timestamp). - A dedicated service DelayedAllocationService, reacting to cluster change events, has the responsibility to schedule reroutes to remove the delay marker. Closes #18293	2016-05-26 13:39:55 +02:00
Ryan Ernst	16d029aff7	Merge branch 'master' into staging_plugins	2016-05-25 14:47:12 -07:00
Ryan Ernst	9c15e0c56d	Merge pull request #18583 from rjernst/official_plugins Use resource files for list of modules and plugins	2016-05-25 14:42:24 -07:00
Ryan Ernst	45adab0cb8	Add test that x-pack is in official plugins list	2016-05-25 14:23:57 -07:00
Jason Tedor	9d39b05845	Remove deprecation suppression Failing the build on deprecation warnings was removed in `19b3ec88af`. This commit removes the suppressed deprecation warnings so that their use is surfaced in the build now. Relates #18582	2016-05-25 17:15:36 -04:00
Ryan Ernst	a9e7bdc54c	Plugins: use resource files for list of modules and plugins This adds modules.txt and plugins.txt to the core jar resource files, which the install plugin command statically loads, in place of the previously hardcoded lists (which have often gone out of date).	2016-05-25 13:42:24 -07:00
Ryan Ernst	478877edf7	Plugins: Changing staging property to be the hash instead of a boolean With the unified release, we will have staged releases based on a unified hash (hash of all the hashes), so using the elasticsearch hash for plugins staging will no longer work. This change makes the `es.plugins.staging` property take the staging hash it should use.	2016-05-25 12:36:31 -07:00
Jim Ferenczi	6d62f33702	Make doc_values accessible for _type `doc_values` for _type field are created but any attempt to load them throws an IAE. This PR re-enables `doc_values` loading for _type, it also enables `fielddata` loading for indices created between 2.0 and 2.1 since doc_values were disabled during that period. It also restores the old docs that gives example on how to sort or aggregate on _type field.	2016-05-25 18:56:13 +02:00
Yannick Welsch	1c59c7e349	Log warning if minimum_master_nodes is set to less than a quorum of master-eligible nodes (#15625 ) The setting minimum_master_nodes is crucial to avoid split brains in a cluster. In order to avoid data loss, it should always be configured to at least a quorum (majority) of master-eligible nodes. This commit adds a warning to the logs on the master node if the value is set to less than quorum of master-eligible nodes.	2016-05-25 16:49:08 +02:00
Tanguy Leroux	bdee8c2632	Disable XContent auto closing of object and arrays	2016-05-25 16:46:09 +02:00
Chris Earle	de6b4f35b1	Remove inaccurate Javadoc on Setting constructor The 'Setting' constructor has some outdated Javadoc that suggested that it would automatically apply 'Property.NodeScope' if no scope is supplied, but no scope is added in that case.	2016-05-25 09:28:34 -05:00
Simon Willnauer	eab3113204	Drop 1.x BWC and cut over to Writeable for Translog.Operation (#18565 ) We still maintain BWC for the translog operations back to 1.1 which is not supported in the current version anyway. This commit drops the bwc and moves the operations to the Writeable interface enforcing immutability.	2016-05-25 11:51:28 +02:00
Britta Weber	6862c48791	Merge pull request #18495 from brwe/geo-query-highlight-II skip all geo point queries in plain highlighter	2016-05-25 11:35:59 +02:00
Yannick Welsch	dee34c916c	Expand wildcards to closed indices in /_cat/indices (#18545 ) Closed indices are already displayed when no indices are explicitly selected. This commit ensures that closed indices are also shown when wildcard filtering is used. It also addresses another issue that is caused by the fact that the cat action is based internally on 3 different cluster states (one when we query the cluster state to get all indices, one when we query cluster health, and one when we query indices stats). We currently fail the cat request when the user specifies a concrete index as parameter that does not exist. The implementation works as intended in that regard. It checks this not only for the first cluster state request, but also the subsequent indices stats one. This means that if the index is deleted before the cat action has queried the indices stats, it rightfully fails. In case the user provides wildcards (or no parameter at all), however, we fail the indices stats as we pass the resolved concrete indices to the indices stats request and fail to distinguish whether these indices have been resolved by wildcards or explicitly requested by the user. This means that if an index has been deleted before the indices stats request gets to execute, we fail the overall cat request. The fix is to let the indices stats request do the resolving again and not pass the concrete indices. Closes #16419 Closes #17395	2016-05-25 10:02:14 +02:00
Tal Levy	0fa67e1538	Expose underlying processor to blame for thrown exception within CompoundProcessor (#18342 ) Fixes #17823	2016-05-24 14:25:40 -07:00
Jason Tedor	568c26a76c	Log OS and JVM on startup This commit adds the OS and JVM to the initial logline on startup. Relates #18557	2016-05-24 16:55:55 -04:00
Ali Beyad	105aee08b3	Removes multiple toXContent entry points for SnapshotInfo SnapshotInfo had a toXContent and an externalToXContent, the former for writing snapshot info to the snapshot blob and the latter for writing the snapshot info to the APIs. This commit unifies writing x-content to one method, toXContent, which distinguishes which format to write the snapshot info in based on the Params parameter. In addition, it makes use of the already existing snapshot specific params found in the BlobStoreFormat. Closes #18494	2016-05-24 15:44:22 -04:00
Britta Weber	b3a8c54928	Test for deadlock when relocating and indexing concurrently see #18553 Reproduces with gradle :core:integTest -Dtests.class=org.elasticsearch.recovery.RelocationIT -Dtests.method="testIndexAndRelocateConcurrently" -Dtests.iters=20 -Dtests.failfast=true -Dtests.logger.level=DEBUG -Dtests.seed=9BE7064B819064FA:C44B70F31707D081 -Dtests.timeoutSuite=60000!	2016-05-24 20:06:21 +02:00
Nik Everett	a93f578bf6	Move parsing of allocation commands into REST Port them to the ObjectParser. Don't let plugins register custom allocation commands	2016-05-24 11:59:05 -04:00
Jason Tedor	f210605af8	Add test for parsing fractional seconds This commit adds a test that ensures that strings containing a fractional number of seconds are correctly parsed into milliseconds. Relates #18548	2016-05-24 11:26:36 -04:00
Clinton Gormley	9c9bea9258	Set version to 5.0.0-alpha3 (#18550 ) * Set version to 5.0.0-alpha3 * Updated version in qa/backwards tests too	2016-05-24 16:46:05 +02:00
Tanguy Leroux	1f011f9dea	Remove Delete-By-Query plugin closes #18469	2016-05-24 13:28:20 +02:00
Martijn van Groningen	27cc2fe4dc	Moved the percolator from core to its own module Significant changes: * AbstractQueryTestCase has moved to the test framework module, in order for query builder tests in modules and plugins * Added support to AbstractQueryTestCase to register plugins * Lift the restriction that only one percolator could be added per index. This validation existed in MapperService, but because the percolator moved to a module it could no longer exist there. Instead of bringing it back it was removed. This validation existed since the percolator cache only supported one percolator query per document, since the percolator cache has been removed this restriction could removed as well. * While moving percolator tests to the new module, also removed a couple of tests for the deprecated percolate and mpercolate api. These APIs are now sugar APIs for bwc and rediect to the searvh and msearvh APIs. Some tests were still testing as if percolate and mpercolate API did the percolation, but this no longer the case and these tests could be removed.	2016-05-24 11:01:57 +02:00
Ryan Ernst	ef435d0099	Remove unnecessary boxing and use of deprecated Double ctors	2016-05-23 13:44:59 -07:00
Lee Hinman	bfce901edf	Merge remote-tracking branch 'dakrone/explain-add-fetch-in-progress'	2016-05-23 09:43:16 -06:00
Lee Hinman	8040ed0c16	Add whether the shard state fetch is pending to the allocation explain API If the shard state fetch is still pending, this will now return a message like: ```json { "shard" : { "index" : "i", "index_uuid" : "de1W1374T4qgvUP4a9Ieaw", "id" : 0, "primary" : false }, "assigned" : false, "shard_state_fetch_pending": true, "unassigned_info" : { "reason" : "INDEX_CREATED", "at" : "2016-04-26T16:34:53.227Z" }, "allocation_delay_ms" : 0, "remaining_delay_ms" : 0, "nodes" : { "z-CbkiELT-SoWT91HIszLA" : { "node_name" : "Brain Cell", "node_attributes" : { "testattr" : "test" }, "store" : { "shard_copy" : "NONE" }, "final_decision" : "NO", "final_explanation" : "the shard state fetch is pending", "weight" : 5.0, "decisions" : [ ] } } } ``` Adds the `shard_state_fetch_pending` field and uses the state to influence the final decision and final explanation. Relates to #17372	2016-05-23 09:42:57 -06:00
Adrien Grand	31e4c16ec3	Merge pull request #18509 from terradatum/epoch Support full range of Java Long for epoch DateTime	2016-05-23 12:27:38 +02:00
Adrien Grand	459916f5dd	Remove custom Base64 implementation. #18413 This replaces o.e.common.Base64 with java.util.Base64.	2016-05-23 11:32:42 +02:00
Tanguy Leroux	e7eb664c78	Change BlobPath.buildAsString() method	2016-05-23 10:50:40 +02:00
Adrien Grand	c5a9edf1c7	Add `Character.MODIFIER_SYMBOL` to the list of symbol categories. #18402 Closes #18388	2016-05-23 10:11:35 +02:00
Jim Ferenczi	238d390637	Fixes for _only_nodes preferences: * Handle multiple attributes/name (coma separated). * Shuffle the nodes that match the preferences. Fix #12546 Fix #12700	2016-05-23 09:44:52 +02:00
Adrien Grand	cb2cfdd9c0	Speed up named queries. #18470 Named queries have a performance bug when they are used with expensive queries that need to perform a lot of work up-front like fuzzy or range queries (including with points). The reason is that they currently re-create the weight and scorer for every hit. Instead we should create weights exactly once and use a single Scorer for all documents that are on the same segment.	2016-05-23 08:56:40 +02:00
Martijn van Groningen	c1a0929123	percolator: Add support dor MatchNoDocsQuery in query terms extract service Before the query extraction would have been aborted and the percolator query would be marked as unknown. This resulted in a situation that these queries always need to be evaluated by the memory index at search time. By adding support for this query many more percolator query candidate hits can skip the expensive memory index verification step. For example the `match` query parser returns a MatchNoDocsQuery if the query terms are removed by text analysis (lets query text only contained stop words).	2016-05-22 22:42:19 +02:00
G. Richard Bellamy	cf54903580	Support full range of Java Long for epoch DateTime Remove the arbitrary limit on epoch_millis and epoch_seconds of 13 and 10 characters, respectively. Instead allow any character combination that can be converted to a Java Long. Update the docs to reflect this change.	2016-05-22 13:08:20 -07:00
Ryan Ernst	37d36f2f4c	Merge branch 'master' into java9	2016-05-21 14:19:58 -07:00
Ryan Ernst	f01f15d3b8	Document the hack	2016-05-21 14:14:12 -07:00
Jason Tedor	ba14aca218	Refactor property placeholder use of env. vars This commit is a slight refactoring of the use of environment variables in replacing property placeholders. In commit `115f983827` the constructor for Settings.Builder was made package visible to provide a hook for tests to mock obtaining environment variables. But we do not need to go that far and can instead provide a small hook for this for tests without opening up the constructor. Thus, in this commit we refactor Settings.Builder#replacePropertyPlaceholders to a package-visible method that accepts a function providing environment variables by names. The public-visible method just delegates to this method passing in System::getenv and tests can use the package-visible method to mock the behavior they need without relying on external environment variables.	2016-05-21 17:06:56 -04:00
Ryan Ernst	41a5c0cfa1	Force java9 log4j hack in testing	2016-05-21 13:41:38 -07:00
Ryan Ernst	b26f5711d6	Fix log4j buggy java version detection	2016-05-21 13:18:12 -07:00
Ryan Ernst	1d40c4bbc1	Make java9 work again This change makes ES compile with java9 again, build 118. * There are a handful of changes due to failure to determine types during compile. * The attachment plugins which use tika needed to have tika upgraded in order to pickup fixes there for java 9. * azure discovery and s3 repository indirectly depend on jaxb, which is no longer in the default modules. They now add a jaxb dependency externally, and make JarHell allow for this package.	2016-05-21 09:41:51 -07:00
Lee Hinman	1d8441b681	Merge remote-tracking branch 'dakrone/remove-script-mode'	2016-05-20 15:22:14 -06:00
Jason Tedor	115f983827	Fix env. var placeholder test so it's reproducible This commit modifies the settings test for environment variables placeholders so that it is reproducible. The underlying issue is that the set of environment variables from system to system can vary, and this means that's we can never be sure that a failing test will be reproducible. This commit simplifies this test to not rely on external forces that could influence reproducibility. Relates #18501	2016-05-20 17:11:37 -04:00
Lee Hinman	fdfd2a2f18	Remove ScriptMode class in favor of boolean true/false This removes the ScriptMode class entirely, which was an enum with two options (ON and OFF) which essentially boiled down to true and false. Now the boolean values are used instead.	2016-05-20 15:01:30 -06:00
Jason Tedor	4c7993ea71	Netty request/response tracer should wait for send We write to Netty channels in an async fashion, but notify listeners via a transport service adapter before we are certain that the channel write succeeded. In particular, the tracer logs are implemented via a transport service adapter and this means that we can write tracer logs before a write was successful and in some cases the write might fail leading to misleading logs. This commit attaches the transport service adapters to channel writes as a listener so that the notification occurs only after a successful write. Relates #18500	2016-05-20 16:26:46 -04:00
Ali Beyad	923d90d434	Remove use of a Fields class in snapshot responses that contains x-content keys, in favor of declaring/using the keys directly. Closes #18497	2016-05-20 15:07:41 -04:00
Simon Willnauer	35e705877b	Limit retries of failed allocations per index (#18467 ) Today if a shard fails during initialization phase due to misconfiguration, broken disks, missing analyzers, not installed plugins etc. elasticsaerch keeps on trying to initialize or rather allocate that shard. Yet, in the worst case scenario this ends in an endless allocation loop. To prevent this loop and all it's sideeffects like spamming log files over and over again this commit adds an allocation decider that stops allocating a shard that failed more than N times in a row to allocate. The number or retries can be configured via `index.allocation.max_retry` and it's default is set to `5`. Once the setting is updated shards with less failures than the number set per index will be allowed to allocate again. Internally we maintain a counter on the UnassignedInfo that is reset to `0` once the shards has been started. Relates to #18417	2016-05-20 20:37:45 +02:00
Britta Weber	b493f0defe	skip all geo point queries in plain highlighter Geo queries and plain highlighter do not seem to work well together (https://issues.apache.org/jira/browse/LUCENE-7293) so we need to skip all geo related queries when we highlight. closes #17537	2016-05-20 20:07:10 +02:00
Jason Tedor	61f40156d3	Do not decode path when sending error Today when sending a REST error to a client, we send the decoded path. But decoding that path can already be the cause of the error in which case decoding it again will just throw an exception leading to us never sending an error back to the client. It would be better to send the entire raw path to the client and that is what we do in this commit. Relates #18477	2016-05-20 12:15:30 -04:00
Igor Motov	ce88d7f9ab	Fix race condition in snapshot initialization When a snapshot initialization fails, the create snapshot method may return before the snapshot metadata in the cluster state is removed. This can cause follow up snapshot-API related calls to fail due to a snapshot still running. This is causing CI failures when we try to delete indices that were participating in failed snapshot to a read-only repository. Closes #18121	2016-05-20 10:52:08 -04:00
Daniel Mitterdorfer	8b962bb234	Increase log level for NettyHttpRequestSizeLimitIT This test fails spuriosly in CI and is not reproducible locally. With this commit we temporarily increase the log level in a few packages that are suspected to reveal the cause.	2016-05-20 15:12:52 +02:00
Jason Tedor	140f9dfe5f	Fix scaling thread pool test bug This commit fixes a test bug in the scaling thread pool configuration test. In particular, the test randomization could select min and max for a thread pool configuration where both are equal to zero. This is a violation of the requirements of the ThreadPoolExecutor. With this commit, we now ensure that the max is bounded below by one.	2016-05-20 08:58:08 -04:00
Martijn van Groningen	80fee8666f	percolator: Removed percolator cache Before 5.0 for it was required that the percolator queries were cached in jvm heap as Lucene queries for two reasons: 1) Performance. The percolator evaluated all percolator queries all the time. There was no pre-selecting queries that are likely to match like we have today. 2) Updates made to percolator queries were visible in realtime, Today these changes are visible in near realtime. So updating no longer requires the percolator to have the queries in jvm heap. So having the percolator queries in jvm heap via the percolator cache is now less attractive. Especially when there are many percolator queries then these queries can consume many GBs of jvm heap. Removing the percolator cache does make the percolate query slower compared to how the execution time in 5.0.0-alpha1 and alpha2, but it is still faster compared to 2.x and before.	2016-05-20 14:52:16 +02:00
Christoph Büscher	d3fe22c990	Improve adding clauses to `span_near` and `span_or` query Currently the query builders expose the clauses of the span query as a modifiable list. Instead we should make the that getter return an unmodifiable list. Also renaming the method used to add a clause from `clause(spanQuery)` to `addClause(spanQuery)`.	2016-05-20 13:36:55 +02:00
Boaz Leskes	34ef5306d2	Snapshotting and sync could cause a dead lock TranslogWriter (#18481 ) #18360 introduced an extra lock in order to allow writes while syncing the translog. This caused a potential deadlock with snapshotting code where we first acquire the instance lock, followed by a sync (which acquires the syncLock). However, the sync logic acquires the syncLock first, followed by the instance lock. I considered solving this by not syncing the translog on snapshot - I think we can get away with just flushing it. That however will create subtleties around snapshoting and whether operations in them are persisted. I opted instead to have slightly uglier code with nest synchronized, where the scope of the change is contained to the TranslogWriter class alone.	2016-05-20 12:56:24 +02:00
Jason Tedor	c257e2c51f	Remove settings and system properties entanglement Today when parsing settings during bootstrap, we add a system property for every Elasticsearch setting. Additionally, settings can be set via system properties. This commit simplifies this situation. - settings are no longer propogated to system properties - system properties can not be used to set settings - the "es." prefix on settings is no longer required (nor permitted) - test logging has a dedicated system property (tests.logger.level) Relates #18198	2016-05-19 14:08:08 -04:00
Clinton Gormley	dc33a83231	Remove the preserve_original option from the FingerprintAnalyzer (#18471 ) The preserve_original option to the ASCIIFoldingFilter doesn't play well with the FingerprintFilter, as it ends up producing fingerprints like: "and consistent godel gödel is said sentence this yes" The goal of the OpenRefine algorithm is to product a small normalized ASCII fingerprint. There's no need to expose preserve_original.	2016-05-19 19:37:13 +02:00
Christoph Büscher	d2515727d0	Improve random DateTimeZone creation in tests We often require a random joda DateTimeZone in our tests. Currently there are a few options for generating such a random DateTimeZone from the set of available ids. Currently most random picks are not really reproducable across different jvms because they rely on order in the ids set implementation. The helper in DateProcessorFactoryTests thus performs a sort on the set of ids before random picking from the result, so I moved this to ESTestCase to make it publicly available and changed all other tests to use that method.	2016-05-19 18:12:48 +02:00
Boaz Leskes	4d6887075f	Log IndexShard.refresh logs under trace (#18435 ) We log them every second...	2016-05-19 17:12:37 +02:00
Christoph Büscher	757ccf00b2	Enforce MatchQueryBuilder#maxExpansions() to be strictly positive	2016-05-19 16:59:37 +02:00
Jeff Evans	3cf4214255	Add better error message when analyzer created without tokenizer or analyzer type (#18455 ) Closes #15492	2016-05-19 15:47:07 +02:00
Ali Beyad	fc6df23fea	Rename AggregatorBuilder and all of its subclasses to AggregationBuilder, in keeping consistent with the Java APIs. Closes #18377 Closes #18367	2016-05-19 09:25:29 -04:00
Tanguy Leroux	35d3bdab84	Add Google Cloud Storage repository plugin Closes #12880	2016-05-19 13:26:23 +02:00
Martijn van Groningen	e2691d7e5c	test: Don't generate a value of 0, because FuzzyQuery constructor does't allow that	2016-05-19 13:08:30 +02:00
Martijn van Groningen	050145f61b	parent/child: Allow adding additional child types that point to an existing parent type From 2.0 adding child types to existing types was forbidden because the`_parent` field stores the join between parent and child at index time. This is to protect from the fact that types that weren't a parent before become a parent while previously indexed documents would not have a join field. This would break the parent/child queries. The restriction was a bit too strict in the sense that also if a type was a parent type the restriction would forbid adding child types that point to a parent type (so child points already point to it). This change make sure that the restriction only applies if that type isn't a parent type already. Closes #17956	2016-05-19 11:05:17 +02:00
Simon Willnauer	d77c299cb9	Register `indices.query.bool.max_clause_count` setting (#18341 ) * Register `indices.query.bool.max_clause_count` setting This commit registers `indices.query.bool.max_clause_count` as a node level setting and removes support for its synonym setting `index.query.bool.max_clause_count`. Closes #18336	2016-05-19 10:42:35 +02:00
Simon Willnauer	2b972f1f75	FSync translog outside of the writers global lock (#18360 ) FSync translog outside of the writers global lock Today we aquire a write global lock that blocks all modification to the translog file while we fsync / checkpoint the file. Yet, we don't necessarily needt to block concurrent operations here. This can lead to a lot of blocked threads if the machine has high concurrency (lot os CPUs) but uses slow disks (spinning disks) which is absolutely unnecessary. We just need to protect from fsyncing / checkpointing concurrently but we can fill buffers and write to the underlying file in a concurrent fashion. This change introduces an additional lock that we hold while fsyncing but moves the checkpointing code outside of the writers global lock.	2016-05-19 09:40:10 +02:00
Simon Willnauer	9a9301f7d8	Remove dead BloomFilter code We don't use this class for a quite a while. lets trash it.	2016-05-18 23:00:57 +02:00
Clinton Gormley	cec9a94b96	Added version 2.3.3 with bwc indices	2016-05-18 17:33:21 +02:00
Christoph Büscher	7c665a010b	Fix TimeZoneRounding#nextRoundingValue for hour, minute and second units Currently rounding intervals obtained by nextRoundingValue() for hour, minute and second units can include an extra hour when happening at DST transitions that add an extra hour (eg CEST -> CET). This changes the rounding logic for time units smaller or equal to an hour to fix this. Closes #18326	2016-05-18 17:29:02 +02:00
markharwood	a846ff93e9	Aggregations fix: support include/exclude strings formatted for IP and date fields in terms and significant_terms aggregations. Closes #17705	2016-05-18 16:21:55 +01:00
Tanguy Leroux	27b65e90ca	Merge pull request #18443 from tlrx/fix-18433 Add missing builder.endObject() in FsInfo	2016-05-18 15:40:55 +02:00
Jason Tedor	cad0608cdb	Add GC overhead logging This commit adds simple GC overhead logging. This logging captures intervals where the JVM is spending a lot of time performing GC but it is not necessarily the case that each GC is large. For a start, this logging is simple and does not attempt to incorporate whether or not the collections were efficient (that is, we are only capturing that a lot of GC is happening, not that a lot of useless GC is happening). Relates #18419	2016-05-18 09:31:28 -04:00
Daniel Mitterdorfer	3954306af2	Merge pull request #18432 from danielmitterdorfer/fix-circuit-breaker-it Clear all caches after testing parent breaker	2016-05-18 15:23:15 +02:00
Tanguy Leroux	d7a31c8cf7	Add missing builder.endObject() in FsInfo closes #18433	2016-05-18 15:19:30 +02:00
Christoph Büscher	808ef6cec7	Fix parsing single `rescore` element in SearchSourceBuilder We are currently only parsing the array-syntax for the rescore part in SearchSourceBuilder ("rescore" : [ {...}, {...} ]) . We also need to support "rescore" : {...} Closes #18439	2016-05-18 15:08:28 +02:00
Clinton Gormley	c03dd8a290	Make the index-too-old exception more explicit (#18438 ) Closes #18418	2016-05-18 13:33:25 +02:00
Daniel Mitterdorfer	de3e7d161f	Add tests for null precondition check in BulkRequest Relates #18347 Checked with @javanna	2016-05-18 12:10:13 +02:00
Yannick Welsch	6dacac49b3	Simplify recovery logic in IndicesClusterStateService (#18405 ) - Moves recovery logic into IndexShard - Simplifies logic to cancel peer recovery of shard where recovery source node changed - Ensures routing entry is set on initialization of IndexShard	2016-05-18 10:51:57 +02:00
Daniel Mitterdorfer	c13df3b6c5	Clear all caches after testing parent breaker With this commit we clear all caches after testing the parent circuit breaker. This is necessary as caches hold on to circuit breakers internally. Additionally, due to usage of CircuitBreaker#addWithoutBreaking() in caches, it's even possible to go above the limit. As a consequence, all subsequent requests fall victim to the limit. Hence, right after the parent circuit breaker tripped, we clear all caches to reduce these circuit breakers to 0 again. We also exclude the clear caches transport request from limit check in order to ensure it will succeed. As this is typically a very small and low-volume request, it is deemed ok to exclude it. Closes #18325	2016-05-18 09:31:35 +02:00
Jason Tedor	ecce53f0df	Add I/O statistics on Linux This commit adds a variety of real disk metrics for the block devices that back Elasticsearch data paths. A collection of statistics are read from /proc/diskstats and are used to report the raw metrics for operations and read/write bytes. Relates #15915	2016-05-17 16:16:39 -04:00
Jason Tedor	584be0b3f8	Refactor JvmGcMonitorService for testing This commit refactors the JvmGcMonitorService so that it can be tested. In particular, hooks are added to verify that the JvmMonitorService correctly observes slow GC events, and that the JvmGcMonitorService logs the correct messages. Relates #18378	2016-05-17 13:05:36 -04:00
Yannick Welsch	9ba554dfd2	Expose previous cluster state only in RoutingAllocation (#18390 ) Instead of re-exposing index metadata and blocks in RoutingNodes (which is part of the cluster state before rerouting), expose it as part of the RoutingAllocation which is known to be only temporarily used during reroute.	2016-05-17 19:02:28 +02:00
$polyfractal$ polyfractal	c755a77022	[TEST] Use a reproducible source of randomness in shuffle	2016-05-17 12:55:07 -04:00
Zachary Tong	7c46b57ff2	Add a Sort ingest processor Sorts an array of values in ascending or descending order. If all elements are numerics, they will be sorted numerically. If values are strings, or mixtures of strings/numbers, the elements will be sorted lexicographically.	2016-05-17 12:06:48 -04:00
Colin Goodheart-Smithe	8c9ca8b518	Moves query profiler classes into their own package The change also renames fields and methods in the Profilers class. Note that I had to make ProfileResult a public class (it was package private before) because now classes that call it are in a different package.	2016-05-17 14:20:05 +01:00
Ali Beyad	3764789d96	Removed unused AllocationService member in TransportClusterAllocationExplainAction Closes #18381	2016-05-16 18:41:36 -04:00
Robert Muir	8d4c1befe5	Merge pull request #18364 from rmuir/nukeRunAsFloat Remove LeafSearchScript.runAsFloat(): Nothing calls it.	2016-05-16 17:08:25 -04:00
Adrien Grand	864ed04059	Lessen leniency of the query dsl. #18276 This change does the following: - Queries that are currently unsupported such as prefix queries on numeric fields or term queries on geo fields now throw an error rather than returning a query that does not match anything. - Fuzzy queries on numeric, date and ip fields are now unsupported: they used to create range queries, we now expect users to use range queries directly. Fuzzy, regexp and prefix queries are now only supported on text/keyword fields (including `_all`). - The `_uid` and `_id` fields do not support prefix or range queries anymore as it would prevent us to store them more efficiently in the future, eg. by using a binary encoding. Note that it is still possible to ignore these errors by using the `lenient` option of the `match` or `query_string` queries.	2016-05-16 17:37:00 +02:00
Colin Goodheart-Smithe	e37e8af5e2	Refactor of query profile classes to make way for other profile implementations	2016-05-16 16:15:50 +01:00
Colin Goodheart-Smithe	6eda9f5df6	more tests following review	2016-05-16 09:07:22 +01:00
Colin Goodheart-Smithe	0c449fee4a	small fix following rebase on master	2016-05-16 09:07:22 +01:00
Colin Goodheart-Smithe	66d0bdab0c	review comments	2016-05-16 09:07:22 +01:00
Colin Goodheart-Smithe	ab3121c871	Adds a methods to find (and dynamically create) the mappers for the parents of a field with dots in the field name	2016-05-16 09:07:22 +01:00
Robert Muir	8edf213492	Remove LeafSearchScript.runAsFloat(): Nothing calls it.	2016-05-15 22:59:28 -04:00
Michael McCandless	0d570352dd	Merge pull request #18355 from mikemccand/iterables_flatten Iterables.flatten should not pre-cache the first iterator	2016-05-15 10:21:35 -04:00
Mike McCandless	8d7db7fd7a	remove whitespace	2016-05-14 18:50:10 -04:00
Mike McCandless	ded8b400b0	Fix concurrency bug in IMC that could lead to negative total indexing bytes	2016-05-14 18:47:26 -04:00
Mike McCandless	48dca45564	leave Iterables.flatten pre-caching the outer Iterable	2016-05-14 17:10:17 -04:00
Mike McCandless	53c2f8b4b6	improve javadocs	2016-05-14 13:46:34 -04:00
Mike McCandless	cf2af8961b	Iterables.flatten should not pre-cache the first iterator	2016-05-14 13:39:48 -04:00
Ali Beyad	d3d57da89f	Removes unused methods in the o/e/common/Strings class. Closes #18346	2016-05-14 08:08:30 -04:00
Daniel Mitterdorfer	009cf434a2	Merge pull request #18347 from danielmitterdorfer/bulk-req-precondition-check Add not-null precondition check in BulkRequest	2016-05-14 11:25:02 +02:00
Daniel Mitterdorfer	a2381640da	Add not-null precondition check in BulkRequest With this commit we add a precondition check to BulkRequest so we fail early if users pass `null` for the request object. For a more detailed discussion, see #12038. This supersedes #12038. Relates #12038.	2016-05-14 09:59:53 +02:00
Robert Muir	2028691e66	painless: improve exception stacktraces closes #18319	2016-05-13 15:40:45 -04:00
Lee Hinman	864ba8dac1	Merge remote-tracking branch 'dakrone/there-can-be-only-one2'	2016-05-13 10:28:41 -06:00
Adrien Grand	b4dec0ddbe	Remove dead code.	2016-05-13 18:27:12 +02:00
Jason Tedor	81898a2e3e	Avoid race while retiring executors Today, a race condition exists when retiring executors. Namely, if an executor is retired and then the thread pool is terminated, the retiring of the executor and the termination of the thread pool can race to remove the retired executor from the queue of retired executors. More precisely, when the executor is initially retired, it is placed on a queue of retired executors, and then removed when it is successfully shutdown. When the pool is terminated, it will also drain the queue of retired executors. This leads to a time-of-check-time-of-use race where the draining can see a retired executor on the queue but that retired executor can be removed upon successful shutdown of that executor. This leads to the draining attempting to remove an element from the queue when there is none. This commit addresses this race condition by instead safely polling the queue. Relates #18333	2016-05-13 12:26:18 -04:00
Lee Hinman	9bcdafedda	Allow only a single extension for a scripting engine Previously multiple extensions could be provided, however, this can lead to confusion with on-disk scripts (ie, "foo.js" and "foo.javascript") having different content. Only a single extension is now supported. The only language currently supporting multiple extensions was the Javascript engine ("js" and "javascript"). It now only supports the `.js` extension. Relates to #10598	2016-05-13 09:54:31 -06:00
Lee Hinman	d5b75491dc	Merge remote-tracking branch 'dakrone/remove-script-sandbox'	2016-05-13 09:50:39 -06:00
Britta Weber	e7c17fc9fb	[TEST] increase logger level until we know what is going on We have an issue for it too: https://github.com/elastic/elasticsearch/issues/18121	2016-05-13 17:37:17 +02:00
Christoph Büscher	a40c397c67	Don't allow `fuzziness` for `multi_match` types cross_fields, phrase and phrase_prefix Currently `fuzziness` is not supported for the `cross_fields` type of the `multi_match` query since it complicates the logic that blends the term queries that cross_fields uses internally. At the moment using this combination is silently ignored, which can lead to confusions. Instead we should throw an exception in this case. The same is true for phrase and phrase_prefix type. Closes #7764	2016-05-13 17:32:14 +02:00
Jason Tedor	786a6a00d9	Add test for fixed executor rejected count This commit adds a test that a fixed executors rejected count behaves as expected. In particular, we test that if we consume the executor, then stuff the executor queue, further tasks will be rejected and the rejected stats are updated appropriately. This test also asserts that if we resize the queue the rejected count is reset to zero. Relates #18301	2016-05-13 11:27:12 -04:00
Lee Hinman	efff3918d8	Remove support for mulitple languages per scripting engine	2016-05-13 09:24:31 -06:00
Lee Hinman	a4060f7436	Remove vestiges of script engine sandboxing This removes all the mentions of the sandbox from the script engine services and permissions model. This means that the following settings are no longer supported: ```yaml script.inline: sandbox script.stored: sandbox ``` Instead, only a `true` or `false` value can be specified. Since this would otherwise break the default-allow parameter for languages like expressions, painless, and mustache, all script engines have been updated to have individual settings, for instance: ```yaml script.engine.groovy.inline: true ``` Would enable all inline scripts for groovy. (they can still be overridden on a per-operation basis). Expressions, Painless, and Mustache all default to `true` for inline, file, and stored scripts to preserve the old scripting behavior. Resolves #17114	2016-05-13 09:24:31 -06:00
Adrien Grand	638da06c1d	Add back support for `ip` range aggregations. #17859 This commit adds support for range aggregations on `ip` fields. However it will only work on 5.x indices. Closes #17700	2016-05-13 17:22:01 +02:00
Clinton Gormley	f2797dbccb	Fixed grammar in index-too-old exception (#18327 )	2016-05-13 15:08:15 +02:00
Adrien Grand	61b1f4ad0b	Fix xcontent rendering of ip terms aggs. #18003 Currently terms on an ip address try to put their binary representation in the json response. With this commit, they would return a formatted ip address: ``` "buckets": [ { "key": "192.168.1.7", "doc_count": 1 } ] ```	2016-05-13 14:59:36 +02:00
Daniel Mitterdorfer	ddbfda2c68	Exclude specific transport actions from request size limit check We add support to explicitly exclude specific transport actions from the request size limit check. We also exclude the following request types currently: MasterPingRequest PingRequest	2016-05-13 14:21:24 +02:00
Britta Weber	d3efe37814	[TEST] mute test for now, we have an issue for it https://github.com/elastic/elasticsearch/issues/18325	2016-05-13 14:08:17 +02:00
Britta Weber	0d5a2f25d3	[TEST] muste test, we have an issue for it https://github.com/elastic/elasticsearch/issues/18293	2016-05-13 12:09:30 +02:00
Olivier Bourgain	df43230844	Add index name and uuid in IndexAlreadyExistsException default message Relates #18274	2016-05-12 14:32:30 -04:00
Jason Tedor	0830bd4885	Remove period in min master node check log message As most of our log messages are not sentences and do not end with periods, this commit removes a period from the end of the min master node bootstrap check log message.	2016-05-12 12:48:58 -04:00
Zachary Tong	5ee5cc25cc	Move AsciiFolding earlier in FingerprintAnalyzer filter chain Rearranges the FingerprintAnalyzer so that AsciiFolding comes earlier in the chain (after lowercasing, before stop removal, for maximum deduping power) Closes #18266	2016-05-12 09:34:15 -04:00
Robert Muir	3b66d40f7c	Merge pull request #18284 from rmuir/painless_value_aggregations _value support in painess?	2016-05-11 20:37:35 -04:00
Robert Muir	6b4e47bf96	this makes aggregations per-document _value fast (bypass hash put, hash get, etc) for painless. but i have no clue how to test it, it seems this feature never worked via REST? Should we drop the feature instead?	2016-05-11 15:39:00 -04:00
Ali Beyad	189341da10	CORS handling triggered whether User-Agent is a browser or not This commit ensures that if CORS is enabled, then Origin headers are checked regardless of whether the request came from a browser or not. In the past, we only proceeded with CORS checks if the User-Agent was a browser.	2016-05-11 15:30:15 -04:00
Ali Beyad	fced8dac72	When CORS is enabled, permit requests from the same origin as the request host, as the request is not a cross origin. Relates #18256	2016-05-11 15:24:36 -04:00
Ali Beyad	5189eb41c7	Dangling indices are not imported if a tombstone for the same index (same name and UUID) exists in the cluster state. This resolves a situation where if an index data folder was copied into a node's data directory while the node is running and that index had a tombstone in the cluster state, the index would still get imported. Closes #18250 Closes #18249	2016-05-11 12:56:19 -04:00
Adrien Grand	ce4af4be42	Remove dead code.	2016-05-11 18:38:07 +02:00
Adrien Grand	866a5459f0	Make significant terms work on fields that are indexed with points. #18031 It will keep using the caching terms enum for keyword/text fields and falls back to IndexSearcher.count for fields that do not use the inverted index for searching (such as numbers and ip addresses). Note that this probably means that significant terms aggregations on these fields will be less efficient than they used to be. It should be ok under a sampler aggregation though. This moves tests back to the state they were in before numbers started using points, and also adds a new test that significant terms aggs fail if a field is not indexed. In the long term, we might want to follow the approach that Robert initially proposed that consists in collecting all documents from the background filter in order to compute frequencies using doc values. This would also mean that significant terms aggregations do not require fields to be indexed anymore.	2016-05-11 16:52:58 +02:00
Jason Tedor	d0edd13f7b	Log setting key not setting object in IMC This commit modifies two logging statements in the IndexingMemoryController to log the key for the setting indices.memory.index_buffer_size instead of the object. Relates #18191	2016-05-11 10:37:23 -04:00

1 2 3 4 5 ...

5336 Commits