OpenSearch

Commit Graph

Author	SHA1	Message	Date
Martijn van Groningen	80fee8666f	percolator: Removed percolator cache Before 5.0 for it was required that the percolator queries were cached in jvm heap as Lucene queries for two reasons: 1) Performance. The percolator evaluated all percolator queries all the time. There was no pre-selecting queries that are likely to match like we have today. 2) Updates made to percolator queries were visible in realtime, Today these changes are visible in near realtime. So updating no longer requires the percolator to have the queries in jvm heap. So having the percolator queries in jvm heap via the percolator cache is now less attractive. Especially when there are many percolator queries then these queries can consume many GBs of jvm heap. Removing the percolator cache does make the percolate query slower compared to how the execution time in 5.0.0-alpha1 and alpha2, but it is still faster compared to 2.x and before.	2016-05-20 14:52:16 +02:00
Luca Cavanna	fcee329332	update http client version to 4.5.2 and http-core 4.4.4 (#18399 ) StrictHostnameVerifier can now be removed	2016-05-20 12:02:42 +02:00
Jason Tedor	c257e2c51f	Remove settings and system properties entanglement Today when parsing settings during bootstrap, we add a system property for every Elasticsearch setting. Additionally, settings can be set via system properties. This commit simplifies this situation. - settings are no longer propogated to system properties - system properties can not be used to set settings - the "es." prefix on settings is no longer required (nor permitted) - test logging has a dedicated system property (tests.logger.level) Relates #18198	2016-05-19 14:08:08 -04:00
Christoph Büscher	d2515727d0	Improve random DateTimeZone creation in tests We often require a random joda DateTimeZone in our tests. Currently there are a few options for generating such a random DateTimeZone from the set of available ids. Currently most random picks are not really reproducable across different jvms because they rely on order in the ids set implementation. The helper in DateProcessorFactoryTests thus performs a sort on the set of ids before random picking from the result, so I moved this to ESTestCase to make it publicly available and changed all other tests to use that method.	2016-05-19 18:12:48 +02:00
Tanguy Leroux	35d3bdab84	Add Google Cloud Storage repository plugin Closes #12880	2016-05-19 13:26:23 +02:00
Jason Tedor	ecce53f0df	Add I/O statistics on Linux This commit adds a variety of real disk metrics for the block devices that back Elasticsearch data paths. A collection of statistics are read from /proc/diskstats and are used to report the raw metrics for operations and read/write bytes. Relates #15915	2016-05-17 16:16:39 -04:00
Adrien Grand	864ed04059	Lessen leniency of the query dsl. #18276 This change does the following: - Queries that are currently unsupported such as prefix queries on numeric fields or term queries on geo fields now throw an error rather than returning a query that does not match anything. - Fuzzy queries on numeric, date and ip fields are now unsupported: they used to create range queries, we now expect users to use range queries directly. Fuzzy, regexp and prefix queries are now only supported on text/keyword fields (including `_all`). - The `_uid` and `_id` fields do not support prefix or range queries anymore as it would prevent us to store them more efficiently in the future, eg. by using a binary encoding. Note that it is still possible to ignore these errors by using the `lenient` option of the `match` or `query_string` queries.	2016-05-16 17:37:00 +02:00
Robert Muir	2028691e66	painless: improve exception stacktraces closes #18319	2016-05-13 15:40:45 -04:00
Lee Hinman	9bcdafedda	Allow only a single extension for a scripting engine Previously multiple extensions could be provided, however, this can lead to confusion with on-disk scripts (ie, "foo.js" and "foo.javascript") having different content. Only a single extension is now supported. The only language currently supporting multiple extensions was the Javascript engine ("js" and "javascript"). It now only supports the `.js` extension. Relates to #10598	2016-05-13 09:54:31 -06:00
Lee Hinman	efff3918d8	Remove support for mulitple languages per scripting engine	2016-05-13 09:24:31 -06:00
Lee Hinman	a4060f7436	Remove vestiges of script engine sandboxing This removes all the mentions of the sandbox from the script engine services and permissions model. This means that the following settings are no longer supported: ```yaml script.inline: sandbox script.stored: sandbox ``` Instead, only a `true` or `false` value can be specified. Since this would otherwise break the default-allow parameter for languages like expressions, painless, and mustache, all script engines have been updated to have individual settings, for instance: ```yaml script.engine.groovy.inline: true ``` Would enable all inline scripts for groovy. (they can still be overridden on a per-operation basis). Expressions, Painless, and Mustache all default to `true` for inline, file, and stored scripts to preserve the old scripting behavior. Resolves #17114	2016-05-13 09:24:31 -06:00
Yannick Welsch	7753420540	Make ShardRouting and UnassignedInfo immutable (#17821 ) This makes defensive copying of ShardRouting objects obsolete whenever we do a reroute and trashes less objects.	2016-05-10 19:11:04 +02:00
Nik Everett	ddc531e729	Build a plugin for testing docs This makes it much easier to apply to other projects. Fixes to doc tests infrastructure: * Fix comparing lists. Was totally broken. * Fix order of actual vs expected parameters. * Allow multiple `// TESTRESPONSE` lines with substitutions to join into one big list of subtitutions. This makes lets the docs look tidier. * Exclude build from snippet scanning * Allow subclasses of ESRestTestCase access to the admin execution context	2016-05-09 14:07:27 -04:00
Nik Everett	b7d02fbd1e	Improve logging of raw rest actions on failure Log the method and the path.	2016-05-09 13:04:33 -04:00
Nik Everett	ef2e3a8c39	Rest tests: More defense around stashing body Integration tests failed: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+multijob-intake/483/console We'll see if the rest tests were hiding some other failure.	2016-05-09 09:52:23 -04:00
Ryan Ernst	3d1be071c9	Merge branch 'master' into pom_gen	2016-05-06 12:56:51 -07:00
Chris Earle	5be79ed02c	Add Failure Details to every NodesResponse Most of the current implementations of BaseNodesResponse (plural Nodes) ignore FailedNodeExceptions. - This adds a helper function to do the grouping to TransportNodesAction - Requires a non-null array of FailedNodeExceptions within the BaseNodesResponse constructor - Reads/writes the array to output - Also adds StreamInput and StreamOutput methods for generically reading and writing arrays	2016-05-06 14:59:43 -04:00
Christoph Büscher	7d14728960	Add xContent shuffling to some more tests This adds some random shuffling of xContent to some more test cases. Relates to #5831	2016-05-06 10:46:39 +02:00
Adrien Grand	de8354dd7f	Allow binary sort values. #17959 The `ip` field uses a binary representation internally. This breaks when rendering sort values in search responses since elasticsearch tries to write a binary byte[] as an utf8 json string. This commit extends the `DocValueFormat` API in order to give fields a chance to choose how to render values. Closes #6077	2016-05-06 09:27:02 +02:00
Ryan Ernst	e16af604bf	Build: Add pom generation to assemble task In preparation for a unified release process, we need to be able to generate the pom files independently of trying to actually publish. This change adds back the maven-publish plugin just for that purpose. The nexus plugin still exists for now, so that we do not break snapshots, but that can be removed at a later time once snapshots are happenign through the unified tools. Note I also changed the dir jars are written into so that all our artifacts are under build/distributions.	2016-05-05 17:57:44 -07:00
Nik Everett	4b1c116461	Generate and run tests from the docs Adds infrastructure so `gradle :docs:check` will extract tests from snippets in the documentation and execute the tests. This is included in `gradle check` so it should happen on CI and during a normal build. By default each `// AUTOSENSE` snippet creates a unique REST test. These tests are executed in a random order and the cluster is wiped between each one. If multiple snippets chain together into a test you can annotate all snippets after the first with `// TEST[continued]` to have the generated tests for both snippets joined. Snippets marked as `// TESTRESPONSE` are checked against the response of the last action. See docs/README.asciidoc for lots more. Closes #12583. That issue is about catching bugs in the docs during build. This catches some bugs in the docs during build which is a good start.	2016-05-05 13:58:03 -04:00
Jason Tedor	784c9e5fb9	Introduce node handshake This commit introduces a handshake when initiating a light connection. During this handshake, node information, cluster name, and version are received from the target node of the connection. This information can be used to immediately validate that the target node is a member of the same cluster, and used to set the version on the stream. This will allow us to extend APIs that are used during initial cluster recovery without a major version change. Relates #15971	2016-05-04 20:06:47 -04:00
Jason Tedor	78d615f320	Merge pull request #18110 from jasontedor/strings-split-as-array Remove Strings#splitStringToArray Remove arbitrary separator/wildcard from PathTrie	2016-05-04 09:38:47 -04:00
Jason Tedor	2dea449949	Remove Strings#splitStringToArray This commit removes the method Strings#splitStringToArray and replaces the call sites with invocations to String#split. There are only two explanations for the existence of this method. The first is that String#split is slightly tricky in that it accepts a regular expression rather than a character to split on. This means that if s is a string, s.split(".") does not split on the character '.', but rather splits on the regular expression '.' which splits on every character (of course, this is easily fixed by invoking s.split("\\.") instead). The second possible explanation is that (again) String#split accepts a regular expression. This means that there could be a performance concern compared to just splitting on a single character. However, it turns out that String#split has a fast path for the case of splitting on a single character and microbenchmarks show that String#split has 1.5x--2x the throughput of Strings#splitStringToArray. There is a slight behavior difference between Strings#splitStringToArray and String#split: namely, the former would return an empty array in cases when the input string was null or empty but String#split will just NPE at the call site on null and return a one-element array containing the empty string when the input string is empty. There was only one place relying on this behavior and the call site has been modified accordingly.	2016-05-04 08:12:41 -04:00
Isabel Drost-Fromm	a8bf75983f	Merge branch 'master' into tests/switch_to_random_value_other_than_for_sort	2016-05-04 10:24:46 +02:00
Daniel Mitterdorfer	0a6f40c7f5	Enable HTTP compression by default with compression level 3 With this commit we compress HTTP responses provided the client supports it (as indicated by the HTTP header 'Accept-Encoding'). We're also able to process compressed HTTP requests if needed. The default compression level is lowered from 6 to 3 as benchmarks have indicated that this reduces query latency with a negligible increase in network traffic. Closes #7309	2016-05-03 08:53:15 +02:00
Isabel Drost-Fromm	372eceb854	Switch to using predicate for testing existing value	2016-05-02 15:41:05 +02:00
Isabel Drost-Fromm	47fefdd273	Switch from separate sort_mode to more general randomValueOtherThan ... for sort tests only ...	2016-04-28 14:45:56 +02:00
Jason Tedor	efeec4d096	Merge pull request #17017 from jasontedor/generic-thread-pool Actually bound the generic thread pool	2016-04-26 08:27:48 -04:00
Alexander Reelsen	486c783f08	Testing: Remove unused junit rule (#17947 ) This rule was used to repeat failed tests due to binding on an already bound port. The test has been fixed so we can get rid of this rule as well.	2016-04-26 09:53:49 +02:00
Adrien Grand	31a9845bc2	Remove the `SearchType` setter on `SearchContext`. #17955 It was not used.	2016-04-26 09:08:37 +02:00
Ali Beyad	d39eb2d691	Adds tombstones to cluster state for index deletions Previously, we would determine index deletes in the cluster state by comparing the index metadatas between the current cluster state and the previous cluster state and decipher which ones were missing (the missing ones are deleted indices). This led to a situation where a node that went offline and rejoined the cluster could potentially cause dangling indices to be imported which should have been deleted, because when a node rejoins, its previous cluster state does not contain reliable state. This commit introduces the notion of index tombstones in the cluster state, where we are explicit about which indices have been deleted. In the case where the previous cluster state is not useful for index metadata comparisons, a node now determines which indices are to be deleted based on these tombstones in the cluster state. There is also functionality to purge the tombstones after exceeding a certain amount. Closes #17265 Closes #16358 Closes #17435	2016-04-25 15:43:20 -04:00
Jason Tedor	5608fa7ac1	Actually bound the generic thread pool This commit actually bounds the size of the generic thread pool. The generic thread pool was of type cached, a thread pool with an unbounded number of workers and an unbounded work queue. With this commit, the generic thread pool is now of type scaling. As such, the cached thread pool type has been removed. By default, the generic thread pool is constructed with a core pool size of four, a max pool size of 128 and idle workers can be reaped after a keep-alive time of thirty seconds expires. The work queue for this thread pool remains unbounded.	2016-04-25 06:47:26 -04:00
Martijn van Groningen	c5ad2e2865	Changed indexed scripts to be stored in the cluster state instead of the `.scripts` index. Also added max script size soft limit for stored scripts. Closes #16651	2016-04-22 13:42:55 +02:00
Nik Everett	65f6f6bc8d	Normalize registration for SignificanceHeuristics When I pulled on the thread that is "Remove PROTOTYPEs from SignificanceHeuristics" I ended up removing SignificanceHeuristicStreams and replacing it with readNamedWriteable. That seems like a lot at once but it made sense at the time. And it is what we want in the end, I think. Anyway, this also converts registration of SignificanceHeuristics to use ParseFieldRegistry to make them consistent with Queries, Aggregations and lots of other stuff. Adds a new and wonderous hack to support serialization checking of NamedWriteables registered by plugins! Related to #17085	2016-04-19 09:47:37 -04:00
Daniel Mitterdorfer	3688629e11	Adjust line-length of transport related classes to coding standard	2016-04-15 10:12:24 +02:00
Ali Beyad	b87fd54ba9	Improvements to the IndicesService class This commit contains the following improvements/fixes: 1. Renaming method names and variables to better reflect the purpose of the method and the semantics of the variable. 2. For deleting indexes, replace the closed parameter passed to the delete index/store methods with obtaining the index's state from the IndexSettings that is already passed in. 3. Added tests to the IndexWithShadowReplicaIT suite, some of which show issues in the shadow replica delete process that are captured in Github issue 17695. Closes #17638	2016-04-14 11:14:02 -04:00
Nik Everett	64f5a4f848	Stop map collisions on FiltersTests Adds randomUnique to generate unique things and uses it to make unique keys. The offending seed was 81AE616FEAD10F17.	2016-04-13 08:35:45 -04:00
Daniel Mitterdorfer	117bc68af3	Limit request size on HTTP level With this commit we limit the size of all in-flight requests on HTTP level. The size is guarded by the same circuit breaker that is also used on transport level. Similarly, the size that is used is HTTP content length. Relates #16011	2016-04-13 09:58:08 +02:00
Daniel Mitterdorfer	52b2016447	Limit request size on transport level With this commit we limit the size of all in-flight requests on transport level. The size is guarded by a circuit breaker and is based on the content size of each request. By default we use 100% of available heap meaning that the parent circuit breaker will limit the maximum available size. This value can be changed by adjusting the setting network.breaker.inflight_requests.limit Relates #16011	2016-04-13 09:54:59 +02:00
Adrien Grand	226644ea2c	Do not assume term queries use the inverted index. #17532 We have a couple places in the code base that assume that search is always done on the inverted index. However with the new points API in Lucene 6, this is not true anymore. This commit makes MappedFieldType.indexedValueForSearch protected and fixes call sites to keep working for field types that use the inverted index and either work differently ar throw an exception otherwise. For instance, it will still be possible to run cross_fields multi match queries on numeric fields, but the score contributions will not be blended as well as before, and significant terms aggregations on long terms will not be possible anymore since points do not record document frequencies.	2016-04-12 09:47:20 +02:00
Adrien Grand	0eb1a816c8	Allow the query cache to be disabled. #16268 This replaces the internal `index.queries.cache.type` setting with a new `index.queries.cache.enabled` setting, which is documented. Closes #15802	2016-04-11 18:06:16 +02:00
Alexander Reelsen	da19ddf3e6	Ingest Attachment: Allow to prevent base64 conversions by using raw bytes (#16601 ) CBOR is natively supported in Elasticsearch and allows for byte arrays. This means, that by using CBOR the user can prevent base64 conversions for the data being sent back and forth. This PR adds support to extract data from a byte array in addition to a string. This also required to add a ByteArrayValueSource class.	2016-04-11 14:14:56 +02:00
David Pilato	1e346d1ac1	Merge branch 'fix/17625-close-ingest-factory'	2016-04-11 10:00:19 +02:00
Nik Everett	525ce40d1c	Give SearchContext a toString and move the string capturing to capture time.	2016-04-10 20:55:31 -04:00
Nik Everett	ac94e5f287	Provide more information about open contexts Sometimes we get a test failure caused by search contexts left open. The tests include a stack trace of the call that opened the context but nothing else about the context. This adds more information about the context that has been left open like what query it was running, what shard it targeted, and whether or not it was a scroll. Relates to #17582	2016-04-10 20:55:31 -04:00
David Pilato	24f48b86b5	Update after review and add a Test	2016-04-09 13:14:25 +02:00
Adrien Grand	42526ac28e	Remove Settings.settingsBuilder. We have both `Settings.settingsBuilder` and `Settings.builder` that do exactly the same thing, so we should keep only one. I kept `Settings.builder` since it has my preference but also it is the one that we use in examples of the Java API.	2016-04-08 18:10:02 +02:00
Chris Earle	d97d5ebb8b	Remove hostname from NetworkAddress.format This removes the inconsistent output of IP addresses. The format was parsing-unfriendly and it makes it hard to reason about API responses, such as to _nodes. With this change in place, it will never print the hostname as part of the default format, which has the added benefit that it can be used consistently for URIs, which was not the case when the hostname might appear at the front with "hostname/ip:port".	2016-04-07 17:27:59 -04:00
Adrien Grand	c33300c543	Make MappedFieldType responsible for providing a parser/formatter. #17546 Aggregations need to perform instanceof calls on MappedFieldType instances in order to know how they should be parsed or formatted. Instead, we should let the field types provide a formatter/parser that can can be used.	2016-04-07 16:57:50 +02:00

1 2 3 4 5 ...

337 Commits