OpenSearch

Commit Graph

Author	SHA1	Message	Date
Mark Vieira	ccf656a9d0	Repository plugin test cacheability fixes (#46572 )	2019-09-11 08:24:55 -07:00
Jim Ferenczi	23bf310c84	Replace the SearchContext with QueryShardContext when building aggregator factories (#46527 ) This commit replaces the `SearchContext` with the `QueryShardContext` when building aggregator factories. Aggregator factories are part of the `SearchContext` so they shouldn't require a `SearchContext` to create them. The main changes here are the signatures of `AggregationBuilder#build` that now takes a `QueryShardContext` and `AggregatorFactory#createInternal` that passes the `SearchContext` to build the `Aggregator`. Relates #46523	2019-09-11 16:43:30 +02:00
Christoph Büscher	aa0c586b73	Deprecate `_field_names` disabling (#42854 ) Currently we allow `_field_names` fields to be disabled explicitely, but since the overhead is negligible now we decided to keep it turned on by default and deprecate the `enable` option on the field type. This change adds a deprecation warning whenever this setting is used, going forward we want to ignore and finally remove it. Closes #27239	2019-09-11 14:58:08 +02:00
Jim Ferenczi	425b1a77e8	Add more context to QueryShardContext (#46584 ) This change adds an IndexSearcher and the node's BigArrays in the QueryShardContext. It's a spin off of #46527 as this change is required to allow aggregation builder to solely use the query shard context. Relates #46523	2019-09-11 12:24:51 +02:00
Mayya Sharipova	2c5f9b558b	Fix highlighting for script_score query (#46507 )	2019-09-10 08:26:47 -04:00
Alexander Reelsen	0915bd7c6a	Update mustache dependency to 0.9.6 (#46243 )	2019-09-09 13:42:03 +02:00
Ryan Ernst	a078bb4b92	Add test tasks for unpooled and direct buffer pooling to netty (#46049 ) Some netty behavior is controlled by system properties. While we want to test with the defaults for Elasticsearch for most tests, within netty we want to ensure these netty settings exhibit correct behavior. This commit adds variants of test and integTest tasks for netty which set the unpooled and direct buffer pooled allocators. relates #45881	2019-08-30 11:37:45 -07:00
Igor Motov	28006fe19f	Fix GeoIpProcessorFactoryTests on windows (#45668 ) Switches windows build to use geoip database loaded on heap instead of memory mapping it. Closes #44552	2019-08-28 18:02:25 -04:00
Mark Tozzi	aec125faff	Support Range Fields in Histogram and Date Histogram (#46012 ) Backport of 1a0dddf4ad24b3f2c751a1fe0e024fdbf8754f94 (AKA #445395) * Add support for a Range field ValuesSource, including decode logic for range doc values and exposing RangeType as a first class enum * Provide hooks in ValuesSourceConfig for aggregations to control ValuesSource class selection on missing & script values * Branch aggregator creation in Histogram and DateHistogram based on ValuesSource class, to enable specialization based on type. This is similar to how Terms aggregator works. * Prioritize field type when available for selecting the ValuesSource class type to use for an aggregation	2019-08-28 09:06:09 -04:00
Tim Brooks	956df7be92	Reindex task state initialized before reindex (#46043 ) Currently the process to execute a reindex process is tightly coupled to step of initializing the task state. This creates problems when this process is asynchronous. It is possible that the task state has not been initialized which prevents follow-up actions such as rethrottle. This commit separates the task initialization so that it can be executed as a first step in the persistent reindex process.	2019-08-27 15:28:04 -05:00
Tim Brooks	07f3ddb549	Extract reindexing logic from transport action (#46033 ) This commit extracts the reindexing logic from the transport action so that it can be incorporated into the persistent reindex work without requiring the usage of the client.	2019-08-27 12:28:37 -05:00
Tim Brooks	ad233e3e38	Add test for CopyBytesSocketChannel (#46031 ) Currently we use a custom CopyBytesSocketChannel for interfacing with netty. We have integration tests that use this channel, however we never verify the read and write behavior in the face of potential partial writes. This commit adds a test for this behavior.	2019-08-27 11:25:22 -05:00
Jason Tedor	3d64605075	Remove node settings from blob store repositories (#45991 ) This commit starts from the simple premise that the use of node settings in blob store repositories is a mistake. Here we see that the node settings are used to get default settings for store and restore throttle rates. Yet, since there are not any node settings registered to this effect, there can never be a default setting to fall back to there, and so we always end up falling back to the default rate. Since this was the only use of node settings in blob store repository, we move them. From this, several places fall out where we were chaining settings through only to get them to the blob store repository, so we clean these up as well. That leaves us with the changeset in this commit.	2019-08-26 16:26:13 -04:00
Jack Conradson	45ad01ab1c	Fix bugs in Painless SCatch node (#45880 ) This fixes two bugs: - A recently introduced bug where an NPE will be thrown if a catch block is empty. - A long-time bug where an NPE will be thrown if multiple catch blocks in a row are empty for the same try block.	2019-08-23 08:08:02 -07:00
Jason Tedor	de6b6fd338	Add node.processors setting in favor of processors (#45885 ) This commit namespaces the existing processors setting under the "node" namespace. In doing so, we deprecate the existing processors setting in favor of node.processors.	2019-08-22 22:18:37 -04:00
Henning Andersen	4afa413a01	Fix update-by-query script examples (#43907 ) Two examples had swapped the order of lang and code when creating a script. Relates #43884	2019-08-22 22:03:54 +02:00
Jack Conradson	a1b88ca009	Move regex error to node (#45813 )	2019-08-22 07:12:54 -07:00
Armin Braun	6aaee8aa0a	Repository Cleanup Endpoint (#43900 ) (#45780 ) * Repository Cleanup Endpoint (#43900) * Snapshot cleanup functionality via transport/REST endpoint. * Added all the infrastructure for this with the HLRC and node client * Made use of it in tests and resolved relevant TODO * Added new `Custom` CS element that tracks the cleanup logic. Kept it similar to the delete and in progress classes and gave it some (for now) redundant way of handling multiple cleanups but only allow one * Use the exact same mechanism used by deletes to have the combination of CS entry and increment in repository state ID provide some concurrency safety (the initial approach of just an entry in the CS was not enough, we must increment the repository state ID to be safe against concurrent modifications, otherwise we run the risk of "cleaning up" blobs that just got created without noticing) * Isolated the logic to the transport action class as much as I could. It's not ideal, but we don't need to keep any state and do the same for other repository operations (like getting the detailed snapshot shard status)	2019-08-21 17:59:49 +02:00
Andrey Ershov	dbc90653dc	transport.publish_address should contain CNAME (#45626 ) This commit adds CNAME reporting for transport.publish_address same way it's done for http.publish_address. Relates #32806 Relates #39970 (cherry picked from commit e0a2558a4c3a6b6fbfc6cd17ed34a6f6ef7b15a9)	2019-08-16 17:42:00 +02:00
Luca Cavanna	c31cddf27e	Update the schema for the REST API specification (#42346 ) * Update the REST API specification This patch updates the REST API spefication in JSON files to better encode deprecated entities, to improve specification of URL paths, and to open up the schema for future extensions. Notably, it changes the `paths` from a list of strings to a list of objects, where each particular object encodes all the information for this particular path: the `parts` and the `methods`. Among the benefits of this approach is eg. encoding the difference between using the `PUT` and `POST` methods in the Index API, to either use a specific document ID, or let Elasticsearch generate one. Also `documentation` becomes an object that supports an `url` and also a `description` which is a new field. * Adapt YAML runner to new REST API specification format The logic for choosing the path to use when running tests has been simplified, as a consequence of the path parts being listed under each path in the spec. The special case for create and index has been removed. Also the parsing code has been hardened so that errors are thrown earlier when the structure of the spec differs from what expected, and their error messages should be more helpful.	2019-08-16 14:40:00 +02:00
Armin Braun	de58353722	Lower Painless Static Memory Footprint (#45487 ) (#45619 ) * Painless generates a ton of duplicate strings and empty `Hashmap` instances wrapped as unmodifiable * This change brings down the static footprint of Painless on an idle node by 20MB (after running the PMC benchmark against said node) * Since we were looking into ways of optimizing for smaller node sizes I think this is a worthwhile optimization	2019-08-15 19:41:45 +02:00
Jim Ferenczi	79a1390935	Add mapper-extras and the RankFeatureQuery in the hlrc (#43713 ) This change adds the support for the RankFeatureQuery in the HLRC by providing an extra dependency on mapper-extras-client. It also removes the dependency on lang-painless in mapper-extras which is not needed anymore since the move of the vector field into a dedicated module. Closes #43634	2019-08-14 18:41:39 +02:00
Jack Conradson	7f550f2b29	Complete decoupling ANTLR AST from Painless AST (#45366 ) This change removes the Reserved class used to track variables usages within the ANTLR grammar. That task is now performed by an existing pass "extractVariables" in the Painless AST. The Painless AST no longer has any dependencies on the ANTLR AST for state outside of the tree being built. This will simplify future refactoring and opens the possibility of alternate grammars.	2019-08-13 08:02:10 -07:00
Tim Brooks	ae06a9399a	Fix bug in copying bytes for socket write (#45463 ) Currently we take the array of nio buffers from the netty channel outbound buffer and copy their bytes to a direct buffer. In the process we mutate the nio buffer positions. It seems like netty will continue to reuse these buffers. This means than any data that is not flushed in a call is lost. This commit fixes this by incrementing the positions after the flush has completed. This is similar to the behavior that SocketChannel would have provided and netty relied upon. Fixes #45444.	2019-08-12 15:59:26 -06:00
Armin Braun	a9e1402189	Remove Settings from BaseRestRequest Constructor (#45418 ) (#45429 ) * Resolving the todo, cleaning up the unused `settings` parameter * Cleaning up some other minor dead code in affected classes	2019-08-12 05:14:45 +02:00
Armin Braun	a501d68f23	Upgrade to Netty 4.1.38 (#45132 ) (#45364 ) * A number of fixes to buffer handling in the .37 and .38 -> we should stay up to date	2019-08-09 03:38:14 +02:00
Tim Brooks	af908efa41	Disable netty direct buffer pooling by default (#44837 ) Elasticsearch does not grant Netty reflection access to get Unsafe. The only mechanism that currently exists to free direct buffers in a timely manner is to use Unsafe. This leads to the occasional scenario, under heavy network load, that direct byte buffers can slowly build up without being freed. This commit disables Netty direct buffer pooling and moves to a strategy of using a single thread-local direct buffer for interfacing with sockets. This will reduce the memory usage from networking. Elasticsearch currently derives very little value from direct buffer usage (TLS, compression, Lucene, Elasticsearch handling, etc all use heap bytes). So this seems like the correct trade-off until that changes.	2019-08-08 15:10:31 -06:00
Henning Andersen	d139896b66	Reindex share retry between hit sources (#44203 ) (#45348 ) The client and remote hit sources had each their own retry mechanism, which would do the same. Supporting resiliency we would have to expand on the retry mechanisms and as a preparation for that, the retry mechanism is now shared such that each sub class is only responsible for sending requests and converting responses/failures to common format. Part of #42612	2019-08-08 22:01:29 +02:00
Jack Conradson	b716b840d3	Remove loop counter from Reserved in Painless AST. (#45298 ) This change adds a compiler pass to give each node the chance to store settings necessary for analysis and writing. This removes the need to pass this in a somewhat convoluted way through an additional class called Reserved, and also removes the need to have the Walker set values for settings on reserved. This is next step in decoupling the Painless grammar from the Painless AST.	2019-08-08 09:34:51 -07:00
Michael Basnight	89861d0884	Add ingest processor existence helper method (#45156 ) This commit adds a helper method to the ingest service allowing it to inspect a pipeline by id and verify the existence of a processor in the pipeline. This work exposed a potential bug in that some processors contain inner processors that are passed in at instantiation. These processors needed a common way to expose their inner processors, so the WrappingProcessor was created in order to expose the inner processor.	2019-08-07 11:19:04 -05:00
Jason Tedor	bd59ee6c72	Fix clock used in update requests (#45262 ) We accidentally switched to using the relative time provider here. This commit fixes this by switching to the appropriate absolute clock.	2019-08-06 21:15:21 -04:00
Jack Conradson	fc8a6fc9d0	Decouple Painless AST Lambda generation from the grammar (#45111 ) This is the first step in decoupling the Painless AST from the grammar. The Painless AST should be able to generate classes independently of how the AST is generated from a grammar. (If I were to build a Painless AST by hand in code this should be all that's necessary.) This change removes Lambda name generation from the ANTLR grammar tree walker. It also removes unnecessary node generation of new array function references from the tree walker as well.	2019-08-06 10:08:19 -07:00
Yannick Welsch	7aeb2fe73c	Add per-socket keepalive options (#44055 ) Uses JDK 11's per-socket configuration of TCP keepalive (supported on Linux and Mac), see https://bugs.openjdk.java.net/browse/JDK-8194298, and exposes these as transport settings. By default, these options are disabled for now (i.e. fall-back to OS behavior), but we would like to explore whether we can enable them by default, in particular to force keepalive configurations that are better tuned for running ES.	2019-08-06 10:45:44 +02:00
Zachary Tong	3df1c76f9b	Allow pipeline aggs to select specific buckets from multi-bucket aggs (#44179 ) This adjusts the `buckets_path` parser so that pipeline aggs can select specific buckets (via their bucket keys) instead of fetching the entire set of buckets. This is useful for bucket_script in particular, which might want specific buckets for calculations. It's possible to workaround this with `filter` aggs, but the workaround is hacky and probably less performant. - Adjusts documentation - Adds a barebones AggregatorTestCase for bucket_script - Tweaks AggTestCase to use getMockScriptService() for reductions and pipelines. Previously pipelines could just pass in a script service for testing, but this didnt work for regular aggs. The new getMockScriptService() method fixes that issue, but needs to be used for pipelines too. This had a knock-on effect of touching MovFn, AvgBucket and ScriptedMetric	2019-08-05 12:18:40 -04:00
Tim Brooks	984ba82251	Move nio channel initialization to event loop (#45155 ) Currently in the transport-nio work we connect and bind channels on the a thread before the channel is registered with a selector. Additionally, it is at this point that we set all the socket options. This commit moves these operations onto the event-loop after the channel has been registered with a selector. It attempts to set the socket options for a non-server channel at registration time. If that fails, it will attempt to set the options after the channel is connected. This should fix #41071.	2019-08-02 17:31:31 -04:00
Jack Conradson	54552edaf6	Whitelist randomUUID in Painless (#45148 ) This whitelists randomUUID with the understanding that it's possible for /dev/random to cause blocking on *nix systems. Users that need randomUUID should switch their random generator source to /dev/urandom if this is a concern for them.	2019-08-02 11:53:56 -07:00
Christoph Büscher	3366726ad1	Enable reloading of synonym_graph filters (#45135 ) Reloading of synonym_graph filter doesn't work currently because the search time AnalysisMode doesn't get propagated to the TokenFilterFactory emitted by the graph filters getChainAwareTokenFilterFactory() method. This change fixes that. Closes #45127	2019-08-02 15:33:42 +02:00
Armin Braun	9450505d5b	Stop Passing Around REST Request in Multiple Spots (#44949 ) (#45109 ) * Stop Passing Around REST Request in Multiple Spots * Motivated by #44564 * We are currently passing the REST request object around to a large number of places. This works fine since we simply copy the full request content before we handle the rest itself which is needlessly hard on GC and heap. * This PR removes a number of spots where the request is passed around needlessly. There are many more spots to optimize in follow-ups to this, but this one would already enable bypassing the request copying for some error paths in a follow up.	2019-08-02 07:31:38 +02:00
Tim Brooks	aff66e3ac5	Add Cors integration tests (#44361 ) This commit adds integration tests to ensure that the basic cors functionality works for the netty and nio transports.	2019-07-31 14:24:23 -06:00
Jack Conradson	5202d2624e	Add several context examples for Painless date documentation (#44985 )	2019-07-31 08:23:17 -07:00
Armin Braun	ac11073183	Optimize Netty Frame Decoding (#44664 ) (#45001 ) * We should not create a new wrapper object if there's no bytes in the `ByteBuf` * We should not create a new wrapped `ByteBuf` if it can't contain a message anyway because it doesn't even have enough bytes for a header left	2019-07-30 15:25:52 +02:00
Armin Braun	4495140d1f	Release Pooled Buffers Earlier for HTTP Requests (#44952 ) (#44991 ) * We should release the buffers right after copying and not only do so after we did all the request handling on the copy * Relates #44564	2019-07-30 10:30:01 +02:00
Jack Conradson	1a21682ed0	Fix JodaCompatibleZonedDateTime casts in Painless (#44874 ) This is a temporary fix during the Joda to Java datetime transition. This will implicitly cast a JodaCompatibleZonedDateTime to a ZonedDateTime for both def and static types. This is necessary to insulate users from needing to know about JodaCompatibleZonedDateTime explicitly.	2019-07-29 12:05:26 -07:00
Ignacio Vera	821f6f893b	Upgrade to Lucene 8.2.0 release (#44859 ) (#44892 )	2019-07-26 08:14:59 +02:00
Nhat Nguyen	d128188c28	Return seq_no and primary_term in noop update (#44603 ) With this change, we will return primary_term and seq_no of the current document if an update is detected as a noop. We already return the version; hence we should also return seq_no and primary_term. Relates #42497	2019-07-25 19:16:56 -04:00
Ryan Ernst	03dd22b56c	Add missing ZonedDateTime methods for joda compat layer (#44829 ) While joda no longer exists in the apis for 7.x, the compatibility layer still exists with helper methods mimicking the behavior of joda for ZonedDateTime objects returned for date fields in scripts. This layer was originally intended to be removed in 7.0, but is now likely to exist for the lifetime of 7.x. This commit adds missing methods from ChronoZonedDateTime to the compat class. These methods were not part of joda, but are needed to act like a real ZonedDateTime. relates #44411	2019-07-25 11:45:57 -07:00
Jason Tedor	c329b454d9	Mark fields in SystemdPluginTests as final These fields can be final, since they are set at construction, and changing them after that could lead to some confusing test cases. This commit allows the compiler to enforce that we never modify these values during tests.	2019-07-24 17:16:50 +09:00
Jason Tedor	58a4bad12f	Align assertion and enable check in systemd plugin This commit more closely aligns the assertion that we are running in a package distribution with disabling the systemd integration if somehow we running on not a package distribution. This is, previously we had an assertion that we are in a package distribution (RPM or Debian package) but would disable the systemd integration if we are not on Linux. Instead, we should disable the systemd integration if we are not running in a package distribution. Because of our assertion, we expect this to never hold, but we need a fallback for when this assertion is violated and assertions are not enabled.	2019-07-24 16:34:42 +09:00
Jason Tedor	1e9c505e95	Avoid dumping the heap in Painless tests (#44782 ) Well, we have a test here that intentionally causes an OutOfMemoryError, to ensure that Painless handles it (I still strongly disagree with doing this). This causes two things to happen: an OutOfMemoryError to be dumped to the console, and the heap to be dumped to disk. This makes it look like we had an OutOfMemoryError while running tests, and the tests did not fail properly. This commit changes the tests configuration so that we suppress the heap dump, which also causes the OutOfMemoryError to no longer be dumped to the console.	2019-07-24 16:04:19 +09:00
Jason Tedor	659ebf6cfb	Notify systemd when Elasticsearch is ready (#44673 ) Today our systemd service defaults to a service type of simple. This means that systemd assumes Elasticsearch is ready as soon as the ExecStart (bin/elasticsearch) process is forked off. This means that the service appears ready long before it actually is, so before it is ready to receive requests. It also means that services that want to depend on Elasticsearch being ready to start can not as there is not a reliable mechanism to determine this. This commit changes the service type to notify. This requires that Elasticsearch sends a notification message via libsystemd sd_notify method. This commit does that by using JNA to invoke this native method. Additionally, we use this integration to also notify systemd when we are stopping.	2019-07-24 14:04:36 +09:00

1 2 3 4 5 ...

5277 Commits