OpenSearch

Commit Graph

Author	SHA1	Message	Date
Alan Woodward	e2af849f70	Move ObjectPath and XContentUtils to libs/x-content (#34803 ) These are generally useful utility classes that do not need to live in the Watcher code	2018-11-02 15:12:09 +00:00
Nik Everett	3cde1356c1	XContent: Check for bad parsers (#34561 ) Adds checks for misbehaving parsers. The checks aren't perfect at all but they are simple and fast enough that we can do them all the time so they'll catch most badly behaving parsers. Closes #34351	2018-10-25 17:03:42 -04:00
Jay Modi	d824cbe992	Test: ensure char[] doesn't being with prefix (#34816 ) The testCharsBeginsWith test has a check that a random prefix of length 2 is not the prefix of a char[]. However, there is no check that the char[] is not randomly generated with the same two characters as the prefix. This change ensures that the char[] does not begin with the prefix. Closes #34765	2018-10-25 08:58:21 -06:00
Julie Tibshirani	5a4866f67d	Mute CharArraysTests#testCharsBeginsWith while we await a fix.	2018-10-23 11:37:54 -07:00
Alpar Torok	0536635c44	Upgrade forbiddenapis to 2.6 (#33809 ) * Upgrade forbiddenapis to 2.6 Closes #33759 * Switch forbiddenApis back to official plugin * Remove CLI based task * Fix forbiddenApisJava9	2018-10-23 12:06:46 +03:00
Daniel Mitterdorfer	dbb6fe58fa	Remove hand-coded XContent duplicate checks With this commit we cleanup hand-coded duplicate checks in XContent parsing. They were necessary previously but since we reconfigured the underlying parser in #22073 and #22225, these checks are obsolete and were also ineffective unless an undocumented system property has been set. As we also remove this escape hatch, we can remove the additional checks as well. Closes #22253 Relates #34588	2018-10-19 10:13:13 +02:00
Daniel Mitterdorfer	92b2e1a209	Remove lenient boolean handling With this commit we remove some leftovers from #26389 which cleaned up lenient boolean handling. Relates #26389 Relates #22298 Relates #34467	2018-10-16 06:30:00 +02:00
Mayya Sharipova	80c5d30f30	XContentBuilder to handle BigInteger and BigDecimal (#32888 ) Although we allow to index BigInteger and BigDecimal into a keyword field, source filtering on these fields would fail as XContentBuilder was not able to deserialize BigInteger and BigDecimal to json. This modifies XContentBuilder to allow to handle BigInteger and BigDecimal. Closes #32395	2018-09-26 14:24:31 -04:00
Christoph Büscher	ba3ceeaccf	Clean up "unused variable" warnings (#31876 ) This change cleans up "unused variable" warnings. There are several cases were we most likely want to suppress the warnings (especially in the client documentation test where the snippets contain many unused variables). In a lot of cases the unused variables can just be deleted though.	2018-09-26 14:09:32 +02:00
Vladimir Dolzhenko	a3e8b831ee	add elasticsearch-shard tool (#32281 ) Relates #31389	2018-09-19 10:28:22 +02:00
Simon Willnauer	c783488e97	Add `_source`-only snapshot repository (#32844 ) This change adds a `_source` only snapshot repository that allows to wrap any existing repository as a _backend_ to snapshot only the `_source` part including live docs markers. Snapshots taken with the `source` repository won't include any indices, doc-values or points. The snapshot will be reduced in size and functionality such that it requires full re-indexing after it's successfully restored. The restore process will copy the `_source` data locally starts a special shard and engine to allow `match_all` scrolls and searches. Any other query, or get call will fail with and unsupported operation exception. The restored index is also marked as read-only. This feature aims mainly for disaster recovery use-cases where snapshot size is a concern or where time to restore is less of an issue. NOTE: The snapshot produced by this repository is still a valid lucene index. This change doesn't allow for any longer retention policies which is out of scope for this change.	2018-09-12 17:47:10 +02:00
Alpar Torok	44ed5f6306	Enable forbiddenapis server java9 (#33245 )	2018-08-31 09:31:55 +03:00
Alpar Torok	5cf6e0d4bc	Ignore module-info in jar hell checks (#33011 ) * Ignore module-info in JarHell checks * Add unit test * integration test to test that jarhell is ran with precommit	2018-08-30 11:41:39 +03:00
Alpar Torok	82d10b484a	Run forbidden api checks with runtimeJavaVersion (#32947 ) Run forbidden APIs checks with runtime hava version	2018-08-22 09:05:22 +03:00
Adrien Grand	039babddf5	CharArraysTests: Fix test bug.	2018-08-16 11:54:39 +02:00
Jay Modi	1a45b27d8b	Move CharArrays to core lib (#32851 ) This change cleans up some methods in the CharArrays class from x-pack, which includes the unification of char[] to utf8 and utf8 to char[] conversions that intentionally do not use strings. There was previously an implementation in x-pack and in the reloading of secure settings. The method from the reloading of secure settings was adopted as it handled more scenarios related to the backing byte and char buffers that were used to perform the conversions. The cleaned up class is moved into libs/core to allow it to be used by requests that will be migrated to the high level rest client. Relates #32332	2018-08-15 15:26:00 -06:00
Jake Landis	be62092060	Introduce the dissect library (#32297 ) The dissect library will be used for the ingest node as an alternative to Grok to split a string based on a pattern. Dissect differs from Grok such that regular expressions are not used to split the string. Note - Regular expressions are used during construction of the objects, but not in the hot path. A dissect pattern takes the form of: '%{a} %{b},%{c}' which is composed of 3 keys (a,b,c) and two delimiters (space and comma). This dissect pattern will match a string of the form: 'foo bar,baz' and will result a key/value pairing of 'a=foo, b=bar, and c=baz'. See the comments in DissectParser for a full explanation. This commit does not include the ingest node processor that will consume it. However, the consumption should be a trivial mapping between the key/value pairing returned by the parser and the key/value pairing needed for the IngestDocument.	2018-08-14 17:08:55 -07:00
Armin Braun	580d59e2d7	CORE: Upgrade to Jackson 2.8.11 (#32670 ) * closes #30352	2018-08-08 12:04:25 +02:00
Jason Tedor	3fb0923182	Fix content type detection with leading whitespace (#32632 ) Today content type detection on an input stream works by peeking up to twenty bytes into the stream. If the stream is headed by more whitespace than twenty bytes, we might fail to detect the content type. We should be ignoring this whitespace before attempting to detect the content type. This commit does that by ignoring all leading whitespace in an input stream before attempting to guess the content type.	2018-08-06 18:07:46 -04:00
Armin Braun	4dda5a990b	INGEST: Fix ThreadWatchDog Throwing on Shutdown (#32578 ) * INGEST: Fix ThreadWatchDog Throwing on Shutdown * #32539 is caused by the fact that ThreadWatchDog.Default could throw on shutdown if the ThreadPool is interrupted while `interruptLongRunningExecutions` is in progress. This is a result of the watchdog not having a lifecycle of its own (normally it terminates when the threadpool terminates). * We can't easily use `org.elasticsearch.common.util.concurrent.EsRejectedExecutionException#isExecutorShutdown` to catch this state the same way other components do since thatwould require adding the core lib to Grok as a dependency * Since we have no knowledge of the lifecycle in this compontent since we're only passed the scheduler `BiFunction` I fixed this by only scheduling the watchdog when there's actually registered threads in it. * I think using the patter of locking via two `Atomic` values should not be much of a performance concern here under load since either the integer will likely be > 0 in this case (because we have multiple Grok in parallel) or the running state will be true because there likely was at least one thread registered when the watchdog ran and so the enqueing of the watchdog task during `register` will happen very rarely here (in the worst case scenario of only a single Grok thread it will happen less frequently than once every `ingest.grok.watchdog.interval`). The atomic update on the count should not be relevant relative to the cost of adding a new node to the CHM either. Fixes #32539 * Also fixes the watchdog to run if it doens't have to in general.	2018-08-06 22:46:26 +02:00
Christoph Büscher	ff87b7aba4	Remove unnecessary warning supressions (#32250 )	2018-07-23 11:31:04 +02:00
Alpar Torok	38e2e1d553	Detect and prevent configuration that triggers a Gradle bug (#31912 ) * Detect and prevent configuration that triggers a Gradle bug As we found in #31862, this can lead to a lot of wasted time as it's not immediatly obvius what's going on. Givent how many projects we have it's getting increasingly easier to run into gradle/gradle#847.	2018-07-19 06:46:58 +00:00
Tim Brooks	c375d5ab23	Add nio transport to security plugin (#31942 ) This is related to #27260. It adds the SecurityNioTransport to the security plugin. Additionally, it adds support for ip filtering. And it randomly uses the nio transport in security integration tests.	2018-07-12 11:55:38 -06:00
Christoph Büscher	4ae4ac08d5	Add Expected Reciprocal Rank metric (#31891 ) This change adds Expected Reciprocal Rank (ERR) as a ranking evaluation metric as descriped in: Chapelle, O., Metlzer, D., Zhang, Y., & Grinspan, P. (2009). Expected reciprocal rank for graded relevance. Proceeding of the 18th ACM Conference on Information and Knowledge Management. https://doi.org/10.1145/1645953.1646033 ERR is an extension of the classical reciprocal rank to the graded relevance case and assumes a cascade browsing model. It quantifies the usefulness of a document at rank `i` conditioned on the degree of relevance of the items at ranks less than `i`. ERR seems to be gain traction as an alternative to (n)DCG, so it seems like a good metric to support. Also ERR seems to be the default optimization metric used for training in RankLib, a widely used learning to rank library. Relates to #29653	2018-07-12 15:50:58 +02:00
Nik Everett	fb27f3e7f0	HLREST: Add x-pack-info API (#31870 ) This is the first x-pack API we're adding to the high level REST client so there is a lot to talk about here! = Open source The client for these APIs is open source. We're taking the previously Elastic licensed files used for the `Request` and `Response` objects and relicensing them under the Apache 2 license. The implementation of these features is staying under the Elastic license. This lines up with how the rest of the Elasticsearch language clients work. = Location of the new files We're moving all of the `Request` and `Response` objects that we're relicensing to the `x-pack/protocol` directory. We're adding a copy of the Apache 2 license to the root fo the `x-pack/protocol` directory to line up with the language in the root `LICENSE.txt` file. All files in this directory will have the Apache 2 license header as well. We don't want there to be any confusion. Even though the files are under the `x-pack` directory, they are Apache 2 licensed. We chose this particular directory layout because it keeps the X-Pack stuff together and easier to think about. = Location of the API in the REST client We've been following the layout of the rest-api-spec files for other APIs and we plan to do this for the X-Pack APIs with one exception: we're dropping the `xpack` from the name of most of the APIs. So `xpack.graph.explore` will become `graph().explore()` and `xpack.license.get` will become `license().get()`. `xpack.info` and `xpack.usage` are special here though because they don't belong to any proper category. For now I'm just calling `xpack.info` `xPackInfo()` and intend to call usage `xPackUsage` though I'm not convinced that this is the final name for them. But it does get us started. = Jars, jars everywhere! This change makes the `xpack:protocol` project a `compile` scoped dependency of the `x-pack:plugin:core` and `client:rest-high-level` projects. I intend to keep it a compile scoped dependency of `x-pack:plugin:core` but I intend to bundle the contents of the protocol jar into the `client:rest-high-level` jar in a follow up. This change has grown large enough at this point. In that followup I'll address javadoc issues as well. = Breaking-Java This breaks that transport client by a few classes around. We've traditionally been ok with doing this to the transport client.	2018-07-08 11:03:56 -04:00
Armin Braun	b7b413e55e	Extend allowed characters for grok field names (#21745 ) (#31653 )	2018-06-29 09:12:47 +02:00
Tim Brooks	86423f9563	Ensure local addresses aren't null (#31440 ) Currently we set local addresses on the creation time of a NioChannel. However, this may return null as the local address may not have been set yet. An example is the local address has not been set on a client channel as the connection process is not yet complete. This PR modifies the getter to set the local field if it is currently null.	2018-06-20 19:50:14 -06:00
Tim Brooks	ffba20b748	Do not preallocate bytes for channel buffer (#31400 ) Currently, when we open a new channel, we pass it an InboundChannelBuffer. The channel buffer is preallocated a single 16kb page. However, there is no guarantee that this channel will be read from anytime soon. Instead, this commit does not preallocate that page. That page will be allocated when we receive a read event.	2018-06-19 09:36:12 -06:00
Tim Brooks	a705e1a9e3	Add byte array pooling to nio http transport (#31349 ) This is related to #28898. This PR implements pooling of bytes arrays when reading from the wire in the http server transport. In order to do this, we must integrate with netty reference counting. That manner in which this PR implements this is making Pages in InboundChannelBuffer reference counted. When we accessing the underlying page to pass to netty, we retain the page. When netty releases its bytebuf, it releases the underlying pages we have passed to it.	2018-06-15 14:01:03 -06:00
Tim Brooks	700357d04e	Immediately flush channel after writing to buffer (#31301 ) This is related to #27260. Currently when we queue a write with a channel we set OP_WRITE and wait until the next selection loop to flush the write. However, if the channel does not have a pending write, it is probably ready to flush. This PR implements an optimistic flush logic that will attempt this flush.	2018-06-13 15:32:13 -06:00
Martijn van Groningen	6030d4be1e	[INGEST] Interrupt the current thread if evaluation grok expressions take too long (#31024 ) This adds a thread interrupter that allows us to encapsulate calls to org.joni.Matcher#search() This method can hang forever if the regex expression is too complex. The thread interrupter in the background checks every 3 seconds whether there are threads execution the org.joni.Matcher#search() method for longer than 5 seconds and if so interrupts these threads. Joni has checks that that for every 30k iterations it checks if the current thread is interrupted and if so returns org.joni.Matcher#INTERRUPTED Closes #28731	2018-06-12 07:49:03 +02:00
Tanguy Leroux	bf58660482	Remove all unused imports and fix CRLF (#31207 ) The X-Pack opening and the recent other refactorings left a lot of unused imports in the codebase. This commit removes them all.	2018-06-11 15:12:12 +02:00
Jason Tedor	5296c11e4f	Rename elasticsearch-nio to nio (#31186 ) This commit renames :libs:elasticsearch-nio to :libs:nio.	2018-06-07 17:00:00 -04:00
Jason Tedor	94be9b471f	Rename elasticsearch-core to core (#31185 ) This commit renames :libs:elasticsearch-core to :libs:core.	2018-06-07 16:50:21 -04:00
Jason Tedor	b32cbc1baa	Move cli sub-project out of server to libs (#31184 ) This commit moves the cli sub-project out of server to libs where it makes more sense.	2018-06-07 16:35:34 -04:00
Tim Brooks	4158387554	Cleanup nio http thread names (#31148 ) This is related to #28898. This commit adds the acceptor thread name to the method checking if this thread is a transport thread. Additionally, it modifies the nio http transport to use the same worker name as the netty4 http server transport.	2018-06-06 15:36:13 -06:00
Tim Brooks	67e73b4df4	Combine accepting selector and socket selector (#31115 ) This is related to #27260. This commit combines the AcceptingSelector and SocketSelector classes into a single NioSelector. This change allows the same selector to handle both server and socket channels. This is valuable as we do not necessarily want a dedicated thread running for accepting channels. With this change, this commit removes the configuration for dedicated accepting selectors for the normal transport class. The accepting workload for new node connections is likely low, meaning that there is no need to dedicate a thread to this process.	2018-06-06 11:59:54 -06:00
Lee Hinman	b22a055bcf	Add get mappings support to high-level rest client (#30889 ) This adds support for the get mappings API to the high level rest client. Relates to #27205	2018-06-04 14:31:08 -06:00
Christoph Büscher	3f87c79500	Change ObjectParser exception (#31030 ) ObjectParser should throw XContentParseExceptions, not IAE. A dedicated parsing exception can includes the place where the error occurred. Closes #30605	2018-06-04 20:20:37 +02:00
Tim Brooks	e8b70273c1	Remove Throwable usage from transport modules (#30845 ) Currently nio and netty modules use the CompletableFuture class for managing listeners. This is unfortunate as that class accepts Throwable. This commit adds a class CompletableContext that wraps the CompletableFuture but does not accept Throwable. This allows the modification of netty and nio logic to no longer handle Throwable.	2018-05-24 17:33:29 -06:00
Tim Brooks	abf8c56a37	Remove logging from elasticsearch-nio jar (#30761 ) This is related to #27260. The elasticsearch-nio jar is supposed to be a library opposed to a framework. Currently it internally logs certain exceptions. This commit modifies it to not rely on logging. Instead exception handlers are passed by the applications that use the jar.	2018-05-21 20:18:12 -06:00
Tim Brooks	99b9ab58e2	Add nio http server transport (#29587 ) This commit is related to #28898. It adds an nio driven http server transport. Currently it only supports basic http features. Cors, pipeling, and read timeouts will need to be added in future PRs.	2018-05-15 16:37:14 -06:00
Nik Everett	50945051b6	HTML5ify Javadoc for core and test framework (#30234 ) `javadoc` will switch from detaulting to html4 to html5 in "a future release". We should get ahead of it so we're not surprised. Also, HTML5 is the future! Er, the present. Anyway, this follows up from #30220 to make the Javadoc for two of the four remaining projects HTML5 compatible.	2018-04-30 09:39:50 -04:00
Adrien Grand	a8c2cc6ce7	Fix dependency checks on libs when generating Eclipse configuration. (#29550 ) Currently this fails because the Eclipse configuration splits the main and test folders into separate projects to avoid circular dependencies. Relates #29336	2018-04-17 17:11:12 +02:00
Nik Everett	69aabb7e40	Build: Fail if any libs depend on non-core libs (#29336 ) Fails the build if any subprojects of `:libs` have dependencies in `:libs` except for `:libs:elasticsearch-core`. Since we now have three places where we resolve project substitutions I've added `dependencyToProject` to `project.ext` in all projects. It resolves both `project` style dependencies and "external" style (like "org.elasticsearch:elasticsearch-core:${version}") dependencies to `Project`s using the `projectSubstitutions`. I use this new function all three places where resovle project substitutions. Finally this pulls `apply plugin: 'elasticsearch.build'` out of `libs/*/build.gradle` and into a subprojects clause in `libs/build.gradle`. I do this entirely so that I can call `tasks.precommit.dependsOn checkDependencies` without waiting for the subprojects to be evaluated or worrying about whether or not they have `precommit` set up in a normal way.	2018-04-16 11:49:27 -04:00
Lee Hinman	14097359a4	Move TimeValue into elasticsearch-core project (#29486 ) This commit moves the `TimeValue` class into the elasticsearch-core project. This allows us to use this class in many of our other projects without relying on the entire `server` jar. Relates to #28504	2018-04-12 10:24:58 -06:00
Lee Hinman	a07ba9e400	Move Streams.copy into elasticsearch-core and make a multi-release jar (#29322 ) * Move Streams.copy into elasticsearch-core and make a multi-release jar This moves the method `Streams.copy(InputStream in, OutputStream out)` into the `elasticsearch-core` project (inside the `o.e.core.internal.io` package). It also makes this class into a multi-release class where the Java 9 equivalent uses `InputStream#transferTo`. This is a followup from https://github.com/elastic/elasticsearch/pull/29300#discussion_r178147495	2018-04-06 11:07:20 -06:00
Lee Hinman	a93c942927	Move ObjectParser into the x-content lib (#29373 ) * Move ObjectParser into the x-content lib This moves `ObjectParser`, `AbstractObjectParser`, and `ConstructingObjectParser` into the libs/x-content dependency. This decoupling allows them to be used for parsing for projects that don't want to depend on the entire Elasticsearch jar. Relates to #28504	2018-04-06 09:41:14 -06:00
Lee Hinman	160d25fcdb	Move Tuple into elasticsearch-core (#29375 ) * Move Tuple into elasticsearch-core This allows us to use Tuple from other projects that don't want to rely on the entire Elasticsearch jar. I have also added very simple tests, since there were none. Relates tangentially to #28504	2018-04-06 08:58:24 -06:00
Martijn van Groningen	9da95efa41	ingest: Don't allow circular referencing of named patterns in the grok processor. Otherwise the grok code throws a stackoverflow error. Closes #29257	2018-04-05 09:35:50 +02:00

1 2

77 Commits