OpenSearch

Commit Graph

Author	SHA1	Message	Date
Christoph Büscher	22f7b03430	Fix test reproducability in AbstractBuilderTestCase setup (#32403 ) Currently AbstractBuilderTestCase generates certain random values in its `beforeTest()` method annotated with @Before only the first time that a test method in the suite is run while initializing the serviceHolder that we use for the rest of the test. This changes the values of subsequent random values and has the effect that when running single methods from a test suite with "-Dtests.method=*", the random values it sees are different from when the same test method is run as part of the whole test suite. This makes it hard to use the reproduction lines logged on failure. This change runs the inialization of the serviceHolder and the randomization connected to it using the test runners master seed, so reproduction by running just one method is possible again. Closes #32400	2018-08-10 15:13:44 +02:00
Jack Conradson	293c8a2b24	Painless: Add an Ingest Script Processor Example (#32302 ) This commit adds two pieces. The first is a small set of documentation providing instructions on how to get setup to run context examples. This will require a download similar to how Kibana works for some of the examples. The second is an ingest processor example using the downloaded data. More examples will follow as ideally one per PR. This also adds a set of tests to individually test each script as a unit test.	2018-08-09 14:24:55 -07:00
Nicholas Knize	e162127ff3	Upgrade to Lucene-7.5.0-snapshot-13b9e28f9d The main feature is the inclusion of bkd backed geo_shape with INTERSECT, DISJOINT, WITHIN bounding box and polygon query support.	2018-08-09 11:15:02 -05:00
Armin Braun	79375d35bb	Scripting: Replace Update Context (#32096 ) * SCRIPTING: Move Update Scripts to their own context * Added system property for backwards compatibility of change to `ctx.params`	2018-08-09 14:32:36 +02:00
Jack Conradson	9b00f095b9	Painless: Move More Logic to PainlessLookup (#32689 ) This moves some run-time lookups for methods and fields to the PainlessLookup.	2018-08-08 16:25:14 -07:00
Armin Braun	7d641ba69b	TESTS: Explicitly Fail Http Client Timeouts (#32708 ) * Don't quietly ignore timeouts when waiting for HTTP responses * Fixes #32702	2018-08-08 15:47:51 +02:00
Luca Cavanna	5c2ef5e869	Preserve index_uuid when creating QueryShardException (#32677 ) As part of #32608 we made sure that the fully qualified index name is taken from the query shard context whenever creating a new `QueryShardException`. That change introduced a regression as instead of setting the entire `Index` object to the exception, which holds index name and index uuid, we ended up setting only the index name (including cluster alias). With this commit we make sure that the index uuid does not get lost and we try to lower the chances that a similar bug makes it in another time. That's done by making `QueryShardContext` return the fully qualified `Index` (which also holds the uuid) rather than only the fully qualified index name.	2018-08-08 09:57:11 +02:00
Jack Conradson	0b7fb4e7b9	Painless: Clean up FunctionRef (#32644 ) This change consolidates all the logic for generating a FunctionReference (renamed from FunctionRef) from several arbitrary constructors to a single static function that is used at both compile-time and run-time. This increases long-term maintainability as it is much easier to follow when and how a function reference is being generated. It moves most of the duplicated logic out of the ECapturingFuncRef, EFuncRef and ELambda nodes and Def as well.	2018-08-07 12:26:57 -07:00
Armin Braun	6fa7016bbf	SCRIPTING: Move Aggregation Scripts to their own context (#32068 ) * SCRIPTING: Move Aggregation Scripts to their own context	2018-08-04 10:37:07 +02:00
Jack Conradson	6ca24e13af	Painless: Use LocalMethod Map For Lookup at Runtime (#32599 ) This modifies Def to use a Map<String, LocalMethod> to look up user-defined methods at runtime instead of writing constant methodhandles to do the reverse lookup. This creates a consistency between how LocalMethods are looked up at compile-time and run-time. This consistency will allow this code to be more maintainable moving forward. This will also allow FunctionReference to be cleaned up in a follow up PR.	2018-08-03 15:22:30 -07:00
Jack Conradson	b938960602	Painless: Move Some Lookup Logic to PainlessLookup (#32565 ) Renames existing methods in PainlessLookup. Adds lookupPainlessClass, lookupPainlessMethod, and lookupPainlessField to PainlessLookup. This consolidates the logic necessary to look these things up into a single place and begins the clean up of some of the nodes that were looking each of these things up individually. This also has the added benefit of improved consistency in error messaging.	2018-08-02 12:33:25 -07:00
Armin Braun	be31cc642b	INGEST: Enable default pipelines (#32286 ) * INGEST: Enable default pipelines * Add `default_pipeline` index setting * `_none` is interpreted as no pipeline * closes #21101	2018-08-02 17:11:12 +02:00
Jack Conradson	2985920134	Painless: Clean Up PainlessField (#32525 ) Updates PainlessField variable names to current naming scheme and removes extraneous variables.	2018-08-01 09:28:18 -07:00
Ryan Ernst	478f6d6cf1	Scripting: Conditionally use java time api in scripting (#31441 ) This commit adds a boolean system property, `es.scripting.use_java_time`, which controls the concrete return type used by doc values within scripts. The return type of accessing doc values for a date field is changed to Object, essentially duck typing the type to allow co-existence during the transition from joda time to java time.	2018-08-01 08:58:49 -07:00
Armin Braun	4b199dde8d	NETWORKING: Fix Netty Leaks by upgrading to 4.1.28 (#32511 ) * Upgrade to `4.1.28` since the problem reported in #32487 is a bug in Netty itself (see https://github.com/netty/netty/issues/7337) * Fixed other leaks in test code that now showed up due to fixes improvements in leak reporting in the newer version * Needed to extend permissions for netty common package because it now sets a classloader at runtime after changes in `63bae0956a` * Adjusted forbidden APIs check accordingly * Closes #32487	2018-08-01 02:34:58 +02:00
Jack Conradson	09e38f2f59	Painless: Clean up PainlessMethod (#32476 ) Renames and removes variables from PainlessMethod to follow the new naming convention. Generates methodtypes at compile-time instead of using a method at run- time. Moves write method to MethodWriter.	2018-07-31 16:25:53 -07:00
Ryan Ernst	2ed9782a67	Scripting: Fix painless compiler loader to know about context classes (#32385 ) This commit fixes the painless compiler classloader to know about the classes from the script context. This fixes an issue when a custom context is used from a plugin which caused a ClassNotFoundException for the script class and its factory classes.	2018-07-31 08:28:03 -07:00
Sohaib Iftikhar	4fa92cbf49	Changed ReindexRequest to use Writeable.Reader (#32401 ) -- This is a pre-stage for adding the reindex API to the REST high-level-client -- Follows the pattern set in #26315	2018-07-31 10:11:17 -04:00
Jack Conradson	c69e62d96f	Painless: Add PainlessConstructor (#32447 ) PainlessMethod was being used as both a method and a constructor, and while there are similarities, there are also some major differences. This allows the reflection objects to be stored reducing the number of other pieces of data stored in a PainlessMethod as they are now redundant. This temporarily increases some of the code in FunctionRef and PainlessDocGenerator as they now differentiate between constructors and methods, BUT is also makes the code more maintainable because there aren't checks in several places anymore to differentiate.	2018-07-30 14:46:24 -07:00
Armin Braun	cf7489899a	INGEST: Clean up Java8 Stream Usage (#32059 ) * GrokProcessor: Rationalize the loop over the map to save allocations and indirection * IngestDocument: Rationalize way we append to `List`	2018-07-30 21:25:30 +02:00
Ryan Ernst	34d006f82a	Tests: Fix convert error tests to use fixed value (#32415 ) The error tests for hex values previously used a random string of digits, but this could be a valid hex value. This commit changes these tests to use a fixed invalid hex value. closes #32370	2018-07-30 10:00:55 -07:00
Jack Conradson	e9e1095596	Painless: Add method type to method. (#32441 ) MethodType can be computed at compile-time rather than run-time. This removes the method that collects MethodType at run-time from a PainlessMethod since is it no longer necessary.	2018-07-27 14:23:37 -07:00
javanna	83d007e7be	[TEST] Mute failing testConvertLongHexError See #32370	2018-07-27 11:50:13 +02:00
Jim Ferenczi	53ff06e621	Upgrade to Lucene-7.5.0-snapshot-608f0277b0 (#32390 ) The main highlight is the removal of the reclaim_deletes_weight in the TieredMergePolicy. The es setting index.merge.policy.reclaim_deletes_weight is deprecated in this commit and the value is ignored. The new merge policy setting setDeletesPctAllowed should be added in a follow up.	2018-07-27 08:28:51 +02:00
Jim Ferenczi	8e5f281b27	AbstractQueryTestCase should run without type less often (#28936 ) This commit changes the randomization to always create an index with a type. It also adds a way to create a query shard context that maps to an index with no type registered in order to explicitely test cases where there is no type.	2018-07-26 20:29:05 +02:00
Tim Brooks	7a56df7c98	Release requests in cors handler (#32364 ) There are two scenarios where a http request could terminate in the cors handler. If that occurs, the requests need to be released. This commit releases those requests.	2018-07-26 10:06:24 -06:00
Jack Conradson	df579f8bce	Painless: Clean Up PainlessClass Variables (#32380 ) Removes the variables name, clazz, and type as they are unnecessary. Renames staticMembers -> staticFields, members -> fields, getters -> getterMethodHandles, and setters -> setterMethodHandles.	2018-07-26 09:02:06 -07:00
Christoph Büscher	35ae87125d	Remove some dead code (#31993 ) Removing some dead code or supressing warnings where apropriate. Most of the time the variable tested for null is dereferenced earlier or never used before.	2018-07-26 17:12:51 +02:00
Christoph Büscher	bec888fa78	Rank-Eval: Reduce scope of an unchecked supression We should only supress the unchecked warnings on ConstructingObjectParser.	2018-07-26 11:16:01 +02:00
Jack Conradson	853aa0afb4	Painless: Decouple PainlessLookupBuilder and Whitelists (#32346 ) Implements a static function in PainlessLookupBuilder that contains all the logic related to Whitelist. PainlessLookupBuilder is available for use in loading from methods beyond Whitelist now.	2018-07-25 10:52:01 -07:00
Dimitris Athanasiou	de53f0123f	[TEST] Mute ConvertProcessortTests.testConvertIntHexError Relates #32370	2018-07-25 17:35:23 +01:00
Armin Braun	717df26fc3	Networking: Fix test leaking buffer (#32296 ) * Test `handler` must release buffer the same way the replaced `org.elasticsearch.http.netty4.Netty4HttpRequestHandler#channelRead0` releases it * Closes #32289	2018-07-24 23:04:22 +02:00
Jack Conradson	1690451a9f	Painless: Update More Methods to New Naming Scheme (#32305 ) This finishes the updating the methods in the PainlessLookupBuilder to the new naming scheme. Mechanical change. Methods include the ones used for copying members in the inheritance hierarchy, calculating shortcuts, and setting the functional interface.	2018-07-24 13:08:05 -07:00
Ryan Ernst	49d4b26f16	Ingest: Support integer and long hex values in convert (#32213 ) This commit adds checks for hex formatted strings in the convert processor, allowing strings like `0x1` to be parsed as integer `1`. closes #32182	2018-07-24 12:05:50 -07:00
Christoph Büscher	59cf600e03	Register ERR metric with NamedXContentRegistry (#32320 ) This adds the ERR metric to the provided xContent parsers in the module and the high level rest client registry. Also adding integration tests to make sure the metric is correctly registered and usable from the client.	2018-07-24 16:05:43 +02:00
Zachary Tong	6ba144ae31	Add WeightedAvg metric aggregation (#31037 ) Adds a new single-value metrics aggregation that computes the weighted average of numeric values that are extracted from the aggregated documents. These values can be extracted from specific numeric fields in the documents. When calculating a regular average, each datapoint has an equal "weight"; it contributes equally to the final value. In contrast, weighted averages scale each datapoint differently. The amount that each datapoint contributes to the final value is extracted from the document, or provided by a script. As a formula, a weighted average is the `∑(value * weight) / ∑(weight)` A regular average can be thought of as a weighted average where every value has an implicit weight of `1`. Closes #15731	2018-07-23 18:33:15 -04:00
Christoph Büscher	fe6bb75eb4	Rename ranking evaluation `quality_level` to `metric_score` (#32168 ) The notion of "quality" is an overloaded term in the search ranking evaluation context. Its usually used to decribe certain levels of "good" vs. "bad" of a seach result with respect to the users information need. We currently report the result of the ranking evaluation as `quality_level` which is a bit missleading. This changes the response parameter name to `metric_score` which fits better.	2018-07-23 22:25:02 +02:00
Jack Conradson	d3c4904fa3	Painless: Clean up add methods in PainlessLookup (#32258 ) This is largely mechanical change that cleans up the addConstructor, addMethod, and addFields methods in PainlessLookup. Changes include renamed variables, better error messages, and some minor code movement to make it more maintainable long term.	2018-07-23 09:12:30 -07:00
Christoph Büscher	ff87b7aba4	Remove unnecessary warning supressions (#32250 )	2018-07-23 11:31:04 +02:00
Armin Braun	7aa8a0a927	INGEST: Extend KV Processor (#31789 ) (#32232 ) * INGEST: Extend KV Processor (#31789) Added more capabilities supported by LS to the KV processor: * Stripping of brackets and quotes from values (`include_brackets` in corresponding LS filter) * Adding key prefixes * Trimming specified chars from keys and values Refactored the way the filter is configured to avoid conditionals during execution. Refactored Tests a little to not have to add more redundant getters for new parameters. Relates #31786 * Add documentation	2018-07-20 22:32:50 +02:00
Armin Braun	e21692e387	INGEST: Make a few Processors callable by Painless (#32170 ) * INGEST: Make a few Processors callable by Painless * Extracted a few stateless String processors as well as the json processor to static methods and whitelisted them in Painless * provide whitelist from processors plugin	2018-07-20 21:10:35 +02:00
Nick Peihl	ac63408655	Add region ISO code to GeoIP Ingest plugin (#31669 )	2018-07-20 11:23:29 -07:00
Christoph Büscher	5cbd9ad177	Rename ranking evaluation response section (#32166 ) Currently the ranking evaluation response contains a 'unknown_docs' section for each search use case in the evaluation set. It contains document ids for results in the search hits that currently don't have a quality rating. This change renames it to `unrated_docs`, which better reflects its purpose.	2018-07-20 11:43:46 +02:00
Jack Conradson	c7a41c501a	Painless: Simplify Naming in Lookup Package (#32177 ) This removes some extraneous naming syntax and makes clear the meaning of certain naming conventions without ambiguities (stricter) within the lookup package. Purely mechanical change. Note this does not cover a large portion of the PainlessLookupBuilder and PainlessLookup yet as there are several more follow up PRs for these incoming.	2018-07-19 16:35:03 -07:00
Julie Tibshirani	15ff3da653	Add support for field aliases. (#32172 ) * Add basic support for field aliases in index mappings. (#31287) * Allow for aliases when fetching stored fields. (#31411) * Add tests around accessing field aliases in scripts. (#31417) * Add documentation around field aliases. (#31538) * Add validation for field alias mappings. (#31518) * Return both concrete fields and aliases in DocumentFieldMappers#getMapper. (#31671) * Make sure that field-level security is enforced when using field aliases. (#31807) * Add more comprehensive tests for field aliases in queries + aggregations. (#31565) * Remove the deprecated method DocumentFieldMappers#getFieldMapper. (#32148)	2018-07-18 09:33:09 -07:00
Jack Conradson	605dc49c48	Painless: Fix caching bug and clean up addPainlessClass. (#32142 ) This change cleans up the addPainlessClass methods by doing the following things: * Rename many variable names to match the new conventions described in the JavaDocs for PainlessLookup * Decouples Whitelist.Class from adding a PainlessClass directly * Adds a second version of addPainlessClass that is intended for use to add future defaults in a follow PR This change also fixes the method and field caches by storing Classes instead of Strings since it would technically be possible now that the whitelists are extendable to have different Classes with the same name. It was convenient to add this change together since some of the new constants are shared. Note the changes are largely mechanical again where all the code behavior should remain the same.	2018-07-18 09:29:52 -07:00
Alan Woodward	cfb30144c9	Call setReferences() on custom referring tokenfilters in _analyze (#32157 ) When building custom tokenfilters without an index in the _analyze endpoint, we need to ensure that referring filters are correctly built by calling their #setReferences() method Fixes #32154	2018-07-18 14:43:20 +01:00
Martijn van Groningen	53ab470264	use before instead of onOrBefore	2018-07-18 13:33:57 +02:00
Martijn van Groningen	1924f5d07c	Add more contexts to painless execute api (#30511 ) This change adds two contexts the execute scripts against: * SEARCH_SCRIPT: Allows to run scripts in a search script context. This context is used in `function_score` query's script function, script fields, script sorting and `terms_set` query. * FILTER_SCRIPT: Allows to run scripts in a filter script context. This context is used in the `script` query. In both contexts a index name needs to be specified and a sample document. The document is needed to create an in-memory index that the script can access via the `doc[...]` and other notations. The index name is needed because a mapping is needed to index the document. Examples: ``` POST /_scripts/painless/_execute { "script": { "source": "doc['field'].value.length()" }, "context" : { "search_script": { "document": { "field": "four" }, "index": "my-index" } } } ``` Returns: ``` { "result": 4 } ``` POST /_scripts/painless/_execute { "script": { "source": "doc['field'].value.length() <= params.max_length", "params": { "max_length": 4 } }, "context" : { "filter_script": { "document": { "field": "four" }, "index": "my-index" } } } Returns: ``` { "result": true } ``` Also changed PainlessExecuteAction.TransportAction to use TransportSingleShardAction instead of HandledAction, because now in case score or filter contexts are used the request needs to be redirected to a node that has an active IndexService for the index being referenced (a node with a shard copy for that index).	2018-07-18 12:42:07 +02:00
Christoph Büscher	ef5e8d8d8a	Fix Java 11 javadoc compile problem Java 11 complains with a "type arguments not allowed here" error when types are used in javadoc links it seems. Simply removing it.	2018-07-18 10:36:31 +02:00
Jack Conradson	03c16cd0e3	Painless: Add PainlessClassBuilder (#32141 ) Several pieces of data in PainlessClass cannot be passed in at the time the PainlessClass is created so it must be "frozen" after all the data is collected. This means PainlessClass is currently serving two functions as both a builder and a set of data. This separates the two pieces into clearly distinct values. This change also removes the PainlessMethodKey in favor of a simple String. The goal is to have the painless method key be completely internal to the PainlessLookup eventually and this simplifies the way there. Note that this was added since PainlessClass and PainlessClassBuilder were already being changed instead of a follow up PR.	2018-07-17 13:54:49 -07:00
Jack Conradson	1c63eb1081	Painless: Fix Bug with Duplicate PainlessClasses (#32110 ) When building the PainlessMethods and PainlessFields they stored a reference to a PainlessClass. This reference was prior to "freezing" the PainlessClass so the data was both incomplete and mutable. This has been replaced with a target java class instead since the PainlessClass is accessible through a java class now and it requires no special modifications to get around a chicken and egg issue.	2018-07-17 10:33:38 -07:00
aptxx	efb4e97cfb	Docs: Fix missing example script quote (#32010 )	2018-07-17 17:42:24 +02:00
Armin Braun	ed3b44fb4c	Handle TokenizerFactory TODOs (#32063 ) * Don't replace Replace TokenizerFactory with Supplier, this approach was rejected in #32063 * Remove unused parameter from constructor	2018-07-17 14:14:02 +02:00
Ioannis Kakavas	9e529d9d58	Enable testing in FIPS140 JVM (#31666 ) Ensure our tests can run in a FIPS JVM JKS keystores cannot be used in a FIPS JVM as attempting to use one in order to init a KeyManagerFactory or a TrustManagerFactory is not allowed.( JKS keystore algorithms for private key encryption are not FIPS 140 approved) This commit replaces JKS keystores in our tests with the corresponding PEM encoded key and certificates both for key and trust configurations. Whenever it's not possible to refactor the test, i.e. when we are testing that we can load a JKS keystore, etc. we attempt to mute the test when we are running in FIPS 140 JVM. Testing for the JVM is naive and is based on the name of the security provider as we would control the testing infrastrtucture and so this would be reliable enough. Other cases of tests being muted are the ones that involve custom TrustStoreManagers or KeyStoreManagers, null TLS Ciphers and the SAMLAuthneticator class as we cannot sign XML documents in the way we were doing. SAMLAuthenticator tests in a FIPS JVM can be reenabled with precomputed and signed SAML messages at a later stage. IT will be covered in a subsequent PR	2018-07-17 10:54:10 +03:00
Christoph Büscher	61486680a2	Add exclusion option to `keep_types` token filter (#32012 ) Currently the `keep_types` token filter includes all token types specified using its `types` parameter. Lucenes TypeTokenFilter also provides a second mode where instead of keeping the specified tokens (include) they are filtered out (exclude). This change exposes this option as a new `mode` parameter that can either take the values `include` (the default, if not specified) or `exclude`. Closes #29277	2018-07-17 09:04:41 +02:00
Jack Conradson	15740d6229	Painless: Move and Rename Several Methods in the lookup package (#32105 )	2018-07-16 16:13:48 -07:00
Nik Everett	d596447f3d	Switch non-x-pack to new style requests (#32106 ) In #29623 we added `Request` object flavored requests to the low level REST client and in #30315 we deprecated the old `performRequest`s. This changes most of the calls not in X-Pack to their new versions.	2018-07-16 17:44:19 -04:00
Jack Conradson	2a1a28f19c	Painless: Separate PainlessLookup into PainlessLookup and PainlessLookupBuilder (#32054 )	2018-07-16 11:15:29 -07:00
Armin Braun	b1479bbed8	Scripting: Remove dead code from painless module (#32064 )	2018-07-16 18:43:00 +02:00
Christoph Büscher	ca4c4f736a	Remove unused params from SSource and Walker (#31935 ) The "source" field in SSource seems unused. If removed, it can also be removed from the ctor, which in turn makes is possible to delete the sourceText in the Walker class.	2018-07-16 10:54:23 +02:00
Armin Braun	b65c586cef	Cleanup Duplication in `PainlessScriptEngine` (#31991 ) * Cleanup Duplication in `PainlessScriptEngine` * Extract duplicate building of compiler settings to method * Remove dead method params + dead constant in `ScriptProcessor`	2018-07-14 13:37:59 +02:00
Armin Braun	ccf6126410	SCRIPTING: Remove unused MultiSearchTemplateRequestBuilder (#32049 ) * Ever since `46e8d97813` this class is unused	2018-07-14 09:03:35 +02:00
Tim Brooks	305bfea9c3	Add nio http transport to security plugin (#32018 ) This is related to #27260. It adds the SecurityNioHttpServerTransport to the security plugin. It randomly uses the nio http transport in security integration tests.	2018-07-13 16:41:02 -06:00
Armin Braun	3679d00a74	Replace Ingest ScriptContext with Custom Interface (#32003 ) * Replace Ingest ScriptContext with Custom Interface * Make org.elasticsearch.ingest.common.ScriptProcessorTests#testScripting more precise * Don't mock script factory in ScriptProcessorTests * Adjust mock script plugin in IT for new API	2018-07-13 23:26:10 +02:00
Vladimir Dolzhenko	b1bf643e41	lazy snapshot repository initialization (#31606 ) lazy snapshot repository initialization	2018-07-13 20:05:49 +02:00
Alan Woodward	a01e26a39b	Correct spelling of AnalysisPlugin#requriesAnalysisSettings (#32025 ) Because this is a static method on a public API, and one that we encourage plugin authors to use, the method with the typo is deprecated in 6.x rather than just renamed.	2018-07-13 13:13:21 +01:00
Christoph Büscher	e31a877a64	Fix problematic chars in javadoc Java 11 complains about unescaped ">" characters in javadocs. Also fixed some compiler complaints about javadoc in StringFunctionUtils.	2018-07-13 11:13:24 +02:00
Christoph Büscher	4ae4ac08d5	Add Expected Reciprocal Rank metric (#31891 ) This change adds Expected Reciprocal Rank (ERR) as a ranking evaluation metric as descriped in: Chapelle, O., Metlzer, D., Zhang, Y., & Grinspan, P. (2009). Expected reciprocal rank for graded relevance. Proceeding of the 18th ACM Conference on Information and Knowledge Management. https://doi.org/10.1145/1645953.1646033 ERR is an extension of the classical reciprocal rank to the graded relevance case and assumes a cascade browsing model. It quantifies the usefulness of a document at rank `i` conditioned on the degree of relevance of the items at ranks less than `i`. ERR seems to be gain traction as an alternative to (n)DCG, so it seems like a good metric to support. Also ERR seems to be the default optimization metric used for training in RankLib, a widely used learning to rank library. Relates to #29653	2018-07-12 15:50:58 +02:00
Alexander Reelsen	ac4e0f1b1d	Tests: Remove use of joda time in some tests (#31922 ) This also extends the dateformatters test to ensure that the printers are acting the same in java time and joda time.	2018-07-12 09:55:17 +02:00
Nik Everett	b83e99a824	Switch url repository rest tests to new style requests (#31944 ) In #29623 we added `Request` object flavored requests to the low level REST client and in #30315 we deprecated the old `performRequest`s. This changes all calls in the `module/repository-url` project to use the new versions.	2018-07-11 14:52:45 -04:00
Nik Everett	939983d783	Switch reindex tests to new style requests (#31941 ) In #29623 we added `Request` object flavored requests to the low level REST client and in #30315 we deprecated the old `performRequest`s. This changes all calls in the `modules/reindex` project to use the new versions.	2018-07-11 14:42:55 -04:00
Jake Landis	51bb27a991	ingest: date_index_name processor template resolution (#31841 ) This change adds support for template snippet (e.g. {{foo}}) resolution in the date_index_name processor. The following configuration options will now resolve a templated value if so configured: * index_name_prefix (e.g "index_name_prefix": "myindex-{{foo}}-") * date_rounding (e.g. "date_rounding" : "{{bar}}") * index_name_format (e.g."index_name_format": "{{baz}}")	2018-07-11 10:13:41 -05:00
Armin Braun	b4087d69d2	Fix assertIngestDocument wrongfully passing (#31913 ) * Fix assertIngestDocument wrongfully passing * Previously docA being subset of docB passed because iteration was over docA's keys only * Scalars in nested fields were not compared in all cases * Assertion errors were hard to interpret (message wasn't correct since it only mentioned the class type) * In cases where two paths contained different types a ClassCastException was thrown instead of an AssertionError * Fixes #28492	2018-07-11 10:24:21 +02:00
Mayya Sharipova	5481fbc249	Handle missing values in painless (#30975 ) * Handle missing values in painless Throw an exception for `doc['field'].value` if this document is missing a value for the `field`. For 7.0: This is the default behaviour from 7.0 For 6.x: To enable this behavior from 6.x, a user can set a jvm.option: `-Des.script.exception_for_missing_value=true` on a node. If a user does not enable this behavior, a deprecation warning is logged on start up. Closes #29286	2018-07-09 11:59:49 -04:00
Armin Braun	5f5157a2dc	Ingest: Enable Templated Fieldnames in Rename (#31690 ) * Ingest: Enable Templated Fieldnames in Rename	2018-07-09 13:50:21 +02:00
Armin Braun	e46ed73379	Ingest: Add ignore_missing option to RemoveProc (#31693 ) Added `ignore_missing` setting to the RemoveProcessor to fix #23086	2018-07-09 10:24:34 +02:00
Jack Conradson	d9a92011bc	Painless: Restructure Definition/Whitelist (#31879 ) Create lookup package rename Definition to PainlessLookup and move to lookup package rename Definition.Method to PainlessMethod rename Definition.MethodKey to PainlessMethod rename Definition.Field to PainlessField rename Definition.Struct to PainlessClass rename Definition.Cast to PainlessCast rename Whitelist.Struct to WhitelistClass rename Whitelist.Constructor to WhitelistConstructor rename Whitelist.Method to WhitelistMethod rename Whitelist.Field to WhitelistField	2018-07-08 12:00:23 -07:00
Christoph Büscher	bd1c513422	Reduce more raw types warnings (#31780 ) Similar to #31523.	2018-07-05 15:38:06 +02:00
Sohaib Iftikhar	40b822c878	Scripting: Remove support for deprecated StoredScript contexts (#31394 ) Removes support for storing scripts without the usual json around the script. So You can no longer do: ``` POST _scripts/<templatename> { "query": { "match": { "title": "{{query_string}}" } } } ``` and must instead do: ``` POST _scripts/<templatename> { "script": { "lang": "mustache", "source": { "query": { "match": { "title": "{{query_string}}" } } } } } ``` This improves error reporting when you attempt to store a script but don't quite get the syntax right. Before, there was a good chance that we'd think of it as a "raw" template and just store it. Now we won't do that. Nice.	2018-07-05 09:30:08 -04:00
Alpar Torok	cf2295b408	Add JDK11 support and enable in CI (#31644 ) * Upgrade bouncycastle Required to fix `bcprov-jdk15on-1.55.jar; invalid manifest format ` on jdk 11 * Downgrade bouncycastle to avoid invalid manifest * Add checksum for new jars * Update tika permissions for jdk 11 * Mute test failing on jdk 11 * Add JDK11 to CI * Thread#stop(Throwable) was removed http://mail.openjdk.java.net/pipermail/core-libs-dev/2018-June/053536.html * Disable failing tests #31456 * Temprorarily disable doc tests To see if there are other failures on JDK11 * Only blacklist specific doc tests * Disable only failing tests in ingest attachment plugin * Mute failing HDFS tests #31498 * Mute failing lang-painless tests #31500 * Fix backwards compatability builds Fix JAVA version to 10 for ES 6.3 * Add 6.x to bwx -> java10 * Prefix out and err from buildBwcVersion for readability ``` > Task :distribution:bwc:next-bugfix-snapshot:buildBwcVersion [bwc] :buildSrc:compileJava [bwc] WARNING: An illegal reflective access operation has occurred [bwc] WARNING: Illegal reflective access by org.codehaus.groovy.reflection.CachedClass (file:/home/alpar/.gradle/wrapper/dists/gradle-4.5-all/cg9lyzfg3iwv6fa00os9gcgj4/gradle-4.5/lib/groovy-all-2.4.12.jar) to method java.lang.Object.finalize() [bwc] WARNING: Please consider reporting this to the maintainers of org.codehaus.groovy.reflection.CachedClass [bwc] WARNING: Use --illegal-access=warn to enable warnings of further illegal reflective access operations [bwc] WARNING: All illegal access operations will be denied in a future release [bwc] :buildSrc:compileGroovy [bwc] :buildSrc:writeVersionProperties [bwc] :buildSrc:processResources [bwc] :buildSrc:classes [bwc] :buildSrc:jar ``` * Also set RUNTIME_JAVA_HOME for bwcBuild So that we can make sure it's not too new for the build to understand. * Align bouncycastle dependency * fix painles array tets closes #31500 * Update jar checksums * Keep 8/10 runtime/compile untill consensus builds on 11 * Only skip failing tests if running on Java 11 * Failures are dependent of compile java version not runtime * Condition doc test exceptions on compiler java version as well * Disable hdfs tests based on runtime java * Set runtime java to minimum supported for bwc * PR review * Add comment with ticket for forbidden apis	2018-07-05 03:24:01 +00:00
Simon Willnauer	3f2a241b7f	Detach Transport from TransportService (#31727 ) Today TransportService is tightly coupled with Transport since it requires an instance of TransportService in order to receive responses and send requests. This is mainly due to the Request and Response handlers being maintained in TransportService but also because of the lack of a proper callback interface. This change moves request handler registry and response handler registration into Transport and adds all necessary methods to `TransportConnectionListener` in order to remove the `TransportService` dependency from `Transport` Transport now accepts one or more `TransportConnectionListener` instances that are executed sequentially in a blocking fashion.	2018-07-04 11:32:35 +02:00
Jack Conradson	a02e5ee740	Painless: Complete Removal of Painless Type (#31699 ) This completes the removal of Painless Type. The new data structures in the definition are a map of names (String) to Java Classes and a map of Java Classes to Painless Structs. The names to Java Classes map can contain a 2 to 1 ratio of names to classes depending on whether or not a short (imported) name is used. The Java Classes to Painless Structs is 1 to 1 always where the Java Class name must match the Painless Struct name. This should lead a significantly simpler type system in Painless moving forward since the Painless Type only held redundant information since Painless does not support generics.	2018-07-03 13:31:56 -07:00
Jake Landis	c0056cddd8	ingest: Introduction of a bytes processor (#31733 ) ingest: Introduction of a bytes processor This processor allows for human readable byte values (e.g. 1kb) to be converted to value in bytes (e.g. 1024). Internally this processor re-uses "ByteSizeValue.parseBytesSizeValue" which supports conversions up to Long.MAX_VALUE and the following units: "b", "kb", "mb", "gb", "tb", pb". This change also introduces a generic return type for the AbstractStringProcessor to allow for code reuse while supporting a String -> T conversion. (String -> Long in this case).	2018-07-03 10:40:56 -05:00
Yannick Welsch	2bb4f38371	Add writeBlob option to replace existing blob (#31729 ) Adds a new parameter to the BlobContainer#writeBlob methods to specify whether the existing file should be overridden or not. For some metadata files in the repository, we actually want to replace the current file. This is currently implemented through an explicit blob delete and then a fresh write. In case of using a cloud provider (S3, GCS, Azure), this results in 2 API requests instead of just 1. This change will therefore allow us to achieve the same functionality using less API requests.	2018-07-03 09:13:50 +02:00
Christoph Büscher	31aabe4bf9	Clean up double semicolon code typos (#31687 )	2018-07-02 15:14:44 +02:00
Nirmal Chidambaram	c827a4e8e1	has_parent builder: exception message/param fix (#31182 ) has_parent builder throws exception message that it expects a `type` while parser excepts `parent_type`	2018-06-30 11:17:37 -07:00
Tanguy Leroux	0ef22db844	[Test] Clean up some repository-s3 tests (#31601 ) This commit removes some tests in the repository-s3 plugin that have not been executed for 2+ years but have been maintained for nothing. Most of the tests in AbstractAwsTestCase were obsolete or superseded by fixture based integration tests.	2018-06-29 13:21:29 +02:00
markharwood	09dd19a403	Add MultiSearchTemplate support to High Level Rest client (#30836 ) Add MultiSearchTemplate support to High Level Rest client. Addresses part of #27205	2018-06-28 14:05:26 +01:00
Armin Braun	13e1cf6191	ingest: Add ignore_missing property to foreach filter (#22147 ) (#31578 )	2018-06-26 20:04:41 +02:00
Alpar Torok	08b8d11e30	Add support for switching distribution for all integration tests (#30874 ) * remove left-over comment * make sure of the property for plugins * skip installing modules if these exist in the distribution * Log the distrbution being ran * Don't allow running with integ-tests-zip passed externally * top level x-pack/qa can't run with oss distro * Add support for matching objects in lists Makes it possible to have a key that points to a list and assert that a certain object is present in the list. All keys have to be present and values have to match. The objects in the source list may have additional fields. example: ``` match: { 'nodes.$master.plugins': { name: ingest-attachment } } ``` * Update plugin and module tests to work with other distributions Some of the tests expected that the integration tests will always be ran with the `integ-test-zip` distribution so that there will be no other plugins loaded. With this change, we check for the presence of the plugin without assuming exclusivity. * Allow modules to run on other distros as well To match the behavior of tets.distributions * Add and use a new `contains` assertion Replaces the previus changes that caused `match` to do a partial match. * Implement PR review comments	2018-06-26 06:49:03 -07:00
Igor Motov	237650e9c0	Add x-opaque-id to search slow logs (#31539 ) Add x-opaque-id to search slow logs only. Indexing slow log and audit logs will be handled as separate PRs. Relates #31521	2018-06-25 12:20:27 -07:00
Christoph Büscher	86ab3a2d1a	Reduce number of raw types warnings (#31523 ) A first attempt to reduce the number of raw type warnings, most of the time by using the unbounded wildcard.	2018-06-25 15:59:03 +02:00
Jonathan Little	8e4768890a	Migrate scripted metric aggregation scripts to ScriptContext design (#30111 ) * Migrate scripted metric aggregation scripts to ScriptContext design #29328 * Rename new script context container class and add clarifying comments to remaining references to params._agg(s) * Misc cleanup: make mock metric agg script inner classes static * Move _score to an accessor rather than an arg for scripted metric agg scripts This causes the score to be evaluated only when it's used. * Documentation changes for params._agg -> agg * Migration doc addition for scripted metric aggs _agg object change * Rename "agg" Scripted Metric Aggregation script context variable to "state" * Rename a private base class from ...Agg to ...State that I missed in my last commit * Clean up imports after merge	2018-06-25 12:01:33 +01:00
Ryan Ernst	7a150ec06d	Core: Combine doExecute methods in TransportAction (#31517 ) TransportAction currently contains 2 doExecute methods, one which takes a the task, and one that does not. The latter is what some subclasses implement, while the first one just calls the latter, dropping the given task. This commit combines these methods, in favor of just always assuming a task is present.	2018-06-22 15:03:01 -07:00
Ryan Ernst	59e7c6411a	Core: Combine messageRecieved methods in TransportRequestHandler (#31519 ) TransportRequestHandler currently contains 2 messageReceived methods, one which takes a Task, and one that does not. The first just delegates to the second. This commit changes all existing implementors of TransportRequestHandler to implement the version which takes Task, thus allowing the class to be a functional interface, and eliminating the need to throw exceptions when a task needs to be ensured.	2018-06-22 07:36:03 -07:00
Adrien Grand	f023e95ae0	Upgrade to Lucene 7.4.0. (#31529 ) This moves Elasticsearch from a recent 7.4.0 snapshot to the GA release.	2018-06-22 16:17:17 +02:00
Ryan Ernst	4f9332ee16	Core: Remove ThreadPool from base TransportAction (#31492 ) Most transport actions don't need the node ThreadPool. This commit removes the ThreadPool as a super constructor parameter for TransportAction. The actions that do need the thread pool then have a member added to keep it from their own constructor.	2018-06-21 11:25:26 -07:00
Ryan Ernst	0a324b9943	Core: Convert TransportAction.execute uses to client calls (#31487 ) This commit converts some of the existing calls to TransportAction.execute to use the equivalent client method for the desired action.	2018-06-21 07:59:55 -07:00
Ryan Ernst	00283a61e1	Remove unused generic type for client execute method (#31444 ) This commit removes the request builder generic type for AbstractClient as it was unused.	2018-06-20 16:26:26 -07:00
Tim Brooks	9ab1325953	Introduce http and tcp server channels (#31446 ) Historically in TcpTransport server channels were represented by the same channel interface as socket channels. This was necessary as TcpTransport was parameterized by the channel type. This commit introduces TcpServerChannel and HttpServerChannel classes. Additionally, it adds the implementations for the various transports. This allows server channels to have unique functionality and not implement the methods they do not support (such as send and getRemoteAddress). Additionally, with the introduction of HttpServerChannel this commit extracts some of the storing and closing channel work to the abstract http server transport.	2018-06-20 16:34:56 -06:00
Alan Woodward	5683bc60a6	Multiplexing token filter (#31208 ) The `multiplexer` filter emits multiple tokens at the same position, each version of the token haivng been passed through a different filter chain. Identical tokens at the same position are removed. This allows users to, for example, index lowercase and original-case tokens, or stemmed and unstemmed versions, in the same field, so that they can search for a stemmed term within x positions of an unstemmed term.	2018-06-20 10:16:26 +01:00
Ryan Ernst	401800d958	Core: Remove index name resolver from base TransportAction (#31002 ) Most transport actions don't need to resolve index names. This commit removes the index name resolver as a super constructor parameter for TransportAction. The actions that do need the resolver then have a member added to keep the resolver from their own constructor.	2018-06-19 17:06:09 -07:00
Tim Brooks	529e704b11	Unify http channels and exception handling (#31379 ) This is a general cleanup of channels and exception handling in http. This commit introduces a CloseableChannel that is a superclass of TcpChannel and HttpChannel. This allows us to unify the closing logic between tcp and http transports. Additionally, the normal http channels are extracted to the abstract server transport. Finally, this commit (mostly) unifies the exception handling between nio and netty4 http server transports.	2018-06-19 11:50:03 -06:00
Ryan Ernst	e67aa96c81	Core: Combine Action and GenericAction (#31405 ) Since #30966, Action no longer has anything but a call to the GenericAction super constructor. This commit renames GenericAction into Action, thus eliminating the Action class. Additionally, this commit removes the Request generic parameter of the class, since it was unused.	2018-06-18 23:53:04 +02:00
Martijn van Groningen	47095357bc	Move language analyzers from server to analysis-common module. (#31300 ) The following analyzers were moved from server module to analysis-common module: `greek`, `hindi`, `hungarian`, `indonesian`, `irish`, `italian`, `latvian`, `lithuanian`, `norwegian`, `persian`, `portuguese`, `romanian`, `russian`, `sorani`, `spanish`, `swedish`, `turkish` and `thai`. Relates to #23658	2018-06-18 11:24:43 +02:00
Alan Woodward	8c0ec05a12	Expose lucene's RemoveDuplicatesTokenFilter (#31275 )	2018-06-18 09:46:12 +01:00
Vladimir Dolzhenko	dbc9d60260	Support for remote path in reindex api (#31290 ) Support for remote path in reindex api Closes #22913	2018-06-15 22:14:28 +02:00
Tim Brooks	a705e1a9e3	Add byte array pooling to nio http transport (#31349 ) This is related to #28898. This PR implements pooling of bytes arrays when reading from the wire in the http server transport. In order to do this, we must integrate with netty reference counting. That manner in which this PR implements this is making Pages in InboundChannelBuffer reference counted. When we accessing the underlying page to pass to netty, we retain the page. When netty releases its bytebuf, it releases the underlying pages we have passed to it.	2018-06-15 14:01:03 -06:00
Nhat Nguyen	8453ca638d	Upgrade to Lucene-7.4.0-snapshot-518d303506 (#31360 )	2018-06-15 10:58:21 -04:00
Christoph Büscher	02346c20a2	Rankeval: Fold template test project into main module (#31203 ) This change moves tests in `smoke-test-rank-eval-with-mustache` into the main ranking evaluation module by declaring that the integration testing cluster requires the `lang-mustache` plugin. This avoids having to maintain the qa project for only one basic test suite.	2018-06-15 15:55:39 +02:00
Christoph Büscher	a0d6c19e75	Add details section for dcg ranking metric (#31177 ) While the other two ranking evaluation metrics (precicion and reciprocal rank) already provide a more detailed output for how their score is calculated, the discounted cumulative gain metric (dcg) and its normalized variant are lacking this until now. Its not really clear which level of detail might be useful for debugging and understanding the final metric calculation, but this change adds a `metric_details` section to REST output that contains some information about the evaluation details.	2018-06-15 11:56:16 +02:00
Tim Brooks	1c5cec0ac7	Remove http status code maps (#31350 ) Currently we maintain a compatibility map of http status codes in both the netty4 and nio modules. These maps convert a RestStatus to a netty HttpResponseStatus. However, as these fundamentally represent integers, we can just use the netty valueOf method to convert a RestStatus to a HttpResponseStatus.	2018-06-14 20:16:40 -06:00
Jack Conradson	0324103737	Painless: Fix bug for static method calls on interfaces (#31348 ) Static method calls on interfaces were not being called correctly which was causing JVM crashes. This change fixes the issue.	2018-06-14 18:30:37 -07:00
Tim Brooks	fcf1e41e42	Extract common http logic to server (#31311 ) This is related to #28898. With the addition of the http nio transport, we now have two different modules that provide http transports. Currently most of the http logic lives at the module level. However, some of this logic can live in server. In particular, some of the setting of headers, cors, and pipelining. This commit begins this moving in that direction by introducing lower level abstraction (HttpChannel, HttpRequest, and HttpResonse) that is implemented by the modules. The higher level rest request and rest channel work can live entirely in server.	2018-06-14 15:10:02 -06:00
Tanguy Leroux	bbfe1eccc7	[Tests] Mutualize fixtures code in BaseHttpFixture (#31210 ) Many fixtures have similar code for writing the pid & ports files or for handling HTTP requests. This commit adds an AbstractHttpFixture class in the test framework that can be extended for specific testing purposes.	2018-06-14 14:09:56 +02:00
Tanguy Leroux	4d7447cb5e	Reenable Checkstyle's unused import rule (#31270 )	2018-06-14 09:52:46 +02:00
Tanguy Leroux	8b4d80ad09	Fix AntFixture waiting condition (#31272 ) The AntFixture waiting condition is evaluated to false but it should be true.	2018-06-13 12:40:22 +02:00
Martijn van Groningen	16d593b22f	Set analyzer version in PreBuiltAnalyzerProviderFactory (#31202 ) instead of lamda that creates the analyzer	2018-06-13 07:25:19 +02:00
Tim Brooks	56ffe553e5	Modify pipelining handlers to require full requests (#31280 ) Currently the http pipelining handlers seem to support chunked http content. However, this does not make sense. There is a content aggregator in the pipeline before the pipelining handler. This means the pipelining handler should only see full http messages. Additionally, the request handler immediately after the pipelining handler only supports full messages. This commit modifies both nio and netty4 pipelining handlers to assert that an inbound message is a full http message. Additionally it removes the tests for chunked content.	2018-06-12 23:15:24 -06:00
Jason Tedor	0bfd18cc8b	Revert upgrade to Netty 4.1.25.Final (#31282 ) This reverts upgrading to Netty 4.1.25.Final until we have a cleaner solution to dealing with the object cleaner thread.	2018-06-12 19:26:18 -04:00
Martijn van Groningen	6030d4be1e	[INGEST] Interrupt the current thread if evaluation grok expressions take too long (#31024 ) This adds a thread interrupter that allows us to encapsulate calls to org.joni.Matcher#search() This method can hang forever if the regex expression is too complex. The thread interrupter in the background checks every 3 seconds whether there are threads execution the org.joni.Matcher#search() method for longer than 5 seconds and if so interrupts these threads. Joni has checks that that for every 30k iterations it checks if the current thread is interrupted and if so returns org.joni.Matcher#INTERRUPTED Closes #28731	2018-06-12 07:49:03 +02:00
Jason Tedor	563141c6c9	Upgrade to Netty 4.1.25.Final (#31232 ) This commit upgrades us to Netty 4.1.25. This upgrade is more challenging than past upgrades, all because of a new object cleaner thread that they have added. This thread requires an additional security permission (set context class loader, needed to avoid leaks in certain scenarios). Additionally, there is not a clean way to shutdown this thread which means that the thread can fail thread leak control during tests. As such, we have to filter this thread from thread leak control.	2018-06-11 16:55:07 -04:00
Tanguy Leroux	bf58660482	Remove all unused imports and fix CRLF (#31207 ) The X-Pack opening and the recent other refactorings left a lot of unused imports in the codebase. This commit removes them all.	2018-06-11 15:12:12 +02:00
Tanguy Leroux	a1916658a9	[Tests] Fix self-referencing tests This commit adapts some test after #31044 has been merged.	2018-06-11 12:45:27 +02:00
rationull	85c26d682a	Call ensureNoSelfReferences() on _agg state variable after scripted metric agg script executions (#31044 ) Previously this was called for the combine script only. This change checks for self references for init, map, and reduce scripts as well, and adds unit test coverage for the init, map, and combine cases.	2018-06-11 08:39:05 +02:00
Julie Tibshirani	8f607071b6	Remove DocumentFieldMappers#smartNameFieldMapper, as it is no longer needed. (#31018 )	2018-06-08 09:24:09 -07:00
Martijn van Groningen	07a57cc131	Move number of language analyzers to analysis-common module (#31143 ) The following analyzers were moved from server module to analysis-common module: `snowball`, `arabic`, `armenian`, `basque`, `bengali`, `brazilian`, `bulgarian`, `catalan`, `chinese`, `cjk`, `czech`, `danish`, `dutch`, `english`, `finnish`, `french`, `galician` and `german`. Relates to #23658	2018-06-08 08:58:46 +02:00
Tim Brooks	237f9b8930	Add nio-transport as option for http smoke tests (#31162 ) This is related to #27260 and #28898. This commit adds the transport-nio plugin as a random option when running the http smoke tests. As part of this PR, I identified an issue where cors support was not properly enabled causing these tests to fail when using transport-nio. This commit also fixes that issue.	2018-06-07 09:46:36 -06:00
Tanguy Leroux	b5f05f676c	Remove BlobContainer.move() method (#31100 ) closes #30680	2018-06-07 10:48:31 +02:00
Adrien Grand	458bca11bc	Add a `feature_vector` field. (#31102 ) This field is similar to the `feature` field but is better suited to index sparse feature vectors. A use-case for this field could be to record topics associated with every documents alongside a metric that quantifies how well the topic is connected to this document, and then boost queries based on the topics that the logged user is interested in. Relates #27552	2018-06-07 10:05:37 +02:00
Christoph Büscher	0c9d4cb417	Fix expectation on parsing exception (#31108 ) The structure of the expected exception slightly changed, the change adapts the assertions accordingly. Closes #31104	2018-06-06 09:58:16 +02:00
Martijn van Groningen	735d0e671a	Make PreBuiltAnalyzerProviderFactory plugable via AnalysisPlugin and move `finger_print`, `pattern` and `standard_html_strip` analyzers to analysis-common module. (both AnalysisProvider and PreBuiltAnalyzerProvider) Changed PreBuiltAnalyzerProviderFactory to extend from PreConfiguredAnalysisComponent and changed to make sure that predefined analyzers are always instantiated with the current ES version and if an instance is requested for a different version then delegate to PreBuiltCache. This is similar to the behaviour that exists today in AnalysisRegistry.PreBuiltAnalysis and PreBuiltAnalyzerProviderFactory. (#31095) Relates to #23658	2018-06-06 07:40:21 +02:00
Tim Brooks	05ee0f8b6e	Add cors support to NioHttpServerTransport (#30827 ) This is related to #28898. This commit adds cors support to the nio http transport. Most of the work is copied directly from the netty module implementation. Additionally, this commit adds tests for the nio http channel.	2018-06-05 10:09:20 -06:00
Christoph Büscher	4624ba5e10	[Tests] Muting RatedRequestsTests#testXContentParsingIsNotLenient	2018-06-05 15:29:49 +02:00
Martijn van Groningen	0fad7cc99a	Take into account the return value of TcpTransport.readMessageLength(...) in Netty4SizeHeaderFrameDecoder (#31057)	2018-06-05 10:35:47 +02:00
Adrien Grand	cc55235030	Decouple MultiValueMode. (#31075 ) Currently this class takes care of moth selecting the relevant value, and replacing missing values if any. This is fine for sorting, which always needs to do both at the same time, but we also have a number of aggregations and script utils that need to retain information about missing values so this change proposes to decouple selection of the relevant value and replacement of missing values.	2018-06-05 08:51:20 +02:00
Christoph Büscher	3f87c79500	Change ObjectParser exception (#31030 ) ObjectParser should throw XContentParseExceptions, not IAE. A dedicated parsing exception can includes the place where the error occurred. Closes #30605	2018-06-04 20:20:37 +02:00
Nhat Nguyen	abe61159a8	Upgrade to Lucene-7.4.0-snapshot-0a7c3f462f (#31073 ) This snapshot includes: - LUCENE-8341: Record soft deletes in SegmentCommitInfo which will resolve #30851 - LUCENE-8335: Enforce soft-deletes field up-front	2018-06-04 14:18:46 -04:00
Jason Tedor	3670a2ae05	Adjust BWC version on client features This commit adjusts the BWC version on client features in master to 6.3.0 after the functionality was backported to the 6.3 branch.	2018-06-01 19:15:31 -04:00
Tim Brooks	f8785dda9d	Add TRACE, CONNECT, and PATCH http methods (#31035 ) This is related to #31017. That issue identified that these three http methods were treated like GET requests. This commit adds them to RestRequest. This means that these methods will be handled properly and generate 405s.	2018-06-01 17:07:54 -06:00
Jason Tedor	2401150be7	Adjust BWC version on client features This commit adjusts the BWC version on client features in master to 6.4.0 after the functionality was backported to the 6.x branch.	2018-06-01 16:33:56 -04:00
Jason Tedor	4522b57e07	Introduce client feature tracking (#31020 ) This commit introduces the ability for a client to communicate to the server features that it can support and for these features to be used in influencing the decisions that the server makes when communicating with the client. To this end we carry the features from the client to the underlying stream as we carry the version of the client today. This enables us to enhance the logic where we make protocol decisions on the basis of the version on the stream to also make protocol decisions on the basis of the features on the stream. With such functionality, the client can communicate to the server if it is a transport client, or if it has, for example, X-Pack installed. This enables us to support rolling upgrades from the OSS distribution to the default distribution without breaking client connectivity as we can now elect to serialize customs in the cluster state depending on whether or not the client reports to us using the feature capabilities that it can under these customs. This means that we would avoid sending a client pieces of the cluster state that it can not understand. However, we want to take care and always send the full cluster state during node-to-node communication as otherwise we would end up with different understanding of what is in the cluster state across nodes depending on which features they reported to have. This is why when deciding whether or not to write out a custom we always send the custom if the client is not a transport client and otherwise do not send the custom if the client is transport client that does not report to have the feature required by the custom. Co-authored-by: Yannick Welsch <yannick@welsch.lu>	2018-06-01 11:45:35 -04:00
Julie Tibshirani	cd0a375414	Remove unused query methods from MappedFieldType. (#30987 ) * Remove MappedFieldType#nullValueQuery, as it is now unused. * Remove MappedFieldType#queryStringTermQuery, as it is never overridden.	2018-05-31 12:47:52 -07:00
Ryan Ernst	46e8d97813	Core: Remove RequestBuilder from Action (#30966 ) This commit removes the RequestBuilder generic type from Action. It was needed to be used by the newRequest method, which in turn was used by client.prepareExecute. Both of these methods are now removed, along with the existing users of prepareExecute constructing the appropriate builder directly.	2018-05-31 16:15:00 +02:00
Christoph Büscher	0a5d46ef3c	[Test] Prefer ArrayList over Vector (#30965 ) Replaces some occurances of Vector class with ArrayList in tests of the rank-eval module.	2018-05-30 21:11:49 +02:00
markharwood	facbb2b2dc	Add “took” timing info to response for _msearch/template API (#30961 ) Add “took” timing info to response for _msearch/template API Closes #30957	2018-05-30 17:57:28 +01:00
Christoph Büscher	1ea9f11b03	Change ScriptException status to 400 (bad request) (#30861 ) Currently failures to compile a script usually lead to a ScriptException, which inherits the 500 INTERNAL_SERVER_ERROR from ElasticsearchException if it does not contain another root cause. Instead, this should be a 400 Bad Request error. This PR changes this more generally for script compilation errors by changing ScriptException to return 400 (bad request) as status code. Closes #12315	2018-05-30 14:00:07 +02:00
Tim Brooks	ad0dc580c5	Fix location of AbstractHttpServerTransport (#30888 ) Currently AbstractHttpServerTransport is in a netty4 module. This is the incorrect location. This commit moves it out of netty4 module. Additionally, it moves unit tests that test AbstractHttpServerTransport logic to server.	2018-05-29 13:14:23 -06:00
Martijn van Groningen	544822c78b	Moved keyword tokenizer to analysis-common module (#30642 ) Relates to #23658	2018-05-29 19:22:28 +02:00
Nhat Nguyen	363f1e84ca	Upgrade to Lucene-7.4-snapshot-1cbadda4d3 (#30928 ) This snapshot includes LUCENE-8328 which is needed to stabilize CCR builds.	2018-05-29 12:29:52 -04:00
Sohaib Iftikhar	3c918d799c	Deprecate accepting malformed requests in stored script API (#28939 ) The stored scripts API today accepts malformed requests instead of throwing an exception. This PR deprecates accepting malformed put stored script requests (requests not using the official script format). Relates to #27612	2018-05-29 15:45:53 +02:00
Martijn van Groningen	ae2f021f1c	Move score script context from SearchScript to its own class (#30816 )	2018-05-25 07:17:50 +02:00
Tim Brooks	e8b70273c1	Remove Throwable usage from transport modules (#30845 ) Currently nio and netty modules use the CompletableFuture class for managing listeners. This is unfortunate as that class accepts Throwable. This commit adds a class CompletableContext that wraps the CompletableFuture but does not accept Throwable. This allows the modification of netty and nio logic to no longer handle Throwable.	2018-05-24 17:33:29 -06:00
Tim Brooks	d7040ad7b4	Reintroduce mandatory http pipelining support (#30820 ) This commit reintroduces `31251c9` and `63a5799`. These commits introduced a memory leak and were reverted. This commit brings those commits back and fixes the memory leak by removing unnecessary retain method calls.	2018-05-23 14:38:52 -06:00
Colin Goodheart-Smithe	4fd0a3e492	Revert "Make http pipelining support mandatory (#30695 )" (#30813 ) This reverts commit `31251c9` introduced in #30695. We suspect this commit is causing the OOME's reported in #30811 and we will use this PR to test this assertion.	2018-05-23 10:54:46 -06:00
Adrien Grand	886db84ad2	Expose Lucene's FeatureField. (#30618 ) Lucene has a new `FeatureField` which gives the ability to record numeric features as term frequencies. Its main benefit is that it allows to boost queries with the values of these features and efficiently skip non-competitive documents at the same time using block-max WAND and indexed impacts.	2018-05-23 08:55:21 +02:00
Nhat Nguyen	1918a30237	Upgrade to Lucene-7.4.0-snapshot-cc2ee23050 (#30778 ) The new snapshot includes LUCENE-8324 which fixes missing checkpoint after a fully deletes segment is dropped on flush. This snapshot should resolves failed tests in the CorruptedFileIT suite. Closes #30741 Closes #30577	2018-05-22 13:11:48 -04:00
Tim Brooks	31251c9a6d	Make http pipelining support mandatory (#30695 ) This is related to #29500 and #28898. This commit removes the abilitiy to disable http pipelining. After this commit, any elasticsearch node will support pipelined requests from a client. Additionally, it extracts some of the http pipelining work to the server module. This extracted work is used to implement pipelining for the nio plugin.	2018-05-22 09:29:31 -06:00
Itamar Syn-Hershko	5f172b6795	[Feature] Adding a char_group tokenizer (#24186 ) === Char Group Tokenizer The `char_group` tokenizer breaks text into terms whenever it encounters a character which is in a defined set. It is mostly useful for cases where a simple custom tokenization is desired, and the overhead of use of the <<analysis-pattern-tokenizer, `pattern` tokenizer>> is not acceptable. === Configuration The `char_group` tokenizer accepts one parameter: `tokenize_on_chars`:: A string containing a list of characters to tokenize the string on. Whenever a character from this list is encountered, a new token is started. Also supports escaped values like `\\n` and `\\f`, and in addition `\\s` to represent whitespace, `\\d` to represent digits and `\\w` to represent letters. Defaults to an empty list. === Example output ```The 2 QUICK Brown-Foxes jumped over the lazy dog's bone for $2``` When the configuration `\\s-:<>` is used for `tokenize_on_chars`, the above sentence would produce the following terms: ```[ The, 2, QUICK, Brown, Foxes, jumped, over, the, lazy, dog's, bone, for, $2 ]```	2018-05-22 16:26:31 +02:00
Ryan Ernst	34180f2285	Scripting: Remove getDate methods from ScriptDocValues (#30690 ) The getDate() and getDates() existed prior to 5.x on long fields in scripting. In 5.x, a new Date type for ScriptDocValues was added. The getDate() and getDates() methods were left on long fields and added to date fields to ease the transition. This commit removes those methods for 7.0.	2018-05-18 21:26:26 -07:00
Nhat Nguyen	67d8fc222d	Upgrade to Lucene-7.4.0-snapshot-59f2b7aec2 (#30726 ) This snapshot resolves issues related to ShrinkIndexIT.	2018-05-18 18:21:39 -04:00
Zachary Tong	d120fb222c	[TEST] Adjust version skips for movavg/movfn tests Since the MovFn PR was backported to 6.x, we can adjust the version skip numbers in master to correctly match 6.3.99 instead of 6.4.0	2018-05-17 18:07:52 +00:00
Christoph Büscher	b6340658f4	Deprecate `nGram` and `edgeNGram` names for ngram filters (#30209 ) The camel case name `nGram` should be removed in favour of `ngram` and similar for `edgeNGram` and `edge_ngram`. Before removal, we need to deprecate the camel case names first. This change adds deprecation warnings for indices with versions 6.4.0 and higher and logs deprecation warnings.	2018-05-17 12:52:22 +02:00
Shashwat Anand	f0da3da6b0	Reindex: Fixed typo in assertion failure message (#30619 ) Fix a typo in an assertion failure message.	2018-05-16 16:26:23 -04:00
Ke Li	d2b9a765cf	Remove version argument in RangeFieldType (#30411 ) The argument `indexVersionCreated` is not needed any more and can be removed.	2018-05-16 17:42:44 +02:00
Zachary Tong	df853c49c0	Add a MovingFunction pipeline aggregation, deprecate MovingAvg agg (#29594 ) This pipeline aggregation gives the user the ability to script functions that "move" across a window of data, instead of single data points. It is the scripted version of MovingAvg pipeline agg. Through custom script contexts, we expose a number of convenience methods: - MovingFunctions.max() - MovingFunctions.min() - MovingFunctions.sum() - MovingFunctions.unweightedAvg() - MovingFunctions.linearWeightedAvg() - MovingFunctions.ewma() - MovingFunctions.holt() - MovingFunctions.holtWinters() - MovingFunctions.stdDev() The user can also define any arbitrary logic via their own scripting, or combine with the above methods.	2018-05-16 10:57:00 -04:00
Tim Brooks	99b9ab58e2	Add nio http server transport (#29587 ) This commit is related to #28898. It adds an nio driven http server transport. Currently it only supports basic http features. Cors, pipeling, and read timeouts will need to be added in future PRs.	2018-05-15 16:37:14 -06:00
Julie Tibshirani	4f9dd37169	Add support for search templates to the high-level REST client. (#30473 )	2018-05-15 13:07:58 -07:00
Jason Tedor	4a4e3d70d5	Default to one shard (#30539 ) This commit changes the default out-of-the-box configuration for the number of shards from five to one. We think this will help address a common problem of oversharding. For users with time-based indices that need a different default, this can be managed with index templates. For users with non-time-based indices that find they need to re-shard with the split API in place they no longer need to resort only to reindexing. Since this has the impact of changing the default number of shards used in REST tests, we want to ensure that we still have coverage for issues that could arise from multiple shards. As such, we randomize (rarely) the default number of shards in REST tests to two. This is managed via a global index template. However, some tests check the templates that are in the cluster state during the test. Since this template is randomly there, we need a way for tests to skip adding the template used to set the number of shards to two. For this we add the default_shards feature skip. To avoid having to write our docs in a complicated way because sometimes they might be behind one shard, and sometimes they might be behind two shards we apply the default_shards feature skip to all docs tests. That is, these tests will always run with the default number of shards (one).	2018-05-14 12:22:35 -04:00
Christoph Büscher	cc93131318	Forbid expensive query parts in ranking evaluation (#30151 ) Currently the ranking evaluation API accepts the full query syntax for the queries specified in the evaluation set and executes them via multi search. This potentially runs costly aggregations and suggestions too. This change adds checks that forbid using aggregations, suggesters, highlighters and the explain and profile options in the queries that are run as part of the ranking evaluation since they are irrelevent in the context of this API.	2018-05-14 17:36:26 +02:00
Alpar Torok	9a5555963b	Add missing dependencies on testClasses (#30527 )	2018-05-14 16:06:56 +03:00
Martijn van Groningen	7b95470897	Moved tokenizers to analysis common module (#30538 ) The following tokenizers were moved: classic, edge_ngram, letter, lowercase, ngram, path_hierarchy, pattern, thai, uax_url_email and whitespace. Left keyword tokenizer factory in server module, because normalizers directly depend on it.This should be addressed on a follow up change. Relates to #23658	2018-05-14 07:55:01 +02:00
Daniel Mitterdorfer	09cf530f4b	Derive max composite buffers from max content len With this commit we determine the maximum number of buffers that Netty keeps while accumulating one HTTP request based on the maximum content length (default 1500 bytes, overridable with the system property `es.net.mtu`). Previously, we kept the default value of 1024 which is too small for bulk requests which leads to unnecessary copies of byte buffers internally. Relates #29448	2018-05-11 10:01:09 +02:00
Nhat Nguyen	519768b5d3	Upgrade to Lucene-7.4-snapshot-6705632810 (#30519 ) This snapshot is to include LUCENE-8298 which allows DocValues updates to reset a value. This is needed for the Lucene rollback work.	2018-05-10 12:31:45 -04:00
Nik Everett	51fa8739ea	Reindex: Fold "with all deps" project into reindex (#30154 ) This folds the `:qa:smoke-test-reindex-with-all-modules` project into `:modules:reindex` by declaring the reindex's integration testing cluster requires the `parent-join` and `lang-painless` plugins and then moving all of the integration tests that depended on parent-join and painless into reindex. It saves us one cluster start up during the build at the cost of a little of the reindex module's "purity". Since the reindex module does have unit tests that test scripting without painless I'm fairly ok with that.	2018-05-10 08:02:23 -04:00
Nik Everett	b4502dbf74	LLClient: Add setJsonEntity (#30447 ) Adds `Request#setJsonEntity(String)` which short circuits the process of sending a json string which is super common.	2018-05-09 18:33:03 -04:00
Yu	2228e6e663	BulkProcessor to retry based on status code (#29329 ) Previously `BulkProcessor` retry logic was based on the exception type of the failed response (`EsRejectedExecutionException`). This commit changes it to be based on the returned status code. This allows us to reproduce the same retry behaviour when the `BulkProcessor` is used from the high-level REST client, which was previously not the case as we cannot rebuild the same exception type when parsing back the response. This change has no effect on the transport client. Closes #28885	2018-05-09 14:27:58 +02:00
Nik Everett	ef4ecb1f1e	Reindex: Use request flavored methods (#30317 ) Use the new request flavored methods for the low level rest client introduced in #29623 in reindex.	2018-05-07 17:14:38 -04:00
Jim Ferenczi	dbd857341f	Upgrade to 7.4.0-snapshot-1ed95c097b (#30357 ) Upgrade to lucene-7.4.0-snapshot-1ed95c097b This version contains: * An Analyzer for Korean * An IntervalQuery and IntervalsSource that retrieve minimum intervals of positional queries. * A new API to retrieve matches (offsets and positions) of a query for a single document. * Support for soft deletes in the index writer. * A fixed shingle filter that handles index time synonyms. * Support for emoji sequence in ICUTokenizer (with an upgrade to icu 61.1)	2018-05-04 11:44:22 +02:00
Ryan Ernst	fb0aa562a5	Network: Remove http.enabled setting (#29601 ) This commit removes the http.enabled setting. While all real nodes (started with bin/elasticsearch) will always have an http binding, there are many tests that rely on the quickness of not actually needing to bind to 2 ports. For this case, the MockHttpTransport.TestPlugin provides a dummy http transport implementation which is used by default in ESIntegTestCase. closes #12792	2018-05-02 11:42:05 -07:00
Adrien Grand	368ddc408f	Remove MapperService#types(). (#29617 ) This isn't be necessary with a single type per index.	2018-05-02 11:35:12 +02:00
Adrien Grand	231a63fdf8	Remove useless version checks in REST tests. (#30165 ) Many tests are added with a version check so that they do not run against a version that doesn't have the feature yet. Master is 7.0, so all tests that do not run against 6.0+ can be removed and the version check can be removed on all tests that always run on 6.0+.	2018-05-02 11:34:15 +02:00
Nik Everett	0be443c5bb	REST Client: Add Request object flavored methods (#29623 ) Adds two new methods to `RestClient` that take a `Request` object. These methods will allows us to add more per-request customizable options without creating more and more and more overloads of the `performRequest` and `performRequestAsync` methods. These new methods look like: ``` Response performRequest(Request request) ``` and ``` void performRequestAsync(Request request, ResponseListener responseListener) ``` This change doesn't add any actual features but enables adding things like per request timeouts and per request node selectors. This change does rework the `HighLevelRestClient` and its tests to use these new `Request` objects and it does update the docs.	2018-05-01 14:31:23 -04:00
Nik Everett	d12e644206	Build: Log a warning if disabling reindex-from-old (#30304 ) We disable the reindex-from-old tests if we're running on windows or in a directory that contains a space. This adds a warning to the logs when we do that so that you can tell that it happened. This will be nice to have when looking at CI and will be a hint to anyone developing locally.	2018-05-01 11:23:18 -04:00
David Turner	d2ca16b4c7	Suppress reindex-from-old tests if there are spaces in the path	2018-05-01 14:32:13 +01:00
Nik Everett	9c8e015552	Build: Mostly silence warning about html4 javadoc (#30220 ) This mostly silences `javadoc`'s warning about defaulting to generating html4 files by enabling generating html5 file for the projects for which that works. It didn't work in a half dozen projects, about half of which I've fixed in this PR, entirely by replacing `<tt>thing</tt>` with `{@code thing}`. There are a few remaining projects that contain javadoc with invalid html5. I'll fix those projects in a followup.	2018-04-28 09:50:54 -04:00
Nik Everett	8401eac425	Test: Switch painless test to 1 shard We think that #28600 is caused by warnings not being collected during one of the fan out phases of search but we're not 100% sure how this is happening. This commit drops the number of shards used for the test to 1 so there isn't a fan out phase. If this makes the issue go away we'll have more information.	2018-04-27 15:01:42 -04:00
Nik Everett	912fbb2211	Reindex: Fold "from old" tests into reindex module (#30142 ) This folds the `:qa:reindex-from-old` project into the `:modules:reindex` project. This should speed up the build marginally by removing a single clsuter start up at the cost of having to wait for old versions of Elasticsearch to start up when checking reindex's integration tests. Those don't take that long so this feels worth it.	2018-04-27 14:04:37 -04:00
Tanguy Leroux	b15631ee54	[Test] Fix RenameProcessorTests.testRenameExistingFieldNullValue() (#29655 ) This test fails when the new field name already exists in the ingest document.	2018-04-26 17:26:37 +02:00
Christoph Büscher	d0f6657d90	Add tests for ranking evaluation with aliases (#29452 ) The ranking evaluation requests so far were not tested against aliases but they should run regardless of the targeted index is a real index or an alias. This change adds cases for this to the integration and rest tests.	2018-04-19 17:00:52 +02:00
Christoph Büscher	24763d881e	Deprecate use of `htmlStrip` as name for HtmlStripCharFilter (#27429 ) The camel case name `htmlStip` should be removed in favour of `html_strip`, but we need to deprecate it first. This change adds deprecation warnings for indices with version starting with 6.3.0 and logs deprecation warnings in this cases.	2018-04-19 16:48:17 +02:00
Christoph Büscher	7c56cc2624	Make ranking evaluation details accessible for client Allow high level java rest client to access details of the metric calculation by making them accessible across packages. Also renaming the inner `Breakdown` classes of the evaluation metrics to `Detail` to better communicate their use.	2018-04-19 14:39:41 +02:00
Jason Tedor	c12c2a6cc9	Rename the bulk thread pool to write thread pool (#29593 ) This commit renames the bulk thread pool to the write thread pool. This is to better reflect the fact that the underlying thread pool is used to execute any document write request (single-document index/delete/update requests, and bulk requests). With this change, we add support for fallback settings thread_pool.bulk.* which will be supported until 7.0.0. We also add a system property so that the display name of the thread pool remains as "bulk" if needed to avoid breaking users.	2018-04-19 08:18:58 -04:00
Christoph Büscher	fa1052017c	[Test] Minor changes to rank_eval tests (#29577 ) Removing an enum in favour of local constants to simplify tests and removing a few deprecated method calls and warnings.	2018-04-19 13:50:18 +02:00
Martijn van Groningen	8afa7c174f	Added painless execute api. (#29164 ) Added an api that allows to execute an arbitrary script and a result to be returned. ``` POST /_scripts/painless/_execute { "script": { "source": "params.var1 / params.var2", "params": { "var1": 1, "var2": 1 } } } ``` Relates to #27875	2018-04-19 09:33:34 +02:00
Jack Conradson	da9a6899ff	Painless: modify grammar to allow more statement delimiters (#29566 ) This allows the grammar to determine when and what delimiters statements will use by splitting up the statements into regular statements and delimited statements, those that do not require a delimiter versus those that do. This allows consumers of the statements to determine what delimiters the statements will use so that in certain cases semicolons are not necessary like when there's a closing right bracket. This change removes the need for semicolon insertion in the lexer, simplifying the existing lexer quite a bit. It also ensures that there isn't a need to track semicolons being inserted into places that aren't necessary such as array initializers.	2018-04-18 10:32:42 -07:00
Adrien Grand	ebd6b5b7ba	Deprecate filtering on `_type`. (#29468 ) As indices are only allowed to have one type now, and types are going away in the future, we should deprecate filtering by `_type`. Relates #15613	2018-04-13 09:07:51 +02:00
Jim Ferenczi	fb81e2cacf	Fix template _msearch with extra tokens This change removes the check for extra tokens when parsing a source generated by a templated _msearch request. This was added unintentionally in #29428 but the intent of this modification was to validate simple _search request only.	2018-04-11 18:04:10 +02:00
Jim Ferenczi	1b6d5e531b	Fail _search request with trailing tokens (#29428 ) This change validates that the `_search` request does not have trailing tokens after the main object and fails the request with a parsing exception otherwise. Closes #28995	2018-04-11 13:10:22 +02:00
Adrien Grand	4918924fae	Remove legacy mapping code. (#29224 ) Some features have been deprecated since `6.0` like the `_parent` field or the ability to have multiple types per index. This allows to remove quite some code, which in-turn will hopefully make it easier to proceed with the removal of types.	2018-04-11 09:41:37 +02:00
Adrien Grand	a091d950a7	Deprecate slicing on `_uid`. (#29353 ) Deprecate slicing on `_uid`. `_id` should be used instead on 6.x.	2018-04-10 14:28:30 +02:00
Martijn van Groningen	182cf11f37	Fixed bug when non percolator docs end up in the search hits. In the case that a document with a percolator field is matched when using the `percolate` query then the fetch phase can fail due to the fact that the percolator can't resolve any query from that document. Closes #29429	2018-04-10 13:33:31 +02:00
Martijn van Groningen	2346f7fa89	removed unused import	2018-04-10 07:44:51 +02:00
Martijn van Groningen	f4395c0c94	Fixed a msm accounting error that can occur during analyzing a percolator query. In case of a disjunction query with both range and term based clauses and msm specified, the query analyzer needs to also reduce the msn if a range based clause for the same field is encountered. This did not happen. Instead of fixing this bug the logic has been simplified to just set a percolator query's msm to 1 if a disjunction contains range clauses and msm on disjunction has been specified. The logic would otherwise just get to complex and the performance gain isn't that much for this kind of percolator queries. In case a percolator query has clauses that have duplicate terms or ranges then for disjunction clauses with a minimum should match the query extraction of the clause with the lowest msm should be used and for conjunction queries query extractions wiht duplicate terms/ranges the msn should be ignored. If this is not done then percolator queries that should match never match. Example percolator query: value1 OR value2 OR value2 OR value3 OR value3 OR value3 OR value4 OR value5 (msm set to 3) In the above example query the extracted msm would be 3 Example document1: value1 value2 value3 With the msm and extracted terms this would match and is expected behaviour Example document2: value3 This document should match too (value3 appears in 3 clauses), but with msm set to 3 and the fact that fact that only distinct values are indexed in extracted terms field this document would Also added another random duel test. Closes #29393	2018-04-10 07:25:12 +02:00
Adrien Grand	0f00277851	Simplify analysis of `bool` queries. (#29430 ) This change tries to simplify the extraction logic of boolean queries by concentrating the logic into two methods: one that merges results for conjunctions, and another one for disjunctions. Other concerns, like the impact of prohibited clauses or how an `UnsupportedQueryException` should be treated are applied on top of those two methods. This is mostly a code reorganization, it doesn't change the result of query extraction except in the case that a query both has required clauses and a minimum number of `SHOULD` clauses that is greater than 1, which we now rewrite into a pure conjunction. For instance `(+A B C)~1` is rewritten into `(+A +(B C))` prior to extraction.	2018-04-09 16:34:45 +02:00
Lee Hinman	a93c942927	Move ObjectParser into the x-content lib (#29373 ) * Move ObjectParser into the x-content lib This moves `ObjectParser`, `AbstractObjectParser`, and `ConstructingObjectParser` into the libs/x-content dependency. This decoupling allows them to be used for parsing for projects that don't want to depend on the entire Elasticsearch jar. Relates to #28504	2018-04-06 09:41:14 -06:00
Christoph Büscher	570f1d9ac7	Add indices options support to _rank_eval (#29386 ) Currently the ranking evaluation API doesn't support many of the standard parameters of the search API. Some of these make sense, like adding support for the common indices options parameters, which this change adds.	2018-04-06 16:23:19 +02:00
Tanguy Leroux	143325d858	[Test] Fix RepositoryURLClientYamlTestSuiteIT This commit fixes the test on Windows by normalizing the path as a correct URI. Closes #29399	2018-04-06 13:51:23 +02:00
Adrien Grand	85f5382a3c	Fix more query extraction bugs. (#29388 ) I found the following bugs: - The 6.0 logic for conjunctions didn't work when there were only `match_all` queries in MUST/FILTER clauses as they didn't propagate the `matchAllDocs` flag. - Some queries still had the same issue as `BooleanQuery` used to have with duplicate terms (see #28353), eg. `MultiPhraseQuery`. Closes #29376	2018-04-06 10:44:34 +02:00
Christoph Büscher	231fd4eb18	Remove `delimited_payload_filter` (#27705 ) From 7.0 on, using `delimited_payload_filter` should throw an error. It was deprecated in 6.2 in favour of `delimited_payload` (#26625). Relates to #27704	2018-04-05 18:41:04 +02:00
Alan Woodward	dccd43af47	Upgrade to lucene 7.3.0 (#29387 )	2018-04-05 10:34:44 +01:00
Tanguy Leroux	08abbdf129	Use fixture to test repository-url module (#29355 ) This commit adds a YAML integration test for the repository-url module that uses a fixture to test URL based repositories on both http:// and file:// prefixes.	2018-04-04 15:55:26 +02:00
Adrien Grand	c21057b3a2	Fix QueryAnalyzerTests. Closes #29363	2018-04-04 12:48:42 +02:00
Adrien Grand	c052e989cf	Fix HasChildQueryBuilderTests to not use the `classic` similarity. Closes #29362	2018-04-04 12:48:41 +02:00
Christoph Büscher	c1ae7e834c	Make TransportRankEvalAction members final	2018-04-04 12:06:33 +02:00
Jason Tedor	a19fd5636b	Add awaits fix for a query analyzer test The test QueryAnalyzerTests#testExactMatch_booleanQuery is failing since `8cdd950056`. This commit adds an awaits fix for it until it can be addressed.	2018-04-04 05:40:13 -04:00
Jason Tedor	4b1ed20a67	Add awaits fix for HasChildQueryBuilderTests These tests are failing since `569d0c0e89`. This commit adds an awaits fix for them until they can be addressed.	2018-04-03 23:18:51 -04:00
Adrien Grand	569d0c0e89	Improve similarity integration. (#29187 ) This improves the way similarities are plugged in in order to: - reject the classic similarity on 7.x indices and emit a deprecation warning otherwise - reject unkwown parameters on 7.x indices and emit a deprecation warning otherwise Even though this breaks the plugin API, I'd like to backport to 7.x so that users can get deprecation warnings when they are doing something that will become unsupported in the future. Closes #23208 Closes #29035	2018-04-03 16:45:25 +02:00
Adrien Grand	8cdd950056	Fix some query extraction bugs. (#29283 ) While playing with the percolator I found two bugs: - Sometimes we set a min_should_match that is greater than the number of extractions. While this doesn't cause direct trouble, it does when the query is nested into a boolean query and the boolean query tries to compute the min_should_match for the entire query based on its own min_should_match and those of the sub queries. So I changed the code to throw an exception when min_should_match is greater than the number of extractions. - Boolean queries claim matches are verified when in fact they shouldn't. This is due to the fact that boolean queries assume that they are verified if all sub clauses are verified but things are more complex than that, eg. conjunctions that are nested in a disjunction or disjunctions that are nested in a conjunction can generally not be verified without running the query.	2018-04-03 16:44:26 +02:00
Christoph Büscher	2b07f63bd5	Fix NDCG for empty search results (#29267 ) Fixes and edge case where DiscountedCumulativeGain can return NaN as result of the quality metric calculation. This can happen when the search result set is empty and normalization is used. We should return 0 in this case. Also adding related unit tests to the other two metrics.	2018-04-03 11:15:44 +02:00
Adrien Grand	3bdfc8f3fb	Upgrade to lucene-7.3.0-snapshot-98a6b3d. (#29298 ) Most notable changes include: - this release doesn't have the 7.2.1 version constant so I had to create one - spatial4j and jts were upgraded	2018-04-03 09:27:14 +02:00
Jack Conradson	782e41a67e	Painless: Remove extraneous INLINE constant. (#29340 )	2018-04-02 21:34:01 -07:00
Jason Tedor	1df43a09b7	Remove HTTP max content length leniency (#29337 ) I am not sure why we have this leniency for HTTP max content length, it has been there since the beginning (`5ac51ee93f`) with no explanation of its source. That said, our philosophy today is different than the philosophy of the past where Elasticsearch would be quite lenient in its handling of settings and today we aim for predictability for both users and us. This commit removes leniency in the parsing of http.max_content_length.	2018-04-02 20:20:01 -04:00
Lee Hinman	6b2167f462	Begin moving XContent to a separate lib/artifact (#29300 ) * Begin moving XContent to a separate lib/artifact This commit moves a large portion of the XContent code from the `server` project to the `libs/xcontent` project. For the pieces that have been moved, some helpers have been duplicated to allow them to be decoupled from ES helper classes. In addition, `Booleans` and `CheckedFunction` have been moved to the `elasticsearch-core` project. This decoupling is a move so that we can eventually make things like the high-level REST client not rely on the entire ES jar, only the parts it needs. There are some pieces that are still not decoupled, in particular some of the XContent tests still remain in the server project, this is because they test a large portion of the pluggable xcontent pieces through `XContentElasticsearchException`. They may be decoupled in future work. Additionally, there may be more piecese that we want to move to the xcontent lib in the future that are not part of this PR, this is a starting point. Relates to #28504	2018-04-02 15:58:31 -06:00
Jason Tedor	8967dbf4c6	Increase timeout on Netty client latch for tests We use a latch when sending requests during tests so that we do not hang forever waiting for replies on those requests. This commit increases the timeout on that latch to 30 seconds because sometimes 10 seconds is just not enough.	2018-03-29 18:33:35 -04:00
Jason Tedor	4ef3de40bc	Fix handling of bad requests (#29249 ) Today we have a few problems with how we handle bad requests: - handling requests with bad encoding - handling requests with invalid value for filter_path/pretty/human - handling requests with a garbage Content-Type header There are two problems: - in every case, we give an empty response to the client - in most cases, we leak the byte buffer backing the request! These problems are caused by a broader problem: poor handling preparing the request for handling, or the channel to write to when the response is ready. This commit addresses these issues by taking a unified approach to all of them that ensures that: - we respond to the client with the exception that blew us up - we do not leak the byte buffer backing the request	2018-03-28 16:25:01 -04:00
Jim Ferenczi	2aaa057387	Propagate ignore_unmapped to inner_hits (#29261 ) In 5.2 `ignore_unmapped` was added to `inner_hits` in order to ignore invalid mapping. This value was automatically set to the value defined in the parent query (`nested`, `has_child`, `has_parent`) but the refactoring of the parent/child in 5.6 removed this behavior unintentionally. This commit restores this behavior but also makes sure that we always automatically enforce this value when the query builder is used directly (previously this was only done by the XContent deserialization). Closes #29071	2018-03-27 18:55:42 +02:00
Christoph Büscher	e4b30071bb	RankEvalRequest should implement IndicesRequest (#29188 ) Change RankEvalRequest to implement IndicesRequest, so it gets treated in a similar fashion to regular search requests e.g. by security.	2018-03-22 11:58:55 +01:00
Lee Hinman	b4af451ec5	Remove BytesArray and BytesReference usage from XContentFactory (#29151 ) * Remove BytesArray and BytesReference usage from XContentFactory This removes the usage of `BytesArray` and `BytesReference` from `XContentFactory`. Instead, a regular `byte[]` should be passed. To assist with this a helper has been added to `XContentHelper` that will preserve the offset and length from the underlying BytesReference. This is part of ongoing work to separate the XContent parts from ES so they can be factored into their own jar. Relates to #28504	2018-03-20 11:52:26 -06:00
Christoph Büscher	80532229a9	Move indices field from RankEvalSpec to RankEvalRequest (#28341 ) Currently we store the indices specified in the request URL together with all the other ranking evaluation specification in RankEvalSpec. This is not ideal since e.g. the indices are not rendered to xContent and so cannot be parsed back. Instead we should keep them in RankEvalRequest.	2018-03-19 16:26:02 +01:00
Jason Tedor	6bf742dd1b	Fix EsAbortPolicy to conform to API (#29075 ) The rejected execution handler API says that rejectedExecution(Runnable, ThreadPoolExecutor) throws a RejectedExecutionException if the task must be rejected due to capacity on the executor. We do throw something that smells like a RejectedExecutionException (it is named EsRejectedExecutionException) yet we violate the API because EsRejectedExecutionException is not a RejectedExecutionException. This has caused problems before where we try to catch RejectedExecution when invoking rejectedExecution but this causes EsRejectedExecutionException to go uncaught. This commit addresses this by modifying EsRejectedExecutionException to extend RejectedExecutionException.	2018-03-16 14:34:36 -04:00
Martijn van Groningen	069a876542	Added minimal docs for reindex api in java-api docs Additionally: * Included the existing update by query java api docs in java-api docs. (for some reason it was never included, it needed some tweaking and then it was good to go) * moved delete-by-query / update-by-query code samples to java file so that we can verify that these samples at least compile. Closes #24203	2018-03-16 07:42:48 +01:00
Lee Hinman	8e8fdc4f0e	Decouple XContentBuilder from BytesReference (#28972 ) * Decouple XContentBuilder from BytesReference This commit removes all mentions of `BytesReference` from `XContentBuilder`. This is needed so that we can completely decouple the XContent code and move it into its own dependency. While this change appears large, it is due to two main changes, moving `.bytes()` and `.string()` out of XContentBuilder itself into static methods `BytesReference.bytes` and `Strings.toString` respectively. The rest of the change is code reacting to these changes (the majority of it in tests). Relates to #28504	2018-03-14 13:47:57 -06:00
Jack Conradson	42fe66162e	Fix Parsing Bug with Update By Query for Stored Scripts (#29039 ) This changes the parsing logic for stored scripts in update by query to match the parsing logic for scripts in general Elasticsearch. Closes #28002	2018-03-14 07:12:15 -07:00
Robin Neatherway	6dadce4761	Painless: Correct ClassToName string conversion (#28997 ) A typo of 'dimensions' rather than 'dimension' caused an infinite loop.	2018-03-13 13:16:48 -07:00
Jason Tedor	5904d936fa	Copy Lucene IOUtils (#29012 ) As we have factored Elasticsearch into smaller libraries, we have ended up in a situation that some of the dependencies of Elasticsearch are not available to code that depends on these smaller libraries but not server Elasticsearch. This is a good thing, this was one of the goals of separating Elasticsearch into smaller libraries, to shed some of the dependencies from other components of the system. However, this now means that simple utility methods from Lucene that we rely on are no longer available everywhere. This commit copies IOUtils (with some small formatting changes for our codebase) into the fold so that other components of the system can rely on these methods where they no longer depend on Lucene.	2018-03-13 12:49:33 -04:00
Martijn van Groningen	beb22d89c8	percolator: Take `matchAllDocs` and `verified` of the sub result into account when analyzing a function_score query. Before the `matchAllDocs` was ignored and this could lead to percolator queries not matching when the inner query was a match_all query and min_score was specified. Before when `verified` was not taken into account if the function_score query wrapped an unverified query this could lead to matching percolator queries that shouldn't match at all.	2018-03-09 07:16:21 +01:00
Lee Hinman	46a79127ed	Remove FastStringReader in favor of vanilla StringReader (#28944 ) This allows us to remove another dependency in the decoupling of the XContent code. Rather than move this class over or decouple it, it can simply be removed. Relates tangentially to #28504	2018-03-08 17:17:36 -07:00
Tal Levy	7784c1bff9	Continue registering pipelines after one pipeline parse failure. (#28752 ) Ingest has been failing to apply existing pipelines from cluster-state into the in-memory representation that are no longer valid. One example of this is a pipeline with a script processor. If a cluster starts up with scripting disabled, these pipelines will not be loaded. Even though GETing a pipeline worked, indexing operations claimed that this pipeline did not exist. This is because one gets information from cluster-state and the other is from an in-memory data-structure. Now, two things happen 1. suppress the exceptions until after other successful pipelines are loaded 2. replace failed pipelines with a placeholder pipeline If the pipeline execution service encounters the stubbed pipeline, it is known that something went wrong at the time of pipeline creation and an exception was thrown to the user at some point at start-up. closes #28269.	2018-03-08 15:22:59 -08:00
Martijn van Groningen	bcfb7ab591	Improved percolator's random candidate query duel test and fixed bugs that were exposed by this: * Duplicates query leafs were not detected in a multi level boolean query * Tracking fields for numeric range queries did not work properly. * The sorting that was used to find the less restrictive clauses in disjunction query did not work too.	2018-03-08 11:39:03 +01:00
Lee Hinman	818920a281	Decouple XContentType from StreamInput/Output (#28927 ) This removes the readFrom and writeTo methods from XContentType, instead using the more generic `readEnum` and `writeEnum` methods. Luckily they are both encoded exactly the same way, so there is no compatibility layer needed for backwards compatibility. Relates to #28504	2018-03-07 14:50:30 -07:00
Lee Hinman	e7d1e12675	Wrap stream passed to createParser in try-with-resources (#28897 ) * Wrap stream passed to createParser in try-with-resources This wraps the stream (`.streamInput()`) that is passed to many of the `createParser` instances in the enclosing (or a new) try-with-resources block. This ensures the `BytesReference.streamInput()` is closed. Relates to #28504 * Use try-with-resources instead of closing in a finally block	2018-03-04 16:48:03 -07:00
Luca Cavanna	1df711c5b7	Remove AcknowledgedRestListener in favour of RestToXContentListener (#28724 ) This commit makes AcknowledgedResponse implement ToXContentObject, so that the response knows how to print its own content out to XContent, which allows us to remove AcknowledgedRestListener.	2018-02-22 09:13:30 +01:00
Lee Hinman	d7eae4b90f	Pass InputStream when creating XContent parser (#28754 ) * Pass InputStream when creating XContent parser Rather than passing the raw `BytesReference` in when creating the xcontent parser, this passes the StreamInput (which is an InputStream), this allows us to decouple XContent from BytesReference. This also removes the use of `commons.Booleans` so it doesn't require more external commons classes. Related to #28504 * Undo boolean removal * Enhance deprecation javadoc	2018-02-21 11:03:25 -07:00
Martijn van Groningen	793cbc651a	Moved Grok helper code to a separate Gradle module and let ingest-common module depend on it.	2018-02-21 11:18:08 +01:00
Yu	7d8fb69d50	version set in ingest pipeline (#27573 ) Add support version and version_type in ingest pipelines Add support for setting document version and version type in set processor of an ingest pipeline.	2018-02-21 09:34:51 +01:00
Lee Hinman	d4fddfa2a0	Remove log4j dependency from elasticsearch-core (#28705 ) * Remove log4j dependency from elasticsearch-core This removes the log4j dependency from our elasticsearch-core project. It was originally necessary only for our jar classpath checking. It is now replaced by a `Consumer<String>` so that the es-core dependency doesn't have external dependencies. The parts of #28191 which were moved in conjunction (like `ESLoggerFactory` and `Loggers`) have been moved back where appropriate, since they are not required in the core jar. This is tangentially related to #28504 * Add javadocs for `output` parameter * Change @code to @link	2018-02-20 09:15:54 -07:00
Martijn van Groningen	9c405e8595	made load method private and add another static getter that users of Grok can use to get the builtin patterns.	2018-02-20 08:09:24 +01:00
Martijn van Groningen	3fad16e76c	renamed module	2018-02-20 08:02:02 +01:00

... 3 4 5 6 7 ...

5004 Commits