OpenSearch

Commit Graph

Author	SHA1	Message	Date
Adrien Grand	459916f5dd	Remove custom Base64 implementation. #18413 This replaces o.e.common.Base64 with java.util.Base64.	2016-05-23 11:32:42 +02:00
Tanguy Leroux	e7eb664c78	Change BlobPath.buildAsString() method	2016-05-23 10:50:40 +02:00
Ryan Ernst	37d36f2f4c	Merge branch 'master' into java9	2016-05-21 14:19:58 -07:00
Ryan Ernst	1d40c4bbc1	Make java9 work again This change makes ES compile with java9 again, build 118. * There are a handful of changes due to failure to determine types during compile. * The attachment plugins which use tika needed to have tika upgraded in order to pickup fixes there for java 9. * azure discovery and s3 repository indirectly depend on jaxb, which is no longer in the default modules. They now add a jaxb dependency externally, and make JarHell allow for this package.	2016-05-21 09:41:51 -07:00
javanna	63c5b31449	update shas for httpclient and httpcore	2016-05-20 14:10:55 +02:00
Tanguy Leroux	8486488627	Disable DeleteByQueryRestIT delete_by_query/10_basic/Basic delete_by_query Because of a REST test namespace conflict introduced by 18329. Issue tracked in 18469	2016-05-19 18:44:53 +02:00
Tanguy Leroux	35d3bdab84	Add Google Cloud Storage repository plugin Closes #12880	2016-05-19 13:26:23 +02:00
Jason Tedor	6e3b49c522	Fix inequality symbol in test assertion This commit fixes the inequality symbol used in a test assertion in RepositoryS3SettingsTests#testInvalidChunkBufferSizeRepositorySettings. The inequality symbol was previously backwards but fixed in commit `cad0608cdb` but fixing the inequality symbol here was missed in that commit. Closes #18449	2016-05-18 12:14:37 -04:00
Jason Tedor	db4809d906	Remove last vestigates of /bin/sh shebangs This commit removes the remaining /bin/sh shebangs in favor of /bin/bash. Relates #18448	2016-05-18 11:03:00 -04:00
$polyfractal$ polyfractal	72094feb12	[TEST] Add missing sort processor to tests, continued	2016-05-17 16:39:53 -04:00
Robert Muir	8d4c1befe5	Merge pull request #18364 from rmuir/nukeRunAsFloat Remove LeafSearchScript.runAsFloat(): Nothing calls it.	2016-05-16 17:08:25 -04:00
Adrien Grand	864ed04059	Lessen leniency of the query dsl. #18276 This change does the following: - Queries that are currently unsupported such as prefix queries on numeric fields or term queries on geo fields now throw an error rather than returning a query that does not match anything. - Fuzzy queries on numeric, date and ip fields are now unsupported: they used to create range queries, we now expect users to use range queries directly. Fuzzy, regexp and prefix queries are now only supported on text/keyword fields (including `_all`). - The `_uid` and `_id` fields do not support prefix or range queries anymore as it would prevent us to store them more efficiently in the future, eg. by using a binary encoding. Note that it is still possible to ignore these errors by using the `lenient` option of the `match` or `query_string` queries.	2016-05-16 17:37:00 +02:00
Robert Muir	8edf213492	Remove LeafSearchScript.runAsFloat(): Nothing calls it.	2016-05-15 22:59:28 -04:00
Robert Muir	2028691e66	painless: improve exception stacktraces closes #18319	2016-05-13 15:40:45 -04:00
Lee Hinman	9bcdafedda	Allow only a single extension for a scripting engine Previously multiple extensions could be provided, however, this can lead to confusion with on-disk scripts (ie, "foo.js" and "foo.javascript") having different content. Only a single extension is now supported. The only language currently supporting multiple extensions was the Javascript engine ("js" and "javascript"). It now only supports the `.js` extension. Relates to #10598	2016-05-13 09:54:31 -06:00
Lee Hinman	efff3918d8	Remove support for mulitple languages per scripting engine	2016-05-13 09:24:31 -06:00
Lee Hinman	a4060f7436	Remove vestiges of script engine sandboxing This removes all the mentions of the sandbox from the script engine services and permissions model. This means that the following settings are no longer supported: ```yaml script.inline: sandbox script.stored: sandbox ``` Instead, only a `true` or `false` value can be specified. Since this would otherwise break the default-allow parameter for languages like expressions, painless, and mustache, all script engines have been updated to have individual settings, for instance: ```yaml script.engine.groovy.inline: true ``` Would enable all inline scripts for groovy. (they can still be overridden on a per-operation basis). Expressions, Painless, and Mustache all default to `true` for inline, file, and stored scripts to preserve the old scripting behavior. Resolves #17114	2016-05-13 09:24:31 -06:00
John Barker	531dcbf20a	Add TAG_SETTING to list of allowed tags for the ec2 discovery plugin. I am unable to set ec2 discovery tags because this setting was accidentally omitted from the register settings list in Ec2DiscoveryPlugin.java. I get this: java.lang.IllegalArgumentException: unknown setting [discovery.ec2.tag.project]	2016-05-10 16:19:46 -04:00
David Pilato	e8ddf5de2f	Merge branch 'pr/hide-s3-repositories-credentials'	2016-05-10 20:22:39 +02:00
Adrien Grand	f481492af3	Remove FieldMapper.Builder.indexName. #18219 The ability to configure index names that are different from the full name was removed in 2.0.	2016-05-10 08:17:00 +02:00
Adrien Grand	5d8f684319	Mapping cleanups. #18180 This removes dead/duplicate code and makes the `_index` field not configurable. (Configuration used to jus be ignored, now we would throw an exception if any is provided.)	2016-05-10 08:14:18 +02:00
Chris Earle	5be79ed02c	Add Failure Details to every NodesResponse Most of the current implementations of BaseNodesResponse (plural Nodes) ignore FailedNodeExceptions. - This adds a helper function to do the grouping to TransportNodesAction - Requires a non-null array of FailedNodeExceptions within the BaseNodesResponse constructor - Reads/writes the array to output - Also adds StreamInput and StreamOutput methods for generically reading and writing arrays	2016-05-06 14:59:43 -04:00
Adrien Grand	7d8708716e	QueryBuilder does not need generics. #18133 QueryBuilder has generics, but those are never used: all call sites use `QueryBuilder<?>`. Only `AbstractQueryBuilder` needs generics so that the base class can contain a default implementation for setters that returns `this`.	2016-05-06 08:38:20 +02:00
Jason Tedor	784c9e5fb9	Introduce node handshake This commit introduces a handshake when initiating a light connection. During this handshake, node information, cluster name, and version are received from the target node of the connection. This information can be used to immediately validate that the target node is a member of the same cluster, and used to set the version on the stream. This will allow us to extend APIs that are used during initial cluster recovery without a major version change. Relates #15971	2016-05-04 20:06:47 -04:00
Jason Tedor	2dea449949	Remove Strings#splitStringToArray This commit removes the method Strings#splitStringToArray and replaces the call sites with invocations to String#split. There are only two explanations for the existence of this method. The first is that String#split is slightly tricky in that it accepts a regular expression rather than a character to split on. This means that if s is a string, s.split(".") does not split on the character '.', but rather splits on the regular expression '.' which splits on every character (of course, this is easily fixed by invoking s.split("\\.") instead). The second possible explanation is that (again) String#split accepts a regular expression. This means that there could be a performance concern compared to just splitting on a single character. However, it turns out that String#split has a fast path for the case of splitting on a single character and microbenchmarks show that String#split has 1.5x--2x the throughput of Strings#splitStringToArray. There is a slight behavior difference between Strings#splitStringToArray and String#split: namely, the former would return an empty array in cases when the input string was null or empty but String#split will just NPE at the call site on null and return a one-element array containing the empty string when the input string is empty. There was only one place relying on this behavior and the call site has been modified accordingly.	2016-05-04 08:12:41 -04:00
Martijn van Groningen	7aca1389e2	ingest: Add `date_index_name` processor. Closes #17814	2016-04-29 17:20:48 +02:00
Tal Levy	07c2fbf83a	Validate properties values according to database type (#17940 ) Fixes #17683.	2016-04-29 07:58:27 -07:00
Yannick Welsch	37382ecfb2	Add Azure discovery tests mocking Azure management endpoint (#18004 )	2016-04-29 15:54:15 +02:00
David Pilato	6c7a44ccd9	Fix test in mapper attachments plugin	2016-04-29 15:02:04 +02:00
David Pilato	2636703afa	Merge branch 'master' into pr/attachments-add-test-forced-values	2016-04-29 14:55:42 +02:00
David Pilato	6ef81c5dcd	S3 repositories credentials should be filtered When working on #18008 I found while reading the code that we don't filter anymore `repositories.s3.access_key` and `repositories.s3.secret_key`. Also fixed a typo in REST test	2016-04-27 14:11:17 +02:00
Alexander Reelsen	f71eb0b888	Version: Set version to 5.0.0-alpha2	2016-04-26 09:30:26 +02:00
Xu Zhang	3e4b470f83	Fix icu IndexScope setting	2016-04-22 15:03:02 -07:00
Ryan Ernst	d12a4bb51d	Merge pull request #17933 from rjernst/camelcase4 Remove camelCase support	2016-04-22 13:46:43 -07:00
xuzha	cd527c5b92	Add support for customizing the rule file in ICU tokenizer Lucene allows to create a ICUTokenizer with a special config argument enabling the customization of the rule based iterator by providing custom rules files. This commit enable this feature. Users could provide a list of RBBI rule files to ICU tokenizer. closes #13146	2016-04-22 12:39:20 -07:00
Ryan Ernst	55388590c1	Remove camelCase support Now that the current uses of magical camelCase support have been deprecated, we can remove these in master (sans remaining issues like BulkRequest). This change removes camel case support from ParseField, query types, analysis, and settings lookup. see #8988	2016-04-22 09:18:10 -07:00
Martijn van Groningen	c5ad2e2865	Changed indexed scripts to be stored in the cluster state instead of the `.scripts` index. Also added max script size soft limit for stored scripts. Closes #16651	2016-04-22 13:42:55 +02:00
Martijn van Groningen	dd2184ab25	ingest: Streamline option naming for several processors: * `rename` processor, renamed `to` to `target_field` * `date` processor, renamed `match_field` to `field` and renamed `match_formats` to `formats` * `geoip` processor, renamed `source_field` to `field` and renamed `fields` to `properties` * `attachment` processor, renamed `source_field` to `field` and renamed `fields` to `properties` Closes #17835	2016-04-21 13:40:43 +02:00
Jun Ohtani	9eb242a5fe	Analyze API : Rename filters/token_filters/char_filter to filter/token_filter/char_filter Closes #15189	2016-04-21 18:05:11 +09:00
Ryan Ernst	523b071836	Internal: Remove XContentBuilderString This was previously used by xcontentbuilder to support camelCase. However, it is no longer used, and can be replaced with just String.	2016-04-18 14:32:18 -07:00
Nik Everett	ff9b28d806	Deprecate remaining readXYZ\|writeXYZ methods	2016-04-18 16:19:45 -04:00
Adrien Grand	d84c643f58	Use the new points API to index numeric fields. #17746 This makes all numeric fields including `date`, `ip` and `token_count` use points instead of the inverted index as a lookup structure. This is expected to perform worse for exact queries, but faster for range queries. It also requires less storage. Notes about how the change works: - Numeric mappers have been split into a legacy version that is essentially the current mapper, and a new version that uses points, eg. LegacyDateFieldMapper and DateFieldMapper. - Since new and old fields have the same names, the decision about which one to use is made based on the index creation version. - If you try to force using a legacy field on a new index or a field that uses points on an old index, you will get an exception. - IP addresses now support IPv6 via Lucene's InetAddressPoint and store them in SORTED_SET doc values using the same encoding (fixed length of 16 bytes and sortable). - The internal MappedFieldType that is stored by the new mappers does not have any of the points-related properties set. Instead, it keeps setting the index options when parsing the `index` property of mappings and does `if (fieldType.indexOptions() != IndexOptions.NONE) { // add point field }` when parsing documents. Known issues that won't fix: - You can't use numeric fields in significant terms aggregations anymore since this requires document frequencies, which points do not record. - Term queries on numeric fields will now return constant scores instead of giving better scores to the rare values. Known issues that we could work around (in follow-up PRs, this one is too large already): - Range queries on `ip` addresses only work if both the lower and upper bounds are inclusive (exclusive bounds are not exposed in Lucene). We could either decide to implement it, or drop range support entirely and tell users to query subnets using the CIDR notation instead. - Since IP addresses now use a different representation for doc values, aggregations will fail when running a terms aggregation on an ip field on a list of indices that contains both pre-5.0 and 5.0 indices. - The ip range aggregation does not work on the new ip field. We need to either implement range aggs for SORTED_SET doc values or drop support for ip ranges and tell users to use filters instead. #17700 Closes #16751 Closes #17007 Closes #11513	2016-04-14 17:56:23 +02:00
Yannick Welsch	80cf9fc761	Add EC2 discovery tests to check permissions of AWS Java SDK (#17677 )	2016-04-13 10:01:49 +02:00
Adrien Grand	3bf6f4076c	Do not set analyzers on numeric fields. When it comes to query parsing, either a field is tokenized and it would go through analysis with its search_analyzer. Or it is not tokenized and the raw string should be passed to termQuery(). Since numeric fields are not tokenized and also declare a search analyzer, values would currently go through analysis twice...	2016-04-12 17:47:29 +02:00
Adrien Grand	013acf9179	Remove MappedFieldType.value. #17557 This commit removes `MappedFieldType.value` and simplifies `MappedFieldType.valueforSearch`. `valueforSearch` was used to post-process values that come for stored fields (eg. to convert a long back to a string representation of a date in the case of a date field) and also values that are extracted from the source but only in the case of GET calls: it would not be called when performing source filtering on search requests. `valueforSearch` is now only called for stored fields, since values that are extracted from the source should already be formatted as expected.	2016-04-12 09:12:56 +02:00
Adrien Grand	496c7fbd84	Upgrade Lucene 6 Release * upgrades numerics to new Point format * updates geo api changes * adds GeoPointDistanceRangeQuery as XGeoPointDistanceRangeQuery * cuts over to ES GeoHashUtils	2016-04-11 16:50:04 -05:00
Yannick Welsch	b08d453a0a	Fix EC2 Discovery settings (#17651 ) Fixes two bugs introduced by the settings refactoring in #16602	2016-04-11 16:17:55 +02:00
Alexander Reelsen	da19ddf3e6	Ingest Attachment: Allow to prevent base64 conversions by using raw bytes (#16601 ) CBOR is natively supported in Elasticsearch and allows for byte arrays. This means, that by using CBOR the user can prevent base64 conversions for the data being sent back and forth. This PR adds support to extract data from a byte array in addition to a string. This also required to add a ByteArrayValueSource class.	2016-04-11 14:14:56 +02:00
Adrien Grand	42526ac28e	Remove Settings.settingsBuilder. We have both `Settings.settingsBuilder` and `Settings.builder` that do exactly the same thing, so we should keep only one. I kept `Settings.builder` since it has my preference but also it is the one that we use in examples of the Java API.	2016-04-08 18:10:02 +02:00
David Pilato	c6b1beb083	Add a test for forced values in mapper-attachments plugin This PR just adds a new test where we check that we forcing a value in the JSON document actually works as expected: ```json { "file": { "_content": "BASE64" "_name": "12-240.pdf", "_language": "en", "_content_type": "pdf" } } ``` Note that we don't support forcing all values. So sending: ```json { "file": { "_content": "BASE64" "_name": "12-240.pdf", "_title": "12-240.pdf", "_keywords": "Div42 Src580 LGE Mechtech", "_language": "en", "_content_type": "pdf" } } ``` Will have absolutely no effect on fields `title` and `keywords`. Note that when `_language` is set, it only works if `index.mapping.attachment.detect_language` is set to `true`. Related to https://discuss.elastic.co/t/mapper-attachments/46615/4	2016-04-08 10:07:21 +02:00

1 2 3 4 5 ...

1328 Commits