OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-03-02 08:59:09 +00:00

Author	SHA1	Message	Date
javanna	b0d7d604ff	Add support for transient metadata to IngestDocument IngestDocument now holds an additional map of transient metadata. The only field that gets added automatically is `timestamp`, which contains the timestamp of ingestion in ISO8601 format. In the future it will be possible to eventually add or modify these fields, which will not get indexed, but they will be available via templates to all of the processors. Transient metadata will be visualized by the simulate api, although they will never get indexed. Moved WriteableIngestDocument to the simulate package as it's only used by simulate and it's now modelled for that specific usecase. Also taken the chance to remove one IngestDocument constructor used only for testing (accepting only a subset of es metadata fields). While doing that introduced some more randomizations to some existing processor tests. Closes #15036	2015-12-09 18:36:01 +01:00
javanna	5bc1e46113	setFieldValue for list to replace when an index is specified It used to do add instead, which is not consistent with the behaviour of set, which always replaces.	2015-12-09 18:28:07 +01:00
javanna	8240031216	Merge branch 'master' into feature/ingest	2015-12-09 18:14:32 +01:00
Simon Willnauer	a49120bfc1	fix compilation	2015-12-09 12:26:28 +01:00
Simon Willnauer	85a1b54867	fix compilation	2015-12-09 11:41:14 +01:00
Martijn van Groningen	233de434a0	Merge pull request #15310 from martijnvg/ingest/stream_put_and_delete_responses Streamline put & delete pipeline responses with index & delete responses	2015-12-09 11:30:57 +01:00
Simon Willnauer	c9d7c92243	fold ClusterSettingsService into ClusterSettings	2015-12-09 09:57:39 +01:00
Britta Weber	e0aa481bf5	Merge pull request #15213 from brwe/copy-to-in-multi-fields-exception throw exception if a copy_to is within a multi field Copy to within multi field is ignored from 2.0 on, see #10802. Instead of just ignoring it, we should throw an exception if this is found in the mapping when a mapping is added. For already existing indices we should at least log a warning. We remove the copy_to in any case. related to #14946	2015-12-08 14:41:07 +01:00
Simon Willnauer	fbbb04b87e	Add infrastructure to transactionally apply and reset dynamic settings This commit adds the infrastructure to make settings that are updateable resetable and changes the application of updates to be transactional. This means setting updates are either applied or not. If the application failes all values are rejected. This initial commit converts all dynamic cluster settings to make use of the new infrastructure. All cluster level dynamic settings are not resettable to their defaults or to the node level settings. The infrastructure also allows to list default values and descriptions which is not fully implemented yet. Values can be reset using a list of key or simple regular expressions. This has only been implemented on the java layer yet. For instance to reset all recovery settings to their defaults a user can just specify `indices.recovery.*`. This commit also adds strict settings validation, if a setting is unknown or if a setting can not be applied the entire settings update request will fail.	2015-12-08 14:39:15 +01:00
Martijn van Groningen	a2cda4e3f2	Streamline the put and delete pipelines responses with the index and delete response in core.	2015-12-08 14:01:28 +01:00
David Pilato	7dcb40bcac	Add support for proxy authentication for s3 and ec2 When using S3 or EC2, it was possible to use a proxy to access EC2 or S3 API but username and password were not possible to be set. This commit adds support for this. Also, to make all that consistent, proxy settings for both plugins have been renamed: * from `cloud.aws.proxy_host` to `cloud.aws.proxy.host` * from `cloud.aws.ec2.proxy_host` to `cloud.aws.ec2.proxy.host` * from `cloud.aws.s3.proxy_host` to `cloud.aws.s3.proxy.host` * from `cloud.aws.proxy_port` to `cloud.aws.proxy.port` * from `cloud.aws.ec2.proxy_port` to `cloud.aws.ec2.proxy.port` * from `cloud.aws.s3.proxy_port` to `cloud.aws.s3.proxy.port` New settings are `proxy.username` and `proxy.password`. ```yml cloud: aws: protocol: https proxy: host: proxy1.company.com port: 8083 username: myself password: theBestPasswordEver! ``` You can also set different proxies for `ec2` and `s3`: ```yml cloud: aws: s3: proxy: host: proxy1.company.com port: 8083 username: myself1 password: theBestPasswordEver1! ec2: proxy: host: proxy2.company.com port: 8083 username: myself2 password: theBestPasswordEver2! ``` Note that `password` is filtered with `SettingsFilter`. We also fix a potential issue in S3 repository. We were supposed to accept key/secret either set under `cloud.aws` or `cloud.aws.s3` but the actual code never implemented that. It was: ```java account = settings.get("cloud.aws.access_key"); key = settings.get("cloud.aws.secret_key"); ``` We replaced that by: ```java String account = settings.get(CLOUD_S3.KEY, settings.get(CLOUD_AWS.KEY)); String key = settings.get(CLOUD_S3.SECRET, settings.get(CLOUD_AWS.SECRET)); ``` Also, we extract all settings for S3 in `AwsS3Service` as it's already the case for `AwsEc2Service` class. Closes #15268.	2015-12-07 23:10:54 +01:00
Tal Levy	e6b287c083	Merge pull request #15133 from talevy/the_one_true_field [Ingest] Have processors operate one field at time.	2015-12-07 08:31:08 -08:00
Tal Levy	45f48ac126	update all processors to only operate on one field at a time when possible	2015-12-07 08:30:00 -08:00
Martijn van Groningen	6062c4eac9	Merge branch 'master' into feature/ingest	2015-12-07 15:53:39 +01:00
Robert Muir	2169a123a5	Filter classes loaded by scripts Since 2.2 we run all scripts with minimal privileges, similar to applets in your browser. The problem is, they have unrestricted access to other things they can muck with (ES, JDK, whatever). So they can still easily do tons of bad things This PR restricts what classes scripts can load via the classloader mechanism, to make life more difficult. The "standard" list was populated from the old list used for the groovy sandbox: though a few more were needed for tests to pass (java.lang.String, java.util.Iterator, nothing scary there). Additionally, each scripting engine typically needs permissions to some runtime stuff. That is the downside of this "good old classloader" approach, but I like the transparency and simplicity, and I don't want to waste my time with any feature provided by the engine itself for this, I don't trust them. This is not perfect and the engines are not perfect but you gotta start somewhere. For expert users that need to tweak the permissions, we already support that via the standard java security configuration files, the specification is simple, supports wildcards, etc (though we do not use them ourselves).	2015-12-05 21:46:52 -05:00
Robert Muir	46377778a9	Merge branch 'master' into getClassLoader	2015-12-04 15:58:36 -05:00
Robert Muir	b0c64910b0	ban RuntimePermission("getClassLoader") this gives more isolation between modules and plugins.	2015-12-04 15:58:02 -05:00
Ryan Ernst	01d48e2062	Merge branch 'master' into jigsaw	2015-12-04 11:29:49 -08:00
David Pilato	619fb998e8	Update Azure Service Management API to 0.9.0 Azure team released new versions of their Java SDK. According to https://github.com/Azure/azure-sdk-for-java/wiki/Azure-SDK-for-Java-Features, it comes with 2 versions. We should at least update to `0.9.0` of V1 but also consider moving to the new APIs (V2). This commit first updates to latest API V1. ```xml <dependency> <groupId>com.microsoft.azure</groupId> <artifactId>azure-svc-mgmt-compute</artifactId> <version>0.9.0</version> </dependency> ``` Closes #15209	2015-12-04 17:32:11 +01:00
javanna	d7c3b51b9c	[TEST] adapt to upstream changes	2015-12-04 16:35:53 +01:00
javanna	73986cc54f	adapt to upstream changes	2015-12-04 14:17:07 +01:00
javanna	6c43137413	Merge branch 'master' into feature/ingest	2015-12-04 14:10:01 +01:00
Adrien Grand	3f86adddbf	Remove MergeMappingException. Failures to merge a mapping can either come as a MergeMappingException if they come from Mapper.merge or as an IllegalArgumentException if they come from FieldTypeLookup.checkCompatibility. I think we should settle on one: this pull request replaces all usage of MergeMappingException with IllegalArgumentException.	2015-12-04 12:56:26 +01:00
Ryan Ernst	0a4a81afaf	Added modules, distributions now include them (just plugins installed in a diff dir)	2015-12-03 14:18:26 -08:00
Tal Levy	ffa8998f36	Merge pull request #15181 from martijnvg/ingest_geoip_only_read_mmdb_files [Ingest] The geoip processor should only try to read *.mmdb files from the geoip config directory	2015-12-03 12:17:09 -08:00
Tal Levy	56da7b32ed	add ability to define custom grok patterns within processor config	2015-12-03 08:24:07 -08:00
Tal Levy	cf1c393d70	Merge pull request #15166 from talevy/remove_pattern_utils move PatternUtils#loadBankFromStream into GrokProcessor.Factory	2015-12-03 08:08:00 -08:00
Martijn van Groningen	6acf8ec263	Removed pipeline tests with a simpler tests The PipelineTests tried to test if the configured map/list in set processor wasn't modified while documents were ingested. Creating a pipeline programmatically created more noise than the test needed. The new tests in IngestDocumentTests have the same goal, but is much smaller and clearer by directly testing against IngestDocument.	2015-12-03 15:19:04 +01:00
Jason Tedor	fbe736c9bb	Cleaner type-inference assistance	2015-12-02 10:49:35 -05:00
Jason Tedor	05430a788a	Remove and forbid use of the type-unsafe empty Collections fields This commit removes and now forbids all uses of the type-unsafe empty Collections fields Collections#EMPTY_LIST, Collections#EMPTY_MAP, and Collections#EMPTY_SET. The type-safe methods Collections#emptyList, Collections#emptyMap, and Collections#emptySet should be used instead.	2015-12-02 10:41:59 -05:00
Martijn van Groningen	9ab765b851	The geoip processor should only try to read *.mmdb files from the geoip config directory	2015-12-02 14:38:49 +01:00
Martijn van Groningen	a9ecde041b	Merge branch 'master' into feature/ingest	2015-12-02 11:21:15 +01:00
Martijn van Groningen	270a3977bc	Removed the lazy cache in DatabaseReaderService and eagerly build all available databases.	2015-12-02 11:16:02 +01:00
David Pilato	d23d8a891f	Remove "empty" licenses dir Follow up #15168 We don't need to have "fake" licenses dir anymore.	2015-12-02 10:22:52 +01:00
Ryan Ernst	d68c6673a2	Build: Cleanup precommit task gradle code This change attempts to simplify the gradle tasks for precommit. One major part of that is using a "less groovy style", as well as being more consistent about how tasks are created and where they are configured. It also allows the things creating the tasks to set up inter task dependencies, instead of assuming them (ie decoupling from tasks eleswhere in the build).	2015-12-01 22:36:54 -08:00
Tal Levy	767bd1d4d5	move PatternUtils#loadBankFromStream into GrokProcessor.Factory	2015-12-01 15:46:02 -08:00
javanna	6c0510b01d	Make rename processor less error prone Rename processor now checks whether the field to rename exists and throws exception if it doesn't. It also checks that the new field to rename to doesn't exist yet, and throws exception otherwise. Also we make sure that the rename operation is atomic, otherwise things may break between the remove and the set and we'd leave the document in an inconsistent state. Note that the requirement for the new field name to not exist simplifies the usecase for e.g. { "rename" : { "list.1": "list.2"} } as such a rename wouldn't be accepted if list is actually a list given that either list.2 already exists or the index is out of bounds for the existing list. If one really wants to replace an existing field, that field needs to be removed first through remove processor and then rename can be used.	2015-12-01 19:58:24 +01:00
Martijn van Groningen	15b6708a5d	and now make use of the lifecycle infrastructure	2015-12-01 18:20:25 +01:00
Tal Levy	8e4c288b5c	Merge pull request #15132 from talevy/no_match_for_grok [Ingest] No match for grok	2015-12-01 09:13:11 -08:00
Tal Levy	2c1effdd41	throw exception when grok processor does not match	2015-12-01 08:58:58 -08:00
Martijn van Groningen	9dd52ad7d3	Removed pollution from the Processor.Factory interface. 1) It no longer extends from Closeable. 2) Removed the config directory setter. Implementation that relied on it, now get the location to the config dir via their constructors.	2015-12-01 17:32:37 +01:00
Martijn van Groningen	fa9fcb3b11	geo processor should add a list of doubles instead of an array to the ingest document	2015-12-01 17:12:34 +01:00
Martijn van Groningen	99a4295330	If a list or map value gets set on ingest document a deep copy needs to be made. If this is not done this can lead to processor configuration being changed by an bulk or index request.	2015-12-01 16:02:04 +01:00
javanna	c67a332486	Query DSL: Enforce distance is greater than 0 in geo distance query Validation is not done as part of the distance setter method and tested in GeoDistanceQueryBuilderTests. Fixed GeoDistanceTests to adapt to the new validation. Closes #15135	2015-12-01 14:07:32 +01:00
Martijn van Groningen	4402da1af0	also change the tests to deal with Exception instead of IOException	2015-11-30 15:45:40 +01:00
Martijn van Groningen	dde274d944	Replaced IOException with Exception on factory implementations' `Processor.Factory#create(Map)` method.	2015-11-30 15:37:16 +01:00
Britta Weber	d8a1a4bd43	fix toXContent() for mapper attachments field We must use simpleName() instead of name() because otherwise when the mapping is generated as a string the field name will be the full path with dots and that is illegal from es 2.0 on. closes https://github.com/elastic/elasticsearch-mapper-attachments/issues/169	2015-11-30 15:28:12 +01:00
Martijn van Groningen	fdf4543b8e	Renamed `add` processor to `set` processor. This name makes more sense, because if a field already exists it overwrites it.	2015-11-30 15:03:20 +01:00
javanna	43b861b076	IngestDocument to support accessing and modifying list items When reading, through #getFieldValue and #hasField, and a list is encountered, the next element in the path is treated as the index of the item that the path points to (e.g. `list.0.key`). If the index is not a number or out of bounds, an exception gets thrown. Added #appendFieldValue method that has the same behaviour as setFieldValue, but when a list is the last element in the path, instead of replacing the whole list it will simply add a new element to the existing list. This method is currently unused, we have to decide whether the set processor or a new processor should use it. A few other changes made: - Renamed hasFieldValue to hasField, as this method is not really about values but only keys. It will return true if a key is there but its value is null, while it returns false only when a field is not there at all. - Changed null semantic in getFieldValue. null gets returned only when it was an actual value in the source, an exception is thrown otherwise when trying to access a non existing field, so that null != field not present. - Made remove stricter about non existing fields. Throws error when trying to remove a non existing field. This is more consistent with the other methods in IngestDocument which are strict about fields that are not present. Relates to #14324	2015-11-30 13:58:03 +01:00
Jim Ferenczi	e182072b6f	Merge pull request #15017 from jimferenczi/fields_option Refuse to load fields from _source when using the `fields` option and support wildcards.	2015-11-30 11:01:21 +01:00

... 11 12 13 14 15 ...

1507 Commits