OpenSearch

Commit Graph

Author	SHA1	Message	Date
David Pilato	87553bba16	Add ingest-attachment support for per document `indexed_chars` limit (#28977 ) We today support a global `indexed_chars` processor parameter. But in some cases, users would like to set this limit depending on the document itself. It used to be supported in mapper-attachments plugin by extracting the limit value from a meta field in the document sent to indexation process. We add an option which reads this limit value from the document itself by adding a setting named `indexed_chars_field`. Which allows running: ``` PUT _ingest/pipeline/attachment { "description" : "Extract attachment information. Used to parse pdf and office files", "processors" : [ { "attachment" : { "field" : "data", "indexed_chars_field" : "size" } } ] } ``` Then index either: ``` PUT index/doc/1?pipeline=attachment { "data": "BASE64" } ``` Which will use the default value (or the one defined by `indexed_chars`) Or ``` PUT index/doc/2?pipeline=attachment { "data": "BASE64", "size": 1000 } ``` Closes #28942	2018-03-14 19:07:20 +01:00
Shane O'Grady	7dcd48a0b5	[discovery-gce] Align code examples and documentation (#28876 ) The docs state that `_gce_` is recommended but the code sample states that `_gce:hostname_` is recommended. This aligns the code sample with the documentation. Also replace `type` with `zen.hosts_provider` as discovery.type was removed in #25080.	2018-03-12 15:37:11 +01:00
Daniel Mitterdorfer	0d78a5890e	Reduce heap-memory usage of ingest-geoip plugin (#28963 ) With this commit we reduce heap usage of the ingest-geoip plugin by memory-mapping the database files. Previously, we have stored these files gzip-compressed but this has resulted that data are loaded on the heap. Closes #28782	2018-03-12 08:07:33 +01:00
Tanguy Leroux	a6a138905d	Use client settings in repository-gcs (#28575 ) Similarly to what has been done for s3 and azure, this commit removes the repository settings `application_name` and `connect/read_timeout` in favor of client settings. It introduce a GoogleCloudStorageClientSettings class (similar to S3ClientSettings) and a bunch of unit tests for that, it aligns the documentation to be more coherent with the S3 one, it documents the connect/read timeouts that were not documented at all and also adds a new client setting that allows to define a custom endpoint.	2018-02-22 15:40:20 +01:00
Ryan Ernst	9a5199fae3	Docs: Remove references to elasticsearch directory in plugins (#28647 ) This directory was removed from plugins in #28589, but docs still referenced it. This commit cleans up the plugin author docs to no longer refer to it.	2018-02-13 09:15:57 -08:00
Clinton Gormley	bf6378e426	Fixed docs rendering of plugin-descriptor.properties in plugin author docs	2018-02-13 10:35:39 +01:00
Ivan Brusic	38c5f4efee	[Docs] Fix incomplete URLs (#28528 )	2018-02-06 09:25:28 +01:00
Deb Adair	459233d550	[DOCS] Fixed list formatting.	2018-02-02 11:01:40 -08:00
Tanguy Leroux	be74f11517	Replace jvm-example by two plugin examples (#28339 ) This pull request replaces the jvm-example plugin (from the jvm/site plugins era) by two new plugins: a custom-settings that shows how to register and use custom settings (including secured settings) in a plugin, and rest-handler plugin that shows how to register a rest handler. The two plugins now reside in the plugins/examples project. They can serve as sample plugins for users, a special attention has been put on documentation. The packaging tests have been adapted to use the custom-settings plugin.	2018-01-26 17:34:24 +01:00
Tyler Smalley	9d7799d356	[Docs] Remove Kerberos/SPNEGO Shield plugin (#28019 ) Plugin has not been updated in over two years and requires Shield and ES 2.3.1	2018-01-10 19:23:06 +01:00
Jim Ferenczi	5ac05faab5	[Docs] fix plugin properties inclusion for plugins authors	2018-01-10 00:00:27 +01:00
Jim Ferenczi	36729d1c46	Add the ability to bundle multiple plugins into a meta plugin (#28022 ) This commit adds the ability to package multiple plugins in a single zip. The zip file for a meta plugin must contains the following structure: \|____elasticsearch/ \| \|____ <plugin1> <-- The plugin files for plugin1 (the content of the elastisearch directory) \| \|____ <plugin2> <-- The plugin files for plugin2 \| \|____ meta-plugin-descriptor.properties <-- example contents below The meta plugin properties descriptor is mandatory and must contain the following properties: description: simple summary of the meta plugin. name: the meta plugin name The installation process installs each plugin in a sub-folder inside the meta plugin directory. The example above would create the following structure in the plugins directory: \|_____ plugins \| \|____ <name_of_the_meta_plugin> \| \| \|____ meta-plugin-descriptor.properties \| \| \|____ <plugin1> \| \| \|____ <plugin2> If the sub plugins contain a config or a bin directory, they are copied in a sub folder inside the meta plugin config/bin directory. \|_____ config \| \|____ <name_of_the_meta_plugin> \| \| \|____ <plugin1> \| \| \|____ <plugin2> \|_____ bin \| \|____ <name_of_the_meta_plugin> \| \| \|____ <plugin1> \| \| \|____ <plugin2> The sub-plugins are loaded at startup like normal plugins with the same restrictions; they have a separate class loader and a sub-plugin cannot have the same name than another plugin (or a sub-plugin inside another meta plugin). It is also not possible to remove a sub-plugin inside a meta plugin, only full removal of the meta plugin is allowed. Closes #27316	2018-01-09 18:28:43 +01:00
Martijn van Groningen	a38f70b5dd	fix doc mistake	2018-01-05 14:12:51 +01:00
Sian Lerk Lau	a4a7150b56	Added ASN support for Ingest GeoIP plugin. Closes #27849	2018-01-05 14:07:04 +01:00
Martijn van Groningen	4585cc8312	ingest: Upgraded the geolite2 databases.	2017-12-20 10:42:46 +01:00
Christoph Büscher	0d11b9fe34	[Docs] Unify spelling of Elasticsearch (#27567 ) Removes occurences of "elasticsearch" or "ElasticSearch" in favour of "Elasticsearch" where appropriate.	2017-11-29 09:44:25 +01:00
Jason Tedor	3c9b919dbf	Add note on plugin distributions in plugins folder This commit adds a note regarding not storing a plugin distribution in the plugins directory during installation or instllation will fail. Relates #27400	2017-11-15 13:33:59 -05:00
Tanguy Leroux	9c4d6c629a	Remove S3 output stream (#27280 ) Now the blob size information is available before writing anything, the repository implementation can know upfront what will be the more suitable API to upload the blob to S3. This commit removes the DefaultS3OutputStream and S3OutputStream classes and moves the implementation of the upload logic directly in the S3BlobContainer. related #26993 closes #26969	2017-11-10 12:22:33 +01:00
Tanguy Leroux	184dda9eb0	Update to AWS SDK 1.11.223 (#27278 )	2017-11-09 13:25:51 +01:00
Jay Greenberg	df5c8bb3bf	Update discovery-ec2.asciidoc Changed the recommendation to use Tribe Node to Cross Cluster Search.	2017-11-07 10:18:38 -05:00
Patrice Bourgougnon	4b7b1e2706	Add an active Elasticsearch WordPress plugin link (#27279 )	2017-11-06 18:13:27 +01:00
Jun Ohtani	77e11f6969	[Doc] Add Ingest CSV Processor Plugin to plugin as a community plugin (#27105 ) * [Doc] Add Ingest CSV Processor Plugin to plugin as a community plugin	2017-10-27 16:16:02 +09:00
Loading Zhang	149e558dd5	Docs: Fix ingest geoip config location (#27110 )	2017-10-25 07:16:42 -07:00
Deb Adair	875e582cc9	[DOCS] Fixed indentation of the definition list.	2017-10-10 12:08:21 -07:00
Chris Earle	dcc6b426ec	[DOCS] Plugin Installation for Windows (#21671 ) This shows an example of how to install a plugin on Windows, which is not as obvious at I would have expected.	2017-10-10 09:31:44 -06:00
wasserman	67845134de	[Docs] Fixed typo of configuration (#25058 )	2017-09-21 16:49:00 +02:00
kel	601be4f83e	Add azure storage endpoint suffix #26432 (#26568 ) Allow specifying azure storage endpoint suffix for an azure client.	2017-09-20 22:26:19 -07:00
David Pilato	a34db4e09f	Support for accessing Azure repositories through a proxy (#23518 ) You can define a proxy using the following settings: ```yml azure.client.default.proxy.host: proxy.host azure.client.default.proxy.port: 8888 azure.client.default.proxy.type: http ``` Supported values for `proxy.type` are `direct`, `http` or `socks`. Defaults to `direct` (no proxy). Closes #23506 BTW I changed a test `testGetSelectedClientBackoffPolicyNbRetries` as it was using an old setting name `cloud.azure.storage.azure.max_retries` instead of `azure.client.azure1.max_retries`.	2017-09-13 11:51:55 +02:00
David Pilato	b01b1c2a58	Remove azure deprecated settings (#26099 ) Follow up for #23405. We remove azure deprecated settings in 7.0: * The legacy azure settings which where starting with `cloud.azure.storage.` prefix have been removed. This includes `account`, `key`, `default` and `timeout`. You need to use settings which are starting with `azure.client.` prefix instead. * Global timeout setting `cloud.azure.storage.timeout` has been removed. You must set it per azure client instead. Like `azure.client.default.timeout: 10s` for example.	2017-09-12 16:51:44 +02:00
Nik Everett	6d2c40e546	Enforce that responses in docs are valid json (#26249 ) All of the snippets in our docs marked with `// TESTRESPONSE` are checked against the response from Elasticsearch but, due to the way they are implemented they are actually parsed as YAML instead of JSON. Luckilly, all valid JSON is valid YAML! Unfurtunately that means that invalid JSON has snuck into the exmples! This adds a step during the build to parse them as JSON and fail the build if they don't parse. But no! It isn't quite that simple. The displayed text of some of these responses looks like: ``` { ... "aggregations": { "range": { "buckets": [ { "to": 1.4436576E12, "to_as_string": "10-2015", "doc_count": 7, "key": "-10-2015" }, { "from": 1.4436576E12, "from_as_string": "10-2015", "doc_count": 0, "key": "10-2015-" } ] } } } ``` Note the `...` which isn't valid json but we like it anyway and want it in the output. We use substitution rules to convert the `...` into the response we expect. That yields a response that looks like: ``` { "took": $body.took,"timed_out": false,"_shards": $body._shards,"hits": $body.hits, "aggregations": { "range": { "buckets": [ { "to": 1.4436576E12, "to_as_string": "10-2015", "doc_count": 7, "key": "-10-2015" }, { "from": 1.4436576E12, "from_as_string": "10-2015", "doc_count": 0, "key": "10-2015-" } ] } } } ``` That is what the tests consume but it isn't valid JSON! Oh no! We don't want to go update all the substitution rules because that'd be huge and, ultimately, wouldn't buy much. So we quote the `$body.took` bits before parsing the JSON. Note the responses that we use for the `_cat` APIs are all converted into regexes and there is no expectation that they are valid JSON. Closes #26233	2017-08-17 09:02:10 -04:00
Ryan Ernst	a51faea79f	Docs: Cleanup docs for ec2 discovery (#26065 ) This commit clears up which settings are allowed for ec2 discovery, and clearly marks those that require setting in the keystore. closes #25619	2017-08-15 10:14:51 -07:00
Jason Tedor	e9687622bd	Rename CONF_DIR to ES_PATH_CONF The environment variable CONF_DIR was previously inconsistently used in our packaging to customize the location of Elasticsearch configuration files. The importance of this environment variable has increased starting in 6.0.0 as it's now used consistently to ensure Elasticsearch and all secondary scripts (e.g., elasticsearch-keystore) all use the same configuration. The name CONF_DIR is there for legacy reasons yet it's too generic. This commit renames CONF_DIR to ES_PATH_CONF. Relates #26197	2017-08-15 06:19:06 +09:00
David Pilato	80b142d218	Azure repository: Move to named configurations as we do for S3 repository We should have the same behavior for Azure repositories as we have for S3 (see #22762). Instead of: ```yml cloud: azure: storage: my_account1: account: your_azure_storage_account1 key: your_azure_storage_key1 default: true my_account2: account: your_azure_storage_account2 key: your_azure_storage_key2 ``` Support something like: ``` azure.client: default: account: your_azure_storage_account1 key: your_azure_storage_key1 my_account2: account: your_azure_storage_account2 key: your_azure_storage_key2 ``` Then instead of: ``` PUT _snapshot/my_backup3 { "type": "azure", "settings": { "account": "my_account2" } } ``` Use: ``` PUT _snapshot/my_backup3 { "type": "azure", "settings": { "config": "my_account2" } } ``` If someone uses: ``` PUT _snapshot/my_backup3 { "type": "azure" } ``` It will use the `default` azure repository settings. And mark as deprecated old settings. Closes #22763.	2017-08-08 15:14:47 +02:00
Ryan Ernst	0266479cab	Docs: Update s3 repository docs with client settings (#26033 ) This commit updates the s3 repository docs to clearly mark settings as part of the s3 client settings, as well as those that are secure and must be stored in the elasticsearch keystore. relates #25619	2017-08-04 11:22:49 -07:00
Ryan Ernst	e23919856e	Docs: Remove docs for aws region and signer type (#26006 ) These settings are removed in 6.0. This commit removes the corresponding documentation. relates #22872 relates #23984	2017-08-01 19:16:15 -04:00
Ryan Ernst	f978974bc6	Docs: Remove s3 repository integ test documentation (#26005 ) The s3 repository plugin has "third party" integ tests which rely on external service and configuration setup. These tests are really internal verification of the plugin (and should be moved to real integ tests). Running them is not something a user should do, and the documentation has been out of date for all of 5.x. This commit removes the docs, removing potential confusion for users.	2017-08-01 19:13:45 -04:00
Ryan Ernst	8ab0d10387	Add compatibility versions to main action response (#25799 ) This commit adds the min wire/index compat versions to the main action output. Not only will this make the compatility expected more transparent, but it also allows to test which version others think the compat versions are, similar to how we test the lucene version.	2017-07-20 13:01:41 -07:00
Clinton Gormley	ff4a2519f2	Update experimental labels in the docs (#25727 ) Relates https://github.com/elastic/elasticsearch/issues/19798 Removed experimental label from: * Painless * Diversified Sampler Agg * Sampler Agg * Significant Terms Agg * Terms Agg document count error and execution_hint * Cardinality Agg precision_threshold * Pipeline Aggregations * index.shard.check_on_startup * index.store.type (added warning) * Preloading data into the file system cache * foreach ingest processor * Field caps API * Profile API Added experimental label to: * Moving Average Agg Prediction Changed experimental to beta for: * Adjacency matrix agg * Normalizers * Tasks API * Index sorting Labelled experimental in Lucene: * ICU plugin custom rules file * Flatten graph token filter * Synonym graph token filter * Word delimiter graph token filter * Simple pattern tokenizer * Simple pattern split tokenizer Replaced experimental label with warning that details may change in the future: * Analysis explain output format * Segments verbose output format * Percentile Agg compression and HDR Histogram * Percentile Rank Agg HDR Histogram	2017-07-18 14:06:22 +02:00
Clinton Gormley	091b1b0765	Fixed page breaks for ICU Collation Keyword Fields	2017-07-03 17:49:28 +02:00
Lisa Cawley	03f952a838	[DOCS] Update docs to use shared attribute file (#25403 ) * [DOCS] Update docs to use shared attribute file * [DOCS] Add shared attributes to Versions.asciidoc	2017-06-27 08:33:28 -07:00
James Baiera	9c65073852	[DOCS] Clarify expected availability of HDFS for the HDFS Repository (#25220 ) If a cluster is configured with an HDFS repository and a node is started, that node must be able to reach HDFS, or else when it attempts to add the repository from the cluster state at start up it will fail to connect and the repository will be left in an inconsistent state. Adding a blurb in the docs to outline the expected availability for HDFS when using the repository plugin.	2017-06-16 09:47:44 -04:00
David Causse	ff9edb627e	[analysis-icu] Allow setting unicodeSetFilter (#20814 ) UnicodeSetFilter was only allowed in the icu_folding token filter. It seems useful to expose this setting in icu_normalizer token filter and char filter.	2017-06-16 11:08:39 +02:00
Russ Cam	a0f50e8aa4	Supported Azure Storage account types (#25167 ) * Supported Azure Storage account types Add important note for Azure Storage account types Relates #20844	2017-06-12 17:03:18 -07:00
Sanne Grinovero	a2d9b0edcd	List Hibernate Search (#25145 ) among the community framework integrations	2017-06-09 12:05:53 +02:00
Jason Tedor	9b4a189147	Add purge option to remove plugin CLI By default, the remove plugin CLI command preserves configuration files. This is so that if a user is upgrading the plugin (which is done by first removing the old version and then installing the new version) they do not lose their configuration file. Yet, there are circumstances where preserving the configuration file is not desired. This commit adds a purge option to the remove plugin CLI command. Relates #24981	2017-06-01 08:53:39 -04:00
Jason Tedor	bb63577f14	Fix plugin docs for using custom config dir This commit fixes an issue with the plugin docs incorrectly specifying how to set a custom configuration directory. The correct way is to use the environment variable CONF_DIR.	2017-05-26 13:40:35 -04:00
debadair	bb3a59fa70	[DOCS] Fixed cross doc xref in plugin docs.	2017-05-16 17:55:47 -07:00
Ryan Ernst	d74760c306	GCS Repository: Add secure storage of credentials (#24697 ) This commit adds gcs credential settings to the elasticsearch keystore. The setting name follows the same pattern as the s3 client settings, beginning with `gcs.client.`, followed by the client name, and then the setting name, in this case, `credentials_file`. Using the legacy service file setting is also deprecated.	2017-05-16 17:17:37 -07:00
propulkit	77feabb3d5	Machine Permission update on Google Compute Machine permission can be updated without deleting instances. Backport of #24607 in master branch	2017-05-11 07:52:34 +02:00
Matt Weber	b24326271e	Add ICUCollationFieldMapper (#24126 ) Adds a new "icu_collation" field type that exposes lucene's ICUCollationDocValuesField. ICUCollationDocValuesField is the replacement for ICUCollationKeyFilter which has been deprecated since Lucene 5.	2017-05-10 10:35:11 +02:00

1 2 3 4 5 ...

360 Commits