OpenSearch

Commit Graph

Author	SHA1	Message	Date
Tal Levy	a771912a22	Add Ingest-Processor specific Rest Endpoints & Add Grok endpoint (#25059 ) This PR enables Ingest plugins to leverage processor-scoped REST endpoints. First of which being the Grok endpoint that retrieves Grok Patterns for users to retrieve all the built-in patterns. Example usage: Kibana Grok Autocomplete!	2017-06-08 15:24:35 -07:00
Guillaume Le Floch	3f6d80aa66	Allow removing multiple fields in ingest processor (#24750 ) * Allow removing multiple fields in ingest processor * Iteration 2 * Few fixes	2017-06-08 13:17:44 -07:00
Tal Levy	e51246023a	add `exclude_keys` option to KeyValueProcessor (#24876 ) and modify data-structure of `include_keys` and `exclude_keys` to be backed by a HashSet	2017-06-05 14:12:48 -07:00
Tal Levy	dfe2ecaa28	add docs example for Ingest scripts manipulating document metadata (#24875 ) It may not be clear to users that the Ingest ScriptProcessor context object `ctx` can manipulate document metadata like `_index` and `_type`.	2017-05-25 07:45:19 -07:00
Ryan Ernst	463fe2f4d4	Scripting: Remove file scripts (#24627 ) This commit removes file scripts, which were deprecated in 5.5. closes #21798	2017-05-17 14:42:25 -07:00
Nik Everett	a01f846226	CONSOLEify a few more docs Adds CONSOLE to cross-cluster-search docs but skips them for testing because we don't have a second cluster set up. This gets us the `VIEW IN CONSOLE` and `COPY AS CURL` links and makes sure that they are valid yaml (not json, technically) but doesn't get testing. Which is better than we had before. Adds CONSOLE to the dynamic templates docs and ingest-node docs. The ingest-node docs contain a ton of non-console snippets. We might want to convert them to full examples later, but that can be a separate thing. Relates to #18160	2017-05-04 21:01:14 -04:00
Jason Tedor	4796557a30	Add primary term to doc write response This commit adds the primary term to the doc write response. Relates #24171	2017-04-19 14:44:22 -04:00
Glen Smith	3ff014d07d	ingest-node.asciidoc - Clarify json processor (#21876 ) Add examples for the json processor.	2017-04-18 23:27:26 -04:00
Clinton Gormley	5cf13f29bb	Update ingest-node.asciidoc Fixed docs typo	2017-03-22 10:44:11 +01:00
Gameldar	e3eb363882	Link directly to the attachments in arrays section The link should be made to the relevant section of the ingest attachments documentation, rather than the top of the page.	2016-12-22 20:52:08 +08:00
gameldar	d404ee3533	Add ingest-attachment-with-arrays section to ingest attachments doc Added a new section detailing how to use the attachment processor within an array. This reverts commit #22296 and instead links to the foreach processor.	2016-12-22 00:18:33 +08:00
Tal Levy	c53b2ee9cd	introduce KV Processor in Ingest Node (#22272 ) Now you can parse field values of the `key=value` variety and have `key` be inserted as a field name in an ingest document. Closes #22222.	2016-12-20 13:26:17 -08:00
Tal Levy	ad4b1ecdeb	[docs] update ingest-node delete docs to mention wildcarding (#22270 )	2016-12-20 10:52:17 -08:00
Tal Levy	bb37167946	Enables the ability to inject serialized json fields into root of document. (#22179 ) The JSON processor has an optional field called "target_field". If you don't specify target_field then target_field becomes what you specified as "field". There isn't anyway to add the fields to the root of a document. By setting `add_to_root`, now serialized fields will be inserted into the top-level fields of the ingest document. Closes #21898.	2016-12-16 10:17:27 -08:00
Pablo Musa	152efe95e6	Small typo fix in the docs. (#22180 ) There is a small typo in the convert processor code example.	2016-12-16 14:50:06 +01:00
Jason Tedor	d06a8903fd	Merge branch 'master' into feature/seq_no * master: (22 commits) Add proper toString() method to UpdateTask (#21582) Fix `InternalEngine#isThrottled` to not always return `false`. (#21592) add `ignore_missing` option to SplitProcessor (#20982) fix trace_match behavior for when there is only one grok pattern (#21413) Remove dead code from GetResponse.java Fixes date range query using epoch with timezone (#21542) Do not cache term queries. (#21566) Updated dynamic mapper section Docs: Clarify date_histogram bucket sizes for DST time zones Handle release of 5.0.1 Fix skip reason for stats API parameters test Reduce skip version for stats API parameter tests Strict level parsing for indices stats Remove cluster update task when task times out (#21578) [DOCS] Mention "all-fields" mode doesn't search across nested documents InternalTestCluster: when restarting a node we should validate the cluster is formed via the node we just restarted Fixed bad asciidoc in boolean mapping docs Fixed bad asciidoc ID in node stats Be strict when parsing values searching for booleans (#21555) Fix time zone rounding edge case for DST overlaps ...	2016-11-16 09:10:35 -05:00
Tal Levy	6796464f16	add `ignore_missing` option to SplitProcessor (#20982 ) Closes #20840.	2016-11-16 15:46:09 +02:00
Tal Levy	04b712bdc5	fix trace_match behavior for when there is only one grok pattern (#21413 ) There is an issue in the Grok Processor, where trace_match: true does not inject the _ingest._grok_match_index into the ingest-document when there is just one pattern provided. This is due to an optimization in the regex construction. This commit adds a check for when this is the case, and injects a static index value of "0", since there is only one pattern matched (at the first index into the patterns). To make this clearer, more documentation was added to the grok-processor docs. Fixes #21371.	2016-11-16 15:41:54 +02:00
Jason Tedor	33f7cd5a16	Remove shard ID from doc write response This commit removes the shard ID from doc write response; this was useful for debugging but its time has passed. Relates #21508	2016-11-11 15:18:25 -05:00
Jason Tedor	d3417fb022	Merge branch 'master' into feature/seq_no * master: (516 commits) Avoid angering Log4j in TransportNodesActionTests Add trace logging when aquiring and releasing operation locks for replication requests Fix handler name on message not fully read Remove accidental import. Improve log message in TransportNodesAction Clean up of Script. Update Joda Time to version 2.9.5 (#21468) Remove unused ClusterService dependency from SearchPhaseController (#21421) Remove max_local_storage_nodes from elasticsearch.yml (#21467) Wait for all reindex subtasks before rethrottling Correcting a typo-Maan to Man-in README.textile (#21466) Fix InternalSearchHit#hasSource to return the proper boolean value (#21441) Replace all index date-math examples with the URI encoded form Fix typos (#21456) Adapt ES_JVM_OPTIONS packaging test to ubuntu-1204 Add null check in InternalSearchHit#sourceRef to prevent NPE (#21431) Add VirtualBox version check (#21370) Export ES_JVM_OPTIONS for SysV init Skip reindex rethrottle tests with workers Make forbidden APIs be quieter about classpath warnings (#21443) ...	2016-11-10 23:40:33 -05:00
Tal Levy	38c650f376	make painless the default scripting language for ScriptProcessor (#20981 ) - fixes a bug in the docs that mentions `lang` as optional - now `lang` defaults to "painless"	2016-10-18 16:22:01 -07:00
Chris Earle	9cf7214380	[DOCS] Add "version" to template and pipeline docs (#20407 ) * [DOCS] Add "version" to template and pipeline docs This adds details about the "version" to both the template and pipeline pages.	2016-10-18 11:56:18 -04:00
Pascal Borreli	fcb01deb34	Fixed typos (#20843 )	2016-10-10 14:51:47 -06:00
Boaz Leskes	27eab74510	merge from master	2016-09-30 17:19:30 +02:00
Martijn van Groningen	55dce523c2	docs: marked `foreach` processor as experimental Closes #19602	2016-09-30 12:23:42 +02:00
Jason Tedor	8879360f66	Fix failing doc tests in feature/seq_no This commit fixes failing doc tests in feature/seq_no after merging master into this branch.	2016-09-29 03:58:02 +02:00
Tal Levy	f3a5ee671b	[docs] [fix] `field` is no longer an option for the script processor (#20614 )	2016-09-29 03:04:43 +02:00
Tal Levy	550a0449bc	[docs] [fix] `field` is no longer an option for the script processor (#20614 )	2016-09-28 21:50:32 +02:00
Tal Levy	9f1f5fdedc	introduce the JSON Processor (#20128 ) introduce the JSON Processor	2016-09-09 14:34:32 -07:00
Tal Levy	dda32545bb	add ignore_missing option to relevant processors (#20194 )	2016-09-09 12:20:18 -07:00
Martijn van Groningen	6f6d17dc9c	ingest: Add `dot_expander` processor that can turn fields with dots in the field name into object fields.	2016-09-05 07:28:38 +02:00
Igor Motov	b36fbc4452	Add support for parameters to the script ingest processor The script processor should support `params` to be consistent with all other script consumers.	2016-08-24 16:49:48 -04:00
Tal Levy	bf046f8f93	update ingest date index name processor with runnable CONSOLE examples (#19935 )	2016-08-11 11:36:14 -07:00
Martijn van Groningen	a91bb29585	ingest: Made the response format of the get pipeline api match with the response format of the index template api Closes #19585	2016-07-29 17:58:30 +02:00
Martijn van Groningen	24d7fa6d54	ingest: Change the `foreach` processor to use the `_ingest._value` ingest metadata attribute to store the current array element being processed. Closes #19592	2016-07-27 09:35:09 +02:00
Martijn van Groningen	1bc12f5214	docs: fix broken link Closes #19430	2016-07-14 11:12:47 +02:00
Tal Levy	8fd01554bc	update foreach processor to only support one applied processor. (#19402 ) Closes #19345.	2016-07-13 13:13:00 -07:00
javanna	62462f5d9b	[TEST] replace ResponseBodyAssertion with existing MatchAssertion We introduced a special response_body assertion to test our docs snippets. The match assertion does the same job though and can be reused and adapted where needed. ResponseBodyAssertion contains provides much better and accurate errors though, which can be now utilized in MatchAssertion so that many more REST tests can benefit from readable error messages. Each response body gets always stashed and can be retrieved for later evaluations already. Instead of providing the response body as strings that get parsed to json objects separately, then converted to maps as ResponseBodyAssertion did, we parse everything once, the json is part of the yaml test, which is supported. The only downside is that json comments cannot be used, rather yaml comments should be used (// C style vs # ). There were only two docs tests that were using comments in ingest-node.asciidoc where I went ahead and remove the comments which didn't seem that useful anyways.	2016-07-01 11:13:10 +02:00
David Pilato	157645fe9e	Merge pull request #18981 from elastic/doc/ingest-foreach Wrong name for values field	2016-06-22 23:14:02 +02:00
Adrien Grand	db9af54ec0	Remove `_timestamp` and `_ttl` on 5.x indices. #18980 This removes the ability to use `_timestamp` and `_ttl` on indices created on or after 5.0. Closes #18280	2016-06-22 08:35:54 +02:00
David Pilato	cb8073e990	Wrong name for values field We wrote that the document is: ```json { "value" : ["foo", "bar", "baz"] } ``` But the processor is using a `values` field: ```json { "foreach" : { "field" : "values", "processors" : [ // ... ] } } ``` It should be `values`.	2016-06-20 18:58:41 +02:00
Tal Levy	a26260fb72	new ScriptProcessor for Ingest (#18193 ) add new ScriptProcessor for executing ES Scripts within pipelines	2016-06-15 14:57:18 -07:00
Martijn van Groningen	a2ad5c0282	docs: fix typo Closes #18877	2016-06-15 10:56:46 +02:00
Christoph Wurm	d71894a226	Update ingest-node.asciidoc Fixed forgotten rename from `match_formats` to `formats` in documentation (changed in `dd2184ab25`)	2016-06-07 15:47:38 +02:00
Martijn van Groningen	766789b0f0	ingest: added `ignore_failure` option to all processors If this option is enabled on a processor it silently catches any processor related failure and continues executing the rest of the pipeline. Closes #18493	2016-06-01 10:29:12 +02:00
Tal Levy	edfbdf2748	add ability to specify multiple grok patterns (#18074 ) - now you can specify a list of grok patterns to match your field with and the first one to successfully match wins. - only non-null captures will be inserted into your matched document. Fixes #17903.	2016-05-25 12:20:39 -07:00
Zachary Tong	7c46b57ff2	Add a Sort ingest processor Sorts an array of values in ascending or descending order. If all elements are numerics, they will be sorted numerically. If values are strings, or mixtures of strings/numbers, the elements will be sorted lexicographically.	2016-05-17 12:06:48 -04:00
Clinton Gormley	3f594089c2	Renamed all AUTOSENSE snippets to CONSOLE (#18210 )	2016-05-09 15:42:23 +02:00
Nik Everett	4b1c116461	Generate and run tests from the docs Adds infrastructure so `gradle :docs:check` will extract tests from snippets in the documentation and execute the tests. This is included in `gradle check` so it should happen on CI and during a normal build. By default each `// AUTOSENSE` snippet creates a unique REST test. These tests are executed in a random order and the cluster is wiped between each one. If multiple snippets chain together into a test you can annotate all snippets after the first with `// TEST[continued]` to have the generated tests for both snippets joined. Snippets marked as `// TESTRESPONSE` are checked against the response of the last action. See docs/README.asciidoc for lots more. Closes #12583. That issue is about catching bugs in the docs during build. This catches some bugs in the docs during build which is a good start.	2016-05-05 13:58:03 -04:00
Martijn van Groningen	7aca1389e2	ingest: Add `date_index_name` processor. Closes #17814	2016-04-29 17:20:48 +02:00
Tal Levy	6302fb65a3	add ability to disable ability to override values of existing fields in set processor	2016-04-28 13:50:19 -07:00
Martijn van Groningen	dd2184ab25	ingest: Streamline option naming for several processors: * `rename` processor, renamed `to` to `target_field` * `date` processor, renamed `match_field` to `field` and renamed `match_formats` to `formats` * `geoip` processor, renamed `source_field` to `field` and renamed `fields` to `properties` * `attachment` processor, renamed `source_field` to `field` and renamed `fields` to `properties` Closes #17835	2016-04-21 13:40:43 +02:00
Clinton Gormley	102a398d9f	Fixed split processor example	2016-04-19 14:11:45 +02:00
Clinton Gormley	a2ab13ddd1	Update ingest-node.asciidoc Documented `separator` in the `split processor Closes https://github.com/elastic/elasticsearch/issues/17831	2016-04-19 11:11:58 +02:00
Martijn van Groningen	16fa3e546e	docs: remove mention of file based grok pattern	2016-04-13 22:51:12 +02:00
Martijn van Groningen	ca5bd89581	docs: adjust grok processor docs to not mention pattern files as these no longer exist Closes #17692	2016-04-13 12:37:50 +02:00
Tal Levy	2064fe3985	add type conversion support to ConvertProcessor	2016-03-29 07:56:53 -07:00
Clinton Gormley	432f0cc193	Docs: Added the ingest node to the modules/nodes page Closes #17113	2016-03-15 19:03:18 +01:00
Martijn van Groningen	2fa33d5c47	Added ingest statistics to node stats API The ingest stats include the following statistics: * `ingest.total.count`- The total number of document ingested during the lifetime of this node * `ingest.total.time_in_millis` - The total time spent on ingest preprocessing documents during the lifetime of this node * `ingest.total.current` - The total number of documents currently being ingested. * `ingest.total.failed` - The total number ingest preprocessing operations failed during the lifetime of this node Also these stats are returned on a per pipeline basis.	2016-03-10 13:21:43 +01:00
Martijn van Groningen	82d01e4315	Added ingest info to node info API, which contains a list of available processors. Internally the put pipeline API uses this information in node info API to validate if all specified processors in a pipeline exist on all nodes in the cluster.	2016-03-07 14:44:50 +01:00
DeDe Morton	6b52b0bdc3	Ingest node edits	2016-03-03 22:29:27 -08:00
Martijn van Groningen	ddc2b60c4b	docs: fix incorrect grok pattern parameter names Closes #16819	2016-02-26 06:38:25 -08:00
DeDe Morton	d3e3f19a1f	Change topic order	2016-02-12 15:00:07 -08:00
DeDe Morton	2734737d55	Add ingest docs to the build	2016-02-11 14:16:56 -08:00
Dongjoon Hyun	21ea552070	Fix typos in docs.	2016-02-09 02:07:32 -08:00
Martijn van Groningen	7a6adfd93a	ingest: Added foreach processor. This processor is useful when all elements of a json array need to be processed in the same way. This avoids that a processor needs to be defined for each element in an array. Also it is very likely that it is unknown how many elements are inside an json array.	2016-02-04 23:44:01 +01:00
Tal Levy	9e7e2ab10b	remove DeDotProcessor from Ingest	2016-02-02 14:16:01 -08:00
Tal Levy	61e4283a16	Add processor tags to on_failure metadata in ingest pipeline closes #16202	2016-01-29 14:36:11 -08:00
Martijn van Groningen	63bc108e7e	docs: s/processor_id/tag	2016-01-27 10:29:49 +01:00
Martijn van Groningen	b784b81665	docs: Remove the fact that ingest was a plugin from the docs.	2016-01-26 15:49:47 +01:00
Tal Levy	894efa3fb6	update ingest docs - move ingest plugin docs to core reference docs - move geoip processor docs to plugins/ingest-geoip.asciidoc - add missing options tables for some processors - add description of pipeline definition - add description of processor definitions including common parameters like "tag" and "on_failure"	2016-01-25 12:08:17 -08:00

1 2 3 4 5

221 Commits