3.8 KiB
3.8 KiB
layout | title | parent | grand_parent | nav_order |
---|---|---|---|---|
default | Grok | Processors | Pipelines | 54 |
Grok
The Grok processor uses pattern matching to structure and extract important keys from unstructured data.
Configuration
The following table describes options you can use with the Grok processor to structure your data and make your data easier to query.
Option | Required | Type | Description |
---|---|---|---|
break_on_match | No | Boolean | Specifies whether to match all patterns or stop once the first successful match is found. Default value is true . |
grok_when | No | String | Specifies under what condition the Grok processor should perform matching. Default is no condition. |
keep_empty_captures | No | Boolean | Enables the preservation of null captures. Default value is false . |
keys_to_overwrite | No | List | Specifies which existing keys will be overwritten if there is a capture with the same key value. Default value is [] . |
match | No | Map | Specifies which keys to match specific patterns against. Default value is an empty body. |
named_captures_only | No | Boolean | Specifies whether to keep only named captures. Default value is true . |
pattern_definitions | No | Map | Allows for custom pattern use inline. Default value is an empty body. |
patterns_directories | No | List | Specifies the path of directories that contain customer pattern files. Default value is an empty list. |
pattern_files_glob | No | String | Specifies which pattern files to use from the directories specified for pattern_directories . Default value is * . |
target_key | No | String | Specifies a parent-level key used to store all captures. Default value is null . |
timeout_millis | No | Integer | The maximum amount of time during which matching occurs. Setting to 0 disables the timeout. Default value is 30,000 . |
Conditional grok
The Grok processor can be configured to run conditionally by using the grok_when
option. The following is an example Grok processor configuration that uses grok_when
:
processor:
- grok:
grok_when: '/type == "ipv4"'
match:
message: ['%{IPV4:clientip} %{WORD:request} %{POSINT:bytes}']
- grok:
grok_when: '/type == "ipv6"'
match:
message: ['%{IPV6:clientip} %{WORD:request} %{POSINT:bytes}']
The grok_when
option can take a conditional expression. This expression is detailed in the Expression syntax documentation.
Metrics
The following table describes common Abstract processor metrics.
Metric name | Type | Description |
---|---|---|
recordsIn |
Counter | Metric representing the ingress of records to a pipeline component. |
recordsOut |
Counter | Metric representing the egress of records from a pipeline component. |
timeElapsed |
Timer | Metric representing the time elapsed during execution of a pipeline component. |
The Grok processor includes the following custom metrics.
Counter
grokProcessingMismatch
: Records the number of records that did not match any of the patterns specified in the match field.grokProcessingMatch
: Records the number of records that matched at least one pattern from thematch
field.grokProcessingErrors
: Records the total number of record processing errors.grokProcessingTimeouts
: Records the total number of records that timed out while matching.
Timer
grokProcessingTime
: The time taken by individual records to match againstmatch
patterns. Theavg
metric is the most useful metric for this timer because because it provides the average time taken to match records.