OpenSearch/x-pack/plugin
David Roberts b61202b0a8 [ML] Add a limit on line merging in find_file_structure (#42501)
When analysing a semi-structured text file the
find_file_structure endpoint merges lines to form
multi-line messages using the assumption that the
first line in each message contains the timestamp.
However, if the timestamp is misdetected then this
can lead to excessive numbers of lines being merged
to form massive messages.

This commit adds a line_merge_size_limit setting
(default 10000 characters) that halts the analysis
if a message bigger than this is created.  This
prevents significant CPU time being spent subsequently
trying to determine the internal structure of the
huge bogus messages.
2019-06-03 13:45:51 +01:00
..
ccr [Backport] Improve build configuration time (#42674) 2019-05-30 10:29:42 -07:00
core [ML] Add a limit on line merging in find_file_structure (#42501) 2019-06-03 13:45:51 +01:00
data-frame [ML Data Frame] Refactor stop logic (#42644) (#42763) 2019-06-03 06:53:44 -05:00
deprecation Deprecate support for chained multi-fields. (#42330) 2019-05-24 15:55:06 -07:00
graph Switch run task to use real distro (#41590) 2019-05-06 12:34:07 -07:00
ilm Switch run task to use real distro (#41590) 2019-05-06 12:34:07 -07:00
logstash Switch run task to use real distro (#41590) 2019-05-06 12:34:07 -07:00
ml [ML] Add a limit on line merging in find_file_structure (#42501) 2019-06-03 13:45:51 +01:00
monitoring Update ciphers for TLSv1.3 and JDK11 if available (#42082) 2019-05-20 09:45:36 -04:00
rollup Remove Log4j 1.2 API as a dependency (#42702) 2019-05-30 16:08:07 -04:00
security Create client-only AnalyzeRequest/AnalyzeResponse classes (#42197) 2019-06-03 09:46:36 +01:00
sql [DOCS] Remove unneeded options from `[source,sql]` code blocks (#42759) 2019-05-31 13:05:13 -04:00
src/test [ML] Add a limit on line merging in find_file_structure (#42501) 2019-06-03 13:45:51 +01:00
watcher [Backport] Improve build configuration time (#42674) 2019-05-30 10:29:42 -07:00
build.gradle Add support for API keys to access Elasticsearch (#38291) 2019-02-05 14:21:57 +11:00