OpenSearch/docs/reference/query-dsl/regexp-query.asciidoc

100 lines
3.0 KiB
Plaintext
Raw Normal View History

[[query-dsl-regexp-query]]
=== Regexp query
++++
<titleabbrev>Regexp</titleabbrev>
++++
2019-07-24 08:37:37 -04:00
Returns documents that contain terms matching a
{wikipedia}/Regular_expression[regular expression].
2019-07-24 08:37:37 -04:00
A regular expression is a way to match patterns in data using placeholder
characters, called operators. For a list of operators supported by the
`regexp` query, see <<regexp-syntax, Regular expression syntax>>.
2019-07-24 08:37:37 -04:00
[[regexp-query-ex-request]]
==== Example request
The following search returns documents where the `user.id` field contains any term
2019-07-24 08:37:37 -04:00
that begins with `k` and ends with `y`. The `.*` operators match any
characters of any length, including no characters. Matching
terms can include `ky`, `kay`, and `kimchy`.
[source,console]
2019-07-24 08:37:37 -04:00
----
GET /_search
{
"query": {
"regexp": {
"user.id": {
"value": "k.*y",
"flags": "ALL",
"case_insensitive": true,
"max_determinized_states": 10000,
"rewrite": "constant_score"
}
}
}
}
2019-07-24 08:37:37 -04:00
----
2019-07-24 08:37:37 -04:00
[[regexp-top-level-params]]
==== Top-level parameters for `regexp`
`<field>`::
(Required, object) Field you wish to search.
2019-07-24 08:37:37 -04:00
[[regexp-query-field-params]]
==== Parameters for `<field>`
`value`::
(Required, string) Regular expression for terms you wish to find in the provided
`<field>`. For a list of supported operators, see <<regexp-syntax, Regular
expression syntax>>.
+
--
By default, regular expressions are limited to 1,000 characters. You can change
this limit using the <<index-max-regex-length, `index.max_regex_length`>>
setting.
2019-07-24 08:37:37 -04:00
[WARNING]
=====
The performance of the `regexp` query can vary based on the regular expression
provided. To improve performance, avoid using wildcard patterns, such as `.*` or
`.*?+`, without a prefix or suffix.
=====
--
2019-07-24 08:37:37 -04:00
`flags`::
(Optional, string) Enables optional operators for the regular expression. For
valid values and more information, see <<regexp-optional-operators, Regular
expression syntax>>.
`case_insensitive`::
(Optional, boolean) allows case insensitive matching of the regular expression
value with the indexed field values when set to true. Setting to false is disallowed.
2019-07-24 08:37:37 -04:00
`max_determinized_states`::
+
--
(Optional, integer) Maximum number of
{wikipedia}/Deterministic_finite_automaton[automaton states]
2019-07-24 08:37:37 -04:00
required for the query. Default is `10000`.
{es} uses https://lucene.apache.org/core/[Apache Lucene] internally to parse
regular expressions. Lucene converts each regular expression to a finite
automaton containing a number of determinized states.
2019-07-24 08:37:37 -04:00
You can use this parameter to prevent that conversion from unintentionally
consuming too many resources. You may need to increase this limit to run complex
regular expressions.
--
2019-07-24 08:37:37 -04:00
`rewrite`::
(Optional, string) Method used to rewrite the query. For valid values and more
information, see the <<query-dsl-multi-term-rewrite, `rewrite` parameter>>.
[[regexp-query-notes]]
==== Notes
===== Allow expensive queries
Regexp queries will not be executed if <<query-dsl-allow-expensive-queries, `search.allow_expensive_queries`>>
is set to false.