2018-07-02 17:44:36 -04:00
|
|
|
[[painless-ingest-processor-context]]
|
|
|
|
=== Ingest processor context
|
|
|
|
|
2018-07-25 13:59:25 -04:00
|
|
|
Use a Painless script in an {ref}/script-processor.html[ingest processor]
|
2018-07-02 17:44:36 -04:00
|
|
|
to modify documents upon insertion.
|
|
|
|
|
|
|
|
*Variables*
|
|
|
|
|
|
|
|
`params` (`Map`, read-only)::
|
|
|
|
User-defined parameters passed in as part of the query.
|
|
|
|
|
2018-07-25 13:59:25 -04:00
|
|
|
{ref}/mapping-index-field.html[`ctx['_index']`] (`String`)::
|
2018-07-02 17:44:36 -04:00
|
|
|
The name of the index.
|
|
|
|
|
2018-07-25 13:59:25 -04:00
|
|
|
{ref}/mapping-type-field.html[`ctx['_type']`] (`String`)::
|
2018-07-02 17:44:36 -04:00
|
|
|
The type of document within an index.
|
|
|
|
|
|
|
|
`ctx` (`Map`)::
|
|
|
|
Contains extracted JSON in a `Map` and `List` structure for the fields
|
|
|
|
that are part of the document.
|
|
|
|
|
|
|
|
*Side Effects*
|
|
|
|
|
2018-07-25 13:59:25 -04:00
|
|
|
{ref}/mapping-index-field.html[`ctx['_index']`]::
|
2018-07-02 17:44:36 -04:00
|
|
|
Modify this to change the destination index for the current document.
|
|
|
|
|
2018-07-25 13:59:25 -04:00
|
|
|
{ref}/mapping-type-field.html[`ctx['_type']`]::
|
2018-07-02 17:44:36 -04:00
|
|
|
Modify this to change the type for the current document.
|
|
|
|
|
2018-08-09 17:24:55 -04:00
|
|
|
`ctx` (`Map`)::
|
2018-07-02 17:44:36 -04:00
|
|
|
Modify the values in the `Map/List` structure to add, modify, or delete
|
|
|
|
the fields of a document.
|
|
|
|
|
|
|
|
*Return*
|
|
|
|
|
|
|
|
void::
|
|
|
|
No expected return value.
|
|
|
|
|
|
|
|
*API*
|
|
|
|
|
2018-08-09 17:24:55 -04:00
|
|
|
The standard <<painless-api-reference, Painless API>> is available.
|
|
|
|
|
|
|
|
*Example*
|
|
|
|
|
|
|
|
To run this example, first follow the steps in
|
|
|
|
<<painless-context-examples, context examples>>.
|
|
|
|
|
|
|
|
The seat data contains:
|
|
|
|
|
|
|
|
* A date in the format `YYYY-MM-DD` where the second digit of both month and day
|
|
|
|
is optional.
|
|
|
|
* A time in the format HH:MM* where the second digit of both hours and minutes
|
|
|
|
is optional. The star (*) represents either the `String` `AM` or `PM`.
|
|
|
|
|
|
|
|
The following ingest script processes the date and time `Strings` and stores the
|
|
|
|
result in a `datetime` field.
|
|
|
|
|
|
|
|
[source,Painless]
|
|
|
|
----
|
2020-03-27 17:04:27 -04:00
|
|
|
String[] dateSplit = ctx.date.splitOnToken("-"); <1>
|
2018-08-09 17:24:55 -04:00
|
|
|
String year = dateSplit[0].trim();
|
|
|
|
String month = dateSplit[1].trim();
|
|
|
|
|
2020-03-27 17:04:27 -04:00
|
|
|
if (month.length() == 1) { <2>
|
2018-08-09 17:24:55 -04:00
|
|
|
month = "0" + month;
|
|
|
|
}
|
|
|
|
|
|
|
|
String day = dateSplit[2].trim();
|
|
|
|
|
2020-03-27 17:04:27 -04:00
|
|
|
if (day.length() == 1) { <3>
|
2018-08-09 17:24:55 -04:00
|
|
|
day = "0" + day;
|
|
|
|
}
|
|
|
|
|
2020-03-27 17:04:27 -04:00
|
|
|
boolean pm = ctx.time.substring(ctx.time.length() - 2).equals("PM"); <4>
|
|
|
|
String[] timeSplit = ctx.time.substring(0,
|
|
|
|
ctx.time.length() - 2).splitOnToken(":"); <5>
|
2018-08-09 17:24:55 -04:00
|
|
|
int hours = Integer.parseInt(timeSplit[0].trim());
|
|
|
|
int minutes = Integer.parseInt(timeSplit[1].trim());
|
|
|
|
|
2020-03-27 17:04:27 -04:00
|
|
|
if (pm) { <6>
|
2018-08-09 17:24:55 -04:00
|
|
|
hours += 12;
|
|
|
|
}
|
|
|
|
|
|
|
|
String dts = year + "-" + month + "-" + day + "T" +
|
|
|
|
(hours < 10 ? "0" + hours : "" + hours) + ":" +
|
|
|
|
(minutes < 10 ? "0" + minutes : "" + minutes) +
|
2020-03-27 17:04:27 -04:00
|
|
|
":00+08:00"; <7>
|
2018-08-09 17:24:55 -04:00
|
|
|
|
|
|
|
ZonedDateTime dt = ZonedDateTime.parse(
|
2020-03-27 17:04:27 -04:00
|
|
|
dts, DateTimeFormatter.ISO_OFFSET_DATE_TIME); <8>
|
|
|
|
ctx.datetime = dt.getLong(ChronoField.INSTANT_SECONDS)*1000L; <9>
|
2018-08-09 17:24:55 -04:00
|
|
|
----
|
2020-03-27 17:04:27 -04:00
|
|
|
<1> Uses the `splitOnToken` function to separate the date `String` from the
|
|
|
|
seat data into year, month, and day `Strings`.
|
2018-08-09 17:24:55 -04:00
|
|
|
Note::
|
|
|
|
* The use of the `ctx` ingest processor context variable to retrieve the
|
|
|
|
data from the `date` field.
|
2020-03-27 17:04:27 -04:00
|
|
|
<2> Appends the <<string-literals, string literal>> `"0"` value to a single
|
2018-08-09 17:24:55 -04:00
|
|
|
digit month since the format of the seat data allows for this case.
|
2020-03-27 17:04:27 -04:00
|
|
|
<3> Appends the <<string-literals, string literal>> `"0"` value to a single
|
2018-08-09 17:24:55 -04:00
|
|
|
digit day since the format of the seat data allows for this case.
|
2020-03-27 17:04:27 -04:00
|
|
|
<4> Sets the <<primitive-types, `boolean type`>>
|
2018-08-09 17:24:55 -04:00
|
|
|
<<painless-variables, variable>> to `true` if the time `String` is a time
|
|
|
|
in the afternoon or evening.
|
|
|
|
Note::
|
|
|
|
* The use of the `ctx` ingest processor context variable to retrieve the
|
|
|
|
data from the `time` field.
|
2020-03-27 17:04:27 -04:00
|
|
|
<5> Uses the `splitOnToken` function to separate the time `String` from the
|
|
|
|
seat data into hours and minutes `Strings`.
|
2018-08-09 17:24:55 -04:00
|
|
|
Note::
|
|
|
|
* The use of the `substring` method to remove the `AM` or `PM` portion of
|
|
|
|
the time `String`.
|
|
|
|
* The use of the `ctx` ingest processor context variable to retrieve the
|
|
|
|
data from the `date` field.
|
2020-03-27 17:04:27 -04:00
|
|
|
<6> If the time `String` is an afternoon or evening value adds the
|
2018-08-09 17:24:55 -04:00
|
|
|
<<integer-literals, integer literal>> `12` to the existing hours to move to
|
|
|
|
a 24-hour based time.
|
2020-03-27 17:04:27 -04:00
|
|
|
<7> Builds a new time `String` that is parsable using existing API methods.
|
|
|
|
<8> Creates a `ZonedDateTime` <<reference-types, reference type>> value by using
|
2018-08-09 17:24:55 -04:00
|
|
|
the API method `parse` to parse the new time `String`.
|
2020-03-27 17:04:27 -04:00
|
|
|
<9> Sets the datetime field `datetime` to the number of milliseconds retrieved
|
2018-08-09 17:24:55 -04:00
|
|
|
from the API method `getLong`.
|
|
|
|
Note::
|
|
|
|
* The use of the `ctx` ingest processor context variable to set the field
|
|
|
|
`datetime`. Manipulate each document's fields with the `ctx` variable as
|
|
|
|
each document is indexed.
|
|
|
|
|
|
|
|
Submit the following request:
|
|
|
|
|
2019-09-09 13:38:14 -04:00
|
|
|
[source,console]
|
2018-08-09 17:24:55 -04:00
|
|
|
----
|
|
|
|
PUT /_ingest/pipeline/seats
|
|
|
|
{
|
2020-07-17 11:31:37 -04:00
|
|
|
"description": "update datetime for seats",
|
|
|
|
"processors": [
|
|
|
|
{
|
|
|
|
"script": {
|
|
|
|
"source": "String[] dateSplit = ctx.date.splitOnToken('-'); String year = dateSplit[0].trim(); String month = dateSplit[1].trim(); if (month.length() == 1) { month = '0' + month; } String day = dateSplit[2].trim(); if (day.length() == 1) { day = '0' + day; } boolean pm = ctx.time.substring(ctx.time.length() - 2).equals('PM'); String[] timeSplit = ctx.time.substring(0, ctx.time.length() - 2).splitOnToken(':'); int hours = Integer.parseInt(timeSplit[0].trim()); int minutes = Integer.parseInt(timeSplit[1].trim()); if (pm) { hours += 12; } String dts = year + '-' + month + '-' + day + 'T' + (hours < 10 ? '0' + hours : '' + hours) + ':' + (minutes < 10 ? '0' + minutes : '' + minutes) + ':00+08:00'; ZonedDateTime dt = ZonedDateTime.parse(dts, DateTimeFormatter.ISO_OFFSET_DATE_TIME); ctx.datetime = dt.getLong(ChronoField.INSTANT_SECONDS)*1000L;"
|
2018-08-09 17:24:55 -04:00
|
|
|
}
|
2020-07-17 11:31:37 -04:00
|
|
|
}
|
|
|
|
]
|
2018-08-09 17:24:55 -04:00
|
|
|
}
|
|
|
|
----
|