Remove usage of multi-types from the docs and added a page explaining type removal (#25543)

Closes #25401
This commit is contained in:
Clinton Gormley 2017-07-05 12:30:19 +02:00 committed by GitHub
parent fefcae3d45
commit 0170e0e8d3
42 changed files with 640 additions and 792 deletions

View File

@ -51,6 +51,7 @@ buildRestTests.expectedUnconvertedCandidates = [
'reference/indices/recovery.asciidoc', 'reference/indices/recovery.asciidoc',
'reference/indices/segments.asciidoc', 'reference/indices/segments.asciidoc',
'reference/indices/shard-stores.asciidoc', 'reference/indices/shard-stores.asciidoc',
'reference/mapping/removal_of_types.asciidoc',
'reference/search/field-stats.asciidoc', 'reference/search/field-stats.asciidoc',
'reference/search/profile.asciidoc', 'reference/search/profile.asciidoc',
] ]

View File

@ -1,24 +1,27 @@
[[search-aggregations-bucket-children-aggregation]] [[search-aggregations-bucket-children-aggregation]]
=== Children Aggregation === Children Aggregation
A special single bucket aggregation that enables aggregating from buckets on parent document types to buckets on child documents. A special single bucket aggregation that selects child documents that have the specified type, as defined in a <<parent-join,`join` field>>.
This aggregation relies on the <<mapping-parent-field,_parent field>> in the mapping. This aggregation has a single option: This aggregation has a single option:
* `type` - The child type that the buckets in the parent space should be mapped to. * `type` - The child type that should be selected.
For example, let's say we have an index of questions and answers. The answer type has the following `join` field in the mapping:
For example, let's say we have an index of questions and answers. The answer type has the following `_parent` field in the mapping:
[source,js] [source,js]
-------------------------------------------------- --------------------------------------------------
PUT child_example PUT child_example
{ {
"settings": {
"mapping.single_type": false
},
"mappings": { "mappings": {
"answer" : { "doc": {
"_parent" : { "properties": {
"type" : "question" "join": {
"type": "join",
"relations": {
"question": "answer"
}
}
} }
} }
} }
@ -26,15 +29,19 @@ PUT child_example
-------------------------------------------------- --------------------------------------------------
// CONSOLE // CONSOLE
The question typed document contain a tag field and the answer typed documents contain an owner field. With the `children` The `question` document contain a tag field and the `answer` documents contain an owner field. With the `children`
aggregation the tag buckets can be mapped to the owner buckets in a single request even though the two fields exist in aggregation the tag buckets can be mapped to the owner buckets in a single request even though the two fields exist in
two different kinds of documents. two different kinds of documents.
An example of a question typed document: An example of a question document:
[source,js] [source,js]
-------------------------------------------------- --------------------------------------------------
PUT child_example/question/1 PUT child_example/doc/1
{ {
"join": {
"name": "question"
},
"body": "<p>I have Windows 2003 server and i bought a new Windows 2008 server...", "body": "<p>I have Windows 2003 server and i bought a new Windows 2008 server...",
"title": "Whats the best way to file transfer my site from server to a newer one?", "title": "Whats the best way to file transfer my site from server to a newer one?",
"tags": [ "tags": [
@ -47,11 +54,16 @@ PUT child_example/question/1
// CONSOLE // CONSOLE
// TEST[continued] // TEST[continued]
Examples of `answer` typed documents: Examples of `answer` documents:
[source,js] [source,js]
-------------------------------------------------- --------------------------------------------------
PUT child_example/answer/1?parent=1&refresh PUT child_example/doc/2?routing=1
{ {
"join": {
"name": "answer",
"parent": "1"
},
"owner": { "owner": {
"location": "Norfolk, United Kingdom", "location": "Norfolk, United Kingdom",
"display_name": "Sam", "display_name": "Sam",
@ -60,8 +72,13 @@ PUT child_example/answer/1?parent=1&refresh
"body": "<p>Unfortunately you're pretty much limited to FTP...", "body": "<p>Unfortunately you're pretty much limited to FTP...",
"creation_date": "2009-05-04T13:45:37.030" "creation_date": "2009-05-04T13:45:37.030"
} }
PUT child_example/answer/2?parent=1&refresh
PUT child_example/doc/3?routing=1&refresh
{ {
"join": {
"name": "answer",
"parent": "1"
},
"owner": { "owner": {
"location": "Norfolk, United Kingdom", "location": "Norfolk, United Kingdom",
"display_name": "Troll", "display_name": "Troll",
@ -117,10 +134,18 @@ Possible response:
[source,js] [source,js]
-------------------------------------------------- --------------------------------------------------
{ {
"timed_out": false,
"took": 25, "took": 25,
"_shards": { "total": 5, "successful": 5, "failed": 0 }, "timed_out": false,
"hits": { "total": 3, "max_score": 0.0, hits: [] }, "_shards": {
"total": 5,
"successful": 5,
"failed": 0
},
"hits": {
"total": 3,
"max_score": 0.0,
"hits": []
},
"aggregations": { "aggregations": {
"top-tags": { "top-tags": {
"doc_count_error_upper_bound": 0, "doc_count_error_upper_bound": 0,

View File

@ -184,7 +184,7 @@ segments |1.4kb
|`segments.version_map_memory` |`svmm`, `segmentsVersionMapMemory` |No |`segments.version_map_memory` |`svmm`, `segmentsVersionMapMemory` |No
|Memory used by version map |1.0kb |Memory used by version map |1.0kb
|`segments.fixed_bitset_memory` |`sfbm`, `fixedBitsetMemory` |No |`segments.fixed_bitset_memory` |`sfbm`, `fixedBitsetMemory` |No
|Memory used by fixed bit sets for nested object field types and type filters for types referred in _parent fields |1.0kb |Memory used by fixed bit sets for nested object field types and type filters for types referred in `join` fields |1.0kb
|`suggest.current` |`suc`, `suggestCurrent` |No |Number of current suggest operations |0 |`suggest.current` |`suc`, `suggestCurrent` |No |Number of current suggest operations |0
|`suggest.time` |`suti`, `suggestTime` |No |Time spent in suggest |0 |`suggest.time` |`suti`, `suggestTime` |No |Time spent in suggest |0
|`suggest.total` |`suto`, `suggestTotal` |No |Number of suggest operations |0 |`suggest.total` |`suto`, `suggestTotal` |No |Number of suggest operations |0

View File

@ -216,14 +216,6 @@ Each bulk item can include the routing value using the
`_routing`/`routing` field. It automatically follows the behavior of the `_routing`/`routing` field. It automatically follows the behavior of the
index / delete operation based on the `_routing` mapping. index / delete operation based on the `_routing` mapping.
[float]
[[bulk-parent]]
=== Parent
Each bulk item can include the parent value using the `_parent`/`parent`
field. It automatically follows the behavior of the index / delete
operation based on the `_parent` / `_routing` mapping.
[float] [float]
[[bulk-wait-for-active-shards]] [[bulk-wait-for-active-shards]]
=== Wait For Active Shards === Wait For Active Shards

View File

@ -84,23 +84,6 @@ When the `_routing` mapping is set as `required` and no routing value is
specified, the delete api will throw a `RoutingMissingException` and reject specified, the delete api will throw a `RoutingMissingException` and reject
the request. the request.
[float]
[[delete-parent]]
=== Parent
The `parent` parameter can be set, which will basically be the same as
setting the routing parameter.
Note that deleting a parent document does not automatically delete its
children. One way of deleting all child documents given a parent's id is
to use the <<docs-delete-by-query,Delete By Query API>> to perform a
index with the automatically generated (and indexed)
field _parent, which is in the format parent_type#parent_id.
When deleting a child document its parent id must be specified, otherwise
the delete request will be rejected and a `RoutingMissingException` will be
thrown instead.
[float] [float]
[[delete-index-creation]] [[delete-index-creation]]
=== Automatic index creation === Automatic index creation

View File

@ -58,14 +58,6 @@ call in-place to make the document visible. This will also make other documents
changed since the last refresh visible. In order to disable realtime GET, changed since the last refresh visible. In order to disable realtime GET,
one can set the `realtime` parameter to `false`. one can set the `realtime` parameter to `false`.
[float]
[[type]]
=== Optional Type
The get API allows for `_type` to be optional. Set it to `_all` in order
to fetch the first document matching the id across all types.
[float] [float]
[[get-source-filtering]] [[get-source-filtering]]
=== Source filtering === Source filtering
@ -178,7 +170,7 @@ The result of the above get operation is:
Field values fetched from the document it self are always returned as an array. Field values fetched from the document it self are always returned as an array.
Since the `counter` field is not stored the get request simply ignores it when trying to get the `stored_fields.` Since the `counter` field is not stored the get request simply ignores it when trying to get the `stored_fields.`
It is also possible to retrieve metadata fields like `_routing` and `_parent` fields: It is also possible to retrieve metadata fields like the `_routing` field:
[source,js] [source,js]
-------------------------------------------------- --------------------------------------------------
@ -223,7 +215,7 @@ will fail.
[float] [float]
[[_source]] [[_source]]
=== Getting the _source directly === Getting the +_source+ directly
Use the `/{index}/{type}/{id}/_source` endpoint to get Use the `/{index}/{type}/{id}/_source` endpoint to get
just the `_source` field of the document, just the `_source` field of the document,

View File

@ -1,6 +1,8 @@
[[docs-index_]] [[docs-index_]]
== Index API == Index API
IMPORTANT: See <<removal-of-types>>.
The index API adds or updates a typed JSON document in a specific index, The index API adds or updates a typed JSON document in a specific index,
making it searchable. The following example inserts the JSON document making it searchable. The following example inserts the JSON document
into the "twitter" index, under a type called "tweet" with an id of 1: into the "twitter" index, under a type called "tweet" with an id of 1:
@ -268,41 +270,6 @@ additional document parsing pass. If the `_routing` mapping is defined
and set to be `required`, the index operation will fail if no routing and set to be `required`, the index operation will fail if no routing
value is provided or extracted. value is provided or extracted.
[float]
[[parent-children]]
=== Parents & Children
A child document can be indexed by specifying its parent when indexing.
For example:
[source,js]
--------------------------------------------------
PUT blogs
{
"settings": {
"mapping.single_type": false
},
"mappings": {
"tag_parent": {},
"blog_tag": {
"_parent": {
"type": "tag_parent"
}
}
}
}
PUT blogs/blog_tag/1122?parent=1111
{
"tag" : "something"
}
--------------------------------------------------
// CONSOLE
When indexing a child document, the routing value is automatically set
to be the same as its parent, unless the routing value is explicitly
specified using the `routing` parameter.
[float] [float]
[[index-distributed]] [[index-distributed]]
=== Distributed === Distributed

View File

@ -297,7 +297,6 @@ change:
* `_index` * `_index`
* `_version` * `_version`
* `_routing` * `_routing`
* `_parent`
Setting `_version` to `null` or clearing it from the `ctx` map is just like not Setting `_version` to `null` or clearing it from the `ctx` map is just like not
sending the version in an indexing request. It will cause that document to be sending the version in an indexing request. It will cause that document to be

View File

@ -66,8 +66,8 @@ POST test/type1/1/_update
// TEST[continued] // TEST[continued]
In addition to `_source`, the following variables are available through In addition to `_source`, the following variables are available through
the `ctx` map: `_index`, `_type`, `_id`, `_version`, `_routing`, the `ctx` map: `_index`, `_type`, `_id`, `_version`, `_routing`
`_parent`, and `_now` (the current timestamp). and `_now` (the current timestamp).
We can also add a new field to the document: We can also add a new field to the document:
@ -286,13 +286,6 @@ Routing is used to route the update request to the right shard and sets the
routing for the upsert request if the document being updated doesn't exist. routing for the upsert request if the document being updated doesn't exist.
Can't be used to update the routing of an existing document. Can't be used to update the routing of an existing document.
`parent`::
Parent is used to route the update request to the right shard and sets the
parent for the upsert request if the document being updated doesn't exist.
Can't be used to update the `parent` of an existing document.
If an alias index routing is specified then it overrides the parent routing and it is used to route the request.
`timeout`:: `timeout`::
Timeout waiting for a shard to become available. Timeout waiting for a shard to become available.

View File

@ -47,7 +47,6 @@ A node can be configured to join a specific cluster by the cluster name. By defa
In a single cluster, you can have as many nodes as you want. Furthermore, if there are no other Elasticsearch nodes currently running on your network, starting a single node will by default form a new single-node cluster named `elasticsearch`. In a single cluster, you can have as many nodes as you want. Furthermore, if there are no other Elasticsearch nodes currently running on your network, starting a single node will by default form a new single-node cluster named `elasticsearch`.
[sect2]
[float] [float]
=== Index === Index
@ -58,7 +57,9 @@ In a single cluster, you can define as many indexes as you want.
[float] [float]
=== Type === Type
Within an index, you can define one or more types. A type is a logical category/partition of your index whose semantics is completely up to you. In general, a type is defined for documents that have a set of common fields. For example, let's assume you run a blogging platform and store all your data in a single index. In this index, you may define a type for user data, another type for blog data, and yet another type for comments data. deprecated[6.0.0,See <<removal-of-types>>]
A type used to be a logical category/partition of your index to allow you to store different types of documents in the same index, eg one type for users, another type for blog posts. It is no longer possible to create multiple types in an index, and the whole concept of types will be removed in a later version. See <<removal-of-types>> for more.
[float] [float]
=== Document === Document
@ -359,13 +360,11 @@ You might also notice that the customer index has a yellow health tagged to it.
=== Index and Query a Document === Index and Query a Document
Let's now put something into our customer index. Remember previously that in order to index a document, we must tell Elasticsearch which type in the index it should go to. Let's now put something into our customer index. We'll index a simple customer document into the customer index, with an ID of 1 as follows:
Let's index a simple customer document into the customer index, "external" type, with an ID of 1 as follows:
[source,js] [source,js]
-------------------------------------------------- --------------------------------------------------
PUT /customer/external/1?pretty PUT /customer/doc/1?pretty
{ {
"name": "John Doe" "name": "John Doe"
} }
@ -378,7 +377,7 @@ And the response:
-------------------------------------------------- --------------------------------------------------
{ {
"_index" : "customer", "_index" : "customer",
"_type" : "external", "_type" : "doc",
"_id" : "1", "_id" : "1",
"_version" : 1, "_version" : 1,
"result" : "created", "result" : "created",
@ -394,7 +393,7 @@ And the response:
-------------------------------------------------- --------------------------------------------------
// TESTRESPONSE[s/"_seq_no" : 0/"_seq_no" : $body._seq_no/ s/"_primary_term" : 1/"_primary_term" : $body._primary_term/] // TESTRESPONSE[s/"_seq_no" : 0/"_seq_no" : $body._seq_no/ s/"_primary_term" : 1/"_primary_term" : $body._primary_term/]
From the above, we can see that a new customer document was successfully created inside the customer index and the external type. The document also has an internal id of 1 which we specified at index time. From the above, we can see that a new customer document was successfully created inside the customer index. The document also has an internal id of 1 which we specified at index time.
It is important to note that Elasticsearch does not require you to explicitly create an index first before you can index documents into it. In the previous example, Elasticsearch will automatically create the customer index if it didn't already exist beforehand. It is important to note that Elasticsearch does not require you to explicitly create an index first before you can index documents into it. In the previous example, Elasticsearch will automatically create the customer index if it didn't already exist beforehand.
@ -402,7 +401,7 @@ Let's now retrieve that document that we just indexed:
[source,js] [source,js]
-------------------------------------------------- --------------------------------------------------
GET /customer/external/1?pretty GET /customer/doc/1?pretty
-------------------------------------------------- --------------------------------------------------
// CONSOLE // CONSOLE
// TEST[continued] // TEST[continued]
@ -413,7 +412,7 @@ And the response:
-------------------------------------------------- --------------------------------------------------
{ {
"_index" : "customer", "_index" : "customer",
"_type" : "external", "_type" : "doc",
"_id" : "1", "_id" : "1",
"_version" : 1, "_version" : 1,
"found" : true, "found" : true,
@ -451,11 +450,11 @@ Before we move on, let's take a closer look again at some of the API commands th
[source,js] [source,js]
-------------------------------------------------- --------------------------------------------------
PUT /customer PUT /customer
PUT /customer/external/1 PUT /customer/doc/1
{ {
"name": "John Doe" "name": "John Doe"
} }
GET /customer/external/1 GET /customer/doc/1
DELETE /customer DELETE /customer
-------------------------------------------------- --------------------------------------------------
// CONSOLE // CONSOLE
@ -481,18 +480,18 @@ We've previously seen how we can index a single document. Let's recall that comm
[source,js] [source,js]
-------------------------------------------------- --------------------------------------------------
PUT /customer/external/1?pretty PUT /customer/doc/1?pretty
{ {
"name": "John Doe" "name": "John Doe"
} }
-------------------------------------------------- --------------------------------------------------
// CONSOLE // CONSOLE
Again, the above will index the specified document into the customer index, external type, with the ID of 1. If we then executed the above command again with a different (or same) document, Elasticsearch will replace (i.e. reindex) a new document on top of the existing one with the ID of 1: Again, the above will index the specified document into the customer index, with the ID of 1. If we then executed the above command again with a different (or same) document, Elasticsearch will replace (i.e. reindex) a new document on top of the existing one with the ID of 1:
[source,js] [source,js]
-------------------------------------------------- --------------------------------------------------
PUT /customer/external/1?pretty PUT /customer/doc/1?pretty
{ {
"name": "Jane Doe" "name": "Jane Doe"
} }
@ -504,7 +503,7 @@ The above changes the name of the document with the ID of 1 from "John Doe" to "
[source,js] [source,js]
-------------------------------------------------- --------------------------------------------------
PUT /customer/external/2?pretty PUT /customer/doc/2?pretty
{ {
"name": "Jane Doe" "name": "Jane Doe"
} }
@ -520,7 +519,7 @@ This example shows how to index a document without an explicit ID:
[source,js] [source,js]
-------------------------------------------------- --------------------------------------------------
POST /customer/external?pretty POST /customer/doc?pretty
{ {
"name": "Jane Doe" "name": "Jane Doe"
} }
@ -538,7 +537,7 @@ This example shows how to update our previous document (ID of 1) by changing the
[source,js] [source,js]
-------------------------------------------------- --------------------------------------------------
POST /customer/external/1/_update?pretty POST /customer/doc/1/_update?pretty
{ {
"doc": { "name": "Jane Doe" } "doc": { "name": "Jane Doe" }
} }
@ -550,7 +549,7 @@ This example shows how to update our previous document (ID of 1) by changing the
[source,js] [source,js]
-------------------------------------------------- --------------------------------------------------
POST /customer/external/1/_update?pretty POST /customer/doc/1/_update?pretty
{ {
"doc": { "name": "Jane Doe", "age": 20 } "doc": { "name": "Jane Doe", "age": 20 }
} }
@ -562,7 +561,7 @@ Updates can also be performed by using simple scripts. This example uses a scrip
[source,js] [source,js]
-------------------------------------------------- --------------------------------------------------
POST /customer/external/1/_update?pretty POST /customer/doc/1/_update?pretty
{ {
"script" : "ctx._source.age += 5" "script" : "ctx._source.age += 5"
} }
@ -580,7 +579,7 @@ Deleting a document is fairly straightforward. This example shows how to delete
[source,js] [source,js]
-------------------------------------------------- --------------------------------------------------
DELETE /customer/external/2?pretty DELETE /customer/doc/2?pretty
-------------------------------------------------- --------------------------------------------------
// CONSOLE // CONSOLE
// TEST[continued] // TEST[continued]
@ -597,7 +596,7 @@ As a quick example, the following call indexes two documents (ID 1 - John Doe an
[source,js] [source,js]
-------------------------------------------------- --------------------------------------------------
POST /customer/external/_bulk?pretty POST /customer/doc/_bulk?pretty
{"index":{"_id":"1"}} {"index":{"_id":"1"}}
{"name": "John Doe" } {"name": "John Doe" }
{"index":{"_id":"2"}} {"index":{"_id":"2"}}
@ -609,7 +608,7 @@ This example updates the first document (ID of 1) and then deletes the second do
[source,sh] [source,sh]
-------------------------------------------------- --------------------------------------------------
POST /customer/external/_bulk?pretty POST /customer/doc/_bulk?pretty
{"update":{"_id":"1"}} {"update":{"_id":"1"}}
{"doc": { "name": "John Doe becomes Jane Doe" } } {"doc": { "name": "John Doe becomes Jane Doe" } }
{"delete":{"_id":"2"}} {"delete":{"_id":"2"}}

View File

@ -45,7 +45,7 @@
[[glossary-id]] id :: [[glossary-id]] id ::
The ID of a <<glossary-document,document>> identifies a document. The The ID of a <<glossary-document,document>> identifies a document. The
`index/type/id` of a document must be unique. If no ID is provided, `index/id` of a document must be unique. If no ID is provided,
then it will be auto-generated. (also see <<glossary-routing,routing>>) then it will be auto-generated. (also see <<glossary-routing,routing>>)
[[glossary-field]] field :: [[glossary-field]] field ::
@ -64,8 +64,8 @@
[[glossary-index]] index :: [[glossary-index]] index ::
An index is like a _table_ in a relational database. It has a An index is like a _table_ in a relational database. It has a
<<glossary-mapping,mapping>> which defines the <<glossary-field,fields>> in the index, <<glossary-mapping,mapping>> which contains a <<glossary-type,type>>,
which are grouped by multiple <<glossary-type,type>>. which contains the <<glossary-field,fields>> in the index.
+ +
An index is a logical namespace which maps to one or more An index is a logical namespace which maps to one or more
<<glossary-primary-shard,primary shards>> and can have zero or more <<glossary-primary-shard,primary shards>> and can have zero or more
@ -74,8 +74,8 @@
[[glossary-mapping]] mapping :: [[glossary-mapping]] mapping ::
A mapping is like a _schema definition_ in a relational database. Each A mapping is like a _schema definition_ in a relational database. Each
<<glossary-index,index>> has a mapping, which defines each <<glossary-type,type>> <<glossary-index,index>> has a mapping, which defines a <<glossary-type,type>>,
within the index, plus a number of index-wide settings. plus a number of index-wide settings.
+ +
A mapping can either be defined explicitly, or it will be generated A mapping can either be defined explicitly, or it will be generated
automatically when a document is indexed. automatically when a document is indexed.
@ -179,11 +179,6 @@
[[glossary-type]] type :: [[glossary-type]] type ::
A type represents the _type_ of document, e.g. an `email`, a `user`, or a `tweet`. A type used to represent the _type_ of document, e.g. an `email`, a `user`, or a `tweet`.
The search API can filter documents by type. Types are deprecated and are in the process of being removed. See <<removal-of-types>>.
An <<glossary-index,index>> can contain multiple types, and each type has a
list of <<glossary-field,fields>> that can be specified for
<<glossary-document,documents>> of that type. Fields with the same
name in different types in the same index must have the same <<glossary-mapping,mapping>>
(which defines how each field in the document is indexed and made searchable).

View File

@ -244,8 +244,6 @@ GET /alias2/_search?q=user:kimchy&routing=2,3
// CONSOLE // CONSOLE
// TEST[continued] // TEST[continued]
If an index operation that uses index routing alias also has a parent routing, the
parent routing is ignored.
[float] [float]
[[alias-adding]] [[alias-adding]]

View File

@ -74,7 +74,7 @@ that can be set when creating an index, please check the
[[mappings]] [[mappings]]
=== Mappings === Mappings
The create index API allows to provide a set of one or more mappings: The create index API allows to provide a type mapping:
[source,js] [source,js]
-------------------------------------------------- --------------------------------------------------

View File

@ -7,20 +7,7 @@ fields to an existing type:
[source,js] [source,js]
-------------------------------------------------- --------------------------------------------------
PUT twitter <1> PUT twitter <1>
{ {}
"settings": {
"mapping.single_type": false
},
"mappings": {
"tweet": {
"properties": {
"message": {
"type": "text"
}
}
}
}
}
PUT twitter/_mapping/user <2> PUT twitter/_mapping/user <2>
{ {
@ -31,19 +18,19 @@ PUT twitter/_mapping/user <2>
} }
} }
PUT twitter/_mapping/tweet <3> PUT twitter/_mapping/user <3>
{ {
"properties": { "properties": {
"user_name": { "email": {
"type": "text" "type": "keyword"
} }
} }
} }
-------------------------------------------------- --------------------------------------------------
// CONSOLE // CONSOLE
<1> <<indices-create-index,Creates an index>> called `twitter` with the `message` field in the `tweet` <<mapping-type,mapping type>>. <1> <<indices-create-index,Creates an index>> called `twitter` without any type mapping.
<2> Uses the PUT mapping API to add a new mapping type called `user`. <2> Uses the PUT mapping API to add a new mapping type called `user`.
<3> Uses the PUT mapping API to add a new field called `user_name` to the `tweet` mapping type. <3> Uses the PUT mapping API to add a new field called `email` to the `user` mapping type.
More information on how to define type mappings can be found in the More information on how to define type mappings can be found in the
<<mapping,mapping>> section. <<mapping,mapping>> section.
@ -139,80 +126,3 @@ PUT my_index/_mapping/user
Each <<mapping-params,mapping parameter>> specifies whether or not its setting Each <<mapping-params,mapping parameter>> specifies whether or not its setting
can be updated on an existing field. can be updated on an existing field.
[float]
[[merging-conflicts]]
=== Conflicts between fields in different types
Fields in the same index with the same name in two different types must have
the same mapping, as they are backed by the same field internally. Trying to
<<updating-field-mappings,update a mapping parameter>> for a field which
exists in more than one type will throw an exception, unless you specify the
`update_all_types` parameter, in which case it will update that parameter
across all fields with the same name in the same index.
TIP: The only parameters which are exempt from this rule -- they can be set to
different values on each field -- can be found in <<field-conflicts>>.
For example, this fails:
[source,js]
-----------------------------------
PUT my_index
{
"settings": {
"mapping.single_type": false
},
"mappings": {
"type_one": {
"properties": {
"text": { <1>
"type": "text",
"analyzer": "standard"
}
}
},
"type_two": {
"properties": {
"text": { <1>
"type": "text",
"analyzer": "standard"
}
}
}
}
}
PUT my_index/_mapping/type_one <2>
{
"properties": {
"text": {
"type": "text",
"analyzer": "standard",
"search_analyzer": "whitespace"
}
}
}
-----------------------------------
// CONSOLE
// TEST[catch:request]
<1> Create an index with two types, both of which contain a `text` field which have the same mapping.
<2> Trying to update the `search_analyzer` just for `type_one` throws an exception like `"Merge failed with failures..."`.
But this then running this succeeds:
[source,js]
-----------------------------------
PUT my_index/_mapping/type_one?update_all_types <1>
{
"properties": {
"text": {
"type": "text",
"analyzer": "standard",
"search_analyzer": "whitespace"
}
}
}
-----------------------------------
// CONSOLE
// TEST[continued]
<1> Adding the `update_all_types` parameter updates the `text` field in `type_one` and `type_two`.

View File

@ -498,7 +498,7 @@ The following example sets the `_id` metadata field of a document to `1`:
-------------------------------------------------- --------------------------------------------------
// NOTCONSOLE // NOTCONSOLE
The following metadata fields are accessible by a processor: `_index`, `_type`, `_id`, `_routing`, `_parent`. The following metadata fields are accessible by a processor: `_index`, `_type`, `_id`, `_routing`.
[float] [float]
[[accessing-ingest-metadata]] [[accessing-ingest-metadata]]

View File

@ -17,13 +17,14 @@ are stored and indexed. For instance, use mappings to define:
[float] [float]
[[mapping-type]] [[mapping-type]]
== Mapping Types == Mapping Type
Each index has one or more _mapping types_, which are used to divide the Each index has one _mapping type_ which determines how the document will be
documents in an index into logical groups. User documents might be stored in a indexed.
`user` type, and blog posts in a `blogpost` type.
Each mapping type has: deprecated[6.0.0,See <<removal-of-types>>].
A mapping type has:
<<mapping-fields,Meta-fields>>:: <<mapping-fields,Meta-fields>>::
@ -34,11 +35,8 @@ treated. Examples of meta-fields include the document's
<<mapping-types,Fields>> or _properties_:: <<mapping-types,Fields>> or _properties_::
Each mapping type contains a list of fields or `properties` pertinent to that A mapping type contains a list of fields or `properties` pertinent to the
type. A `user` type might contain `title`, `name`, and `age` fields, while a document.
`blogpost` type might contain `title`, `body`, `user_id` and `created` fields.
Fields with the same name in different mapping types in the same index
<<field-conflicts,must have the same mapping>>.
[float] [float]
@ -91,14 +89,12 @@ causing a mapping explosion:
== Dynamic mapping == Dynamic mapping
Fields and mapping types do not need to be defined before being used. Thanks Fields and mapping types do not need to be defined before being used. Thanks
to _dynamic mapping_, new mapping types and new field names will be added to _dynamic mapping_, new field names will be added automatically, just by
automatically, just by indexing a document. New fields can be added both to indexing a document. New fields can be added both to the top-level mapping
the top-level mapping type, and to inner <<object,`object`>> and type, and to inner <<object,`object`>> and <<nested,`nested`>> fields.
<<nested,`nested`>> fields.
The The <<dynamic-mapping,dynamic mapping>> rules can be configured to customise
<<dynamic-mapping,dynamic mapping>> rules can be configured to the mapping that is used for new fields.
customise the mapping that is used for new types and new fields.
[float] [float]
== Explicit mappings == Explicit mappings
@ -107,70 +103,33 @@ You know more about your data than Elasticsearch can guess, so while dynamic
mapping can be useful to get started, at some point you will want to specify mapping can be useful to get started, at some point you will want to specify
your own explicit mappings. your own explicit mappings.
You can create mapping types and field mappings when you You can create field mappings when you
<<indices-create-index,create an index>>, and you can add mapping types and <<indices-create-index,create an index>>, and you can add
fields to an existing index with the <<indices-put-mapping,PUT mapping API>>. fields to an existing index with the <<indices-put-mapping,PUT mapping API>>.
[float] [float]
== Updating existing mappings == Updating existing field mappings
Other than where documented, *existing type and field mappings cannot be Other than where documented, *existing field mappings cannot be
updated*. Changing the mapping would mean invalidating already indexed updated*. Changing the mapping would mean invalidating already indexed
documents. Instead, you should create a new index with the correct mappings documents. Instead, you should create a new index with the correct mappings
and reindex your data into that index. and <<docs-reindex,reindex>> your data into that index.
[[field-conflicts]]
[float]
== Fields are shared across mapping types
Mapping types are used to group fields, but the fields in each mapping type
are not independent of each other. Fields with:
* the _same name_
* in the _same index_
* in _different mapping types_
* map to the _same field_ internally,
* and *must have the same mapping*.
If a `title` field exists in both the `user` and `blogpost` mapping types, the
`title` fields must have exactly the same mapping in each type. The only
exceptions to this rule are the <<copy-to>>, <<dynamic>>, <<enabled>>,
<<ignore-above>>, and <<properties>> parameters, which may have different
settings per field.
Usually, fields with the same name also contain the same type of data, so
having the same mapping is not a problem. When conflicts do arise, these can
be solved by choosing more descriptive names, such as `user_title` and
`blog_title`.
[float] [float]
== Example mapping == Example mapping
A mapping for the example described above could be specified when creating the A mapping could be specified when creating an index, as follows:
index, as follows:
[source,js] [source,js]
--------------------------------------- ---------------------------------------
PUT my_index <1> PUT my_index <1>
{ {
"settings": {
"mapping.single_type": false
},
"mappings": { "mappings": {
"user": { <2> "doc": { <2>
"properties": { <3> "properties": { <3>
"title": { "type": "text" }, <4> "title": { "type": "text" }, <4>
"name": { "type": "text" }, <4> "name": { "type": "text" }, <4>
"age": { "type": "integer" } <4> "age": { "type": "integer" }, <4>
}
},
"blogpost": { <2>
"properties": { <3>
"title": { "type": "text" }, <4>
"body": { "type": "text" }, <4>
"user_id": {
"type": "keyword" <4>
},
"created": { "created": {
"type": "date", <4> "type": "date", <4>
"format": "strict_date_optional_time||epoch_millis" "format": "strict_date_optional_time||epoch_millis"
@ -182,13 +141,15 @@ PUT my_index <1>
--------------------------------------- ---------------------------------------
// CONSOLE // CONSOLE
<1> Create an index called `my_index`. <1> Create an index called `my_index`.
<2> Add mapping types called `user` and `blogpost`. <2> Add a mapping type called `doc`.
<3> Specify fields or _properties_ in each mapping type. <3> Specify fields or _properties_.
<4> Specify the data `type` and mapping for each field. <4> Specify the data `type` and mapping for each field.
-- --
include::mapping/removal_of_types.asciidoc[]
include::mapping/types.asciidoc[] include::mapping/types.asciidoc[]
include::mapping/fields.asciidoc[] include::mapping/fields.asciidoc[]

View File

@ -16,14 +16,10 @@ PUT data/counters/1 <1>
<1> Creates the `data` index, the `counters` mapping type, and a field <1> Creates the `data` index, the `counters` mapping type, and a field
called `count` with datatype `long`. called `count` with datatype `long`.
The automatic detection and addition of new types and fields is called The automatic detection and addition of new fields is called
_dynamic mapping_. The dynamic mapping rules can be customised to suit your _dynamic mapping_. The dynamic mapping rules can be customised to suit your
purposes with: purposes with:
<<default-mapping,`_default_` mapping>>::
Configure the base mapping to be used for new mapping types.
<<dynamic-field-mapping,Dynamic field mappings>>:: <<dynamic-field-mapping,Dynamic field mappings>>::
The rules governing dynamic field detection. The rules governing dynamic field detection.
@ -36,50 +32,9 @@ TIP: <<indices-templates,Index templates>> allow you to configure the default
mappings, settings and aliases for new indices, whether created mappings, settings and aliases for new indices, whether created
automatically or explicitly. automatically or explicitly.
[float]
=== Disabling automatic type creation
Automatic type creation can be disabled per-index by setting the `index.mapper.dynamic`
setting to `false` in the index settings:
[source,js]
--------------------------------------------------
PUT data/_settings
{
"index.mapper.dynamic":false <1>
}
--------------------------------------------------
// CONSOLE
// TEST[continued]
<1> Disable automatic type creation for the index named "data".
Automatic type creation can also be disabled for all indices by setting an index template:
[source,js]
--------------------------------------------------
PUT _template/template_all
{
"index_patterns": ["*"],
"order":0,
"settings": {
"index.mapper.dynamic": false <1>
}
}
--------------------------------------------------
// CONSOLE
// TEST[continued]
<1> Disable automatic type creation for all indices.
Regardless of the value of this setting, types can still be added explicitly
when <<indices-create-index,creating an index>> or with the
<<indices-put-mapping,PUT mapping>> API.
include::dynamic/default-mapping.asciidoc[]
include::dynamic/field-mapping.asciidoc[] include::dynamic/field-mapping.asciidoc[]
include::dynamic/templates.asciidoc[] include::dynamic/templates.asciidoc[]
include::dynamic/default-mapping.asciidoc[]

View File

@ -1,89 +1,14 @@
[[default-mapping]] [[default-mapping]]
=== `_default_` mapping === `_default_` mapping
The default mapping, which will be used as the base mapping for any new deprecated[6.0.0,See <<removal-of-types>>]
mapping types, can be customised by adding a mapping type with the name
The default mapping, which will be used as the base mapping for a new
mapping type, can be customised by adding a mapping type with the name
`_default_` to an index, either when `_default_` to an index, either when
<<indices-create-index,creating the index>> or later on with the <<indices-create-index,creating the index>> or later on with the
<<indices-put-mapping,PUT mapping>> API. <<indices-put-mapping,PUT mapping>> API.
The documentation for this feature has been removed as it no longer makes
sense in 6.x where there can be only a single type per index.
[source,js]
--------------------------------------------------
PUT my_index
{
"settings": {
"mapping.single_type": false
},
"mappings": {
"_default_": { <1>
"_source": {
"enabled": false
}
},
"user": {}, <2>
"blogpost": { <3>
"_source": {
"enabled": true
}
}
}
}
--------------------------------------------------
// CONSOLE
<1> The `_default_` mapping defaults the <<mapping-source-field,`_source`>> field to disabled.
<2> The `user` type inherits the settings from `_default_`.
<3> The `blogpost` type overrides the defaults and enables the <<mapping-source-field,`_source`>> field.
NOTE: When updating the `_default_` mapping with the
<<indices-put-mapping,PUT mapping>> API, the new mapping is not merged with
the existing mapping. Instead, the new `_default_` mapping replaces the
existing one.
While the `_default_` mapping can be updated after an index has been created,
the new defaults will only affect mapping types that are created afterwards.
The `_default_` mapping can be used in conjunction with
<<indices-templates,Index templates>> to control dynamically created types
within automatically created indices:
[source,js]
--------------------------------------------------
PUT _template/logging
{
"index_patterns": ["logs-*"], <1>
"settings": { "number_of_shards": 1 }, <2>
"mappings": {
"_default_": {
"_field_names": { <3>
"enabled": false
},
"dynamic_templates": [
{
"strings": { <4>
"match_mapping_type": "string",
"mapping": {
"type": "text",
"fields": {
"raw": {
"type": "keyword",
"ignore_above": 256
}
}
}
}
}
]
}
}
}
PUT logs-2015.10.01/event/1
{ "message": "error:16" }
--------------------------------------------------
// CONSOLE
<1> The `logging` template will match any indices beginning with `logs-`.
<2> Matching indices will be created with a single primary shard.
<3> The `_field_names` field will be disabled by default for new type mappings.
<4> String fields will be created with a `text` main field, and a `keyword` `.raw` field.

View File

@ -396,34 +396,3 @@ PUT my_index
<1> Like the default dynamic mapping rules, doubles are mapped as floats, which <1> Like the default dynamic mapping rules, doubles are mapped as floats, which
are usually accurate enough, yet require half the disk space. are usually accurate enough, yet require half the disk space.
[[override-default-template]]
=== Override default template
You can override the default mappings for all indices and all types
by specifying a `_default_` type mapping in an index template
which matches all indices.
For example, to disable the `_field_names` field by default for all types in all
new indices, you could create the following index template:
[source,js]
--------------------------------------------------
PUT _template/disable_field_names
{
"order": 0,
"index_patterns": ["*"], <1>
"mappings": {
"_default_": { <2>
"_field_names": { <3>
"enabled": false
}
}
}
}
--------------------------------------------------
// CONSOLE
<1> Applies the mappings to an `index` which matches the pattern `*`, in other
words, all new indices.
<2> Defines the `_default_` type mapping types within the index.
<3> Disables the `_field_names` field by default.

View File

@ -49,11 +49,7 @@ can be customised when a mapping type is created.
All fields in the document which contain non-null values. All fields in the document which contain non-null values.
[float] [float]
=== Routing meta-fields === Routing meta-field
<<mapping-parent-field,`_parent`>>::
Used to create a parent-child relationship between two mapping types.
<<mapping-routing-field,`_routing`>>:: <<mapping-routing-field,`_routing`>>::
@ -67,7 +63,6 @@ can be customised when a mapping type is created.
Application specific metadata. Application specific metadata.
include::fields/all-field.asciidoc[] include::fields/all-field.asciidoc[]
include::fields/field-names-field.asciidoc[] include::fields/field-names-field.asciidoc[]
@ -78,8 +73,6 @@ include::fields/index-field.asciidoc[]
include::fields/meta-field.asciidoc[] include::fields/meta-field.asciidoc[]
include::fields/parent-field.asciidoc[]
include::fields/routing-field.asciidoc[] include::fields/routing-field.asciidoc[]
include::fields/source-field.asciidoc[] include::fields/source-field.asciidoc[]

View File

@ -1,7 +1,7 @@
[[mapping-meta-field]] [[mapping-meta-field]]
=== `_meta` field === `_meta` field
Each mapping type can have custom meta data associated with it. These are not A mapping type can have custom meta data associated with it. These are not
used at all by Elasticsearch, but can be used to store application-specific used at all by Elasticsearch, but can be used to store application-specific
metadata, such as the class that a document belongs to: metadata, such as the class that a document belongs to:

View File

@ -1,169 +0,0 @@
[[mapping-parent-field]]
=== `_parent` field
A parent-child relationship can be established between documents in the same
index by making one mapping type the parent of another:
[source,js]
--------------------------------------------------
PUT my_index
{
"settings": {
"mapping.single_type": false
},
"mappings": {
"my_parent": {},
"my_child": {
"_parent": {
"type": "my_parent" <1>
}
}
}
}
PUT my_index/my_parent/1 <2>
{
"text": "This is a parent document"
}
PUT my_index/my_child/2?parent=1 <3>
{
"text": "This is a child document"
}
PUT my_index/my_child/3?parent=1&refresh=true <3>
{
"text": "This is another child document"
}
GET my_index/my_parent/_search
{
"query": {
"has_child": { <4>
"type": "my_child",
"query": {
"match": {
"text": "child document"
}
}
}
}
}
--------------------------------------------------
// CONSOLE
<1> The `my_parent` type is parent to the `my_child` type.
<2> Index a parent document.
<3> Index two child documents, specifying the parent document's ID.
<4> Find all parent documents that have children which match the query.
See the <<query-dsl-has-child-query,`has_child`>> and
<<query-dsl-has-parent-query,`has_parent`>> queries,
the <<search-aggregations-bucket-children-aggregation,`children`>> aggregation,
and <<parent-child-inner-hits,inner hits>> for more information.
The value of the `_parent` field is accessible in aggregations
and scripts, and may be queried with the
<<query-dsl-parent-id-query, `parent_id` query>>:
[source,js]
--------------------------
GET my_index/_search
{
"query": {
"parent_id": { <1>
"type": "my_child",
"id": "1"
}
},
"aggs": {
"parents": {
"terms": {
"field": "_parent", <2>
"size": 10
}
}
},
"script_fields": {
"parent": {
"script": {
"source": "doc['_parent']" <3>
}
}
}
}
--------------------------
// CONSOLE
// TEST[continued]
<1> Querying the id of the `_parent` field (also see the <<query-dsl-has-parent-query,`has_parent` query>> and the <<query-dsl-has-child-query,`has_child` query>>)
<2> Aggregating on the `_parent` field (also see the <<search-aggregations-bucket-children-aggregation,`children`>> aggregation)
<3> Accessing the `_parent` field in scripts
==== Parent-child restrictions
* The parent and child types must be different -- parent-child relationships
cannot be established between documents of the same type.
* The `_parent.type` setting can only point to a type that doesn't exist yet.
This means that a type cannot become a parent type after it has been
created.
* Parent and child documents must be indexed on the same shard. The `parent`
ID is used as the <<mapping-routing-field,routing>> value for the child,
to ensure that the child is indexed on the same shard as the parent.
This means that the same `parent` value needs to be provided when
<<docs-get,getting>>, <<docs-delete,deleting>>, or <<docs-update,updating>>
a child document.
==== Global ordinals
Parent-child uses <<eager-global-ordinals,global ordinals>> to speed up joins.
Global ordinals need to be rebuilt after any change to a shard. The more
parent id values are stored in a shard, the longer it takes to rebuild the
global ordinals for the `_parent` field.
Global ordinals, by default, are built eagerly: if the index has changed,
global ordinals for the `_parent` field will be rebuilt as part of the refresh.
This can add significant time the refresh. However most of the times this is the
right trade-off, otherwise global ordinals are rebuilt when the first parent-child
query or aggregation is used. This can introduce a significant latency spike for
your users and usually this is worse as multiple global ordinals for the `_parent`
field may be attempt rebuilt within a single refresh interval when many writes
are occurring.
When the parent/child is used infrequently and writes occur frequently it may
make sense to disable eager loading:
[source,js]
--------------------------------------------------
PUT my_index
{
"settings": {
"mapping.single_type": false
},
"mappings": {
"my_parent": {},
"my_child": {
"_parent": {
"type": "my_parent",
"eager_global_ordinals": false
}
}
}
}
--------------------------------------------------
// CONSOLE
The amount of heap used by global ordinals can be checked as follows:
[source,sh]
--------------------------------------------------
# Per-index
GET _stats/fielddata?human&fields=_parent
# Per-node per-index
GET _nodes/stats/indices/fielddata?human&fields=_parent
--------------------------------------------------
// CONSOLE

View File

@ -6,8 +6,7 @@ formula:
shard_num = hash(_routing) % num_primary_shards shard_num = hash(_routing) % num_primary_shards
The default value used for `_routing` is the document's <<mapping-id-field,`_id`>> The default value used for `_routing` is the document's <<mapping-id-field,`_id`>>.
or the document's <<mapping-parent-field,`_parent`>> ID, if present.
Custom routing patterns can be implemented by specifying a custom `routing` Custom routing patterns can be implemented by specifying a custom `routing`
value per document. For instance: value per document. For instance:
@ -133,5 +132,5 @@ less than `index.number_of_shards`.
Once enabled, the partitioned index will have the following limitations: Once enabled, the partitioned index will have the following limitations:
* Mappings with parent-child relationships cannot be created within it. * Mappings with <<parent-join,`join` field>> relationships cannot be created within it.
* All mappings within the index must have the `_routing` field marked as required. * All mappings within the index must have the `_routing` field marked as required.

View File

@ -1,6 +1,8 @@
[[mapping-type-field]] [[mapping-type-field]]
=== `_type` field === `_type` field
deprecated::[6.0.0,See <<removal-of-types>>]
Each document indexed is associated with a <<mapping-type-field,`_type`>> (see Each document indexed is associated with a <<mapping-type-field,`_type`>> (see
<<mapping-type>>) and an <<mapping-id-field,`_id`>>. The `_type` field is <<mapping-type>>) and an <<mapping-id-field,`_id`>>. The `_type` field is
indexed in order to make searching by type name fast. indexed in order to make searching by type name fast.
@ -11,28 +13,17 @@ scripts, and when sorting:
[source,js] [source,js]
-------------------------- --------------------------
# Example documents # Example documents
PUT my_index
{
"settings": {
"mapping.single_type": false
}
}
PUT my_index/type_1/1 PUT my_index/doc/1?refresh=true
{ {
"text": "Document with type 1" "text": "Document with type 'doc'"
}
PUT my_index/type_2/2?refresh=true
{
"text": "Document with type 2"
} }
GET my_index/_search GET my_index/_search
{ {
"query": { "query": {
"terms": { "term": {
"_type": [ "type_1", "type_2" ] <1> "_type": "doc" <1>
} }
}, },
"aggs": { "aggs": {

View File

@ -1,9 +1,7 @@
[[mapping-uid-field]] [[mapping-uid-field]]
=== `_uid` field === `_uid` field
deprecated[6.0.0, Now that types have been removed, documents are uniquely deprecated[6.0.0, Now that types have been removed, documents are uniquely identified by their `_id` and the `_uid` field has only been kept as a view over the `_id` field for backward compatibility.]
identified by their `_id` and the `_uid` field has only been kept as a view
over the `_id` field for backward compatibility.]
Each document indexed is associated with a <<mapping-type-field,`_type`>> (see Each document indexed is associated with a <<mapping-type-field,`_type`>> (see
<<mapping-type>>) and an <<mapping-id-field,`_id`>>. These values are <<mapping-type>>) and an <<mapping-id-field,`_id`>>. These values are

View File

@ -82,6 +82,5 @@ PUT my_index
<2> The `user` object inherits the type-level setting. <2> The `user` object inherits the type-level setting.
<3> The `user.social_networks` object enables dynamic mapping, so new fields may be added to this inner object. <3> The `user.social_networks` object enables dynamic mapping, so new fields may be added to this inner object.
TIP: The `dynamic` setting is allowed to have different settings for fields of TIP: The `dynamic` setting can be updated on existing fields
the same name in the same index. Its value can be updated on existing fields
using the <<indices-put-mapping,PUT mapping API>>. using the <<indices-put-mapping,PUT mapping API>>.

View File

@ -92,7 +92,6 @@ GET my_index/_mapping <3>
<2> The document can be retrieved. <2> The document can be retrieved.
<3> Checking the mapping reveals that no fields have been added. <3> Checking the mapping reveals that no fields have been added.
TIP: The `enabled` setting is allowed to have different settings for fields of TIP: The `enabled` setting can be updated on existing fields
the same name in the same index. Its value can be updated on existing fields
using the <<indices-put-mapping,PUT mapping API>>. using the <<indices-put-mapping,PUT mapping API>>.

View File

@ -31,8 +31,7 @@ Many APIs which support date values also support <<date-math,date math>>
expressions, such as `now-1m/d` -- the current time, minus one month, rounded expressions, such as `now-1m/d` -- the current time, minus one month, rounded
down to the nearest day. down to the nearest day.
TIP: The `format` setting must have the same setting for fields of the same TIP: The `format` setting can be updated on existing fields using the
name in the same index. Its value can be updated on existing fields using the
<<indices-put-mapping,PUT mapping API>>. <<indices-put-mapping,PUT mapping API>>.

View File

@ -4,4 +4,3 @@
The `index` option controls whether field values are indexed. It accepts `true` The `index` option controls whether field values are indexed. It accepts `true`
or `false`. Fields that are not indexed are not queryable. or `false`. Fields that are not indexed are not queryable.
NOTE: For the legacy mapping type <<string,`string`>> the `index` option only accepts legacy values `analyzed` (default, treat as full-text field), `not_analyzed` (treat as keyword field) and `no`.

View File

@ -85,6 +85,3 @@ GET my_index/groups/_search
<2> The phrase query matches our document which is weird, but its what we asked <2> The phrase query matches our document which is weird, but its what we asked
for in the mapping. for in the mapping.
TIP: The `position_increment_gap` setting is allowed to have different settings
for fields of the same name in the same index. Its value can be updated on
existing fields using the <<indices-put-mapping,PUT mapping API>>.

View File

@ -78,6 +78,5 @@ GET my_index/_search
See {defguide}/_index_time_search_as_you_type.html[Index time search-as-you- See {defguide}/_index_time_search_as_you_type.html[Index time search-as-you-
type] for a full explanation of this example. type] for a full explanation of this example.
TIP: The `search_analyzer` setting must have the same setting for fields of TIP: The `search_analyzer` setting can be updated on existing fields
the same name in the same index. Its value can be updated on existing fields
using the <<indices-put-mapping,PUT mapping API>>. using the <<indices-put-mapping,PUT mapping API>>.

View File

@ -0,0 +1,419 @@
[[removal-of-types]]
== Removal of mapping types
IMPORTANT: Indices created in Elasticsearch 6.0.0 or later may only contain a
single <<mapping-type,mapping type>>. Indices created in 5.x with multiple
mapping types will continue to function as before in Elasticsearch 6.x.
Mapping types will be completely removed in Elasticsearch 7.0.0.
[float]
=== What are mapping types?
Since the first release of Elasticsearch, each document has been stored in a
single index and assigned a single mapping type. A mapping type was used to
represent the type of document or entity being indexed, for instance a
`twitter` index might have a `user` type and a `tweet` type.
Each mapping type could have its own fields, so the `user` type might have a
`full_name` field, a `user_name` field, and an `email` field, while the
`tweet` type could have a `content` field, a `tweeted_at` field and, like the
`user` type, a `user_name` field.
Each document had a `_type` meta-field containing the type name, and searches
could be limited to one or more types by specifying the type name(s) in the
URL:
[source,js]
----
GET twitter/user,tweet/_search
{
"query": {
"match": {
"user_name": "kimchy"
}
}
}
----
// NOCONSOLE
The `_type` field was combined with the document's `_id` to generate a `_uid`
field, so documents of different types with the same `_id` could exist in a
single index.
Mapping types were also used to establish a
/guide/en/elasticsearch/reference/5.4/mapping-parent-field.html[parent-child relationship]
between documents, so documents of type `question` could be parents to
documents of type `answer`.
[float]
=== Why are mapping types being removed?
Initially, we spoke about an ``index'' being similar to a ``database'' in an
SQL database, and a ``type'' being equivalent to a
``table''.
This was a bad analogy that led to incorrect assumptions. In an SQL database,
tables are independent of each other. The columns in one table have no
bearing on columns with the same name in another table. This is not the case
for fields in a mapping type.
In an Elasticsearch index, fields that have the same name in different mapping
types are backed by the same Lucene field internally. In other words, using
the example above, the `user_name` field in the `user` type is stored in
exactly the same field as the `user_name` field in the `tweet` type, and both
`user_name` fields must have the same mapping (definition) in both types.
This can lead to frustration when, for example, you want `deleted` to be a
`date` field in one type and a `boolean` field in another type in the same
index.
On top of that, storing different entities that have few or no fields in
common in the same index leads to sparse data and interferes with Lucene's
ability to compress documents efficiently.
For these reasons, we have decided to remove the concept of mapping types from
Elasticsearch.
[float]
=== Alternatives to mapping types
[float]
==== Index per document type
The first alternative is to have an index per document type. Instead of
storing tweets and users in a single `twitter` index, you could store tweets
in the `tweets` index and users in the `user` index. Indices are completely
independent of each other and so there will be no conflict of field types
between indices.
This approach has two benefits:
* Data is more likely to be dense and so benefit from compression techniques
used in Lucene.
* The term statistics used for scoring in full text search are more likely to
be accurate because all documents in the same index represent a single
entity.
Each index can be sized appropriately for the number of documents it will
contain: you can use a smaller number of primary shards for `users` and a
larger number of primary shards for `tweets`.
[float]
==== Custom type field
Of course, there is a limit to how many primary shards can exist in a cluster
so you many not want to waste an entire shard for a collection of only a few
thousand documents. In this case, you can implement your own custom `type`
field which will work in a similar way to the old `_type`.
Let's take the `user`/`tweet` example above. Originally, the workflow would
have looked something like this:
[source,js]
----
PUT twitter
{
"mappings": {
"user": {
"properties": {
"name": { "type": "text" },
"user_name": { "type": "keyword" },
"email": { "type": "keyword" }
}
},
"tweet": {
"properties": {
"content": { "type": "text" },
"user_name": { "type": "keyword" },
"tweeted_at": { "type": "date" }
}
}
}
}
PUT twitter/user/kimchy
{
"name": "Shay Banon",
"user_name": "kimchy",
"email": "shay@kimchy.com"
}
PUT twitter/tweet/1
{
"user_name": "kimchy",
"tweeted_at": "2017-10-24T09:00:00Z",
"content": "Types are going away"
}
GET twitter/tweet/_search
{
"query": {
"match": {
"user_name": "kimchy"
}
}
}
----
// NOCONSOLE
You could achieve the same thing by adding a custom `type` field as follows:
[source,js]
----
PUT twitter
{
"mappings": {
"doc": {
"properties": {
"type": { "type": "keyword" }, <1>
"name": { "type": "text" },
"user_name": { "type": "keyword" },
"email": { "type": "keyword" },
"content": { "type": "text" },
"tweeted_at": { "type": "date" }
}
}
}
}
PUT twitter/doc/user-kimchy
{
"type": "user", <1>
"name": "Shay Banon",
"user_name": "kimchy",
"email": "shay@kimchy.com"
}
PUT twitter/doc/tweet-1
{
"type": "tweet", <1>
"user_name": "kimchy",
"tweeted_at": "2017-10-24T09:00:00Z",
"content": "Types are going away"
}
GET twitter/_search
{
"query": {
"bool": {
"must": {
"match": {
"user_name": "kimchy"
}
},
"filter": {
"match": {
"type": "tweet" <1>
}
}
}
}
}
----
// NOCONSOLE
<1> The explicit `type` field takes the place of the implicit `_type` field.
[float]
==== Parent/Child without mapping types
Previously, a parent-child relationship was represented by making one mapping
type the parent, and one or more other mapping types the children. Without
types, we can no longer use this syntax. The parent-child feature will
continue to function as before, except that the way of expressing the
relationship between documents has been changed to use the new
<<parent-join,`join` field>>.
[float]
=== Schedule for removal of mapping types
This is a big change for our users, so we have tried to make it as painless as
possible. The change will roll out as follows:
Elasticsearch 5.6.0::
* Setting `index.mapping.single_type: true` on an index will enable the
single-type-per-index behaviour which will be enforced in 6.0.
* The <<parent-join,`join` field>> replacement for parent-child is available
on indices created in 5.6.
Elasticsearch 6.x::
* Indices created in 5.x will continue to function in 6.x as they did in 5.x.
* Indices created in 6.x only allow a single-type per index. Any name
can be used for the type, but there can be only one.
* The `_type` name can no longer be combined with the `_id` to form the `_uid`
field. The `_uid` field has become an alias for the `_id` field.
* New indices no longer support the old-style of parent/child and should
use the <<parent-join,`join` field>> instead.
* The `_default_` mapping type is deprecated.
Elasticsearch 7.x::
* The `type` parameter in URLs are optional. For instance, indexing
a document no longer requires a document `type`.
* The `GET|PUT _mapping` APIs support a query string parameter
(`include_type_name`) which indicates whether the body should include
a layer for the type name. It defaults to `true`. 7.x indices which
don't have an explicit type will use the dummy type name `_doc`.
* The `_default_` mapping type is removed.
Elasticsearch 8.x::
* The `type` parameter is no longer supported in URLs.
* The `include_in_type` parameter defaults to `false`.
Elasticsearch 9.x::
* The `include_in_type` parameter is removed.
[float]
=== Migrating multi-type indices to single-type
The <<docs-reindex,Reindex API>> can be used to convert multi-type indices to
single-type indices. The following examples can be used in Elasticsearch 5.6
or Elasticsearch 6.x. In 6.x, there is no need to specify
`index.mapping.single_type` as that is the default.
[float]
==== Index per document type
This first example splits our `twitter` index into a `tweets` index and a
`users` index:
[source,js]
----
PUT users
{
"settings": {
"index.mapping.single_type": true
},
"mappings": {
"user": {
"properties": {
"name": {
"type": "text"
},
"user_name": {
"type": "keyword"
},
"email": {
"type": "keyword"
}
}
}
}
}
PUT tweets
{
"settings": {
"index.mapping.single_type": true
},
"mappings": {
"tweet": {
"properties": {
"content": {
"type": "text"
},
"user_name": {
"type": "keyword"
},
"tweeted_at": {
"type": "date"
}
}
}
}
}
POST _reindex
{
"source": {
"index": "twitter",
"type": "user"
},
"dest": {
"index": "users"
}
}
POST _reindex
{
"source": {
"index": "twitter",
"type": "tweet"
},
"dest": {
"index": "tweets"
}
}
----
// NOCONSOLE
[float]
==== Custom type field
This next example adds a custom `type` field and sets it to the value of the
original `_type`. It also adds the type to the `_id` in case there are any
documents of different types which have conflicting IDs:
[source,js]
----
PUT new_twitter
{
"mappings": {
"doc": {
"properties": {
"type": {
"type": "keyword"
},
"name": {
"type": "text"
},
"user_name": {
"type": "keyword"
},
"email": {
"type": "keyword"
},
"content": {
"type": "text"
},
"tweeted_at": {
"type": "date"
}
}
}
}
}
POST _reindex
{
"source": {
"index": "twitter"
},
"dest": {
"index": "new_twitter"
},
"script": {
"source": """
ctx._source.type = ctx._type;
ctx._id = ctx._type + '-' + ctx._id;
ctx._type = 'doc';
"""
}
}
----
// NOCONSOLE

View File

@ -78,8 +78,6 @@ include::types/numeric.asciidoc[]
include::types/object.asciidoc[] include::types/object.asciidoc[]
include::types/string.asciidoc[]
include::types/text.asciidoc[] include::types/text.asciidoc[]
include::types/token-count.asciidoc[] include::types/token-count.asciidoc[]

View File

@ -60,7 +60,7 @@ WARNING: It is required to index the lineage of a parent in the same shard so yo
always route child documents using their greater parent id. always route child documents using their greater parent id.
For instance the following index two children documents pointing to the same parent `1 For instance the following index two children documents pointing to the same parent `1`
with a `routing` value equals to the `id` of the parent: with a `routing` value equals to the `id` of the parent:
[source,js] [source,js]

View File

@ -1,18 +0,0 @@
[[string]]
=== String datatype
The `string` field is unsupported for indexes created in 5.x in favor of the
`text` and `keyword` fields. Attempting to create a string field in an index
created in 5.x will cause Elasticsearch to attempt to upgrade the `string` into
the appropriate `text` or `keyword` field. It will return an HTTP `Warning`
header telling you that `string` is deprecated. This upgrade process isn't
always perfect because there are some combinations of features that are
supported by `string` but not `text` or `keyword`. For that reason it is better
to use `text` or `keyword`.
Indexes imported from 2.x *only* support `string` and not `text` or `keyword`.
To ease the migration from 2.x Elasticsearch will downgrade `text` and `keyword`
mappings applied to indexes imported from 2.x into `string`. While long lived
indexes will eventually need to be recreated against 5.x before eventually
upgrading to 6.x, this downgrading smooths the process before you find time for
it.

View File

@ -128,8 +128,3 @@ The following parameters are accepted by `text` fields:
Whether term vectors should be stored for an <<mapping-index,`analyzed`>> Whether term vectors should be stored for an <<mapping-index,`analyzed`>>
field. Defaults to `no`. field. Defaults to `no`.
NOTE: Indexes imported from 2.x do not support `text`. Instead they will
attempt to downgrade `text` into `string`. This allows you to merge modern
mappings with legacy mappings. Long lived indexes will have to be recreated
before upgrading to 6.x but mapping downgrade gives you the opportunity to do
the recreation on your own schedule.

View File

@ -13,8 +13,8 @@ fields are used to index arrays of objects, where each object can be queried
<<query-dsl-has-child-query,`has_child`>> and <<query-dsl-has-parent-query,`has_parent`>> queries:: <<query-dsl-has-child-query,`has_child`>> and <<query-dsl-has-parent-query,`has_parent`>> queries::
A <<mapping-parent-field,parent-child relationship>> can exist between two A <<parent-join,`join` field relationship>> can exist between
document types within a single index. The `has_child` query returns parent documents within a single index. The `has_child` query returns parent
documents whose child documents match the specified query, while the documents whose child documents match the specified query, while the
`has_parent` query returns child documents whose parent document matches the `has_parent` query returns child documents whose parent document matches the
specified query. specified query.

View File

@ -1,37 +1,42 @@
[[query-dsl-parent-id-query]] [[query-dsl-parent-id-query]]
=== Parent Id Query === Parent Id Query
added[5.0.0]
The `parent_id` query can be used to find child documents which belong to a particular parent. The `parent_id` query can be used to find child documents which belong to a particular parent.
Given the following mapping definition: Given the following mapping definition:
[source,js] [source,js]
-------------------------------------------- --------------------------------------------
PUT /my_index PUT my_index
{ {
"settings": {
"mapping.single_type": false
},
"mappings": { "mappings": {
"blog_post": { "doc": {
"properties": { "properties": {
"name": { "my_join_field": {
"type": "keyword" "type": "join",
"relations": {
"my_parent": "my_child"
} }
} }
},
"blog_tag": {
"_parent": {
"type": "blog_post"
},
"_routing": {
"required": true
} }
} }
} }
} }
PUT my_index/doc/1?refresh
{
"text": "This is a parent document",
"my_join_field": "my_parent"
}
PUT my_index/doc/2?routing=1&refresh
{
"text": "This is a child document",
"my_join_field": {
"name": "my_child",
"parent": "1"
}
}
-------------------------------------------- --------------------------------------------
// CONSOLE // CONSOLE
// TESTSETUP // TESTSETUP
@ -42,7 +47,7 @@ GET /my_index/_search
{ {
"query": { "query": {
"parent_id": { "parent_id": {
"type" : "blog_tag", "type": "my_child",
"id": "1" "id": "1"
} }
} }
@ -50,36 +55,14 @@ GET /my_index/_search
-------------------------------------------------- --------------------------------------------------
// CONSOLE // CONSOLE
The above is functionally equivalent to using the following
<<query-dsl-has-parent-query, `has_parent`>> query, but performs
better as it does not need to do a join:
[source,js]
--------------------------------------------------
GET /my_index/_search
{
"query": {
"has_parent": {
"parent_type": "blog_post",
"query": {
"term": {
"_id": "1"
}
}
}
}
}
--------------------------------------------------
// CONSOLE
==== Parameters ==== Parameters
This query has two required parameters: This query has two required parameters:
[horizontal] [horizontal]
`type`:: The **child** type. This must be a type with `_parent` field. `type`:: The **child** type name, as specified in the <<parent-join,`join` field>>.
`id`:: The ID of the parent document.
`id`:: The required parent id select documents must referrer to.
`ignore_unmapped`:: When set to `true` this will ignore an unmapped `type` and will not match any `ignore_unmapped`:: When set to `true` this will ignore an unmapped `type` and will not match any
documents for this query. This can be useful when querying multiple indexes documents for this query. This can be useful when querying multiple indexes

View File

@ -55,6 +55,11 @@ directory. Instead, mappings should be created using the API with:
* <<indices-put-mapping>> * <<indices-put-mapping>>
* <<indices-templates>> * <<indices-templates>>
[role="exclude",id="mapping-parent-field"]
=== `_parent` field
The `_parent` field has been removed in favour of the <<parent-join,`join` field>>.
[role="exclude",id="modules-memcached"] [role="exclude",id="modules-memcached"]
=== memcached === memcached

View File

@ -39,8 +39,7 @@ be used for highlighting if it mapped to have `store` set to `true`.
================================== ==================================
The field name supports wildcard notation. For example, using `comment_*` The field name supports wildcard notation. For example, using `comment_*`
will cause all <<text,text>> and <<keyword,keyword>> fields (and <<string,string>> will cause all <<text,text>> and <<keyword,keyword>> fields that match the expression to be highlighted.
from versions before 5.0) that match the expression to be highlighted.
Note that all other fields will not be highlighted. If you use a custom mapper and want to Note that all other fields will not be highlighted. If you use a custom mapper and want to
highlight on a field anyway, you have to provide the field name explicitly. highlight on a field anyway, you have to provide the field name explicitly.

View File

@ -190,7 +190,7 @@ So in the above example only the comment part is returned per nested hit and not
document that contained the comment. document that contained the comment.
[[nested-inner-hits-source]] [[nested-inner-hits-source]]
==== Nested inner hits and _source ==== Nested inner hits and +_source+
Nested document don't have a `_source` field, because the entire source of document is stored with the root document under Nested document don't have a `_source` field, because the entire source of document is stored with the root document under
its `_source` field. To include the source of just the nested document, the source of the root document is parsed and just its `_source` field. To include the source of just the nested document, the source of the root document is parsed and just

View File

@ -40,8 +40,7 @@ GET /_search
If the requested fields are not stored (`store` mapping set to `false`), they will be ignored. If the requested fields are not stored (`store` mapping set to `false`), they will be ignored.
Stored field values fetched from the document itself are always returned as an array. On the contrary, metadata fields like `_routing` and Stored field values fetched from the document itself are always returned as an array. On the contrary, metadata fields like `_routing` are never returned as an array.
`_parent` fields are never returned as an array.
Also only leaf fields can be returned via the `field` option. So object fields can't be returned and such requests Also only leaf fields can be returned via the `field` option. So object fields can't be returned and such requests
will fail. will fail.