diff --git a/README.md b/README.md index b8d25ef6817..f0e327bf948 100644 --- a/README.md +++ b/README.md @@ -1,36 +1,86 @@ Mapper Attachments Type for ElasticSearch ========================================= -The mapper attachments plugin adds the `attachment` type to ElasticSearch using Tika. +The mapper attachments plugin adds the `attachment` type to ElasticSearch using [Apache Tika](http://lucene.apache.org/tika/). In order to install the plugin, simply run: `bin/plugin -install elasticsearch/elasticsearch-mapper-attachments/1.8.0`. - ------------------------------------------------------ - | Attachment Mapper Plugin | ElasticSearch | Tika | - ------------------------------------------------------ - | 1.9.0-SNAPSHOT (master) | 0.90.3 -> master | 1.4 | - ------------------------------------------------------ - | 1.8.0 | 0.90.3 -> master | 1.2 | - ------------------------------------------------------ - | 1.7.0 | 0.90 -> 0.90.2 | 1.2 | - ------------------------------------------------------ - | 1.6.0 | 0.19 -> 0.20 | 1.2 | - ------------------------------------------------------ - | 1.5.0 | 0.19 -> 0.20 | 1.2 | - ------------------------------------------------------ - | 1.4.0 | 0.19 -> 0.20 | 1.1 | - ------------------------------------------------------ - | 1.3.0 | 0.19 -> 0.20 | 1.0 | - ------------------------------------------------------ - | 1.2.0 | 0.19 -> 0.20 | 1.0 | - ------------------------------------------------------ - | 1.1.0 | 0.19 -> 0.20 | | - ------------------------------------------------------ - | 1.0.0 | 0.18 | | - ------------------------------------------------------ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
Attachment Mapper PluginElasticSearchTikaRelease date
1.9.0-SNAPSHOT (master)0.90.3 -> master1.4
1.8.00.90.3 -> master1.207/08/2013
1.7.00.90 -> 0.90.21.226/02/2013
1.6.00.19 -> 0.201.228/09/2012
1.5.00.19 -> 0.201.219/09/2012
1.4.00.19 -> 0.201.125/03/2012
1.3.00.19 -> 0.201.007/03/2012
1.2.00.19 -> 0.201.015/02/2012
1.1.00.19 -> 0.200.1007/02/2012
1.0.00.180.1005/12/2011
-The `attachment` type allows to index different "attachment" type field (encoded as `base64`), for example, microsoft office formats, open document formats, ePub, HTML, and so on (full list can be found [here](http://lucene.apache.org/tika/0.10/formats.html)). +The `attachment` type allows to index different "attachment" type field (encoded as `base64`), for example, +microsoft office formats, open document formats, ePub, HTML, and so on (full list can be found [here](http://tika.apache.org/1.4/formats.html)). The `attachment` type is provided as a plugin extension. The plugin is a simple zip file that can be downloaded and placed under `$ES_HOME/plugins` location. It will be automatically detected and the `attachment` type will be added. @@ -112,8 +162,6 @@ By default, `100000` characters are extracted when indexing the content. This de Note, this feature is support since `1.3.0` version. -The plugin uses [Apache Tika](http://lucene.apache.org/tika/) to parse attachments, so many formats are supported, listed [here](http://lucene.apache.org/tika/0.10/formats.html). - Metadata parsing error handling -------------------------------