OpenSearch/docs/plugins/repository-gcs.asciidoc

[[repository-gcs]]
=== Google Cloud Storage Repository Plugin

The GCS repository plugin adds support for using the https://cloud.google.com/storage/[Google Cloud Storage]
service as a repository for {ref}/modules-snapshots.html[Snapshot/Restore].

:plugin_name: repository-gcs
include::install_remove.asciidoc[]

[[repository-gcs-usage]]
==== Getting started

The plugin uses the https://cloud.google.com/storage/docs/json_api/[Google Cloud Storage JSON API] (v1)
to connect to the Storage service. If this is the first time you use Google Cloud Storage, you first
need to connect to the https://console.cloud.google.com/[Google Cloud Platform Console] and create a new
project. Once your project is created, you must enable the Cloud Storage Service for your project.

[[repository-gcs-creating-bucket]]
===== Creating a Bucket

Google Cloud Storage service uses the concept of https://cloud.google.com/storage/docs/key-terms[Bucket]
as a container for all the data. Buckets are usually created using the
https://console.cloud.google.com/[Google Cloud Platform Console]. The plugin will not automatically
create buckets.

To create a new bucket:

1. Connect to the https://console.cloud.google.com/[Google Cloud Platform Console]
2. Select your project
3. Got to the https://console.cloud.google.com/storage/browser[Storage Browser]
4. Click the "Create Bucket" button
5. Enter a the name of the new bucket
6. Select a storage class
7. Select a location
8. Click the "Create" button

The bucket should now be created.

[[repository-gcs-service-authentication]]
===== Service Authentication

The plugin supports two authentication modes:

* The built-in <<repository-gcs-using-compute-engine, Compute Engine authentication>>. This mode is
recommended if your elasticsearch node is running on a Compute Engine virtual machine.

* Specifying <<repository-gcs-using-service-account, Service Account>> credentials.

[[repository-gcs-using-compute-engine]]
===== Using Compute Engine
When running on Compute Engine, the plugin use Google's built-in authentication mechanism to
authenticate on the Storage service. Compute Engine virtual machines are usually associated to a
default service account. This service account can be found in the VM instance details in the
https://console.cloud.google.com/compute/[Compute Engine console].

This is the default authentication mode and requires no configuration.

NOTE: The Compute Engine VM must be allowed to use the Storage service. This can be done only at VM
creation time, when "Storage" access can be configured to "Read/Write" permission. Check your
instance details at the section "Cloud API access scopes".

[[repository-gcs-using-service-account]]
===== Using a Service Account
If your elasticsearch node is not running on Compute Engine, or if you don't want to use Google's
built-in authentication mechanism, you can authenticate on the Storage service using a
https://cloud.google.com/iam/docs/overview#service_account[Service Account] file.

To create a service account file:
1. Connect to the https://console.cloud.google.com/[Google Cloud Platform Console]
2. Select your project
3. Got to the https://console.cloud.google.com/permissions[Permission] tab
4. Select the https://console.cloud.google.com/permissions/serviceaccounts[Service Accounts] tab
5. Click on "Create service account"
6. Once created, select the new service account and download a JSON key file

A service account file looks like this:

[source,js]
----
{
  "type": "service_account",
  "project_id": "your-project-id",
  "private_key_id": "...",
  "private_key": "-----BEGIN PRIVATE KEY-----\n...\n-----END PRIVATE KEY-----\n",
  "client_email": "service-account-for-your-repository@your-project-id.iam.gserviceaccount.com",
  "client_id": "...",
  "auth_uri": "https://accounts.google.com/o/oauth2/auth",
  "token_uri": "https://accounts.google.com/o/oauth2/token",
  "auth_provider_x509_cert_url": "https://www.googleapis.com/oauth2/v1/certs",
  "client_x509_cert_url": "..."
}
----
// NOTCONSOLE

This file must be stored in the {ref}/secure-settings.html[elasticsearch keystore], under a setting name
of the form `gcs.client.NAME.credentials_file`, where `NAME` is the name of the client congiguration.
The default client name is `default`, but a different client name can be specified in repository
settings using `client`.

For example, if specifying the credentials file in the keystore under
`gcs.client.my_alternate_client.credentials_file`, you can configure a repository to use these
credentials like this:

[source,js]
----
PUT _snapshot/my_gcs_repository
{
  "type": "gcs",
  "settings": {
    "bucket": "my_bucket",
    "client": "my_alternate_client"
  }
}
----
// CONSOLE
// TEST[skip:we don't have gcs setup while testing this]

[[repository-gcs-bucket-permission]]
===== Set Bucket Permission

The service account used to access the bucket must have the "Writer" access to the bucket:

1. Connect to the https://console.cloud.google.com/[Google Cloud Platform Console]
2. Select your project
3. Got to the https://console.cloud.google.com/storage/browser[Storage Browser]
4. Select the bucket and "Edit bucket permission"
5. The service account must be configured as a "User" with "Writer" access


[[repository-gcs-repository]]
==== Create a Repository

Once everything is installed and every node is started, you can create a new repository that
uses Google Cloud Storage to store snapshots:

[source,js]
----
PUT _snapshot/my_gcs_repository
{
  "type": "gcs",
  "settings": {
    "bucket": "my_bucket"
  }
}
----
// CONSOLE
// TEST[skip:we don't have gcs setup while testing this]

The following settings are supported:

`bucket`::

    The name of the bucket to be used for snapshots. (Mandatory)

`client`::

    The client congfiguration to use. This controls which credentials are used to connect
    to Compute Engine.

`base_path`::

    Specifies the path within bucket to repository data. Defaults to
    the root of the bucket.

`chunk_size`::

    Big files can be broken down into chunks during snapshotting if needed.
    The chunk size can be specified in bytes or by using size value notation,
    i.e. `1g`, `10m`, `5k`. Defaults to `100m`.

`compress`::

    When set to `true` metadata files are stored in compressed format. This
    setting doesn't affect index files that are already compressed by default.
    Defaults to `false`.

`application_name`::

    Name used by the plugin when it uses the Google Cloud JSON API. Setting
    a custom name can be useful to authenticate your cluster when requests
    statistics are logged in the Google Cloud Platform. Default to `repository-gcs`
Add Google Cloud Storage repository plugin Closes #12880 2015-09-15 17:35:10 +02:00			`[[repository-gcs]]`
			`=== Google Cloud Storage Repository Plugin`

			`The GCS repository plugin adds support for using the https://cloud.google.com/storage/[Google Cloud Storage]`
			`service as a repository for {ref}/modules-snapshots.html[Snapshot/Restore].`

Added "release-state" support to plugin docs 2017-04-20 15:01:37 +02:00			`:plugin_name: repository-gcs`
			`include::install_remove.asciidoc[]`
Add Google Cloud Storage repository plugin Closes #12880 2015-09-15 17:35:10 +02:00
			`[[repository-gcs-usage]]`
			`==== Getting started`

			`The plugin uses the https://cloud.google.com/storage/docs/json_api/[Google Cloud Storage JSON API] (v1)`
			`to connect to the Storage service. If this is the first time you use Google Cloud Storage, you first`
			`need to connect to the https://console.cloud.google.com/[Google Cloud Platform Console] and create a new`
			`project. Once your project is created, you must enable the Cloud Storage Service for your project.`

			`[[repository-gcs-creating-bucket]]`
			`===== Creating a Bucket`

			`Google Cloud Storage service uses the concept of https://cloud.google.com/storage/docs/key-terms[Bucket]`
			`as a container for all the data. Buckets are usually created using the`
			`https://console.cloud.google.com/[Google Cloud Platform Console]. The plugin will not automatically`
			`create buckets.`

			`To create a new bucket:`

			`1. Connect to the https://console.cloud.google.com/[Google Cloud Platform Console]`
			`2. Select your project`
			`3. Got to the https://console.cloud.google.com/storage/browser[Storage Browser]`
			`4. Click the "Create Bucket" button`
			`5. Enter a the name of the new bucket`
			`6. Select a storage class`
			`7. Select a location`
			`8. Click the "Create" button`

			`The bucket should now be created.`

			`[[repository-gcs-service-authentication]]`
			`===== Service Authentication`

			`The plugin supports two authentication modes:`

GCS Repository: Add secure storage of credentials (#24697) This commit adds gcs credential settings to the elasticsearch keystore. The setting name follows the same pattern as the s3 client settings, beginning with `gcs.client.`, followed by the client name, and then the setting name, in this case, `credentials_file`. Using the legacy service file setting is also deprecated. 2017-05-16 17:17:37 -07:00			`* The built-in <<repository-gcs-using-compute-engine, Compute Engine authentication>>. This mode is`
Add Google Cloud Storage repository plugin Closes #12880 2015-09-15 17:35:10 +02:00			`recommended if your elasticsearch node is running on a Compute Engine virtual machine.`

GCS Repository: Add secure storage of credentials (#24697) This commit adds gcs credential settings to the elasticsearch keystore. The setting name follows the same pattern as the s3 client settings, beginning with `gcs.client.`, followed by the client name, and then the setting name, in this case, `credentials_file`. Using the legacy service file setting is also deprecated. 2017-05-16 17:17:37 -07:00			`* Specifying <<repository-gcs-using-service-account, Service Account>> credentials.`
Add Google Cloud Storage repository plugin Closes #12880 2015-09-15 17:35:10 +02:00
			`[[repository-gcs-using-compute-engine]]`
			`===== Using Compute Engine`
GCS Repository: Add secure storage of credentials (#24697) This commit adds gcs credential settings to the elasticsearch keystore. The setting name follows the same pattern as the s3 client settings, beginning with `gcs.client.`, followed by the client name, and then the setting name, in this case, `credentials_file`. Using the legacy service file setting is also deprecated. 2017-05-16 17:17:37 -07:00			`When running on Compute Engine, the plugin use Google's built-in authentication mechanism to`
Add Google Cloud Storage repository plugin Closes #12880 2015-09-15 17:35:10 +02:00			`authenticate on the Storage service. Compute Engine virtual machines are usually associated to a`
			`default service account. This service account can be found in the VM instance details in the`
			`https://console.cloud.google.com/compute/[Compute Engine console].`

GCS Repository: Add secure storage of credentials (#24697) This commit adds gcs credential settings to the elasticsearch keystore. The setting name follows the same pattern as the s3 client settings, beginning with `gcs.client.`, followed by the client name, and then the setting name, in this case, `credentials_file`. Using the legacy service file setting is also deprecated. 2017-05-16 17:17:37 -07:00			`This is the default authentication mode and requires no configuration.`
Add Google Cloud Storage repository plugin Closes #12880 2015-09-15 17:35:10 +02:00
			`NOTE: The Compute Engine VM must be allowed to use the Storage service. This can be done only at VM`
			`creation time, when "Storage" access can be configured to "Read/Write" permission. Check your`
			`instance details at the section "Cloud API access scopes".`

			`[[repository-gcs-using-service-account]]`
			`===== Using a Service Account`
GCS Repository: Add secure storage of credentials (#24697) This commit adds gcs credential settings to the elasticsearch keystore. The setting name follows the same pattern as the s3 client settings, beginning with `gcs.client.`, followed by the client name, and then the setting name, in this case, `credentials_file`. Using the legacy service file setting is also deprecated. 2017-05-16 17:17:37 -07:00			`If your elasticsearch node is not running on Compute Engine, or if you don't want to use Google's`
Add Google Cloud Storage repository plugin Closes #12880 2015-09-15 17:35:10 +02:00			`built-in authentication mechanism, you can authenticate on the Storage service using a`
			`https://cloud.google.com/iam/docs/overview#service_account[Service Account] file.`

			`To create a service account file:`
			`1. Connect to the https://console.cloud.google.com/[Google Cloud Platform Console]`
			`2. Select your project`
			`3. Got to the https://console.cloud.google.com/permissions[Permission] tab`
			`4. Select the https://console.cloud.google.com/permissions/serviceaccounts[Service Accounts] tab`
			`5. Click on "Create service account"`
			`6. Once created, select the new service account and download a JSON key file`

			`A service account file looks like this:`

Test docs for plugins We weren't doing it before because we weren't starting the plugins. Now we are. The hardest part of this was handling the files the tests expect to be on the filesystem. extraConfigFiles was broken. 2016-05-13 16:15:51 -04:00			`[source,js]`
Add Google Cloud Storage repository plugin Closes #12880 2015-09-15 17:35:10 +02:00			`----`
			`{`
			`"type": "service_account",`
			`"project_id": "your-project-id",`
			`"private_key_id": "...",`
			`"private_key": "-----BEGIN PRIVATE KEY-----\n...\n-----END PRIVATE KEY-----\n",`
			`"client_email": "service-account-for-your-repository@your-project-id.iam.gserviceaccount.com",`
			`"client_id": "...",`
			`"auth_uri": "https://accounts.google.com/o/oauth2/auth",`
			`"token_uri": "https://accounts.google.com/o/oauth2/token",`
			`"auth_provider_x509_cert_url": "https://www.googleapis.com/oauth2/v1/certs",`
			`"client_x509_cert_url": "..."`
			`}`
			`----`
Remove most of the need for `// NOTCONSOLE` and be much more stingy about what we consider a console candidate. * Add `// CONSOLE` to check-running * Fix version in some snippets * Mark groovy snippets as groovy * Fix versions in plugins * Fix language marker errors * Fix language parsing in snippets This adds support for snippets who's language is written like `[source, txt]` and `["source","js",subs="attributes,callouts"]`. This also makes language required for snippets which is nice because then we can be sure we can grep for snippets in a particular language. 2016-08-29 17:33:25 -04:00			`// NOTCONSOLE`
Add Google Cloud Storage repository plugin Closes #12880 2015-09-15 17:35:10 +02:00
[DOCS] Fixed cross doc xref in plugin docs. 2017-05-16 17:55:47 -07:00			`This file must be stored in the {ref}/secure-settings.html[elasticsearch keystore], under a setting name`
GCS Repository: Add secure storage of credentials (#24697) This commit adds gcs credential settings to the elasticsearch keystore. The setting name follows the same pattern as the s3 client settings, beginning with `gcs.client.`, followed by the client name, and then the setting name, in this case, `credentials_file`. Using the legacy service file setting is also deprecated. 2017-05-16 17:17:37 -07:00			of the form `gcs.client.NAME.credentials_file`, where `NAME` is the name of the client congiguration.
			The default client name is `default`, but a different client name can be specified in repository
			settings using `client`.
Add Google Cloud Storage repository plugin Closes #12880 2015-09-15 17:35:10 +02:00
GCS Repository: Add secure storage of credentials (#24697) This commit adds gcs credential settings to the elasticsearch keystore. The setting name follows the same pattern as the s3 client settings, beginning with `gcs.client.`, followed by the client name, and then the setting name, in this case, `credentials_file`. Using the legacy service file setting is also deprecated. 2017-05-16 17:17:37 -07:00			`For example, if specifying the credentials file in the keystore under`
			`gcs.client.my_alternate_client.credentials_file`, you can configure a repository to use these
			`credentials like this:`
Add Google Cloud Storage repository plugin Closes #12880 2015-09-15 17:35:10 +02:00
Test docs for plugins We weren't doing it before because we weren't starting the plugins. Now we are. The hardest part of this was handling the files the tests expect to be on the filesystem. extraConfigFiles was broken. 2016-05-13 16:15:51 -04:00			`[source,js]`
Add Google Cloud Storage repository plugin Closes #12880 2015-09-15 17:35:10 +02:00			`----`
			`PUT _snapshot/my_gcs_repository`
			`{`
			`"type": "gcs",`
			`"settings": {`
			`"bucket": "my_bucket",`
GCS Repository: Add secure storage of credentials (#24697) This commit adds gcs credential settings to the elasticsearch keystore. The setting name follows the same pattern as the s3 client settings, beginning with `gcs.client.`, followed by the client name, and then the setting name, in this case, `credentials_file`. Using the legacy service file setting is also deprecated. 2017-05-16 17:17:37 -07:00			`"client": "my_alternate_client"`
Add Google Cloud Storage repository plugin Closes #12880 2015-09-15 17:35:10 +02:00			`}`
			`}`
			`----`
			`// CONSOLE`
Test docs for plugins We weren't doing it before because we weren't starting the plugins. Now we are. The hardest part of this was handling the files the tests expect to be on the filesystem. extraConfigFiles was broken. 2016-05-13 16:15:51 -04:00			`// TEST[skip:we don't have gcs setup while testing this]`
Add Google Cloud Storage repository plugin Closes #12880 2015-09-15 17:35:10 +02:00
			`[[repository-gcs-bucket-permission]]`
			`===== Set Bucket Permission`

			`The service account used to access the bucket must have the "Writer" access to the bucket:`

			`1. Connect to the https://console.cloud.google.com/[Google Cloud Platform Console]`
			`2. Select your project`
Update repository-gcs.asciidoc Fixed link 2016-05-27 17:44:58 +03:00			`3. Got to the https://console.cloud.google.com/storage/browser[Storage Browser]`
Add Google Cloud Storage repository plugin Closes #12880 2015-09-15 17:35:10 +02:00			`4. Select the bucket and "Edit bucket permission"`
			`5. The service account must be configured as a "User" with "Writer" access`


			`[[repository-gcs-repository]]`
			`==== Create a Repository`

			`Once everything is installed and every node is started, you can create a new repository that`
			`uses Google Cloud Storage to store snapshots:`

Test docs for plugins We weren't doing it before because we weren't starting the plugins. Now we are. The hardest part of this was handling the files the tests expect to be on the filesystem. extraConfigFiles was broken. 2016-05-13 16:15:51 -04:00			`[source,js]`
Add Google Cloud Storage repository plugin Closes #12880 2015-09-15 17:35:10 +02:00			`----`
			`PUT _snapshot/my_gcs_repository`
			`{`
			`"type": "gcs",`
			`"settings": {`
GCS Repository: Add secure storage of credentials (#24697) This commit adds gcs credential settings to the elasticsearch keystore. The setting name follows the same pattern as the s3 client settings, beginning with `gcs.client.`, followed by the client name, and then the setting name, in this case, `credentials_file`. Using the legacy service file setting is also deprecated. 2017-05-16 17:17:37 -07:00			`"bucket": "my_bucket"`
Add Google Cloud Storage repository plugin Closes #12880 2015-09-15 17:35:10 +02:00			`}`
			`}`
			`----`
			`// CONSOLE`
Test docs for plugins We weren't doing it before because we weren't starting the plugins. Now we are. The hardest part of this was handling the files the tests expect to be on the filesystem. extraConfigFiles was broken. 2016-05-13 16:15:51 -04:00			`// TEST[skip:we don't have gcs setup while testing this]`
Add Google Cloud Storage repository plugin Closes #12880 2015-09-15 17:35:10 +02:00
			`The following settings are supported:`

			`bucket`::

			`The name of the bucket to be used for snapshots. (Mandatory)`

GCS Repository: Add secure storage of credentials (#24697) This commit adds gcs credential settings to the elasticsearch keystore. The setting name follows the same pattern as the s3 client settings, beginning with `gcs.client.`, followed by the client name, and then the setting name, in this case, `credentials_file`. Using the legacy service file setting is also deprecated. 2017-05-16 17:17:37 -07:00			`client`::
Add Google Cloud Storage repository plugin Closes #12880 2015-09-15 17:35:10 +02:00
GCS Repository: Add secure storage of credentials (#24697) This commit adds gcs credential settings to the elasticsearch keystore. The setting name follows the same pattern as the s3 client settings, beginning with `gcs.client.`, followed by the client name, and then the setting name, in this case, `credentials_file`. Using the legacy service file setting is also deprecated. 2017-05-16 17:17:37 -07:00			`The client congfiguration to use. This controls which credentials are used to connect`
			`to Compute Engine.`
Add Google Cloud Storage repository plugin Closes #12880 2015-09-15 17:35:10 +02:00
			`base_path`::

			`Specifies the path within bucket to repository data. Defaults to`
			`the root of the bucket.`

			`chunk_size`::

			`Big files can be broken down into chunks during snapshotting if needed.`
			`The chunk size can be specified in bytes or by using size value notation,`
			i.e. `1g`, `10m`, `5k`. Defaults to `100m`.

			`compress`::

			When set to `true` metadata files are stored in compressed format. This
			`setting doesn't affect index files that are already compressed by default.`
			Defaults to `false`.

			`application_name`::

			`Name used by the plugin when it uses the Google Cloud JSON API. Setting`
			`a custom name can be useful to authenticate your cluster when requests`
			statistics are logged in the Google Cloud Platform. Default to `repository-gcs`