--- layout: default title: High-level Python client nav_order: 5 --- # High-level Python client The OpenSearch high-level Python client (`opensearch-dsl-py`) provides wrapper classes for common OpenSearch entities, like documents, so you can work with them as Python objects. Additionally, the high-level client simplifies writing queries and supplies convenient Python methods for common OpenSearch operations. The high-level Python client supports creating and indexing documents, searching with and without filters, and updating documents using queries. This getting started guide illustrates how to connect to OpenSearch, index documents, and run queries. For the client source code, see the [opensearch-dsl-py repo](https://github.com/opensearch-project/opensearch-dsl-py). ## Setup To add the client to your project, install it using [pip](https://pip.pypa.io/): ```bash pip install opensearch-dsl ``` {% include copy.html %} After installing the client, you can import it like any other module: ```python from opensearchpy import OpenSearch from opensearch_dsl import Search ``` {% include copy.html %} ## Connecting to OpenSearch To connect to the default OpenSearch host, create a client object with SSL enabled if you are using the Security plugin. You can use the default credentials for testing purposes: ```python host = 'localhost' port = 9200 auth = ('admin', 'admin') # For testing only. Don't store credentials in code. ca_certs_path = '/full/path/to/root-ca.pem' # Provide a CA bundle if you use intermediate CAs with your root CA. # Create the client with SSL/TLS enabled, but hostname verification disabled. client = OpenSearch( hosts = [{'host': host, 'port': port}], http_compress = True, # enables gzip compression for request bodies http_auth = auth, use_ssl = True, verify_certs = True, ssl_assert_hostname = False, ssl_show_warn = False, ca_certs = ca_certs_path ) ``` {% include copy.html %} If you have your own client certificates, specify them in the `client_cert_path` and `client_key_path` parameters: ```python host = 'localhost' port = 9200 auth = ('admin', 'admin') # For testing only. Don't store credentials in code. ca_certs_path = '/full/path/to/root-ca.pem' # Provide a CA bundle if you use intermediate CAs with your root CA. # Optional client certificates if you don't want to use HTTP basic authentication. client_cert_path = '/full/path/to/client.pem' client_key_path = '/full/path/to/client-key.pem' # Create the client with SSL/TLS enabled, but hostname verification disabled. client = OpenSearch( hosts = [{'host': host, 'port': port}], http_compress = True, # enables gzip compression for request bodies http_auth = auth, client_cert = client_cert_path, client_key = client_key_path, use_ssl = True, verify_certs = True, ssl_assert_hostname = False, ssl_show_warn = False, ca_certs = ca_certs_path ) ``` {% include copy.html %} If you are not using the Security plugin, create a client object with SSL disabled: ```python host = 'localhost' port = 9200 # Create the client with SSL/TLS and hostname verification disabled. client = OpenSearch( hosts = [{'host': host, 'port': port}], http_compress = True, # enables gzip compression for request bodies use_ssl = False, verify_certs = False, ssl_assert_hostname = False, ssl_show_warn = False ) ``` {% include copy.html %} ## Creating an index To create an OpenSearch index, use the `client.indices.create()` method. You can use the following code to construct a JSON object with custom settings: ```python index_name = 'my-dsl-index' index_body = { 'settings': { 'index': { 'number_of_shards': 4 } } } response = client.indices.create(index_name, body=index_body) ``` {% include copy.html %} ## Indexing a document You can create a class to represent the documents that you'll index in OpenSearch by extending the `Document` class: ```python class Movie(Document): title = Text(fields={'raw': Keyword()}) director = Text() year = Text() class Index: name = index_name def save(self, ** kwargs): return super(Movie, self).save(** kwargs) ``` {% include copy.html %} To index a document, create an object of the new class and call its `save()` method: ```python # Set up the opensearch-py version of the document Movie.init(using=client) doc = Movie(meta={'id': 1}, title='Moneyball', director='Bennett Miller', year='2011') response = doc.save(using=client) ``` {% include copy.html %} ## Performing bulk operations You can perform several operations at the same time by using the `bulk()` method of the client. The operations may be of the same type or of different types. Note that the operations must be separated by a `\n` and the entire string must be a single line: ```python movies = '{ "index" : { "_index" : "my-dsl-index", "_id" : "2" } } \n { "title" : "Interstellar", "director" : "Christopher Nolan", "year" : "2014"} \n { "create" : { "_index" : "my-dsl-index", "_id" : "3" } } \n { "title" : "Star Trek Beyond", "director" : "Justin Lin", "year" : "2015"} \n { "update" : {"_id" : "3", "_index" : "my-dsl-index" } } \n { "doc" : {"year" : "2016"} }' client.bulk(movies) ``` {% include copy.html %} ## Searching for documents You can use the `Search` class to construct a query. The following code creates a Boolean query with a filter: ```python s = Search(using=client, index=index_name) \ .filter("term", year="2011") \ .query("match", title="Moneyball") response = s.execute() ``` {% include copy.html %} The preceding query is equivalent to the following query in OpenSearch domain-specific language (DSL): ```json GET my-dsl-index/_search { "query": { "bool": { "must": { "match": { "title": "Moneyball" } }, "filter": { "term" : { "year": 2011 } } } } } ``` ## Deleting a document You can delete a document using the `client.delete()` method: ```python response = client.delete( index = 'my-dsl-index', id = '1' ) ``` {% include copy.html %} ## Deleting an index You can delete an index using the `client.indices.delete()` method: ```python response = client.indices.delete( index = 'my-dsl-index' ) ``` {% include copy.html %} ## Sample program The following sample program creates a client, adds an index with non-default settings, inserts a document, performs bulk operations, searches for the document, deletes the document, and then deletes the index: ```python from opensearchpy import OpenSearch from opensearch_dsl import Search, Document, Text, Keyword host = 'localhost' port = 9200 auth = ('admin', 'admin') # For testing only. Don't store credentials in code. ca_certs_path = 'root-ca.pem' # Create the client with SSL/TLS enabled, but hostname verification disabled. client = OpenSearch( hosts=[{'host': host, 'port': port}], http_compress=True, # enables gzip compression for request bodies # http_auth=auth, use_ssl=False, verify_certs=False, ssl_assert_hostname=False, ssl_show_warn=False, # ca_certs=ca_certs_path ) index_name = 'my-dsl-index' index_body = { 'settings': { 'index': { 'number_of_shards': 4 } } } response = client.indices.create(index_name, index_body) print('\nCreating index:') print(response) # Create the structure of the document class Movie(Document): title = Text(fields={'raw': Keyword()}) director = Text() year = Text() class Index: name = index_name def save(self, ** kwargs): return super(Movie, self).save(** kwargs) # Set up the opensearch-py version of the document Movie.init(using=client) doc = Movie(meta={'id': 1}, title='Moneyball', director='Bennett Miller', year='2011') response = doc.save(using=client) print('\nAdding document:') print(response) # Perform bulk operations movies = '{ "index" : { "_index" : "my-dsl-index", "_id" : "2" } } \n { "title" : "Interstellar", "director" : "Christopher Nolan", "year" : "2014"} \n { "create" : { "_index" : "my-dsl-index", "_id" : "3" } } \n { "title" : "Star Trek Beyond", "director" : "Justin Lin", "year" : "2015"} \n { "update" : {"_id" : "3", "_index" : "my-dsl-index" } } \n { "doc" : {"year" : "2016"} }' client.bulk(movies) # Search for the document. s = Search(using=client, index=index_name) \ .filter('term', year='2011') \ .query('match', title='Moneyball') response = s.execute() print('\nSearch results:') for hit in response: print(hit.meta.score, hit.title) # Delete the document. print('\nDeleting document:') print(response) # Delete the index. response = client.indices.delete( index = index_name ) print('\nDeleting index:') print(response) ``` {% include copy.html %}