OpenSearch

Commit Graph

Author	SHA1	Message	Date
Martijn van Groningen	1cfb6a79f1	Parent/child: refactored _parent field mapper and parent/child queries * Cut the `has_child` and `has_parent` queries over to use Lucene's query time global ordinal join. The main benefit of this change is that parent/child queries can now efficiently execute if parent/child queries are wrapped in a bigger boolean query. If the rest of the query only hit a few documents both has_child and has_parent queries don't need to evaluate all parent or child documents any more. * Cut the `_parent` field over to use doc values. This significantly reduces the on heap memory footprint of parent/child, because the parent id values are never loaded into memory. Breaking changes: * The `type` option on the `_parent` field can only point to a parent type that doesn't exist yet, so this means that an existing type/mapping can't become a parent type any longer. * The `has_child` and `has_parent` queries can no longer be use in alias filters. All these changes, improvements and breaks in compatibility only apply for indices created with ES version 2.0 or higher. For indices creates with ES <= 2.0 the older implementation is used. It is highly recommended to re-index all your indices with parent and child documents to benefit from all the improvements that come with this refactoring. The easiest way to achieve this is by using the scan and bulk apis using a simple script. Closes #6107 Closes #8134	2015-05-29 21:44:17 +02:00
Areek Zillur	fb8cd53582	This commit removes the ability to use `filter` for PhraseSuggester collate. Only `query` can be used for collation. Internally, a collate query is executed as an exists query. So specifying a filter does not have any benefits.	2015-05-29 12:26:08 -04:00
Colin Goodheart-Smithe	35a58d874e	Scripting: Unify script and template requests across codebase This change unifies the way scripts and templates are specified for all instances in the codebase. It builds on the Script class added previously and adds request building and parsing support as well as the ability to transfer script objects between nodes. It also adds a Template class which aims to provide the same functionality for template APIs Closes #11091	2015-05-29 16:52:04 +01:00
Britta Weber	a031232c48	[doc] remove reference to seal, was removed in #11336	2015-05-29 11:40:34 +02:00
Britta Weber	87a0c76e9c	Merge remote-tracking branch 'boaz/index_seal_to_flush_sync'	2015-05-29 10:31:03 +02:00
Igor Motov	55fc3a727b	Core: refactor upgrade API to use transport and write minimum compatible version that the index was upgraded to In #11072 we are adding a check that will prevent opening of old indices. However, this check doesn't take into consideration the fact that indices can be made compatible with the current version through upgrade API. In order to make compatibility check aware of the upgrade, the upgrade API should write a new setting `index.version.minimum_compatible` that will indicate the minimum compatible version of lucene this index is compatible with and `index.version.upgraded` that will indicate the version of elasticsearch that performed the upgrade. Closes #11095	2015-05-28 05:23:49 -10:00
Zachary Tong	d32a80f37b	Docs: Fix misplaced images in moving_avg docs	2015-05-27 16:13:36 -04:00
Zachary Tong	491afbe01c	Aggregations: Add Holt-Winters model to `moving_avg` pipeline aggregation Closes #11043	2015-05-27 14:45:45 -04:00
Alexander Reelsen	fc224a0de8	Cat API: Add wildcard support for header names This adds wildcard support (simple regexes) for specifying header names. Aliases are supported as well. Closes #10811	2015-05-27 16:09:31 +02:00
Boaz Leskes	37bdbe074a	doc feedback	2015-05-27 15:40:02 +03:00
Tanguy Leroux	340b7ef6ef	Add common SystemD file for RPM/DEB package	2015-05-27 11:51:58 +02:00
javanna	fc28bc73f8	[DOCS] add kopf to site plugins	2015-05-27 10:28:53 +02:00
Ryan Schneider	8ec6bf7340	[DOCS] Update get.asciidoc Updated to not mislead the reader that the data is actually gone when a document is updated. For example if you have 100GB of docs and update each one you'll only be able to access 100GB of the data, but there would theoretically be 200GB of doc data. Closes #10375	2015-05-27 10:17:10 +02:00
Boaz Leskes	6d269cbf4d	feedback	2015-05-27 10:29:37 +03:00
javanna	6c81a8daf3	Internal: count api to become a shortcut to the search api The count api used to have its own execution path, although it would do the same (up to bugs!) of the search api. This commit makes it a shortcut to the search api with size set to 0. The change is made in a backwards compatible manner, by leaving all of the java api code around too, given that you may not want to get back a whole SearchResponse when asking only for number of hits matching a query, also cause migrating from countResponse.getCount() to searchResponse.getHits().totalHits() doesn't look great from a user perspective. We can always decide to drop more code around the count api if we want to break backwards compatibility on the java api, making it a shortcut on the rest layer only. Closes #9117 Closes #11198	2015-05-26 19:12:11 +02:00
Alexander Reelsen	1fa21a76cf	Documentation: Fix elasticsearch documentation build The commit for closing #11033 was not building the asciidoc documentation.	2015-05-26 18:16:12 +02:00
Alexander Reelsen	045f01c085	Infra for deprecation logging Add support for a specific deprecation logging that can be used to turn on in order to notify users of a specific feature, flag, setting, parameter, ... being deprecated. The deprecation logger logs with a "deprecation." prefix logge (or "org.elasticsearch.deprecation." if full name is used), and outputs the logging to a dedicated deprecation log file. Deprecation logging are logged under the DEBUG category. The idea is not to enabled them by default (under WARN or ERROR) when running embedded in another application. By default they are turned off (INFO), in order to turn it on, the "deprecation" category need to be set to DEBUG. This can be set in the logging file or using the cluster update settings API, see the documentation Closes #11033	2015-05-26 17:44:52 +02:00
Tanguy Leroux	ce63590bd6	API: Add response filtering with filter_path parameter This change adds a new "filter_path" parameter that can be used to filter and reduce the responses returned by the REST API of elasticsearch. For example, returning only the shards that failed to be optimized: ``` curl -XPOST 'localhost:9200/beer/_optimize?filter_path=_shards.failed' {"_shards":{"failed":0}}% ``` It supports multiple filters (separated by a comma): ``` curl -XGET 'localhost:9200/_mapping?pretty&filter_path=.mappings..properties.name,.mappings..properties.title' ``` It also supports the YAML response format. Here it returns only the `_id` field of a newly indexed document: ``` curl -XPOST 'localhost:9200/library/book?filter_path=_id' -d '---hello:\n world: 1\n' --- _id: "AU0j64-b-stVfkvus5-A" ``` It also supports wildcards. Here it returns only the host name of every nodes in the cluster: ``` curl -XGET 'http://localhost:9200/_nodes/stats?filter_path=nodes..host' {"nodes":{"lvJHed8uQQu4brS-SXKsNA":{"host":"portable"}}} ``` And "" can be used to include sub fields without knowing the exact path. Here it returns only the Lucene version of every segment: ``` curl 'http://localhost:9200/_segments?pretty&filter_path=indices..version' { "indices" : { "beer" : { "shards" : { "0" : [ { "segments" : { "_0" : { "version" : "5.2.0" }, "_1" : { "version" : "5.2.0" } } } ] } } } } ``` Note that elasticsearch sometimes returns directly the raw value of a field, like the _source field. If you want to filter _source fields, you should consider combining the already existing _source parameter (see Get API for more details) with the filter_path parameter like this: ``` curl -XGET 'localhost:9200/_search?pretty&filter_path=hits.hits._source&_source=title' { "hits" : { "hits" : [ { "_source":{"title":"Book #2"} }, { "_source":{"title":"Book #1"} }, { "_source":{"title":"Book #3"} } ] } } ```	2015-05-26 13:51:04 +02:00
Britta Weber	eeeb29f900	spell correct and add single quotes	2015-05-26 11:41:19 +02:00
Britta Weber	37782c1745	analyzers: custom analyzers names and aliases must not start with _ closes #9596	2015-05-26 11:38:15 +02:00
Boaz Leskes	b376a3fbfb	Move index sealing terminology to synced flush #10032 introduced the notion of sealing an index by marking it with a special read only marker, allowing for a couple of optimization to happen. The most important one was to speed up recoveries of shards where we know nothing has changed since they were online by skipping the file based sync phase. During the implementation we came up with a light notion which achieves the same recovery benefits but without the read only aspects which we dubbed synced flush. The fact that it was light weight and didn't put the index in read only mode, allowed us to do it automatically in the background which has great advantage. However we also felt the need to allow users to manually trigger this operation. The implementation at #11179 added the sync flush internal logic and the manual (rest) rest API. The name of the API was modeled after the sealing terminology which may end up being confusing. This commit changes the API name to match the internal synced flush naming, namely `{index}/_flush/synced'. On top of that it contains a couple other changes: - Remove all java client API. This feature is not supposed to be called programtically by applications but rather by admins. - Improve rest responses making structure similar to other (flush) API - Change IndexShard#getOperationsCount to exclude the internal +1 on open shard . it's confusing to get 1 while there are actually no ongoing operations - Some minor other clean ups	2015-05-25 22:32:32 +03:00
Alex Chan	e31049988b	[Docs] Fix minor spelling errors Closes #11320	2015-05-25 19:56:43 +02:00
Eduardo Gurgel	0f3b3c0787	Docs: Fix typo on percolate_format description Closes #11215	2015-05-25 13:17:59 +02:00
Clinton Gormley	4d27d751fb	Docs: Move the page on facets into redirects.asciidoc	2015-05-24 23:34:23 +02:00
Clinton Gormley	6171ae6cc4	Docs: Added stub entries for pages deleted from 1.x	2015-05-24 17:57:34 +02:00
Clinton Gormley	4b854d10bd	Docs: Tidied up the field statistics docs	2015-05-24 15:12:44 +02:00
Britta Weber	4d0b40ca52	Merge pull request #11235 from nik9000/seal_docs Rewrote some _seal documentation	2015-05-22 18:24:23 +02:00
Clinton Gormley	cde2c91b5a	Docs: Example blocks can't contain warnings	2015-05-22 17:37:58 +02:00
Clinton Gormley	631e03c872	Docs: Tidied up term vectors docs Moved annotations out of titles Made the example titles into example blocks	2015-05-22 17:19:12 +02:00
Nik Everett	6da1e858dc	Rewrote some _seal documentation The first two paragraphs were confusing to me so I tried to rewrite them. I removed some passive voice because it irks me.	2015-05-22 10:51:21 -04:00
Clinton Gormley	20279a2556	Docs: Rename reference docs to Elasticsearch Reference	2015-05-22 14:49:11 +02:00
Adrien Grand	42f9053817	Merge pull request #11280 from jpountz/fix/remove_binary_compress Mappings: Remove the `compress`/`compress_threshold` options of the BinaryFieldMapper.	2015-05-22 14:21:13 +02:00
Adrien Grand	461683ac58	Mappings: Remove the `compress`/`compress_threshold` options of the BinaryFieldMapper. This option is broken currently since it potentially interprets an incoming binary value as compressed while it just happens that the first bytes are the same as the LZF header.	2015-05-22 14:20:42 +02:00
Colin Goodheart-Smithe	35deb7efea	Aggregations: Renaming reducers to Pipeline Aggregators	2015-05-21 14:57:23 +01:00
Igor Motov	dd41c68741	Snapshot/Restore: fix FSRepository location configuration Closes #11068	2015-05-20 22:14:31 -04:00
Lee Hinman	0a6f7ef379	[DOCS] Mention Integer.MAX_VALUE limit for http.max_content_length Fixes #11244	2015-05-20 13:08:59 -06:00
Clinton Gormley	5e4d5e1c64	Docs: Included the index-seal docs in the indices section	2015-05-20 11:20:12 +02:00
Simon Willnauer	488be75d19	Add some words about the purpose of a seal etc.	2015-05-19 12:26:08 +02:00
Simon Willnauer	9d2852f0ab	Merge branch 'master' into feature/synced_flush Conflicts: src/main/java/org/elasticsearch/index/engine/InternalEngine.java src/main/java/org/elasticsearch/index/shard/IndexShard.java src/main/java/org/elasticsearch/indices/recovery/RecoverySourceHandler.java src/test/java/org/elasticsearch/index/engine/InternalEngineTests.java	2015-05-19 12:16:22 +02:00
Adrien Grand	2c241e8a36	Mappings: Remove the `ignore_conflicts` option. Mappings conflicts should not be ignored. If I read the history correctly, this option was added when a mapping update to an existing field was considered a conflict, even if the new mapping was exactly the same. Now that mapping updates are smart enough to detect conflicting options, we don't need an option to ignore conflicts.	2015-05-18 15:28:23 +02:00
javanna	a843008b17	Highlighting: require_field_match set to true by default The default `false` for `require_field_match` is a bit odd and confusing for users, given that field names get ignored by default and every field gets highlighted if it contains terms extracted out of the query, regardless of which fields were queries. Changed the default to `true`, it can always be changed per request. Closes #10627 Closes #11067	2015-05-15 21:38:45 +02:00
Clinton Gormley	9d71816cd2	Docs: Fixed explanation of AUTO fuzziness Closes #11186	2015-05-15 21:25:11 +02:00
Scott Chamberlain	ae599f93a7	Added an R client to community clients page I maintain an R client called `elastic` at https://github.com/ropensci/elastic	2015-05-15 21:25:11 +02:00
javanna	46c521f7ec	Highlighting: nuke XPostingsHighlighter Our own fork of the lucene PostingsHighlighter is not easy to maintain and doesn't give us any added value at this point. In particular, it was introduced to support the require_field_match option and discrete per value highlighting, used in case one wants to highlight the whole content of a field, but get back one snippet per value. These two features won't make it into lucene as they slow things down and shouldn't have been supported from day one on our end probably. One other customization we had was support for a wider range of queries via custom rewrite etc. (yet another way to slow things down), which got added to lucene and works much much better than what we used to do (instead of or rewrite, term s are pulled out of the automata for multi term queries). Removing our fork means the following in terms of features: - dropped support for require_field_match: the postings highlighter will only highlight fields that were queried - some custom es queries won't be supported anymore, meaning they won't be highlighted. The only one I found up until now is the phrase_prefix. Postings highlighter rewrites against an empty reader to avoid slow operations (like the ones that we were performing with the fork that we are removing here), thus the prefix will not be expanded to any term. What the postings highlighter does instead is pulling the automata out of multi term queries, but this is not supported at the moment with our MultiPhrasePrefixQuery. Closes #10625 Closes #11077	2015-05-15 20:41:33 +02:00
Clinton Gormley	3a69b65e88	Docs: Fixed the backslash escaping on the pattern analyzer docs Closes #11099	2015-05-15 18:40:16 +02:00
Jun Ohtani	597c53a0bb	Add migrationi note for AnalyzeRequest	2015-05-16 00:25:53 +09:00
Adrien Grand	bf599d68dd	Merge pull request #11042 from jpountz/feature/aggs_missing Aggs: Make it possible to configure missing values.	2015-05-15 16:33:29 +02:00
Adrien Grand	32e23b9100	Aggs: Make it possible to configure missing values. Most aggregations (terms, histogram, stats, percentiles, geohash-grid) now support a new `missing` option which defines the value to consider when a field does not have a value. This can be handy if you eg. want a terms aggregation to handle the same way documents that have "N/A" or no value for a `tag` field. This works in a very similar way to the `missing` option on the `sort` element. One known issue is that this option sometimes cannot make the right decision in the unmapped case: it needs to replace all values with the `missing` value but might not know what kind of values source should be produced (numerics, strings, geo points?). For this reason, we might want to add an `unmapped_type` option in the future like we did for sorting. Related to #5324	2015-05-15 16:26:58 +02:00
Martijn van Groningen	719252a138	Merge pull request #11183 from martijnvg/parent-child/remove_id_cache_from_stats_and_clear_cache_apis Removed `id_cache` from stats and cat apis.	2015-05-15 14:39:35 +02:00
Martijn van Groningen	ece18f162e	Removed `id_cache` from stats and cat apis. Also removed the `id_cache` option from the clear cache api. Closes #5269	2015-05-15 14:06:18 +02:00

1 2 3 4 5 ...

1470 Commits