OpenSearch

Commit Graph

Author	SHA1	Message	Date
Clinton Gormley	1fff379742	[DOCS] Documented the fact that binary fields are not stored by default	2014-03-20 12:43:43 +01:00
Florian Schilling	c0a092aa92	[Doc] Updated docs for distance scripting Updated docs for distance scripting and added missing geohash distance functions Closes #5397	2014-03-20 12:18:25 +01:00
Clinton Gormley	4c34615686	[DOCS] Fixed some bad UTF8	2014-03-19 12:46:06 +01:00
Shay Banon	0ef3b03be1	Move to use serial merge schedule by default Today, we use ConcurrentMergeScheduler, and this can be painful since it is concurrent on a shard level, with a max of 3 threads doing concurrent merges. If there are several shards being indexed, then there will be a minor explosion of threads trying to do merges, all being throttled by our merge throttling. Moving to serial merge scheduler will still maintain concurrency of merges across shards, as we have the merge thread pool that schedules those merges. It will just be a serial one on a specific shard. Also, on serial merge scheduler, we now have a limit of how many merges it will do at one go, so it will let other shards get their fair chance of merging. We use the pending merges on IW to check if merges are needed or not for it. Note, that if a merge is happening, it will not block due to a sync on the maybeMerge call at indexing (flush) time, since we wrap our merge scheduler with the EnabledMergeScheduler, where maybeMerge is not activated during indexing, only with explicit calls to IW#maybeMerge (see Merges). closes #5447	2014-03-18 13:17:00 +01:00
Igor Motov	a1192044f2	Add ability to get snapshot status for running snapshots Closes #4946	2014-03-17 20:13:49 -04:00
David Pilato	0805c01984	[DOCS] Add Azure storage repositories	2014-03-17 19:40:28 +01:00
markharwood	5f1d9af9fe	Documentation fix for significant_terms heading levels	2014-03-17 12:17:54 +00:00
Randy Stauner	1486188a3b	[DOCS] Reword clear-scroll sentence	2014-03-17 12:08:49 +01:00
lzhoucs	5a5171cb70	[DOCS] Fix typo in the reference doc. SuSe -> SUSE SUSE, as a Linux distribution, is never lower cased fixes #5354	2014-03-17 12:03:25 +01:00
Justin Etheredge	36219a1786	[DOCS] Updating scripting docs for geo functions Added a few functions are corrected the default unit where necessary	2014-03-17 11:59:02 +01:00
Boaz Leskes	ee8743f3f2	[Docs] added a missing reference to significantterms-aggergations Also fix header level mismatch issue reported by the build	2014-03-17 11:45:55 +01:00
David Pilato	f54e9246c1	Add _cat/plugins endpoint If we want to have a full picture of versions running in a cluster, we need to add a `_cat/plugins` endpoint. Response could look like: ```sh % curl es2:9200/_cat/plugins?v node component version type url desc es1 mapper-attachments 1.7.0 j Adds the attachment type allowing to parse difference attachment formats es1 lang-javascript 1.4.0 j JavaScript plugin allowing to add javascript scripting support es1 analysis-smartcn 1.9.0 j Smart Chinese analysis support es1 marvel 1.1.0 j/s http://localhost:9200/_plugins/marvel Elasticsearch Management & Monitoring es1 kopf 0.5.3 s http://localhost:9200/_plugins/kopf kopf - simple web administration tool for ElasticSearch es2 mapper-attachments 2.0.0.RC1 j Adds the attachment type allowing to parse difference attachment formats es2 lang-javascript 2.0.0.RC1 j JavaScript plugin allowing to add javascript scripting support es2 analysis-smartcn 2.0.0.RC1 j Smart Chinese analysis support ``` Closes #4824.	2014-03-16 12:16:09 +01:00
Clinton Gormley	fb934aff57	[DOCS] Documented gateway.local.auto_import_dangled Relates to #4996	2014-03-15 12:07:17 +01:00
rphadake	36a0cb99d7	[Doc] doc updates for date histogram interval Close #5308	2014-03-14 18:55:32 +01:00
Adrien Grand	65d3b61b97	Add an option to force _optimize operations. When forced, the index will be merged even if it contains a single segment with no deletions. Close #5243	2014-03-14 18:21:56 +01:00
Adrien Grand	eef71da650	[Doc] Add a chart about the relative error of the percentiles aggregation.	2014-03-14 12:23:23 +01:00
markharwood	767bef0596	Significant_terms aggregation identifies terms that are significant rather than merely popular in a set. Significance is related to the changes in document frequency observed between everyday use in the corpus and frequency observed in the result set. The asciidocs include extensive details on the applications of this feature. Closes #5146	2014-03-14 10:34:24 +00:00
Adrien Grand	5821fa042c	Cardinality aggregation. This aggregation computes unique term counts using the hyperloglog++ algorithm which uses linear counting to estimate low cardinalities and hyperloglog on higher cardinalities. Since this algorithm works on hashes, it is useful for high-cardinality fields to store the hash of values directly in the index, which is the purpose of the new `murmur3` field type. This is less necessary on low-cardinality string fields because the aggregator is smart enough to only compute the hash once per unique value per segment thanks to ordinals, or on numeric fields since hashing them is very fast. Close #5426	2014-03-13 19:19:56 +01:00
Florian Schilling	81e537bd5e	ContextSuggester ================ This commit extends the `CompletionSuggester` by context informations. In example such a context informations can be a simple string representing a category reducing the suggestions in order to this category. Three base implementations of these context informations have been setup in this commit. - a Category Context - a Geo Context All the mapping for these context informations are specified within a context field in the completion field that should use this kind of information.	2014-03-13 11:24:46 +01:00
Kurt Hurtado	ca6a2bb790	[DOCS] Various aggregation doc fixes	2014-03-13 09:05:25 +01:00
Costin Leau	9624b215fb	Add docs for plugin isolation	2014-03-11 12:32:58 +02:00
Boaz Leskes	b7a95d11a7	Introduced VersionType.FORCE & VersionType.EXTERNAL_GTE Also added "external_gt" as an alias name for VersionType.EXTERNAL , accessible for the rest layer. Closes #4213 , Closes #2946	2014-03-10 21:07:17 +01:00
javanna	d5aaa90f34	[TEST] Randomized number of shards used for indices created during tests Introduced two levels of randomization for the number of shards (between 1 and 10) when running tests: 1) through the existing random index template, which now sets a random number of shards that is shared across all the indices created in the same test method unless overwritten 2) through `createIndex` and `prepareCreate` methods, similar to what happens using the `indexSettings` method, which changes for every `createIndex` or `prepareCreate` unless overwritten (overwrites index template for what concerns the number of shards) Added the following facilities to deal with the random number of shards: - `getNumShards` to retrieve the number of shards of a given existing index, useful when doing comparisons based on the number of shards and we can avoid specifying a static number. The method returns an object containing the number of primaries, number of replicas and the total number of shards for the existing index - added `assertFailures` that checks that a shard failure happened during a search request, either partial failure or total (all shards failed). Checks also the error code and the error message related to the failure. This is needed as without knowing the number of shards upfront, when simulating errors we can run into either partial (search returns partial results and failures) or total failures (search returns an error) - added common methods similar to `indexSettings`, to be used in combination with `createIndex` and `prepareCreate` method and explicitly control the second level of randomization: `numberOfShards`, `minimumNumberOfShards` and `maximumNumberOfShards`. Added also `numberOfReplicas` despite the number of replicas is not randomized (default not specified but can be overwritten by tests) Tests that specified the number of shards have been reviewed and the results follow: - removed number_of_shards in node settings, ignored anyway as it would be overwritten by both mechanisms above - remove specific number of shards when not needed - removed manual shards randomization where present, replaced with ordinary one that's now available - adapted tests that didn't need a specific number of shards to the new random behaviour - fixed a couple of test bugs (e.g. 3 levels parent child test could only work on a single shard as the routing key used for grand-children wasn't correct) - also done some cleanup, shared code through shard size facets and aggs tests and used common methods like `assertAcked`, `ensureGreen`, `refresh`, `flush` and `refreshAndFlush` where possible - made sure that `indexSettings()` is always used as a basis when using `prepareCreate` to inject specific settings - converted indexRandom(false, ...) + refresh to indexRandom(true, ...)	2014-03-10 13:01:52 +01:00
Simon Willnauer	fbb8c0fafa	[DOCS] Add `coming` tag to multiple rescores Closes #5365	2014-03-10 09:27:44 +01:00
Andrew Raines	2f48be597e	Display all available endpoints by default at /_cat Closes #5106	2014-03-07 13:21:43 -06:00
Konrad Feldmeier	d7b0d547d4	[DOCS] Multiple doc fixes Closes #5047	2014-03-07 14:24:58 +01:00
Benjamin Devèze	2affa5004f	Fix small typo in percentiles doc	2014-03-07 10:10:19 +01:00
Adrien Grand	f359b7f38b	[DOC] The percentiles aggregation is coming in 1.1.0.	2014-03-07 10:03:15 +01:00
Brusic	95274c18c5	Added support for char filters in the analyze API Closes #5148	2014-03-06 12:23:51 +01:00
James Brook	a93d6d55a5	Added support for aliases to index templates Adapted existing PR (#2739) to updated code (post #4920), added tests and docs (@javanna) Closes #1825	2014-03-06 11:11:07 +01:00
uboness	9d0fc76f54	Added support for sorting buckets based on sub aggregations Supports sorting on sub-aggs down the current hierarchy. This is supported as long as the aggregation in the specified order path are of a single-bucket type, where the last aggregation in the path points to either a single-bucket aggregation or a metrics one. If it's a single-bucket aggregation, the sort will be applied on the document count in the bucket (i.e. doc_count), and if it is a metrics type, the sort will be applied on the pointed out metric (in case of a single-metric aggregations, such as avg, the sort will be applied on the single metric value) NOTE: this commit adds a constraint on what should be considered a valid aggregation name. Aggregations names must be alpha-numeric and may contain '-' and '_'. Closes #5253	2014-03-06 00:05:27 +01:00
Igor Motov	b723ee0d20	[DOCS] Update boolean mapping docs with a full list of values that are treated as false Closes #5337	2014-03-05 15:33:59 -05:00
Clinton Gormley	98ecf80f07	[DOCS] Formatting error Closes #5346	2014-03-05 17:40:51 +01:00
Kevin	2c7a3a49c5	[DOCS] add Elasticsearch Image Plugin	2014-03-05 14:16:56 +01:00
Zachary Tong	7b16c5857d	Percentiles aggregation. A new metric aggregation that can compute approximate values of arbitrary percentiles. Close #5323	2014-03-03 18:06:14 +01:00
Martijn van Groningen	dcb590398d	[DOCS] Better document the limitation of nested objects.	2014-03-03 14:12:18 +01:00
Binh Ly	7e49848697	Clarify range aggregations	2014-02-28 14:38:57 -05:00
Clinton Gormley	53ce0e8e27	[DOCS] Fixed added[] tag version number	2014-02-28 15:29:43 +01:00
Lee Hinman	e53a43800e	Add `explain` flag support to the reroute API By specifying the `explain` flag, an explanation for the reason a command can or cannot be executed is returned. No allocation commands are actually performed. Returns a response similar to: { "state": {...cluster state...}, "acknowledged": true, "explanations" : [ { "command" : "cancel", "parameters" : { "index" : "decide", "shard" : 0, "node" : "IvpoKRdtRiGrQ_WKtt4_4w", "allow_primary" : false }, "decisions" : [ { "decider" : "cancel_allocation_command", "decision" : "YES", "explanation" : "..." } ] }, { "command" : "move", "parameters" : { "index" : "decide", "shard" : 0, "from_node" : "IvpoKRdtRiGrQ_WKtt4_4w", "to_node" : "IvpoKRdtRiGrQ_WKtt4_4w" }, "decisions" : [ { "decider" : "same_shard", "decision" : "NO", "explanation" : "shard cannot be allocated on same node [IvpoKRdtRiGrQ_WKtt4_4w] it already exists on" }, etc ] }] } also removes AllocationExplanation from cluster state Closes #2483 Closes #5169	2014-02-27 09:48:51 -07:00
Simon Willnauer	9160516b28	Expose `filler_token` via ShingleTokenFilterFactory Lucene 4.7 supports a setter for the `filler_token` that is inserted if there are gaps in the token stream. This change exposes this setting. Closes #4307	2014-02-26 22:21:10 +01:00
Martijn van Groningen	1441fec068	[DOCS] Updated memory considerations for p/c queries and filters.	2014-02-26 22:16:51 +01:00
Simon Willnauer	90e57c15e8	[DOCS]: fixed small problem in example json	2014-02-26 16:40:04 +01:00
Clinton Gormley	03ad168b24	[DOCS] Added note about dely in clearing filter cache. Closes #5231	2014-02-24 11:36:22 +01:00
hura	818f8c0e2b	[DOCS] Fix wrong explanation in configuration.asciidoc Replaced network.host with node.name to match config file	2014-02-24 11:29:50 +01:00
Luca Cavanna	4e6610a798	Fixed multi term queries support in postings highlighter for non top-level queries In #4052 we added support for highlighting multi term queries using the postings highlighter. That worked only for top-level queries though, and not for multi term queries that are nested for instance within a bool query, or filtered query, or a constant score query. The way we make this work is by walking the query structure and temporarily overriding the query rewrite method with a method that allows for multi terms extraction. Closes #5102	2014-02-21 21:43:40 +01:00
Adrien Grand	edb854d952	Document the indices segments response format.	2014-02-21 12:01:32 +01:00
Lee Hinman	8f8cc7205d	Add "locale" parameter to query_string and simple_query_string Fixes #5128 Remove java 7 specific Locale functions, add "coming[1.1.0]" to documentation add LocaleUtils utility class for dealing with Locale functions	2014-02-20 15:53:08 -07:00
Martijn van Groningen	a81a4a5efe	[DOCS] Included the `_percolator` index breaking change to migration docs.	2014-02-20 16:43:06 +01:00
Isabel Drost-Fromm	48004ff8a5	Add mustache templating to query execution. Adds support for storing mustache based query templates that can later be filled with query parameter values at execution time. Templates may be both quoted, non-quoted and referencing templates stored in config/scripts/*.mustache by file name. See docs/reference/query-dsl/queries/template-query.asciidoc for templating examples. Implementation detail: mustache itself is being shaded as it depends directly on guava - so having it marked optional but included in the final distribution raises chances of version conflicts downstream. Fixes #4879	2014-02-20 12:21:59 +01:00
javanna	419db6ee12	[DOCS] Fixed typo in create index api	2014-02-19 17:49:38 +01:00
Boaz Leskes	e379f419e6	[DOCS] Remove clear flag from node-stats as it is not used anymore	2014-02-17 15:20:12 +01:00
Luca Cavanna	3afdf4a872	Added support for aliases to create index api It is now possible to specify aliases during index creation: curl -XPUT 'http://localhost:9200/test' -d ' { "aliases" : { "alias1" : {}, "alias2" : { "filter" : { "term" : {"field":"value"}} } } }' Closes #4920	2014-02-17 14:54:21 +01:00
Britta Weber	db3c6c2a8e	Enable percolation for nested documents closes #5082	2014-02-14 22:42:33 +01:00
Lee Hinman	c97bcc3602	Add support for `lowercase_expanded_terms` flag to simple_query_string Default the flag to true, making simple_query_string behave similarly to query_string Fixes #5008	2014-02-14 11:51:23 -07:00
Nik Everett	5c3f4ceafb	Add preserve original token option to ASCIIFolding Closes #4931	2014-02-14 19:37:00 +01:00
Luca Cavanna	6abd0a76bd	[DOCS] improved get docs - added _version to response - exists call use -XHEAD with -i flag to include headers in the output	2014-02-14 13:11:10 +01:00
Lars Francke	2a765415c8	Update get.asciidoc Minor improvements. curl -XHEAD doesn't actually print anything so I've changed to use -I which actually prints the headers received.	2014-02-14 13:11:10 +01:00
Brian Yoder	41dba68bda	Added the `DistanceUnit.NAUTICALMILES` enumeration label with the corresponding NM and nmi unit suffixes. Update the docs to match. Closes #5085	2014-02-14 19:48:58 +09:00
uboness	d335630e57	[docs] fixed errors in aggs docs - error in nested aggs example - error in terms aggs example	2014-02-13 20:36:02 +01:00
Oleg Anashkin	eb0e1aa38f	Fix typo in similarity docs DRF similarity -> DFR similarity	2014-02-13 07:45:30 -08:00
Luca Cavanna	179750f0f5	[DOCS] fixed count docs, it now requires a top-level query object, same as other apis Relates to #4074	2014-02-13 13:36:20 +01:00
Luca Cavanna	9902f04033	[DOCS] rephrased delete by query docs	2014-02-13 11:44:51 +01:00
Luca Cavanna	01abea5945	[DOCS] fixed count and validate query docs, they now require a top-level query object, same as other apis Relates to #4074 Closes #5111	2014-02-13 11:42:04 +01:00
Kevin	99942089a8	[DOCS] add DynamoDB river plugin	2014-02-13 10:38:04 +01:00
James Yu	699fe5e929	fixed markup and typo	2014-02-13 10:33:15 +01:00
Clinton Gormley	80c7619591	[DOCS] Changed coming[] to added[] for 1.0.0*	2014-02-12 17:17:25 +02:00
Luca Cavanna	1d8d58391f	[DOCS] added coming tags for `zen.discovery.publish_timeout` made dynamic	2014-02-12 15:24:38 +01:00
Luca Cavanna	16e4ac8713	[DOCS] Documented `discovery.zen.publish_timeout` setting	2014-02-12 10:45:37 +01:00
Luca Cavanna	847521b44c	[DOCS] added `discovery.zen.publish_timeout` to the dynamic settings list	2014-02-12 10:45:30 +01:00
Igor Motov	02ebe33758	[DOCS] Fix typo in rename_pattern in snapshot/restore documentation	2014-02-11 09:23:07 -05:00
Simon Willnauer	990ce658a4	[Docs] Remove `custom_score` from documentation and add a migration section.	2014-02-11 14:59:15 +01:00
Mihnea Dobrescu-Balaur	1f7efb5471	[DOCS] Add GitHub community river plugin	2014-02-11 11:55:24 +01:00
Alexander Reelsen	b02e6dc996	Migrating NodesInfo API to use plugins instead of singular plugin In order to be consistent (and because in 1.0 we switched from parameter driven information to specifzing the metrics as part of the URI) this patch moves from 'plugin' to 'plugins' in the Nodes Info API.	2014-02-11 10:05:10 +01:00
Luca Cavanna	7de7a0ace3	[TEST] fixed typo in _cat/thread_pool docs	2014-02-10 16:20:03 +01:00
Shay Banon	e5f43a1867	add version and master_node flags to cluster state	2014-02-10 02:24:03 +01:00
David Pilato	c214acc5e7	[DOCS] Add GridFS repository community plugin	2014-02-08 10:43:54 +01:00
Sean Gallagher	e935a301df	Doc fix explaining resynchronization with the Cancel command. Added line explaining resync process to Reroute/Cancel command. Closes #5025	2014-02-07 17:02:36 -05:00
Clinton Gormley	93930d6dc7	Removed 0.90.* deprecation and addition notifications Closes #5052	2014-02-07 20:52:49 +01:00
Adrien Grand	9cb17408cb	Make size=0 return all buckets for the geohash_grid aggregation. Close #4875	2014-02-07 09:55:10 +01:00
David Pilato	444dff7b40	[DOCS] delete by query requires a top-level query parameter Closes #5044 (cherry picked from commit 1e265b3)	2014-02-07 08:50:15 +01:00
Kevin	d9b704fd86	add redis transport plugin	2014-02-06 18:19:54 +01:00
Lee Hinman	d2078a5e28	Add fuzzy/slop support to `simple_query_string` Ports the change from https://issues.apache.org/jira/browse/LUCENE-5410	2014-02-06 10:05:10 -07:00
Costin Leau	f5a8de6321	[DOCS] organize a bit the repository plugins (cherry picked from commit 88e1c20c4581885db7e5e65edf7eb3629c2d31ca)	2014-02-06 19:01:58 +02:00
Simon Willnauer	162ca99376	Added `cross_fields` mode to multi_match query `cross_fields` attemps to treat fields with the same analysis configuration as a single field and uses maximum score promotion or combination of the scores based depending on the `use_dis_max` setting. By default scores are combined. `cross_fields` can also search across fields of hetrogenous types for instance if numbers can be part of the query it makes sense to search also on numeric fields if an analyzer is provided in the reqeust. Relates to #2959	2014-02-06 17:15:55 +01:00
Clinton Gormley	56479fb0e4	[DOCS] Make apt/yum repos more visible	2014-02-06 17:04:37 +01:00
Boaz Leskes	9bf263c741	[DOCS] Fix terms agg value script example	2014-02-06 16:35:49 +01:00
Boaz Leskes	ae4ed29f9b	[Docs] value_count supports script per 1.1	2014-02-06 15:04:50 +01:00
Clinton Gormley	17e2ca5259	[DOCS] Updated migration docs for multi_field to point to copy_to	2014-02-06 14:34:07 +01:00
Clinton Gormley	6238d406b5	[DOCS] Removed the experimental label from Tribe, Hot Threads and Completion Suggester	2014-02-06 14:19:17 +01:00
David Pilato	583f148334	[DOCS] add azure and gce discovery plugins Clean EC2 disco doc Add Azure disco doc Add Google Compute Engine doc Fix Zen doc (add `enabled` in `multicast` parameters list) - Fix #5032.	2014-02-06 09:18:42 +01:00
David Pilato	8b1a6fc5b6	Add S3 and HDFS repositories	2014-02-05 17:53:37 +01:00
Clinton Gormley	d9bdfe3fec	[DOCS] Deprecated the path setting in favour of copy_to Relates to #4729	2014-02-05 14:47:48 +01:00
Adrien Grand	6777be60ce	Add script support to value_count aggregations. Close #5001	2014-02-04 14:29:32 +01:00
Clinton Gormley	238b26a466	[DOC] Tidied up geohashgrid aggregations	2014-02-04 11:54:32 +01:00
Jun Ohtani	ba415b8ad2	Does not support "script" in value_clunt aggregation.	2014-02-04 10:26:07 +01:00
Adrien Grand	cc1ff560df	Rename `geohashgrid` to `geohash_grid` in documentation. It was renamed in `fc6bc4c477`. Close #4997	2014-02-04 09:39:55 +01:00
Lars Francke	1bd9dc129b	Fix confusing sentence The original sentence didn't make much sense. I hope this is a bit better. Taken heavy inspiration from `c63d8c4fb5`	2014-02-03 17:20:40 +01:00
Lars Francke	7cbd0962b5	Improve Aggregations documentation * Mostly minor things like typos and grammar stuff * Some clarifications * The note on the deprecation was ambiguous. I've removed the problematic part so that it now definitely says it's deprecated	2014-02-03 17:16:52 +01:00
Shay Banon	d36e345f1f	fix docs to reflect removal of byte buffer memory	2014-02-03 09:54:30 -05:00
Igor Motov	90da268237	Remove support for boost in copy_to field Currently, boosting on `copy_to` is misleading and does not work as originally specified in #4520. Instead of boosting just the terms from the origin field, it boosts the whole destination field. If two fields copy_to a third field, one with a boost of 2 and another with a boost of 3, all the terms in the third field end up with a boost of 6. This was not the intention. The alternative: to store the boost in a payload for every term, results in poor performance and inflexibility. Instead, users should either (1) query the common field AND the field that requires boosting, or (2) the multi_match query will soon be able to perform term-centric cross-field matching that will allow per-field boosting at query time (coming in 1.1).	2014-01-31 14:34:01 -05:00

1 2 3 4 5 ...

431 Commits