OpenSearch

Commit Graph

Author	SHA1	Message	Date
Randy Stauner	1486188a3b	[DOCS] Reword clear-scroll sentence	2014-03-17 12:08:49 +01:00
lzhoucs	5a5171cb70	[DOCS] Fix typo in the reference doc. SuSe -> SUSE SUSE, as a Linux distribution, is never lower cased fixes #5354	2014-03-17 12:03:25 +01:00
Justin Etheredge	36219a1786	[DOCS] Updating scripting docs for geo functions Added a few functions are corrected the default unit where necessary	2014-03-17 11:59:02 +01:00
Boaz Leskes	ee8743f3f2	[Docs] added a missing reference to significantterms-aggergations Also fix header level mismatch issue reported by the build	2014-03-17 11:45:55 +01:00
David Pilato	f54e9246c1	Add _cat/plugins endpoint If we want to have a full picture of versions running in a cluster, we need to add a `_cat/plugins` endpoint. Response could look like: ```sh % curl es2:9200/_cat/plugins?v node component version type url desc es1 mapper-attachments 1.7.0 j Adds the attachment type allowing to parse difference attachment formats es1 lang-javascript 1.4.0 j JavaScript plugin allowing to add javascript scripting support es1 analysis-smartcn 1.9.0 j Smart Chinese analysis support es1 marvel 1.1.0 j/s http://localhost:9200/_plugins/marvel Elasticsearch Management & Monitoring es1 kopf 0.5.3 s http://localhost:9200/_plugins/kopf kopf - simple web administration tool for ElasticSearch es2 mapper-attachments 2.0.0.RC1 j Adds the attachment type allowing to parse difference attachment formats es2 lang-javascript 2.0.0.RC1 j JavaScript plugin allowing to add javascript scripting support es2 analysis-smartcn 2.0.0.RC1 j Smart Chinese analysis support ``` Closes #4824.	2014-03-16 12:16:09 +01:00
Clinton Gormley	fb934aff57	[DOCS] Documented gateway.local.auto_import_dangled Relates to #4996	2014-03-15 12:07:17 +01:00
rphadake	36a0cb99d7	[Doc] doc updates for date histogram interval Close #5308	2014-03-14 18:55:32 +01:00
Adrien Grand	65d3b61b97	Add an option to force _optimize operations. When forced, the index will be merged even if it contains a single segment with no deletions. Close #5243	2014-03-14 18:21:56 +01:00
Adrien Grand	eef71da650	[Doc] Add a chart about the relative error of the percentiles aggregation.	2014-03-14 12:23:23 +01:00
markharwood	767bef0596	Significant_terms aggregation identifies terms that are significant rather than merely popular in a set. Significance is related to the changes in document frequency observed between everyday use in the corpus and frequency observed in the result set. The asciidocs include extensive details on the applications of this feature. Closes #5146	2014-03-14 10:34:24 +00:00
Adrien Grand	5821fa042c	Cardinality aggregation. This aggregation computes unique term counts using the hyperloglog++ algorithm which uses linear counting to estimate low cardinalities and hyperloglog on higher cardinalities. Since this algorithm works on hashes, it is useful for high-cardinality fields to store the hash of values directly in the index, which is the purpose of the new `murmur3` field type. This is less necessary on low-cardinality string fields because the aggregator is smart enough to only compute the hash once per unique value per segment thanks to ordinals, or on numeric fields since hashing them is very fast. Close #5426	2014-03-13 19:19:56 +01:00
Florian Schilling	81e537bd5e	ContextSuggester ================ This commit extends the `CompletionSuggester` by context informations. In example such a context informations can be a simple string representing a category reducing the suggestions in order to this category. Three base implementations of these context informations have been setup in this commit. - a Category Context - a Geo Context All the mapping for these context informations are specified within a context field in the completion field that should use this kind of information.	2014-03-13 11:24:46 +01:00
Kurt Hurtado	ca6a2bb790	[DOCS] Various aggregation doc fixes	2014-03-13 09:05:25 +01:00
Mohsin Husen	9fcee312dc	[DOCS] Added spring data elasticsearch integration	2014-03-13 08:44:17 +01:00
Costin Leau	9624b215fb	Add docs for plugin isolation	2014-03-11 12:32:58 +02:00
Boaz Leskes	b7a95d11a7	Introduced VersionType.FORCE & VersionType.EXTERNAL_GTE Also added "external_gt" as an alias name for VersionType.EXTERNAL , accessible for the rest layer. Closes #4213 , Closes #2946	2014-03-10 21:07:17 +01:00
javanna	d5aaa90f34	[TEST] Randomized number of shards used for indices created during tests Introduced two levels of randomization for the number of shards (between 1 and 10) when running tests: 1) through the existing random index template, which now sets a random number of shards that is shared across all the indices created in the same test method unless overwritten 2) through `createIndex` and `prepareCreate` methods, similar to what happens using the `indexSettings` method, which changes for every `createIndex` or `prepareCreate` unless overwritten (overwrites index template for what concerns the number of shards) Added the following facilities to deal with the random number of shards: - `getNumShards` to retrieve the number of shards of a given existing index, useful when doing comparisons based on the number of shards and we can avoid specifying a static number. The method returns an object containing the number of primaries, number of replicas and the total number of shards for the existing index - added `assertFailures` that checks that a shard failure happened during a search request, either partial failure or total (all shards failed). Checks also the error code and the error message related to the failure. This is needed as without knowing the number of shards upfront, when simulating errors we can run into either partial (search returns partial results and failures) or total failures (search returns an error) - added common methods similar to `indexSettings`, to be used in combination with `createIndex` and `prepareCreate` method and explicitly control the second level of randomization: `numberOfShards`, `minimumNumberOfShards` and `maximumNumberOfShards`. Added also `numberOfReplicas` despite the number of replicas is not randomized (default not specified but can be overwritten by tests) Tests that specified the number of shards have been reviewed and the results follow: - removed number_of_shards in node settings, ignored anyway as it would be overwritten by both mechanisms above - remove specific number of shards when not needed - removed manual shards randomization where present, replaced with ordinary one that's now available - adapted tests that didn't need a specific number of shards to the new random behaviour - fixed a couple of test bugs (e.g. 3 levels parent child test could only work on a single shard as the routing key used for grand-children wasn't correct) - also done some cleanup, shared code through shard size facets and aggs tests and used common methods like `assertAcked`, `ensureGreen`, `refresh`, `flush` and `refreshAndFlush` where possible - made sure that `indexSettings()` is always used as a basis when using `prepareCreate` to inject specific settings - converted indexRandom(false, ...) + refresh to indexRandom(true, ...)	2014-03-10 13:01:52 +01:00
Simon Willnauer	fbb8c0fafa	[DOCS] Add `coming` tag to multiple rescores Closes #5365	2014-03-10 09:27:44 +01:00
Clinton Gormley	8383f271d1	[DOCS] Updated the Perl docs	2014-03-09 19:45:16 +01:00
Andrew Raines	2f48be597e	Display all available endpoints by default at /_cat Closes #5106	2014-03-07 13:21:43 -06:00
Konrad Feldmeier	d7b0d547d4	[DOCS] Multiple doc fixes Closes #5047	2014-03-07 14:24:58 +01:00
Benjamin Devèze	2affa5004f	Fix small typo in percentiles doc	2014-03-07 10:10:19 +01:00
Adrien Grand	f359b7f38b	[DOC] The percentiles aggregation is coming in 1.1.0.	2014-03-07 10:03:15 +01:00
Brusic	95274c18c5	Added support for char filters in the analyze API Closes #5148	2014-03-06 12:23:51 +01:00
James Brook	a93d6d55a5	Added support for aliases to index templates Adapted existing PR (#2739) to updated code (post #4920), added tests and docs (@javanna) Closes #1825	2014-03-06 11:11:07 +01:00
uboness	9d0fc76f54	Added support for sorting buckets based on sub aggregations Supports sorting on sub-aggs down the current hierarchy. This is supported as long as the aggregation in the specified order path are of a single-bucket type, where the last aggregation in the path points to either a single-bucket aggregation or a metrics one. If it's a single-bucket aggregation, the sort will be applied on the document count in the bucket (i.e. doc_count), and if it is a metrics type, the sort will be applied on the pointed out metric (in case of a single-metric aggregations, such as avg, the sort will be applied on the single metric value) NOTE: this commit adds a constraint on what should be considered a valid aggregation name. Aggregations names must be alpha-numeric and may contain '-' and '_'. Closes #5253	2014-03-06 00:05:27 +01:00
Igor Motov	b723ee0d20	[DOCS] Update boolean mapping docs with a full list of values that are treated as false Closes #5337	2014-03-05 15:33:59 -05:00
Clinton Gormley	98ecf80f07	[DOCS] Formatting error Closes #5346	2014-03-05 17:40:51 +01:00
Kevin	2c7a3a49c5	[DOCS] add Elasticsearch Image Plugin	2014-03-05 14:16:56 +01:00
Binh Ly	612e95a321	[DOCS] Java API JSON typo	2014-03-03 18:20:49 -05:00
Zachary Tong	7b16c5857d	Percentiles aggregation. A new metric aggregation that can compute approximate values of arbitrary percentiles. Close #5323	2014-03-03 18:06:14 +01:00
Martijn van Groningen	dcb590398d	[DOCS] Better document the limitation of nested objects.	2014-03-03 14:12:18 +01:00
Binh Ly	7e49848697	Clarify range aggregations	2014-02-28 14:38:57 -05:00
Clinton Gormley	53ce0e8e27	[DOCS] Fixed added[] tag version number	2014-02-28 15:29:43 +01:00
Lee Hinman	e53a43800e	Add `explain` flag support to the reroute API By specifying the `explain` flag, an explanation for the reason a command can or cannot be executed is returned. No allocation commands are actually performed. Returns a response similar to: { "state": {...cluster state...}, "acknowledged": true, "explanations" : [ { "command" : "cancel", "parameters" : { "index" : "decide", "shard" : 0, "node" : "IvpoKRdtRiGrQ_WKtt4_4w", "allow_primary" : false }, "decisions" : [ { "decider" : "cancel_allocation_command", "decision" : "YES", "explanation" : "..." } ] }, { "command" : "move", "parameters" : { "index" : "decide", "shard" : 0, "from_node" : "IvpoKRdtRiGrQ_WKtt4_4w", "to_node" : "IvpoKRdtRiGrQ_WKtt4_4w" }, "decisions" : [ { "decider" : "same_shard", "decision" : "NO", "explanation" : "shard cannot be allocated on same node [IvpoKRdtRiGrQ_WKtt4_4w] it already exists on" }, etc ] }] } also removes AllocationExplanation from cluster state Closes #2483 Closes #5169	2014-02-27 09:48:51 -07:00
Simon Willnauer	9160516b28	Expose `filler_token` via ShingleTokenFilterFactory Lucene 4.7 supports a setter for the `filler_token` that is inserted if there are gaps in the token stream. This change exposes this setting. Closes #4307	2014-02-26 22:21:10 +01:00
Martijn van Groningen	1441fec068	[DOCS] Updated memory considerations for p/c queries and filters.	2014-02-26 22:16:51 +01:00
Simon Willnauer	90e57c15e8	[DOCS]: fixed small problem in example json	2014-02-26 16:40:04 +01:00
Clinton Gormley	03ad168b24	[DOCS] Added note about dely in clearing filter cache. Closes #5231	2014-02-24 11:36:22 +01:00
hura	818f8c0e2b	[DOCS] Fix wrong explanation in configuration.asciidoc Replaced network.host with node.name to match config file	2014-02-24 11:29:50 +01:00
Luca Cavanna	4e6610a798	Fixed multi term queries support in postings highlighter for non top-level queries In #4052 we added support for highlighting multi term queries using the postings highlighter. That worked only for top-level queries though, and not for multi term queries that are nested for instance within a bool query, or filtered query, or a constant score query. The way we make this work is by walking the query structure and temporarily overriding the query rewrite method with a method that allows for multi terms extraction. Closes #5102	2014-02-21 21:43:40 +01:00
Adrien Grand	edb854d952	Document the indices segments response format.	2014-02-21 12:01:32 +01:00
Lee Hinman	8f8cc7205d	Add "locale" parameter to query_string and simple_query_string Fixes #5128 Remove java 7 specific Locale functions, add "coming[1.1.0]" to documentation add LocaleUtils utility class for dealing with Locale functions	2014-02-20 15:53:08 -07:00
Martijn van Groningen	a81a4a5efe	[DOCS] Included the `_percolator` index breaking change to migration docs.	2014-02-20 16:43:06 +01:00
Isabel Drost-Fromm	48004ff8a5	Add mustache templating to query execution. Adds support for storing mustache based query templates that can later be filled with query parameter values at execution time. Templates may be both quoted, non-quoted and referencing templates stored in config/scripts/*.mustache by file name. See docs/reference/query-dsl/queries/template-query.asciidoc for templating examples. Implementation detail: mustache itself is being shaded as it depends directly on guava - so having it marked optional but included in the final distribution raises chances of version conflicts downstream. Fixes #4879	2014-02-20 12:21:59 +01:00
javanna	419db6ee12	[DOCS] Fixed typo in create index api	2014-02-19 17:49:38 +01:00
Boaz Leskes	e379f419e6	[DOCS] Remove clear flag from node-stats as it is not used anymore	2014-02-17 15:20:12 +01:00
Luca Cavanna	3afdf4a872	Added support for aliases to create index api It is now possible to specify aliases during index creation: curl -XPUT 'http://localhost:9200/test' -d ' { "aliases" : { "alias1" : {}, "alias2" : { "filter" : { "term" : {"field":"value"}} } } }' Closes #4920	2014-02-17 14:54:21 +01:00
Britta Weber	db3c6c2a8e	Enable percolation for nested documents closes #5082	2014-02-14 22:42:33 +01:00
Lee Hinman	c97bcc3602	Add support for `lowercase_expanded_terms` flag to simple_query_string Default the flag to true, making simple_query_string behave similarly to query_string Fixes #5008	2014-02-14 11:51:23 -07:00
Nik Everett	5c3f4ceafb	Add preserve original token option to ASCIIFolding Closes #4931	2014-02-14 19:37:00 +01:00
Luca Cavanna	6abd0a76bd	[DOCS] improved get docs - added _version to response - exists call use -XHEAD with -i flag to include headers in the output	2014-02-14 13:11:10 +01:00
Lars Francke	2a765415c8	Update get.asciidoc Minor improvements. curl -XHEAD doesn't actually print anything so I've changed to use -I which actually prints the headers received.	2014-02-14 13:11:10 +01:00
Brian Yoder	41dba68bda	Added the `DistanceUnit.NAUTICALMILES` enumeration label with the corresponding NM and nmi unit suffixes. Update the docs to match. Closes #5085	2014-02-14 19:48:58 +09:00
uboness	d335630e57	[docs] fixed errors in aggs docs - error in nested aggs example - error in terms aggs example	2014-02-13 20:36:02 +01:00
Oleg Anashkin	eb0e1aa38f	Fix typo in similarity docs DRF similarity -> DFR similarity	2014-02-13 07:45:30 -08:00
Luca Cavanna	179750f0f5	[DOCS] fixed count docs, it now requires a top-level query object, same as other apis Relates to #4074	2014-02-13 13:36:20 +01:00
Luca Cavanna	9902f04033	[DOCS] rephrased delete by query docs	2014-02-13 11:44:51 +01:00
Luca Cavanna	01abea5945	[DOCS] fixed count and validate query docs, they now require a top-level query object, same as other apis Relates to #4074 Closes #5111	2014-02-13 11:42:04 +01:00
Kevin	5d01aac87e	add elasticsearch-osem to integrations page	2014-02-13 11:02:36 +01:00
Kevin	99942089a8	[DOCS] add DynamoDB river plugin	2014-02-13 10:38:04 +01:00
James Yu	699fe5e929	fixed markup and typo	2014-02-13 10:33:15 +01:00
Kevin	1075b9ae33	[DOCS] should use setPostFilter instead of setFilter	2014-02-13 14:28:00 +11:00
Clinton Gormley	80c7619591	[DOCS] Changed coming[] to added[] for 1.0.0*	2014-02-12 17:17:25 +02:00
Luca Cavanna	1d8d58391f	[DOCS] added coming tags for `zen.discovery.publish_timeout` made dynamic	2014-02-12 15:24:38 +01:00
Luca Cavanna	16e4ac8713	[DOCS] Documented `discovery.zen.publish_timeout` setting	2014-02-12 10:45:37 +01:00
Luca Cavanna	847521b44c	[DOCS] added `discovery.zen.publish_timeout` to the dynamic settings list	2014-02-12 10:45:30 +01:00
Karel Minarik	91900ef346	[DOC] Updated the Ruby gem version for Elasticsearch 0.90.x	2014-02-11 16:12:53 +01:00
Igor Motov	02ebe33758	[DOCS] Fix typo in rename_pattern in snapshot/restore documentation	2014-02-11 09:23:07 -05:00
Simon Willnauer	990ce658a4	[Docs] Remove `custom_score` from documentation and add a migration section.	2014-02-11 14:59:15 +01:00
Mihnea Dobrescu-Balaur	1f7efb5471	[DOCS] Add GitHub community river plugin	2014-02-11 11:55:24 +01:00
Alexander Reelsen	b02e6dc996	Migrating NodesInfo API to use plugins instead of singular plugin In order to be consistent (and because in 1.0 we switched from parameter driven information to specifzing the metrics as part of the URI) this patch moves from 'plugin' to 'plugins' in the Nodes Info API.	2014-02-11 10:05:10 +01:00
Honza Král	d58118c641	[DOCS] adding a note on python client versioning schema	2014-02-11 03:43:53 +01:00
Luca Cavanna	7de7a0ace3	[TEST] fixed typo in _cat/thread_pool docs	2014-02-10 16:20:03 +01:00
Karel Minarik	e2b20843c8	[DOCS] Added a table with 0.90/1.0 compatibility and corresponding instructions	2014-02-10 11:58:42 +01:00
Shay Banon	e5f43a1867	add version and master_node flags to cluster state	2014-02-10 02:24:03 +01:00
David Pilato	c214acc5e7	[DOCS] Add GridFS repository community plugin	2014-02-08 10:43:54 +01:00
Sean Gallagher	e935a301df	Doc fix explaining resynchronization with the Cancel command. Added line explaining resync process to Reroute/Cancel command. Closes #5025	2014-02-07 17:02:36 -05:00
Clinton Gormley	164d52767c	[DOCS] Removed deprecated queries/filters from Java API docs	2014-02-07 20:59:42 +01:00
Clinton Gormley	93930d6dc7	Removed 0.90.* deprecation and addition notifications Closes #5052	2014-02-07 20:52:49 +01:00
Adrien Grand	9cb17408cb	Make size=0 return all buckets for the geohash_grid aggregation. Close #4875	2014-02-07 09:55:10 +01:00
David Pilato	444dff7b40	[DOCS] delete by query requires a top-level query parameter Closes #5044 (cherry picked from commit 1e265b3)	2014-02-07 08:50:15 +01:00
Clinton Gormley	2b0e580046	[DOCS] Added backwards compatibility instructions to Perl client	2014-02-06 19:10:46 +01:00
Kevin	d9b704fd86	add redis transport plugin	2014-02-06 18:19:54 +01:00
Lee Hinman	d2078a5e28	Add fuzzy/slop support to `simple_query_string` Ports the change from https://issues.apache.org/jira/browse/LUCENE-5410	2014-02-06 10:05:10 -07:00
Costin Leau	f5a8de6321	[DOCS] organize a bit the repository plugins (cherry picked from commit 88e1c20c4581885db7e5e65edf7eb3629c2d31ca)	2014-02-06 19:01:58 +02:00
Evan Wong	593f98a373	Fixed the string() code literal in the java client index api doc.	2014-02-06 17:29:40 +01:00
Simon Willnauer	162ca99376	Added `cross_fields` mode to multi_match query `cross_fields` attemps to treat fields with the same analysis configuration as a single field and uses maximum score promotion or combination of the scores based depending on the `use_dis_max` setting. By default scores are combined. `cross_fields` can also search across fields of hetrogenous types for instance if numbers can be part of the query it makes sense to search also on numeric fields if an analyzer is provided in the reqeust. Relates to #2959	2014-02-06 17:15:55 +01:00
Clinton Gormley	56479fb0e4	[DOCS] Make apt/yum repos more visible	2014-02-06 17:04:37 +01:00
Boaz Leskes	9bf263c741	[DOCS] Fix terms agg value script example	2014-02-06 16:35:49 +01:00
Boaz Leskes	ae4ed29f9b	[Docs] value_count supports script per 1.1	2014-02-06 15:04:50 +01:00
Clinton Gormley	17e2ca5259	[DOCS] Updated migration docs for multi_field to point to copy_to	2014-02-06 14:34:07 +01:00
Clinton Gormley	6238d406b5	[DOCS] Removed the experimental label from Tribe, Hot Threads and Completion Suggester	2014-02-06 14:19:17 +01:00
David Pilato	583f148334	[DOCS] add azure and gce discovery plugins Clean EC2 disco doc Add Azure disco doc Add Google Compute Engine doc Fix Zen doc (add `enabled` in `multicast` parameters list) - Fix #5032.	2014-02-06 09:18:42 +01:00
David Pilato	8b1a6fc5b6	Add S3 and HDFS repositories	2014-02-05 17:53:37 +01:00
Clinton Gormley	d9bdfe3fec	[DOCS] Deprecated the path setting in favour of copy_to Relates to #4729	2014-02-05 14:47:48 +01:00
Adrien Grand	6777be60ce	Add script support to value_count aggregations. Close #5001	2014-02-04 14:29:32 +01:00
Clinton Gormley	238b26a466	[DOC] Tidied up geohashgrid aggregations	2014-02-04 11:54:32 +01:00
Jun Ohtani	ba415b8ad2	Does not support "script" in value_clunt aggregation.	2014-02-04 10:26:07 +01:00
Adrien Grand	cc1ff560df	Rename `geohashgrid` to `geohash_grid` in documentation. It was renamed in `fc6bc4c477`. Close #4997	2014-02-04 09:39:55 +01:00
Lars Francke	1bd9dc129b	Fix confusing sentence The original sentence didn't make much sense. I hope this is a bit better. Taken heavy inspiration from `c63d8c4fb5`	2014-02-03 17:20:40 +01:00
Lars Francke	7cbd0962b5	Improve Aggregations documentation * Mostly minor things like typos and grammar stuff * Some clarifications * The note on the deprecation was ambiguous. I've removed the problematic part so that it now definitely says it's deprecated	2014-02-03 17:16:52 +01:00
Shay Banon	d36e345f1f	fix docs to reflect removal of byte buffer memory	2014-02-03 09:54:30 -05:00
Igor Motov	90da268237	Remove support for boost in copy_to field Currently, boosting on `copy_to` is misleading and does not work as originally specified in #4520. Instead of boosting just the terms from the origin field, it boosts the whole destination field. If two fields copy_to a third field, one with a boost of 2 and another with a boost of 3, all the terms in the third field end up with a boost of 6. This was not the intention. The alternative: to store the boost in a payload for every term, results in poor performance and inflexibility. Instead, users should either (1) query the common field AND the field that requires boosting, or (2) the multi_match query will soon be able to perform term-centric cross-field matching that will allow per-field boosting at query time (coming in 1.1).	2014-01-31 14:34:01 -05:00
Martijn van Groningen	7e1eed9814	The forceful no cache behaviour for range filter with now date match expression should only be active if no rounding has been specified for `now` in the date range range expression (for example: `now/d`). Also the automatic now detection in range filters is overrideable by the `_cache` option. Closes #4947 Relates to #4846	2014-01-30 15:51:33 +01:00
uboness	d3f2173ef9	fixed date_/histogram aggregation documentation - added documentation for the `min_doc_count` setting Closes #4944	2014-01-29 20:55:26 +01:00
Igor Motov	2755eecf65	Add throttling to snaphost and restore operations Closes #4855	2014-01-29 10:33:59 -05:00
Martijn van Groningen	c82f27577b	Added dedicated thread pool cat api, that can show all thread pool related statistic (size, rejected, queue etc.) for all thread pools (get, search, index etc.) By default active, rejected and queue thread statistics are included for the index, bulk and search thread pool. Other thread statistics of other thread pools can be included via the `h` query string parameter. Closes #4907	2014-01-29 13:25:06 +01:00
uboness	9f04e5fe38	fixed nested example response in docs Closes #4935	2014-01-29 13:09:12 +01:00
uboness	dd389d1cc5	Made all multi-bucket aggs return consistent response format Closes #4926	2014-01-28 17:46:57 +01:00
Luca Cavanna	b61ca9932a	[DOCS] Clarified docs for cluster.routing.allocation.same_shard.host cluster setting Clarified also javadocs for SameShardAllocationDecider	2014-01-28 12:32:37 +01:00
Luca Cavanna	95bf091dd6	[DOCS] unified index settings info and added warmers section in create index docs	2014-01-27 17:10:38 +01:00
Costin Leau	2690019e95	update link to Hadoop Snapshot/Restore plugin	2014-01-25 18:27:14 +02:00
Clinton Gormley	1aa1e83e03	[DOCS] Updated the breaking changes for the fields param Closes #4888	2014-01-25 12:34:15 +01:00
Karel Minarik	241bb09db1	[DOCS] More assertive statement about requiring `query` in _count, etc	2014-01-23 20:35:44 +01:00
Nik Everett	93a8e80aff	Support multiple rescores Detects if rescores arrive as an array instead of a plain object. If so then parse each element of the array as a separate rescore to be executed one after another. It looks like this: "rescore" : [ { "window_size" : 100, "query" : { "rescore_query" : { "match" : { "field1" : { "query" : "the quick brown", "type" : "phrase", "slop" : 2 } } }, "query_weight" : 0.7, "rescore_query_weight" : 1.2 } }, { "window_size" : 10, "query" : { "score_mode": "multiply", "rescore_query" : { "function_score" : { "script_score": { "script": "log10(doc['numeric'].value + 2)" } } } } } ] Rescores as a single object are still supported. Closes #4748	2014-01-23 16:29:07 +01:00
Nik Everett	37f80c8d80	Documentation for score_mode Closes #4742	2014-01-23 16:24:48 +01:00
Brusic	d9b71a8083	[DOCS] various docs fixes Removed unused misc.asciidoc file Added plugins directory to directory layout Fixed transport.tcp.connect_timeout value to match the code found in NetworkService.TcpSettings Clarified that phrase query does not preserve order of terms Clarified merge page Added instructions on how to build documentation to docs/README	2014-01-23 10:52:13 +01:00
Clinton Gormley	8685818ad3	[DOCS] Moved termvector and mtermvectors from search to docs	2014-01-22 14:10:26 +01:00
Simon Willnauer	cb3bcb05be	[DOCS]: Fix added version termvectors.asciidoc	2014-01-22 12:08:13 +01:00
Simon Willnauer	e6ace1313e	[DOCS]: fixed added / coming tags in docs	2014-01-22 12:02:37 +01:00
Martijn van Groningen	2981edca54	[DOCS] `coming` instead of `added` for copy_to feature.	2014-01-22 11:26:22 +01:00
Martijn van Groningen	5a61a8b098	[DOCS] annotated the multi fields and copy_to feature with the right version.	2014-01-22 11:16:41 +01:00
Adrien Grand	9282ae4ffd	Terms aggregations: make size=0 return all terms. Terms aggregations return up to `size` terms, so up to now, the way to get all matching terms back was to set `size` to an arbitrary high number that would be larger than the number of unique terms. Terms aggregators already made sure to not allocate memory based on the `size` parameter so this commit mostly consists in making `0` an alias for the maximum integer value in the TermsParser. Close #4837	2014-01-22 11:05:10 +01:00
Martijn van Groningen	75778d082b	[DOCS] Moved multi fields documentation into the core-types page Removed docs about setting inheriting (was never added) Made mapping samples formatting similar as other ones.	2014-01-22 10:05:58 +01:00
Lee Hinman	2c289fb538	Add the ability to retrieve fields from field data Adds a new FetchSubPhase, FieldDataFieldsFetchSubPhase, which loads the field data cache for a field and returns an array of values for the field. Also removes `doc['<field>']` and `_source.<field>` workaround no longer needed in field name resolving. Closes #4492	2014-01-21 09:13:32 -07:00
Adrien Grand	fe351f14e8	Document `index.shard.check_on_startup`.	2014-01-21 15:55:59 +01:00
Martijn van Groningen	66ed9a855a	[DOCS] Added multi fields link to mapping page.	2014-01-21 10:52:32 +01:00
Shay Banon	e29659e36d	add internal force local flag, used by tribe node tribe node to set it to true so all master read operations will automatically execute on the local tribe node	2014-01-20 22:40:26 +01:00
Luca Cavanna	bdb1992e85	Fixed typo	2014-01-20 19:32:50 +01:00
Martijn van Groningen	9bc3d996ff	[SPECS] Updated percolator specs.	2014-01-20 18:18:27 +01:00
Igor Motov	649f1b13da	Initial implementation of custom _all field Closes #4520	2014-01-20 10:44:33 -05:00
Simon Willnauer	f0bce08c30	Return `MatchNoDocsQuery` if query string is emtpy Closes #3952	2014-01-20 16:08:57 +01:00
Florian Gilcher	eed079aaac	Reference docs fixes * Make it clearer that `aggs` is an allowed synomym for the `aggregations` key * Fix broken example in for datehistogram, `1.5M` is not an allowed interval * Make use of colon before examples consistent * Fix typos	2014-01-20 12:14:17 +01:00
Dawid Weiss	ae71b25145	Documentation typo.	2014-01-20 11:51:08 +01:00
Martijn van Groningen	db394117c4	Made sure that any filter that wraps a p/c filter (has_child & has_parent) either directly or indirectly will never be cached by making CustomQueryWrappingFilter extend from NoCacheFilter. Closes #4757	2014-01-20 10:54:09 +01:00
Alexander Reelsen	e34a35244c	[DOCS] Added documentation for CAT Aliases API Added asciidoc. Added new lines in java class.	2014-01-20 09:23:00 +01:00
Clinton Gormley	5003ca9278	[DOCS] Fixed file:/// URL for installing plugins	2014-01-20 01:34:12 +01:00
Andy Goldstein	8f659bccb1	Add documentation for transport.publish_port	2014-01-17 22:06:22 +01:00
David Pilato	38874e5f9b	Remove the "-f" script argument from the documentation Closes #4778.	2014-01-17 11:44:30 +01:00
dpen2000	bb19412122	[DOCS] Fixed typo in frontends.asciidoc	2014-01-16 13:19:51 +01:00
Clinton Gormley	8cb091e55d	[DOCS] Tidied up asciidoc for migration page	2014-01-16 12:22:05 +01:00
Luca Cavanna	4126ae2631	[DOCS] updated json responses after #4310 and #4480 - Removed "ok": true from response examples - Added "created" flag to index response examples - Replaced exists flag with found in delete response examples	2014-01-16 12:01:39 +01:00
Luca Cavanna	3399f6926a	[DOCS] made it clearer that the _version is incremented by all write operations (deletes included)	2014-01-16 11:44:46 +01:00
Igor Motov	4643f78098	[DOCS] Add documentation for URL repository	2014-01-15 13:13:16 -05:00
Clinton Gormley	3d4891321b	[DOCS] Minor changes to the breaking changes doc	2014-01-15 18:23:03 +01:00
Alexander Reelsen	c6155c5142	release [1.0.0.RC1]	2014-01-15 17:02:22 +00:00
Clinton Gormley	9e3f527721	[DOCS] Fixed asciidoc issue	2014-01-15 18:00:13 +01:00
Clinton Gormley	faddd66e87	[DOCS] Added breaking changes in 1.0	2014-01-15 17:50:24 +01:00
Clinton Gormley	12a095d797	[DOCS] Tidied up the multi-indices docs	2014-01-15 16:13:38 +01:00
Clinton Gormley	93ba3b5e70	[DOCS] Tidied up layout of setup docs	2014-01-15 15:09:34 +01:00
Lee Hinman	3062e59f51	[DOCS] Fix default setting in circuit breaker documentation	2014-01-15 07:05:05 -07:00
Clinton Gormley	a0b993e2dc	[DOCS] Tidied up cluster settings docs	2014-01-15 14:51:18 +01:00
Clinton Gormley	f8a427e266	[DOCS] Moved fielddata circuit breaker higher up the page	2014-01-15 14:00:08 +01:00
Alexander Reelsen	349a8be4fd	Consistent REST API changes for GETting data * Made GET mappings consistent, supporting * /{index}/_mappings/{type} * /{index}/_mapping/{type} * /_mapping/{type} * Added "mappings" in the JSON response to align it with other responses * Made GET warmers consistent, support /{index}/_warmers/{type} and /_warmer, /_warner/{name} as well as wildcards and _all notation * Made GET aliases consistent, support /{index}/_aliases/{name} and /_alias, /_aliases/{name} as well as wildcards and _all notation * Made GET settings consistent, added /{index}/_setting/{name}, /_settings/{name} as well as supportings wildcards in settings name * Returning empty JSON instead of a 404, if a specific warmer/ setting/alias/type is missing * Added a ton of spec tests for all of the above * Added a couple of more integration tests for several features Relates #4071	2014-01-14 22:33:52 +01:00
Igor Motov	ba7699a38b	Add documentation for index.routing.allocation.._name and index.routing.allocation.._id options	2014-01-14 16:20:46 -05:00
Britta Weber	411739fe3b	Make PUT and DELETE consistent for _mapping, _alias and _warmer See issue #4071 PUT options for _mapping: Single type can now be added with `[PUT\|POST] {index\|_all\|\|regex\|blank}/[_mapping\|_mappings]/type` and `[PUT\|POST] {index\|_all\|\|regex\|blank}/type/[_mapping\|_mappings]` PUT options for _warmer: PUT with a single warmer can now be done with `[PUT\|POST] {index\|_all\|\|prefix\|blank}/{type\|_all\|\|prefix\|blank}/[_warmer\|_warmers]/warmer_name` PUT options for _alias: Single alias can now be PUT with `[PUT\|POST] {index\|_all\|\|prefix\|blank}/[_alias\|_aliases]/alias` DELETE options _mapping: Several mappings can be deleted at once by defining several indices and types with `[DELETE] /{index}/{type}` `[DELETE] /{index}/{type}/_mapping` `[DELETE] /{index}/_mapping/{type}` where `index= * \| _all \| glob pattern \| name1, name2, …` `type= * \| _all \| glob pattern \| name1, name2, …` Alternatively, the keyword `_mapings` can be used. DELETE options for _warmer: Several warmers can be deleted at once by defining several indices and names with `[DELETE] /{index}/_warmer/{type}` where `index= * \| _all \| glob pattern \| name1, name2, …` `type= * \| _all \| glob pattern \| name1, name2, …` Alternatively, the keyword `_warmers` can be used. DELETE options for _alias: Several aliases can be deleted at once by defining several indices and names with `[DELETE] /{index}/_alias/{type}` where `index= * \| _all \| glob pattern \| name1, name2, …` `type= * \| _all \| glob pattern \| name1, name2, …` Alternatively, the keyword `_aliases` can be used.	2014-01-14 20:02:43 +01:00
Benjamin Vetter	ba8e012be9	Referring to stop analyzer for stopword docs #329	2014-01-14 11:53:30 +01:00
Benjamin Vetter	22a96e6a18	Added stopwords: _none_ to the docs #329	2014-01-14 11:53:29 +01:00
Igor Motov	b987615f5e	Improve support for partial snapshots Fixes #4701. Changes behavior of the snapshot operation. The operation now fails if not all primary shards are available at the beginning of the snapshot operation. The restore operation no longer tries to restore indices with shards that failed or were missing during snapshot operation.	2014-01-13 16:59:21 -05:00
Lee Hinman	b379bf5668	Default to not accepting type wrapper in indexing requests Currently it is possible to index a document as: ``` POST /myindex/mytype/1 { "foo"...} ``` or as: ``` POST /myindex/mytype/1 { "mytype": { "foo"... } } ``` This makes indexing non-deterministic and fields can be misinterpreted as type names. This changes makes Elasticsearch accept only the first form by default, ie without the type wrapper. This can be changed by setting `index.mapping.allow_type_wrapper` to `true`` when creating the index. Closes #4484	2014-01-13 14:37:00 -07:00
Clinton Gormley	0751f0b7c6	[DOCS] Fixed link to tribe.asciidoc	2014-01-13 22:01:12 +01:00
Clinton Gormley	2e79246c1a	[DOCS] Added docs for tribe node Related #4708	2014-01-13 21:53:53 +01:00
Andrew Raines	e13f55dfca	[DOCS] Update cat/indices to reflect ?pri flag	2014-01-13 14:18:27 -06:00
markharwood	541059a4d1	Adds a new coerce flag for numeric field mappings which is defaulted to true. When set to false a new strict mode of parsing is employed which a) does not permit numbers to be passed as JSON strings in quotes b) rejects numbers with fractions that are passed to integer, short or long fields. Closes #4117	2014-01-13 17:58:18 +00:00
markharwood	2795f4e55d	Standardized use of “_length” for parameter names rather than “_len”. Java Builder apis drop old “len” methods in favour of new “length” Rest APIs support both old “len: and new “length” forms using new ParseField class to a) provide compiler-checked consistency between Builder and Parser classes and b) a common means of handling deprecated syntax in the DSL. Documentation and rest specs only document the new “*length” forms Closes #4083	2014-01-13 15:59:15 +00:00
Simon Willnauer	8247e4beae	Rename RobinEngine and friends to InternalEngine Closes #4633	2014-01-13 15:49:10 +01:00
LightGuard	e89d5d0d86	Fixing up code block delimeters for asciidoctor You can now successfully run the docs through asciidoctor	2014-01-13 15:26:53 +01:00
Simon Willnauer	7f63ddf94e	Default stopwords list should be `_none_` for all but language-specific analyzers `standard_html_strip` and `pattern` analyzer support stopwords which are set to the default `english` stopwords by default. Those analyzers should not use stopwords by default since they are language neutral Closes #4699	2014-01-13 14:44:10 +01:00
Adrien Grand	5c237fe834	Add new option `min_doc_count` to terms and histogram aggregations. `min_doc_count` is the minimum number of hits that a term or histogram key should match in order to appear in the response. `min_doc_count=0` replaces `compute_empty_buckets` for histograms and will behave exactly like facets' `all_terms=true` for terms aggregations. Close #4662	2014-01-13 10:09:38 +01:00
Martijn van Groningen	943b62634c	Replaced the multi-field type in favour for the multi fields option that can be set on any core field. When upgrading to ES 1.0 the existing mappings with a multi-field type automatically get replaced to a core field with the new `fields` option. If a `multi_field` type-ed field doesn't have a main / default field, a default field will be chosen for the multi fields syntax. The new main field type will be equal to the first `multi_field` fields' field or type string if no fields have been configured for the `multi_field` field and in both cases the default index will not be indexed (`index=no` is set on the default field). If a `multi_field` typed field has a default field, that field will replace the `multi_field` typed field. Closes to #4521	2014-01-13 09:21:53 +01:00
Florian Schilling	464037e0c1	Geo clean Up ============ The default unit for measuring distances is MILES in most cases. This commit moves ES over to the International System of Units and make it work on a default which relates to METERS . Also the current structures of the `GeoBoundingBox Filter` changed in order to define the Bounding by setting abitrary corners. Distances --------- Since the default unit for measuring distances has changed to a default unit `DistanceUnit.DEFAULT` relating to meters, the REST API has changed at the following places: * `ScriptDocValues.factorDistance()` returns meters instead of miles * `ScriptDocValues.factorDistanceWithDefault()` returns meters instead of miles * `ScriptDocValues.arcDistance()` returns meters instead of miles one might use `ScriptDocValues.arcDistanceInMiles()` * `ScriptDocValues.arcDistanceWithDefault()` returns meters instead of miles * `ScriptDocValues.distance()` returns meters instead of miles one might use `ScriptDocValues.distanceInMiles()` * `ScriptDocValues.distanceWithDefault()` returns meters instead of miles one might use `ScriptDocValues.distanceInMilesWithDefault()` * `GeoDistanceFilter` default unit changes from kilometers to meters * `GeoDistanceRangeFilter` default unit changes from miles to meters * `GeoDistanceFacet` default unit changes from miles to meters Geo Bounding Box Filter ----------------------- The naming of the GeoBoundingBoxFilter properties allows to set arbitrary corners (see #4084) namely `top_right`, `top_left`, `bottom_right` and `bottom_left`. This change also includes the fields `topRight` and `bottomLeft` Also it is be possible to set the single values by using just `top`, `bottom`, `left` and `right` parameters. Closes #4515, #4084	2014-01-11 21:30:29 +09:00
Boaz Leskes	5ac7bd83ad	Expose min/max open file descriptors in Cluster Stats API Also changes the response format of that section to: ``` "open_file_descriptors": { "min": 200, "max": 346, "avg": 273 } ``` Closes #4681 Note: this is an aggregate of 3 commits in the 0.90 branch	2014-01-10 12:15:56 +01:00
Shay Banon	fe2a70831f	remove bloom from clear cache API, add id_cache	2014-01-09 21:08:45 +01:00
Clinton Gormley	3ab73ab957	Deprecate document _boost Fixes #4664	2014-01-09 16:04:01 +01:00
Simon Willnauer	bc5a9ca342	Rename edit_distance/min_similarity to fuzziness A lot of different API's currently use different names for the same logical parameter. Since lucene moved away from the notion of a `similarity` and now uses an `fuzziness` we should generalize this and encapsulate the generation, parsing and creation of these settings across all queries. This commit adds a new `Fuzziness` class that handles the renaming and generalization in a backwards compatible manner. This commit also added a ParseField class to better support deprecated Query DSL parameters The ParseField class allows specifying parameger that have been deprecated. Those parameters can be more easily tracked and removed in future version. This also allows to run queries in `strict` mode per index to throw exceptions if a query is executed with deprected keys. Closes #4082	2014-01-09 15:14:51 +01:00
Martijn van Groningen	eb63bb259d	Added `action.destructive_requires_name` that controls whether wildcard expressions and `_all` is allowed to be used for destructive operat Also the delete index api requires always an index to be specified (either concrete index, alias or wildcard expression) Closes #4549 #4481	2014-01-09 11:36:50 +01:00
Alexander Reelsen	7042a9aa65	[DOCS] Fix HTTP endpoints after stats API changes	2014-01-09 11:30:28 +01:00
Alexander Reelsen	1652767ec8	[DOCS] Added documentation for SameShardAllocationDecider Closes #4615	2014-01-09 11:24:12 +01:00
Martijn van Groningen	e6f83248a2	Deprecated disable allocation decider which has the following options: `allocation.disable_new_allocation`, `allocation.disable_allocation`, `allocation.disable_replica_allocation`, in favour for the enable allocation decider which has a single option `allocation.enable` wich can be set to the following values: `none`, `new_primaries`, `primaries` and `all` (default). Closes #4488	2014-01-09 10:01:46 +01:00
Martijn van Groningen	7e341cefd0	Change the `sort` boolean option in percolate api to the sort dsl available in search api. Closes #4625	2014-01-09 09:58:34 +01:00
Martijn van Groningen	0973b2863c	Added extra rest endpoint for get settings api. Added rest test to also test the get settings' prefix option.	2014-01-09 09:44:40 +01:00
Clinton Gormley	2e4b70d40f	[DOCS] Fixed duplicate ID in highlighting	2014-01-09 00:37:18 +01:00
Nik Everett	bbf0ec52de	Add warning phrase suggester's max_errors large number can badly impact performance.	2014-01-08 23:06:41 +01:00
Igor Motov	bec6527312	Add support for flat_settings flag to all REST APIs that output settings Closes #4140	2014-01-08 10:36:36 -05:00
Martijn van Groningen	6dc434822c	Changed get index settings api to use new internal get index settings api instead of relying on the cluster state api. The new internal get index settings api is more efficient when it comes to sending the index settings from the master to the client via the Also the get index settings support now all the indices options. Closes #4620	2014-01-08 13:18:57 +01:00
Nik Everett	8bd9e34e39	Stop FVH from throwing away some query boosts The FVH was throwing away some boosts on queries stopping a number of ways to boost phrase matches to the top of the list of fragments from working. The plain highlighter also doesn't work for this but that is because it doesn't support the concept of the same term having a different score at different positions. Also update documentation claiming that FHV is nicer for weighing terms found by query combinations. Closes #4351	2014-01-08 11:51:48 +01:00
Nik Everett	522d620eb6	Use FHV's phraseLimit This prevents poisoning the FVH with documents that contain TONS of matches which take tons of memory and time to highlight. Closes #4645	2014-01-08 11:27:58 +01:00
Alexander Reelsen	ad50afbec8	Simplify usage of nodes info API Important: This breaks backwards compatibility with 0.90 * Removed endpoints: /_cluster/nodes, /_cluster/nodes/nodeId1,nodeId2 * Disallow usage of parameters, but make required metrics part of URI * Changed NodesInfoRequest to return everything by default * Fixed NPE in NodesInfoResponse Closes #4055	2014-01-08 09:46:04 +01:00
Alexander Reelsen	6ef6bb993c	Cluster state API: Improved consistency Instead of specifying what kind of data should be filtered, this commit streamlines the API to actually specify, what kind of data should be displayed. This makes its behaviour similar to the other requests, like NodeIndicesStats. A small feature has been added as well: If you specify an index to select on, not only the metadata, but also the routing tables are filtered by index in order to prevent too big cluster states to be returned. Also the CAT apis have been changed to only return the wanted data in order to keep network traffic as small as needed. Tests for the cluster state API filtering have been added as well. Note: This change breaks backwards compatibility with 0.90! Closes #4065	2014-01-08 09:25:20 +01:00
Igor Motov	5d98341d11	Fix typo in snapshot/restore documentation	2014-01-07 14:03:12 -05:00
Shay Banon	4aa5ef139e	randomize flush interval so multiple shards won't flush at the sam time - also, allow to update interval using update settings on an index	2014-01-07 19:58:28 +01:00
markharwood	602de04692	A GeoHashGrid aggregation that buckets GeoPoints into cells whose dimensions are determined by a choice of GeoHash resolution. Added a long-based representation of GeoHashes to GeoHashUtils for fast evaluation in aggregations. The new BucketUtils provides a common heuristic for determining the number of results to obtain from each shard in "top N" type requests.	2014-01-07 18:03:33 +00:00
Lee Hinman	2cb40fcb17	Rename "exists" to "found" in TermVector and Get responses - Adds the "created" field to the index action response - Reverses Delete class' notFound to Found to avoid double negative	2014-01-07 09:47:07 -07:00
Simon Willnauer	fa16969360	Cleanup comments and class names s/ElasticSearch/Elasticsearch * Clean up s/ElasticSearch/Elasticsearch on docs/* * Clean up s/ElasticSearch/Elasticsearch on src/* bin/* & pom.xml * Clean up s/ElasticSearch/Elasticsearch on NOTICE.txt and README.textile Closes #4634	2014-01-07 11:21:51 +01:00
Andrew Raines	c46721a25f	Document h/headers switcheroo.	2014-01-06 16:08:48 -06:00
Martijn van Groningen	32c5471d33	Rename `score` to `track_scores` in percolate api. Closes #4624	2014-01-06 14:57:39 +01:00
Adrien Grand	9763d079b8	Eager norms loading options. Norms can be eagerly loaded on a per-field basis by setting norms.loading to `eager` instead of the default `lazy`: ``` "my_string_field" : { "type": "string", "norms": { "loading": "eager" } } ``` In case this behavior should be applied to all fields, it is possible to change the default value by setting `index.norms.loading` to `eager`. Close #4079	2014-01-06 09:53:42 +01:00
Alexander Reelsen	bb275166f1	Simplify nodes stats API First, this breaks backwards compatibility! * Removed /_cluster/nodes/stats endpoint * Excpect the stats types not as parameters, but as part of the URL * Returning all indices stats by default, returning all nodes stats by default * Supporting groups & types in nodes stats now as well * Updated documentation & tests accordingly * Allow level parameter for "shards" and "indices" (cluster does not make sense here) Closes #4057	2014-01-06 08:33:32 +01:00
Alexander Reelsen	33878be1e8	Simplify indices stats API Note: This breaks backward compatibility * Removed clear/all parameters, now all stats are returned by default * Made the metrics part of the URL * Removed a lot of handlers * Added shards/indices/cluster level paremeter to change response serialization * Returning translog statistics in IndicesStats * Added TranslogStats class * Added IndexShard.translogStats() method to get the stats from concrete implementation * Updated documentation Closes #4054	2014-01-06 07:27:03 +01:00
Lee Hinman	47607a69a1	Default the circuit breaker limit to 80% of the maximum JVM heap	2014-01-03 16:21:55 -07:00
Lee Hinman	5463f7953f	Expose `simple_query_string` flags in `flags` parameter	2014-01-03 16:14:19 -07:00
Alexander Reelsen	811b7d7d78	Do not start packages on installation The reason to not start packages on installation is to allow to configure them before starting up (setting heap, cluster.name etc) Also the documentation was updated in order to show, which statements need to be executed. In addition, these statements are also printed out when the package is installed, depending on whether chkconfig, system or update-rc.d is used. Closes #3722	2014-01-03 17:40:27 +01:00
Martijn van Groningen	f1bf585089	The `fields` option should always return an array for json document fields and single valued field for metadata fields. Also the `fields` option can only be used to fetch leaf fields, trying to do fetch object fields will return in a client error. Closes #4542	2014-01-03 17:29:12 +01:00
David Pilato	0c7b494bb8	plugin manager: new `timeout` option When testing plugin manager with real downloads, it could happen that the test run forever. Fortunately, test suite will be interrupted after 20 minutes, but it could be useful not to fail the whole test suite but only warn in that case. By default, plugin manager still wait indefinitely but it can be modified using new `--timeout` option: ```sh bin/plugin --install elasticsearch/kibana --timeout 30s bin/plugin --install elasticsearch/kibana --timeout 1h ``` Closes #4603. Closes #4600.	2014-01-03 16:48:18 +01:00
Britta Weber	9f54e9782d	rename _shard -> _index and also rename classes and variables closes #4584	2014-01-03 14:00:23 +01:00
Lee Hinman	a754224751	Add field data memory circuit breaker. This adds the field data circuit breaker, which is used to estimate the amount of memory required to load field data before loading it. It then raises a CircuitBreakingException if the limit is exceeded. It is configured with two parameters: `indices.fielddata.cache.breaker.limit` - the maximum number of bytes of field data to be loaded before circuit breaking. Defaults to `indices.fielddata.cache.size` if set, unbounded otherwise. `indices.fielddata.cache.breaker.overhead` - a contast for all field data estimations to be multiplied with before aggregation. Defaults to 1.03. Both settings can be configured dynamically using the cluster update settings API.	2014-01-02 15:04:47 -07:00
Martijn van Groningen	aa548f5148	Remove GET `_aliases` api in favour for GET `_alias` api Currently there are two get aliases apis that both have the same functionality, but have a different response structure. The reason for having 2 apis is historic. The GET _alias api was added in 0.90.x and is more efficient since it only sends the needed alias data from the cluster state between the master node and the node that received the request. In the GET _aliases api the complete cluster state is send to the node that received the request and then the right information is filtered out and send back to the client. The GET _aliases api should be removed in favour for the alias api Closes to #4539	2014-01-02 13:56:11 +01:00
Martijn van Groningen	f4bf0d5112	Replaced `ignore_indices` with `ignore_unavailable`, `expand_wildcards` and `allow_no_indices`. * `ignore_unavailable` - Controls whether to ignore if any specified indices are unavailable, this includes indices that don't exist or closed indices. Either `true` or `false` can be specified. * `allow_no_indices` - Controls whether to fail if a wildcard indices expressions results into no concrete indices. Either `true` or `false` can be specified. For example if the wildcard expression `foo` is specified and no indices are available that start with `foo` then depending on this setting the request will fail. This setting is also applicable when `_all`, `` or no index has been specified. * `expand_wildcards` - Controls to what kind of concrete indices wildcard indices expression expand to. If `open` is specified then the wildcard expression if expanded to only open indices and if `closed` is specified then the wildcard expression if expanded only to closed indices. Also both values (`open,closed`) can be specified to expand to all indices. Closes to #4436	2014-01-02 12:19:45 +01:00
Britta Weber	1ede9a5730	make term statistics accessible in scripts term statistics can be accessed via the _shard variable. Below is a minimal example. See documentation on details. ``` DELETE paytest PUT paytest { "mappings": { "test": { "_all": { "auto_boost": true, "enabled": true }, "properties": { "text": { "index_analyzer": "fulltext_analyzer", "store": "yes", "type": "string" } } } }, "settings": { "analysis": { "analyzer": { "fulltext_analyzer": { "filter": [ "my_delimited_payload_filter" ], "tokenizer": "whitespace", "type": "custom" } }, "filter": { "my_delimited_payload_filter": { "delimiter": "+", "encoding": "float", "type": "delimited_payload_filter" } } }, "index": { "number_of_replicas": 0, "number_of_shards": 1 } } } POST paytest/test/1 { "text": "the+1 quick+2 brown+3 fox+4 is quick+10" } POST paytest/test/2 { "text": "the+1 quick+2 red+3 fox+4" } POST paytest/_refresh POST paytest/_search { "script_fields": { "ttf": { "script": "_shard[\"text\"][\"quick\"].ttf()" } } } POST paytest/_search { "script_fields": { "freq": { "script": "_shard[\"text\"][\"quick\"].freq()" } } } POST paytest/test/2/_termvector POST paytest/_search { "script_fields": { "payloads": { "script": "term = _shard[\"text\"].get(\"red\",_PAYLOADS);payloads = []; for(pos : term){payloads.add(pos.payloadAsFloat(-1));} return payloads;" } } } POST paytest/_search { "script_fields": { "tv": { "script": "_shard[\"text\"][\"quick\"].freq()" } }, "query": { "function_score": { "functions": [ { "script_score": { "script": "_shard[\"text\"][\"quick\"].freq()" } } ] } } } ``` closes #3772	2014-01-02 11:17:33 +01:00
Adrien Grand	1654ae8937	Explicit doc_values setting. Once doc values are enabled on a field, they can't be disabled. Close #4560	2013-12-30 11:10:52 +01:00
Adrien Grand	05448b6276	Doc values for geo points. This commits add doc values support to geo point using the exact same approach as for numeric data: geo points for a given document are stored uncompressed and sequentially in a single binary doc values field. Close #4207	2013-12-27 12:45:18 +01:00
Florian Schilling	bc452dff84	* setup accurate GeoDistance Function * adapt tests * introduced default GeoDistance function * Updated docs closes #4498	2013-12-27 19:15:19 +09:00
Andrew Raines	69d88a1edd	[DOCS] Add headers and help parameters.	2013-12-23 22:26:28 -06:00
Martijn van Groningen	eb86a3a6fe	[DOCS] Changed `shape_field_name` to `path` in geo_shape filter documentation. Relates to #4486	2013-12-23 11:27:06 +01:00
Clinton Gormley	998b7b3b86	[DOCS] Fixed community links to official clients	2013-12-20 12:16:58 +01:00
Clinton Gormley	dea6b112ae	[DOCS] Corrected bloom loading docs	2013-12-20 11:20:54 +01:00
Clinton Gormley	2b8c82c883	[DOCS] Documented index.codec.bloom.load for #4525	2013-12-20 10:51:17 +01:00
Clinton Gormley	51dc057244	[DOCS] Added the official PHP client to the community page.	2013-12-20 10:51:17 +01:00
Richard Pijnenburg	df85fdf88f	Add repository information to docs This adds the apt and yum repo information to the setup docs.	2013-12-19 15:58:08 +01:00
Adrien Grand	52db8eb324	More documentation improvements for fielddata loading.	2013-12-18 16:05:35 +01:00
Adrien Grand	07443089ce	Improve documentation of the new `disabled` field data format.	2013-12-18 15:44:57 +01:00
Boaz Leskes	3c5106ae98	Added cluster health status to the Cluster Stats API Relates to #4460	2013-12-18 12:03:49 +01:00
Chris Simpson	4f8c916eed	[Docs] Fix Typo Fixes small typo in the geo_distance aggregation docs.	2013-12-18 11:21:21 +01:00
spenceralger	89e6b9cfc4	Merge pull request #4494 from spenceralger/add_js_docs JavaScript client docs	2013-12-17 14:41:57 -08:00
Spencer Alger	a8ca8497c5	added doc page for the JavaScipt client, and listed it in the clients list.	2013-12-17 15:26:29 -07:00
Boaz Leskes	2b6214cff7	Added Cluster Stats API Closes #4460	2013-12-17 13:14:46 +01:00
Grégory Quatannens	c64abaae7e	Fixing typo and grammar	2013-12-17 11:39:02 +01:00
Adrien Grand	33599d9a34	Compressed geo-point field data. This commit allows to trade precision for memory when storing geo points. This new field data impl accepts a `precision` parameter that controls the maximum expected error for storing coordinates. This option can be updated on a live index with the PUT mapping API. Default precision is 1cm, which requires 8 bytes per geo-point (50% memory saving compared to using 2 doubles). Close #4386	2013-12-17 11:29:48 +01:00
Clinton Gormley	684affa5c7	[DOCS] Removed unused file	2013-12-17 11:28:19 +01:00
Alexander Reelsen	b713cf56ed	Allow to provide parameters not only through -D but as long parameters All getopt long style parameters are now set as es. properties, elasticsearch --path.data=/some/path results in -Des.path.data=/some/path Closes #4393	2013-12-17 10:43:27 +01:00
Alexander Reelsen	c30945a3d8	Start elasticsearch in the foreground by default Instead of using the '-f' parameter to start elasticsearch in the foreground, this is now the default modus. In order to start elasticsearch in the background, the '-d' parameter can be used. Closes #4392	2013-12-17 10:39:22 +01:00
Clinton Gormley	34b9b16233	[DOCS] Fixed some bad link refs	2013-12-16 18:07:33 +01:00
Martijn van Groningen	23d2b1ea7b	Renamed top level `filter` to `post_filter`. Closes #4119	2013-12-16 17:10:14 +01:00
Lee Hinman	db431b7cb3	Remove the `field` and `text` queries. The `text` query was replaced by the `match` query and has been deprecated for quite a while. The `field` query should be replaced by a `query_string` query with the `default_field` specified. Fixes #4033	2013-12-16 08:59:36 -07:00
Adrien Grand	4e7ce4ee02	Make field data changes immediately taken into account and add the ability to disallow field data loading. This commit changes field data configuration updates so that they are immediately taken into account for loading new segments. The way it works is that field data configuration is now cached separately from the field data cache, meaning that it is now possible to clear the field data configuration from IndexFieldDataService while the cache will stay around. On the next time that Elasticsearch will reload field data configuration, it will check if there is already a cache entry, and reuse it if it exists. To disable field data loading, all that is required is to change the field data format to "none" (supported by all field data types) using the update mapping API. Elasticsearch will then refuse to load field data on any new segment, but field data which has been loaded on the previous segments will remain available. So you need to clear the field data cache in order to reclaim memory (otherwise memory will be reclaimed slower, as segments get merged). Close #4430 Close #4431	2013-12-16 14:34:33 +01:00
Adrien Grand	36bd9cc432	Aggregations: Ordinals-based string bucketing support. When the ValuesSource has ordinals, terms ordinals are used as a cache key to bucket ordinals. This can make terms aggregations on String terms significantly faster. Close #4350	2013-12-13 15:34:02 +01:00
Martijn van Groningen	10e2528cce	Added the `force_source` option to highlighting that enforces to use of the _source even if there are stored fields. The percolator uses this option to deal with the fact that the MemoryIndex doesn't support stored fields, this is possible b/c the _source of the document being percolated is always present. Closes #4348	2013-12-13 13:39:53 +01:00
Lee Hinman	77fcf71338	Add new `simple_query_string` query type This adds support for Lucene's SimpleQueryParser by adding a new type of query called the `simple_query_string`. The `simple_query_string` query is designed to be able to parse human-entered queries without throwing any exceptions. Resolves #4159	2013-12-12 12:09:32 -07:00
Alexander Reelsen	81e13a870b	Packaging: Ensure setting of sysctl vm.max_map_count In order to be sure that memory mapped lucene directories are working one can configure the kernel about how many memory mapped areas a process may have. This setting ensure for the debian and redhat initscripts as well as the systemd startup, that this setting is set high enough. Closes #4397	2013-12-11 09:19:22 +01:00
Boaz Leskes	99b421925f	Add wildcard support to field resolving in the Get Field Mapping API Closes #4367	2013-12-10 23:46:37 +01:00
Simon Willnauer	6c189310b9	Remove 'term_index_interval' and 'term_index_divisor' These settings are no longer relevant since they are codec / postingsformat level settings since Lucene 4.0 Closes #3912	2013-12-10 16:54:08 +01:00
Martijn van Groningen	ebf6519965	Added aggs option to percolate api documentation.	2013-12-10 14:09:37 +01:00
Lee Hinman	bc9698a347	Support 'yaml' as a format for the Analyze API Fixes #4311	2013-12-08 15:08:00 -07:00
Martijn van Groningen	8c1de501e7	Update percolator highlighting docs.	2013-12-07 16:40:49 -05:00
Adrien Grand	32eb5ffa92	[Docs] Document which encoding should be used in order to make sense of the offsets returned by the term vectors API. Close #4363	2013-12-06 22:39:08 +01:00
Lee Hinman	a1d4731137	[DOCS] Fix outdated link to wonderdog in community integration	2013-12-06 12:05:43 -07:00
Shay Banon	28eff2ba29	remove help command, list all cat commands in /_cat?h endpoint	2013-12-05 14:36:27 +01:00
Markus Fischer	2da0611dfb	[DOCS] Completion suggest: Clarify de-duplication, optimize/merge This contribution is based on the feedback given in issue #4254 and issue #4255, and should clear things up, when suggestions are being removed and not displayed anymore after deletion of data.	2013-12-05 11:10:56 +01:00
Nik Everett	8e34057bc0	Add support for combining fields to the FVH The Fast Vector Highlighter can combine matches on multiple fields to highlight a single field using `matched_fields`. This is most intuitive for multifields that analyze the same string in different ways. Example: { "query": { "query_string": { "query": "content.plain:running scissors", "fields": ["content"] } }, "highlight": { "order": "score", "fields": { "content": { "matched_fields": ["content", "content.plain"], "type" : "fvh" } } } } Closes #3750	2013-12-03 11:10:01 +01:00
Yousef	302c762d5e	Wrong link to Token Filter	2013-12-03 10:39:13 +01:00
Nik Everett	7690b40ec6	Allow string fields to store token counts To use this one you send a string to a field of type 'token_count'. This makes the most sense with a multi-field.	2013-12-03 09:39:32 +01:00
Alexander Reelsen	6528df2764	[DOCS] Test framework documentation The java test framework using randomized testing is explained with a couple of examples.	2013-12-02 18:01:45 +01:00
Clinton Gormley	7d993fd917	[DOCS] Another cat?v change	2013-12-02 15:30:49 +01:00
Clinton Gormley	5b15ed73fa	[DOCS] Linked cat-pending to cluster-pending	2013-12-02 15:29:47 +01:00
Clinton Gormley	992b2d82b0	[DOCS] Changed the _cat docs to use ?v instead of ?v=true	2013-12-02 15:27:41 +01:00
Clinton Gormley	d9a480c97a	[DOCS] Typos in aggregations	2013-12-02 15:14:25 +01:00
Conrad Pankoff	87246af256	[DOCS] Fixed typos and corrected grammar	2013-12-02 10:08:26 +01:00
uboness	cdc7dfbb2c	Changed the "script_lang" parameter to "lang" in all value source based aggs - to be consistent with all other script based APIs.	2013-12-02 02:01:03 +01:00
Clinton Gormley	bc393b6d79	Changed the minScore comparator from > to >= Closes #4303	2013-11-29 20:29:20 +01:00
uboness	0d6a35b9a7	- Added support for term filtering based on include/exclude regex on the terms agg - Added javadoc to the TermsBuilder Closes #4267	2013-11-29 13:46:48 +01:00
uboness	afb0d119e4	- Added docs for the value_count aggregation - Fixed typos in the terms facets docs - Fixed aggregation docs layout - Added docs for shard_size in term aggregation	2013-11-29 12:35:42 +01:00
Clinton Gormley	b48344f296	[DOCS] Doc'ed cluster pending tasks	2013-11-29 08:21:26 +01:00
Andrew Raines	91999e14ce	Add _cat/pending_tasks. Closes #4251.	2013-11-29 01:09:06 -06:00
Lee Hinman	9939e81d88	[DOCS] Fix porter stem filter name in other stemming docs	2013-11-28 22:14:47 -07:00
Lee Hinman	fb4e903e35	[DOCS] Fix name of porter stemming token filter	2013-11-28 22:01:19 -07:00
Clinton Gormley	6ce3495029	[DOCS] Fixed a bad link	2013-11-27 17:54:25 +01:00
Clinton Gormley	cdc1935b6e	[DOCS] Documented rest.action.multi.allow_explicit_index	2013-11-27 17:33:09 +01:00
Boaz Leskes	c63d8c4fb5	[Docs] Added _source filtering to documentation Relates to #3301	2013-11-26 19:16:24 +01:00
Britta Weber	dbef64009f	[DOC] add doc for multi term vector api closes #3998	2013-11-26 17:03:14 +01:00
Alexander Reelsen	bf74f49fdd	Updated Analyzing/Fuzzysuggester from lucene trunk * Minor alignments (like setter to ctor) * FuzzySuggester has a unicode aware flag, which is not exposed in the fuzzy completion request parameters * Made XAnalyzingSuggester flags (PAYLOAD_SEP, END_BYTE, SEP_LABEL) to be written into the postings format, so we can retain backwards compatibility * The above change also implies, that these flags can be set per instantiated XAnalyzingSuggester * CompletionPostingsFormatTest now uses a randomProvider for writing data to check for bwc	2013-11-26 12:52:06 +01:00
Martijn van Groningen	a03556daa0	Added execution option to `range` filter, with the `index` and `fielddata` as values. Deprecated `numeric_range` filter in favor for the `range` filter with `fielddata` as execution. Closes #4034	2013-11-25 23:43:40 +01:00
uboness	c7f6c5266d	initial commit of the aggregations module Closes #3300	2013-11-24 03:13:08 -08:00
Jun Ohtani	7bbe453273	[DOCS] Added elasticsearch-extended-analyze plugin	2013-11-21 09:48:00 +01:00
Clinton Gormley	7c59ed4087	[DOCS] Fixed duplicate docs ID in delete	2013-11-21 17:38:51 +11:00
Shay Banon	a9880dcbf1	add timeout doc to delete	2013-11-20 12:50:03 -08:00
Matt Weber	a841a422f6	Add a field data based TermsFilter Add FieldDataTermsFilter that compares terms out of the fielddata cache. When filtering on a large set of terms this filter can be considerably faster than using a standard lucene terms filter. Add the "fielddata" execution mode to the terms filter parser to enable the use of the new FieldDataTermsFilter. Add supporting tests and documentation. Closes #4209	2013-11-19 19:18:16 +01:00
Andrew Raines	8fabeb1c0b	First pass at cat docs.	2013-11-14 21:37:02 -05:00
Andrew Raines	5c085c1204	Fix misspellings.	2013-11-14 20:10:36 -05:00
Luca Cavanna	0aaa39d00a	Minor improvements to indices filter and query & updated docs Slightly simplified indices filter and query parsers code Trimmed down tests where possible	2013-11-14 17:25:34 +01:00
Olivier Favre	fa80ca97b2	Indices query/filter skip parsing altogether for irrelevant indices when possible Closes #2416	2013-11-14 17:24:49 +01:00
Igor Motov	510397aecd	Initial implementation of Snapshot/Restore API Closes #3826	2013-11-10 18:26:56 -05:00
Lee Hinman	f7d5d1e5c9	[DOCS] Update store docs to indicate mmapfs is now the default on 64-bit Linux	2013-11-09 11:42:43 -07:00
Clinton Gormley	5af4e02d6c	[DOCS] Fix link to statsd plugin Fixes #4128	2013-11-08 20:29:51 +01:00
Clinton Gormley	7189310764	In ctor of GeoPointFieldMapper, geohash_prefix now implicitly enables geohash option Also improved docs for geopoint type and geohash_cell filte Closes #3951	2013-11-08 13:52:17 +01:00
Cory G Watson	6bbcc34061	Add wabisabi to Scala clients.	2013-11-08 10:34:14 +01:00
Clinton Gormley	b27976fbed	[DOCS] Fixed the fielddata regex example on core mapping	2013-11-07 17:09:18 +01:00
Clinton Gormley	3465e69e83	[DOCS] Changed all store:yes/no to store:true/false which is how this setting is stored internally	2013-11-07 16:57:18 +01:00
Simon Willnauer	77bc5d5ecf	release [1.0.0.Beta1]	2013-11-06 15:32:43 +01:00
Simon Willnauer	9654631186	Change 'standart' analyzer to use emtpy stopword list by default. The 'default' / 'standard' analyzer can be a trappy default sicne it filters english stopwords by default. Yet a default should not be dedicated to a certain language since elasticsearch is used in many different scenarios where a standard analysis chain with specialization to english full-text might be rather counter productive. This commit changes the 'standard' analyzer to use an empty stopword list for indices that are created from 1.0.0.Beta1 version onwards but will maintain backwards compatibiliy for older indices. Closes #3775	2013-11-05 21:07:21 +01:00
Shay Banon	7c32269f4f	Dist. Percolation: Use .percolator instead of _percolator for type name Use .percolator as the internal (hidden) type name for percolators within the index. Seems nicer name to represent "hidden" types within an index. closes #4090	2013-11-05 20:02:59 +01:00
Boaz Leskes	a9fdcadf01	[DOCS] Added documentation for the keep word token filter	2013-11-04 18:38:44 +01:00
Clinton Gormley	356de95840	Added simplified range syntax to query string docs	2013-11-04 18:18:36 +01:00
Karel Minarik	b93dac678f	[DOC] Added a link to the official Ruby client to the "Clients" page	2013-11-04 11:47:14 +01:00
Karel Minarik	7023ef2e3f	[DOCS] Added a basic information about the official Ruby client to documentation	2013-11-04 11:46:36 +01:00
Ben McCann	46edfc484a	[DOCS] Add some documentation about the performance of `_source` usage in scripts.	2013-11-04 11:05:55 +01:00
Igor Motov	c724f0de5d	Initial implementation of ResourceWatcherService Closes #4062	2013-11-03 21:55:54 -05:00
Dan Everton	6df60b7271	[DOC] Improve documentation on search stats groups Document the ability to return all search statistics groups and provide examples of returning search statistics for groups.	2013-11-01 13:53:39 +01:00
Martijn van Groningen	30ab6f841d	[DOCS] Fixed percolate docs errors	2013-11-01 11:44:07 +01:00
Clinton Gormley	4206cc988e	[DOCS] Typo on shingle tokenfilter	2013-10-31 20:18:00 +01:00
Opak Alex	6856cfc5e3	add reference for ember-data-elasticsearch-kit to integrations page	2013-10-31 11:40:01 +01:00
Alexander Reelsen	dfcb3ca2d4	RegexpQueryBuilder now implements MultiTermQueryBuilder This allows the RegexpQueryBuilder to be used in span queries Added tests for all span multi term queries. Also updated the documentation and removed mentioning of numeric range queries for span queries (they have to be terms). Closes #3392	2013-10-31 09:12:57 +01:00
Boaz Leskes	8819f91d47	Add a GetFieldMapping API This new API allows to get the mapping for a specific set of fields rather than get the whole index mapping and traverse it. The fields to be retrieved can be specified by their full path, index name and field name and will be resolved in this order. In case multiple field match, the first one will be returned. Since we are now generating the output (rather then fall back to the stored mapping), you can specify `include_defaults`=true on the request to have default values returned. Closes #3941	2013-10-30 16:16:36 +01:00
Clinton Gormley	8b2efd4849	[DOCS] Added a version flag to percolation	2013-10-30 13:59:03 +01:00
Clinton Gormley	0585890a5f	[DOCS] Fixed a typo	2013-10-30 13:57:18 +01:00
Alexander Reelsen	2ec9742147	[DOCS] Extending setup as a service documentation * Tell people to use ES_JAVA_OPTS for es.node.name or similar parameters * Showing a simple way to install Oracle JDK on ubuntu/debian Closes #3999	2013-10-29 13:58:06 +01:00
David Pilato	5d90abf701	mget API should support global routing parameter mget API support `_routing` field but not `routing` parameter. Reproduction here: ```sh curl -XDELETE "http://localhost:9200/test/"; echo curl -XPUT "http://localhost:9200/test/" -d'{ "settings": { "number_of_replicas": 0, "number_of_shards": 5 } }'; echo curl -XPUT 'http://localhost:9200/test/order/1-1?routing=key1' -d '{ "productName":"doc 1" }'; echo curl -XPUT 'http://localhost:9200/test/order/1-2?routing=key1' -d '{ "productName":"doc 2" }'; echo curl -XPUT 'http://localhost:9200/test/order/1-3?routing=key1&refresh=true' -d '{ "productName":"doc 3" }'; echo curl -XPOST 'http://localhost:9200/test/order/_mget?pretty' -d '{ "docs" : [ { "_index" : "test", "_type" : "order", "_id" : "1-1", "_routing" : "key1" }, { "_index" : "test", "_type" : "order", "_id" : "1-2", "_routing" : "key1" }, { "_index" : "test", "_type" : "order", "_id" : "1-3", "_routing" : "key1" } ] }'; echo curl -XPOST 'http://localhost:9200/test/order/_mget?pretty&routing=key1' -d '{ "ids": [ "1-1", "1-2", "1-3" ] }'; echo ``` Closes #3996.	2013-10-28 21:05:55 +01:00
Britta Weber	c9dab6991e	rename and document "index.mapping.date.parse_upper_inclusive" setting for date fields The setting causes the upper bound for a range query/filter to be rounded up, therefore the name `round_ceil` seems to make more sense. Also this commit removes the redundant fourth parameter to DateMathParser.parse(..) which was never used. was: parse(String text, long now, boolean roundUp, boolean upperInclusive) is now: parse(String text, long now, boolean roundCeil) closes #3914	2013-10-28 15:48:31 +01:00
Ben McCann	cc4bc7d57d	Fix nonsensical sentence in standard analyzer documentation so that it is more understandable	2013-10-25 00:18:32 +02:00
Luca Cavanna	48ac9747a8	Added third highlighter type based on lucene postings highlighter Requires field index_options set to "offsets" in order to store positions and offsets in the postings list. Considerably faster than the plain highlighter since it doesn't require to reanalyze the text to be highlighted: the larger the documents the better the performance gain should be. Requires less disk space than term_vectors, needed for the fast_vector_highlighter. Breaks the text into sentences and highlights them. Uses a BreakIterator to find sentences in the text. Plays really well with natural text, not quite the same if the text contains html markup for instance. Treats the document as the whole corpus, and scores individual sentences as if they were documents in this corpus, using the BM25 algorithm. Uses forked version of lucene postings highlighter to support: - per value discrete highlighting for fields that have multiple values, needed when number_of_fragments=0 since we want to return a snippet per value - manually passing in query terms to avoid calling extract terms multiple times, since we use a different highlighter instance per doc/field, but the query is always the same The lucene postings highlighter api is quite different compared to the existing highlighters api, the main difference being that it allows to highlight multiple fields in multiple docs with a single call, ensuring sequential IO. The way it is introduced in elasticsearch in this first round is a compromise trying not to change the current highlight api, which works per document, per field. The main disadvantage is that we lose the sequential IO, but we can always refactor the highlight api to work with multiple documents. Supports pre_tag, post_tag, number_of_fragments (0 highlights the whole field), require_field_match, no_match_size, order by score and html encoding. Closes #3704	2013-10-24 23:38:00 +02:00
Luca Cavanna	e981e411d7	[DOCS] rephrased docs for highlight no_match_size parameter (removed 0.90.6 coming tag as it's needed only in 0.90 branch)	2013-10-24 14:38:32 +02:00
Nik Everett	14a709f563	Highlighting can return excerpt with no highlights You can configure the highlighting api to return an excerpt of a field even if there wasn't a match on the field. The FVH makes excerpts from the beginning of the string to the first boundary character after the requested length or the boundary_max_scan, whichever comes first. The Plain highlighter makes excerpts from the beginning of the string to the end of the last token before the requested length. Closes #1171	2013-10-24 14:38:32 +02:00
Boaz Leskes	0e6e6f97dc	Merge pull request #3940 from rboulton/patch-1 [Docs] Clean up wording in cluster health api doc	2013-10-22 04:09:13 -07:00
Markus Fischer	782d315da3	Fix markup	2013-10-21 16:11:09 +02:00
Richard Boulton	b62cc7c716	Clean up wording to reduce confusion The description of the timeout parameter was worded misleadingly; it implied that the API would wait until the cluster reached the desired level and then stayed at that level for the timeout. I've tweaked the sentence to remove the risk of confusion.	2013-10-21 12:37:50 +01:00
Clinton Gormley	b2d82d7e75	[DOCS] Reorganised the highlight_query docs and added a version flag	2013-10-18 18:03:31 +02:00
Matt Weber	1e0a834c68	Document strict dynamic type mapping.	2013-10-18 08:29:31 -07:00
Nik Everett	60550e4cc2	phrase_len is not called phrase_length	2013-10-18 09:29:53 -04:00
Clinton Gormley	adf0c8424b	[DOCS] How to check max_file_descriptors	2013-10-17 11:54:36 +02:00
David Pilato	4efd94e7cf	Java API Documentation (0.90+) needs update for accessors in Facets docs Closes #3921. (cherry picked from commit a753c48)	2013-10-17 09:50:15 +02:00
Honza Kral	dd43d932f1	Added a link to official Python client to the client list, fixed perl link	2013-10-16 17:51:50 +02:00
Honza Kral	4f3ad73854	Added brief overview of the python client to the guide	2013-10-16 17:45:05 +02:00
Martijn van Groningen	b7c4adeea3	[Docs] update reference to remove documentation about percolating during an index, bulk or update request.	2013-10-16 16:31:36 +02:00
Martijn van Groningen	1d0841e2b8	Added initial documentation for the redesigned percolator.	2013-10-16 14:12:19 +02:00
Boaz Leskes	18e12ef66c	[Docs] updated refrences to dynamic_date_formats	2013-10-16 12:04:31 +02:00
Boaz Leskes	57b2d45142	[Docs] added document for the lenient option in match queries	2013-10-16 10:53:25 +02:00
Clinton Gormley	f5e2cf9785	[Docs] Typo	2013-10-15 17:27:05 +02:00
Clinton Gormley	4798425da6	[Docs] Added a page for the Perl client	2013-10-15 17:22:34 +02:00
Alexander Reelsen	4d19239ec4	Add support for Lucene SuggestStopFilter The suggest stop filter is an improved version of the stop filter, which takes stopwords only into account if the last char of a query is a whitespace. This allows you to keep stopwords, but to allow suggesting for "a". Example: Index document content "a word". You are now able to suggest for "a" and get back results in the completion suggester, if the suggest stop filter is used on the query side, but will not get back any results for "a " as this is identified as a stopword. The implementation allows to set the `remove_trailing` parameter for a custom stop filter and thus use the suggest stop filter instead of the standard stop filter.	2013-10-15 16:12:02 +02:00
Clinton Gormley	870346070e	[DOCS] Added compound_on_flush docs and updated compound_format docs to include note about accepting a float	2013-10-15 13:30:56 +02:00
Clinton Gormley	d67331b554	[DOCS] Added script.disable_dynamic to the scripting page	2013-10-15 12:25:07 +02:00
steve mayzak	48656fd1ed	removed a duplicate paragraphin config docs	2013-10-14 15:33:56 -07:00
Britta Weber	34441f3897	fix naming in function_score - "boost" should be "boost_factor" - "mult" should be "multiply" Also, store combine function names in ImmutableMap instead of iterating over all possible names each time. closes #3872 for master	2013-10-14 14:56:59 +02:00
Simon Willnauer	25d6f04f13	[DOCS] Note that cutoff_frequency doesn't handle stacked tokens gracefully	2013-10-14 14:09:38 +02:00
Britta Weber	c3ab79a10e	[DOCS] Add doc for delimited payload token filter	2013-10-14 13:41:35 +02:00
Clinton Gormley	9a062e465c	[DOCS] Reorganised common API conventions	2013-10-13 16:46:56 +02:00
Clinton Gormley	4316b13880	[DOCS] Render common options on the same page	2013-10-13 14:14:50 +02:00
Shay Banon	420b3396f4	Set queue sizes by default on bulk/index thread pools Now that we properly fixed the ability to set the queue size on the index / bulk thread pool, we should actually set them to a somehow reasonable value to protect from users potentially overflowing our system. I suggest defaults to be 50 for bulk, and 200 for indexing. Also, set the thread pool for get, which we should set (in a similar value to a "read" queue size we have today). closes #3888	2013-10-12 21:51:37 +02:00
Subhash Gopalakrishnan	b758b76da4	Support year units in date math expressions According to http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/mapping-date-format.html, the date math expressions support M (month), w (week), h (hour), m (minute), and s (second) units. Why years are not supported? Please add support for year units. Closes #3828. Closes #3874.	2013-10-11 09:24:52 +02:00
Clinton Gormley	8462f88c39	[DOCS] Added more specific versions to the suggesters	2013-10-10 20:59:12 +02:00
Adrien Grand	f2d75654bf	Add clear warnings that only the default codec, postings format and doc values format have backward compatibility warranties.	2013-10-10 13:30:08 +02:00
Clinton Gormley	ba1b4886e3	[DOCS] Moved "named filters/queries" up one level	2013-10-10 11:23:08 +02:00
Jonathan CHAMPION	278e99ef69	Fix small doc mistakes	2013-10-10 11:20:13 +02:00
Adrien Grand	4fa8f6f61f	Doc values integration. This commit allows for using Lucene doc values as a backend for field data, moving the cost of building field data from the refresh operation to indexing. In addition, Lucene doc values can be stored on disk (partially, or even entirely), so that memory management is done at the operating system level (file-system cache) instead of the JVM, avoiding long pauses during major collections due to large heaps. So far doc values are supported on numeric types and non-analyzed strings (index:no or index:not_analyzed). Under the hood, it uses SORTED_SET doc values which is the only type to support multi-valued fields. Since the field data API set is a bit wider than the doc values API set, some operations are not supported: - field data filtering: this will fail if doc values are enabled, - field data cache clearing, even for memory-based doc values formats, - getting the memory usage for a specific field, - knowing whether a field is actually multi-valued. This commit also allows for configuring doc-values formats on a per-field basis similarly to postings formats. In particular the doc values format of the _version field can be configured through its own field mapper (it used to be handled in UidFieldMapper previously). Closes #3806	2013-10-09 16:34:30 +02:00
Matt Weber	3225375a77	Add monitoring link for es2graphite.	2013-10-09 10:47:59 +02:00
Lee Hinman	dede6ee874	Remove extra 'processors' anchor in threadpool docs	2013-10-09 01:56:49 -06:00
Adrien Grand	97958ed02a	Improved warm-up of new segments. * Merged segments are now warmed-up at the end of the merge operation instead of _refresh, so that _refresh doesn't pay the price for the warm-up of merged segments, which is often higher than flushed segments because of their size. * Even when no _warmer is registered, some basic warm-up of the segments is performed: norms, doc values (_version). This should help a bit people who forget to register warmers. * Eager loading support for the parent id cache and field data: when one can't predict what terms will be present in the index, it is tempting to use a match_all query in a warmer, but in that case, query execution might not be much faster than field data loading so having a warmer that only loads field data without running a query can be useful. Closes #3819	2013-10-08 23:06:55 +02:00
Clinton Gormley	264a00a40f	[DOCS] Added pages explaining lucene query parser syntax and regular expression syntax	2013-10-07 14:42:49 +02:00
Alexander Reelsen	f0cf97c0ac	Changed documentation to use getter notation Updated some java documentation to reflect the use of getters instead of calling methods based on field names. Relates to #2657	2013-10-06 21:18:43 +02:00
Clinton Gormley	7a53d41446	[DOCS] Changed capitalization of operator in rescore query	2013-10-05 17:18:15 +02:00

... 5 6 7 8 9 ...

713 Commits