OpenSearch

Commit Graph

Author	SHA1	Message	Date
mikemccand	00fcf4d560	#6081 : set IO throttling back to 20 MB/sec now that #6018 is fixed	2014-05-12 14:42:26 -04:00
mikemccand	b6ae7fbadb	#5882 : fix docs	2014-05-12 14:16:27 -04:00
mikemccand	254ebc2f88	#6120 Remove SerialMergeScheduler (master only) It's dangerous to expose SerialMergeScheduler as an option: since it only allows one merge at a time, it can easily cause merging to fall behind. Closes #6120	2014-05-12 14:06:20 -04:00
Lee Hinman	e7e4ef859a	Add /_cat/fielddata to display fielddata usage Closes #4593	2014-05-09 13:18:02 +02:00
Alex Ksikes	dae48d9fe8	Added the ability to include the queried document for More Like This API. By default More Like This API excludes the queried document from the response. However, when debugging or when comparing scores across different queries, it could be useful to have the best possible matched hit. So this option lets users explicitly specify the desired behavior. Closes #6067	2014-05-09 12:59:39 +02:00
Alex Ksikes	48b7172ee7	Provided some insights as to how More Like This works internally. In the Google Groups forum there appears to be some confusion as to what mlt does. This documentation update should hopefully help demystifying this feature, and provide some understanding as to how to use its parameters. Closes #6092	2014-05-09 12:13:29 +02:00
javanna	bd2a616c82	[DOCS] fixed broken json in multi term vectors docs	2014-05-08 16:01:13 +02:00
javanna	2999152e19	[DOCS] fixed typo in multi term vectors docs	2014-05-08 15:50:24 +02:00
Ivan Brusic	bac0627c5e	Update fielddata.asciidoc Spelling correction	2014-05-08 10:59:24 +02:00
Ivan Brusic	59e0c34cdb	Update fielddata.asciidoc Fixed default value for circuit breaker	2014-05-08 10:58:10 +02:00
Andrew Selden	f23274523a	Integration tests for benchmark API. - Randomized integration tests for the benchmark API. - Negative tests for cases where the cluster cannot run benchmarks. - Return 404 on missing benchmark name. - Allow to specify 'types' as an array in the JSON syntax when describing a benchmark competition. - Don't record slowest for single-request competitions. Closes #6003, #5906, #5903, #5904	2014-05-07 14:14:54 -07:00
mikemccand	9daaae27b3	clarify that CMS defaults change is coming in 1.2	2014-05-07 13:49:54 -04:00
uboness	fc52db1209	Changed the respnose structure of the percentiles aggregation where now all the percentiles are placed under a `values` object (or `values` array in case the `keyed` flag is set to `false` Closes #5870	2014-05-07 18:35:24 +02:00
Chris Earle	12f758e811	[DOCS] Update nodes documentation with all headers Adds a table with the exhaustive list of all available headers with a brief description (mostly from `org.elasticsearch.rest.action.cat.RestNodesAction`) so that people do not need to go searching for them in the code like I did, or search through `nodes?help`.	2014-05-07 11:18:22 -05:00
Britta Weber	7944369fd1	Add `shard_min_doc_count` parameter for significant terms similar to `shard_size` Significant terms internally maintain a priority queue per shard with a size potentially lower than the number of terms. This queue uses the score as criterion to determine if a bucket is kept or not. If many terms with low subsetDF score very high but the `min_doc_count` is set high, this might result in no terms being returned because the pq is filled with low frequent terms which are all sorted out in the end. This can be avoided by increasing the `shard_size` parameter to a higher value. However, it is not immediately clear to which value this parameter must be set because we can not know how many terms with low frequency are scored higher that the high frequent terms that we are actually interested in. On the other hand, if there is no routing of docs to shards involved, we can maybe assume that the documents of classes and also the terms therein are distributed evenly across shards. In that case it might be easier to not add documents to the pq that have subsetDF <= `shard_min_doc_count` which can be set to something like `min_doc_count`/number of shards because we would assume that even when summing up the subsetDF across shards `min_doc_count` will not be reached. closes #5998 closes #6041	2014-05-07 18:02:56 +02:00
Richard Boulton	fdb5eb6555	Update keyword-tokenizer.asciidoc	2014-05-07 15:04:07 +02:00
violuke	9ed34b5a9e	Correcting gramma	2014-05-06 18:00:19 +02:00
田传武	78b85d658c	[DOCS] Added vertx elasticsearch integration	2014-05-06 17:57:35 +02:00
Clinton Gormley	394a3e4332	[DOCS] Updated the mapping and field mapping docs to use the new format Closes #6057	2014-05-06 17:21:09 +02:00
Keiji Yoshida	80d7bc3423	Update getting-started.asciidoc Fixed "Jone Done" to "Jone Doe"	2014-05-06 16:32:33 +02:00
Matthieu Bacconnier	7fd5f18539	Update asciifolding-tokenfilter.asciidoc Typo	2014-05-06 16:30:09 +02:00
Benjamin Devèze	6feeac98c8	s/boost_factor/boost in custom_filters_score doc I may be wrong but I think custom_filters_score used boost rather than boost factor?	2014-05-06 16:15:36 +02:00
Clinton Gormley	2e03a6629b	Update create-index.asciidoc Document defaults for `number_of_shards` and `number_of_replicas` Closes #5899	2014-05-06 16:10:23 +02:00
Audrey	d7023fbb3f	Update "Character classes" part	2014-05-06 16:05:51 +02:00
Kevin Wang	33d256119d	fix field data stats doc	2014-05-06 15:57:00 +02:00
gabriel-tessier	7b0efcbd96	fix typo	2014-05-06 15:54:36 +02:00
Radu Gheorghe	c4477f0ded	Removed mention of Spatial4J and JTS requirement AFAIK, on 1.0 at least (and later), those libraries are included.	2014-05-06 14:49:48 +02:00
pickypg	2c11475bdd	Update geo-shape-type documentation Update `geo-shape-type.asciidoc` to include all `GeoShapeType`s supported by the `org.elasticsearch.common.geo.builders.ShapeBuilder`. Changes include: 1. A tabular mapping of GeoJSON types to Elasticsearch types 2. Listing all types, with brief examples, for all support Elasticsearch types 3. Putting non-standard types to the bottom (really just moving Envelope to the bottom) 4. Linking to all GeoJSON types. 5. Adding whitespace around tightly nested arrays (particularly `multipolygon`) for readability	2014-05-06 14:41:00 +02:00
Kevin Wang	19468880a8	[DOCS] add compass and compress_threshold to binary field mapping doc	2014-05-06 14:27:35 +02:00
Ali Bozorgkhan	f1af845795	[DOCS] Fixed a typo Close #5963	2014-05-06 10:28:13 +02:00
Igal	20b05b56c4	[DOCS] Update client.asciidoc Should be classpath rather than classloader. Close #5965	2014-05-06 10:28:13 +02:00
Audrey	52d2f2d229	[DOCS] Update phrase-suggest.asciidoc Grammatical error Close #5993	2014-05-06 10:28:13 +02:00
Adrien Grand	fc78dd2f13	[DOC] Fix default values for filter cache size and field data circuit breaker. Relates to #5990	2014-05-06 10:13:05 +02:00
mikemccand	07563379dc	fix docs for merging and throttling	2014-05-05 16:22:00 -04:00
Clinton Gormley	7a9aad30f4	[DOCS] Changed score_type to score_mode for has_child/parent queries	2014-05-05 18:30:12 +02:00
Alexander Reelsen	d4fcf23057	Cluster State API: Remove index template filtering The possibility of filtering for index templates in the cluster state API had been introduced before there was a dedicated index templates API. This commit removes this support from the cluster state API, as it was not really clean, requiring you to specify the metadata and the index templates. Closes #4954	2014-05-05 14:54:14 +02:00
gabriel-tessier	48930c2950	[DOC] Fix typo in function score query documentation.	2014-05-02 23:44:56 +02:00
Alex Ksikes	b55d8ed2e3	Fix behavior on default boost factor for More Like This. A boost terms factor of 1.0 is not the same as no boosting of terms. The desired behavior is to deactivate boosting by default. If the user specifies any value other than 0, then boosting is activated. Closes #6021	2014-05-02 16:59:09 +02:00
Mansur Ashraf	d5f90e9803	[DOCS] Added Twitter Storehaus client Added Twitter Storehaus client	2014-05-02 12:08:05 +02:00
Holger Hoffstätte	f5c9bf6f0f	Update JNA to latest version Updating to this version allows to configure a special JNA directory, in case the /tmp directory is mounted with the noexec option, as JNA extracts some data and tries to execute parts of it. Also updated documentation to clarify mlockall and memory settings as well as pointing to the new jna.tmpdir system property. Closes #5493	2014-05-02 11:52:57 +02:00
Martijn van Groningen	013b319415	Added `reverse_nested` aggregation. The `reverse_nested` aggregation allows to aggregate on properties outside of the nested scope of a `nested` aggregation. Closes #5507	2014-05-01 00:23:05 +07:00
Binh Ly	fe89b8735a	[DOC] Fixed filtered_query typo	2014-04-29 10:24:52 -04:00
Robert Muir	8e0a479316	Upgrade to Lucene 4.8 Closes #5932	2014-04-28 06:45:50 -04:00
Chris Earle	5528370e24	Added type, max, min, queueSize & keepAlive to _cat/thread_pool Closes #5366	2014-04-28 12:00:27 +02:00
Simon Willnauer	f285ffc610	Multi value handling in decay functions Decay functions currently only use the first value in a field that contains multiple values to compute the distance to the origin. Instead, it should consider all distances if more values are in the field and then use one of min/max/sum/avg which is defined by the user. Relates to #3960 closes #5940	2014-04-28 11:55:32 +02:00
javanna	5d1d5d6754	[DOCS] Removed leftover indices status link	2014-04-28 11:39:12 +02:00
javanna	1685e3611c	[DOCS] Fixed get asciidoc missing section warning	2014-04-28 11:39:12 +02:00
javanna	16468f9ca3	[DOCS] Fixed scripting example	2014-04-28 11:39:12 +02:00
Clinton Gormley	4b9f1d261d	Removed indices-status docs. Related #4854	2014-04-28 10:40:45 +02:00
Lee Hinman	81e83cca74	Disable dynamic scripting by default Closes #5853	2014-04-25 15:08:26 -06:00
Boaz Leskes	051beb51a3	Version types `EXTERNAL` & `EXTERNAL_GTE` test for version equality in read operation & disallow them in the Update API Separate version check logic for reads and writes for all version types, which allows different behavior in these cases. Change `VersionType.EXTERNAL` & `VersionType.EXTERNAL_GTE` to behave the same as `VersionType.INTERNAL` for read operations. The previous behavior was fit for writes but is useless in reads. This commit also makes the usage of `EXTERNAL` & `EXTERNAL_GTE` in the update api raise a validation error as it make cause data to be lost. Closes #5663 , Closes #5661, Closes #5929	2014-04-25 23:06:12 +02:00
Uwe Dauernheim	080c4ade25	Fix typo	2014-04-25 14:59:10 -06:00
Benoss	ed33b022d3	Update setup repositories documentation Update doc so http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/setup-repositories.html example is going to 1.1 instead of 0.90	2014-04-25 14:57:23 -06:00
Clinton Gormley	c1e03bf860	Update keyword-repeat-tokenfilter.asciidoc	2014-04-24 16:44:02 +02:00
Clinton Gormley	39705aa236	[DOCS] rewrite -> fuzzy_rewrite in match query Fixed typo	2014-04-23 21:05:14 +02:00
Simon Willnauer	b36ef995bb	Change default recovery throttling to 50MB / sec The current setting of 20MB/sec seems to be too conservative given the capabilities of modern hardware / network throughput. A 50MB default should provide better out of the box performance.	2014-04-23 15:40:21 +02:00
Robert Muir	8568c18e6f	Change default numeric precision_step Change the default numeric precision_step to 16 for 64-bit types, 8 for 32-bit and 16-bit types. Disable precision_step for the 8-bit byte type. Closes #5905	2014-04-23 09:01:25 -04:00
Simon Willnauer	b4f0603169	Change default merge throttling to 50MB / sec The current setting of 20MB/sec seems to be too conservative given the capabilities of modern hardware. Even on cloud infrastructure this seems to be too lowish. A 50MB default should provide better out of the box performance	2014-04-22 21:08:40 +02:00
Binh Ly	1746f2f792	[DOCS] getting started tutorial	2014-04-22 13:33:03 -04:00
Lee Hinman	57bee03193	[DOCS] Add /_search_shards documentation	2014-04-22 08:54:32 -06:00
Simon Willnauer	1cf62e7782	Use unlimited flush_threshold_ops for translog Currently we use 5k operations as a flush threshold. Indexing 5k documents per second is rather common which would cause the index to be committed on the lucene level each time the flush logic runs which is 5 seconds by default. We should rather use a size based threshold similar to the lucene index writer that doesn't cause such agressive commits which can slow down indexing significantly especially since they cause the underlying devices to fsync their data.	2014-04-22 16:37:07 +02:00
Clinton Gormley	3ba8fbbef8	Update benchmark.asciidoc Fixed incorrect parameter spec for benchmark nodes	2014-04-22 14:16:10 +02:00
Clinton Gormley	0e782331be	Update benchmark.asciidoc	2014-04-21 20:39:33 +02:00
Samuel Molinari	909cf4de44	Update function-score-query.asciidoc	2014-04-20 13:39:32 +02:00
David Pilato	f3fe50aac4	[DOCS] fix typo	2014-04-19 22:44:44 +02:00
Xiao Yu	4b5e8cec8e	Add a site plugin into list Howdy, Not sure if this is kosher but I would like to add my site plugin to the list in the docs.	2014-04-17 19:28:37 +02:00
Christoph Frick	e3e631eca5	Update allocation.asciidoc	2014-04-17 14:42:58 +02:00
Igor Motov	4c3027729e	[DOCS] Make snapshot repository examples consistent	2014-04-16 17:28:43 -04:00
Clinton Gormley	65906d176a	Update multi-match-query.asciidoc Typo	2014-04-16 15:41:38 +02:00
Kouhei Sutou	de59cde926	Remove garbage	2014-04-15 17:57:25 +02:00
Simon Willnauer	9898eed30c	[DOCS] Update merge docs to reflect the max_merge_at_once property	2014-04-15 16:42:23 +02:00
Simon Willnauer	320a206352	Switch back to ConcurrentMergeScheduler Load tests showed that SerialMS has problems to keep up with the merges under high load. We should switch back to CMS until we have a better story to balance merge threads / efforts across shards on a single node. Closes #5817	2014-04-15 16:42:23 +02:00
Scott Wilkerson	9ea0e3a95b	Update percolate.asciidoc fix typo	2014-04-15 16:01:44 +02:00
eliasah	c61110c28d	Update core-types.asciidoc Missing bracket	2014-04-15 15:57:04 +02:00
Yousef	d7fda621e9	Updated date_formats to new dynamic_date_formats	2014-04-15 15:44:08 +02:00
Andrew Selden	2cf66c4115	Benchmark documentation Moving benchmark documentation under the search section. Closes #5786	2014-04-14 14:08:41 -07:00
Peter Dyson	f8537183b9	[DOCS] update old status of plugins	2014-04-13 20:18:19 -04:00
Malte Schirnacher	8ce3bba010	Fix typos in percolate.asciidoc Close #5762 #5763 #5764	2014-04-11 18:09:16 +02:00
Sean Gallagher	80ebd49253	[DOCS] Added tables and fixes to upgrade.asciidoc, fixed version in README.textile Author: Sean Gallagher Date: 10 Apr 2014 15:23 EDT	2014-04-10 15:23:07 -04:00
Nik Everett	40f1913cf3	[Docs] Add experimental highlighter plugin	2014-04-10 13:32:34 -04:00
Andrew Selden	e2c8ff92ba	Benchmark API Add an API endpoint at /_bench for submitting, listing, and aborting search benchmarks. This API can be used for timing search requests, subject to various user-defined settings. Benchmark results provide summary and detailed statistics on such values as min, max, and mean time. Values are reported per-node so that it is easy to spot outliers. Slow requests are also reported. Long running benchmarks can be viewed with a GET request, or aborted with a POST request. Benchmark results are optionally stored in an index for subsequent analysis. Closes #5407	2014-04-09 13:06:55 -07:00
Nik Everett	af0278b51b	[Docs] Allocation setting explanation Closes #5748	2014-04-09 12:11:36 -06:00
Costin Leau	960d353dbd	Remove plugin isolation feature for a future version relates #5261	2014-04-09 17:28:11 +03:00
Andrew O'Brien	48031b6236	Fixes typo in "Scan" search type documention	2014-04-07 16:01:37 -06:00
Sean Gallagher	5138083e13	Author: Sean Gallagher Date: Tue Apr 1 12:28:00 2014 Added upgrade.asciidoc and links to it from setup.asciidoc Author: Sean Gallagher Date: Apr 1 2014 Added upgrade.asciidoc Add upgrade instructions Author: Sean Gallagher Date: 4/4/14 Closes issue #5651 Fixed upgrade.asciidoc typo and incorrect usage. Author: Sean Gallagher Date: 4 Apr 2014 Closes 5651	2014-04-07 14:43:35 -04:00
wittyameta	94278d81e3	Update advanced-scripting.asciidoc	2014-04-07 07:20:13 -06:00
Richard Pijnenburg	c6caeea887	Update link to puppet module and remove link to other RPM repo as we have our own.	2014-04-07 14:24:10 +02:00
Richard Pijnenburg	d8364e89a7	Fix typo and add more clients	2014-04-07 13:52:06 +02:00
Richard Pijnenburg	043d78565f	Removing EOL client rubberband and adding official php client	2014-04-07 13:51:44 +02:00
Kevin Wang	ecab74fe6c	add lucene language model similarities (Dirichlet & JelinekMercer)	2014-04-07 10:48:03 +02:00
Kevin Wang	866c520abb	Add doc value for binary field. Close #5669	2014-04-07 10:18:55 +02:00
gabriel-tessier	000c33aac3	fix typo	2014-04-07 09:23:46 +02:00
Martijn van Groningen	ade1d0ef57	Added global ordinals (unique incremental numbering for terms) to fielddata. Added a terms aggregation implementations that work on global ordinals, which is also the default. Closes #5672	2014-04-07 11:06:41 +07:00
Lee Hinman	211f740100	Add `getAsRatio` to Settings class, allow DiskThresholdDecider to take percentages Adds new RatioValue class that parses ratios between 0-100% expressed in either floating-point (0.13) or percentage (51.12%) notation. Closes #5690	2014-04-04 13:19:35 -06:00
Karl Meisterheim	6d993bc810	[DOCS] A few grammar and word use corrections	2014-04-04 19:26:38 +02:00
Peter Dyson	233279bb64	[DOCS] Fixed typo	2014-04-04 17:37:56 +02:00
Lee Hinman	c3089701f2	[DOCS] remove extraneous ` from cache page	2014-04-02 16:07:00 -06:00
Alexander Reelsen	e547e113e1	Geo context suggester: Require precision in mapping The default precision was way too exact and could lead people to think that geo context suggestions are not working. This patch now requires you to set the precision in the mapping, as elasticsearch itself can never tell exactly, what the required precision for the users suggestions are. Closes #5621	2014-04-02 23:51:14 +02:00
Radu Gheorghe	b9cb70198e	Typo in the description for include_in_all I know this is uber-minor, but I was confused by the phrase "the raw field value to be copied". I assume "is" was supposed to be instead of "to"	2014-04-02 12:02:12 +02:00
Binh Ly	51a6a95de3	[DOC] Fixed flags example incorrect syntax	2014-04-01 14:43:38 -04:00
Igor Motov	d13850814e	[DOCS] "F" is not valid false value for boolean type	2014-04-01 08:16:43 -04:00
Nik Everett	1df942b463	[docs] Indices stats groups in nodes api Closes #5349	2014-03-31 19:54:48 +02:00
javanna	8fe6fe638d	[DOCS] fixed transport client link in java api docs	2014-03-31 18:35:57 +02:00
Hannes Korte	c11293ad78	Fix some typos in documentation.	2014-03-31 13:48:17 +02:00
Alex Brasetvik	cd8ed388d9	Document http.cors-settings	2014-03-31 11:34:46 +02:00
Andrew O'Brien	bd9c1bc8d9	Update has-parent-filter.asciidoc "This filter return child..." => This filter returns child...	2014-03-31 00:06:35 +02:00
Kevin Wang	ceed22fe00	Add suggest stats closes #4032	2014-03-28 11:13:54 +01:00
Lee Hinman	8fbd1bdd48	Add the `field_value_factor` function to the function_score query The `field_value_factor` function uses the value of a field in the document to influence the score. A query that looks like: { "query": { "function_score": { "query": {"match": { "body": "foo" }}, "functions": [ { "field_value_factor": { "field": "popularity", "factor": 1.1, "modifier": "square" } } ], "score_mode": "max", "boost_mode": "sum" } } } Would have the score modified by: square(1.1 * doc['popularity'].value) Closes #5519	2014-03-27 14:29:37 -06:00
Shay Banon	6fce15beec	Tribe: Index level blocks, index conflict settings allow to configure on the index level which blocks can optionally be applied using tribe.blocks.indices prefix settings. allow to control what will be done when a conflict is detected on index names coming from several clusters using the tribe.on_conflict setting. Defaults remains "any", but now support also "drop" and "prefer_[tribeName]". closes #5501	2014-03-27 09:45:20 -07:00
Peter Dyson	029c7b174a	Adding Kopf to community list of monitoring tools. Adding versatile monitoring and administration tool Kopf to the community section of the documentation.	2014-03-27 17:07:49 +01:00
David Pilato	85b9aafaad	[DOCS] `_type` instead of Type Field	2014-03-27 08:35:15 +01:00
Igor Motov	3ffd0a1dfa	Remove deprecated gateways Closes #5422	2014-03-26 18:10:51 -04:00
Igor Motov	c2e38fbf78	[DOCS] Clarify nested type documentation	2014-03-26 11:57:41 -04:00
javanna	42c36ef72d	[DOCS] fixed typo Closes #5272	2014-03-26 14:51:02 +01:00
Kevin Wang	374b633a4b	add uppercase token filter closes #5539	2014-03-26 15:07:43 +07:00
bleskes	5d832374dd	Update Documentation Feature Flags [1.1.0]	2014-03-25 17:51:30 +01:00
Adrien Grand	c977a49b76	[DOC] Clarify settings and documentation about norms.	2014-03-25 16:05:23 +01:00
Boaz Leskes	fc8dc3f733	[Docs] updated the search template and query template docs	2014-03-25 15:25:02 +01:00
Adrien Grand	1c0b6da0ac	Allow to disable norms on an existing field. Close #4813	2014-03-25 14:13:06 +01:00
Alexander Reelsen	4fc461a97c	[DOCS] Moved the template query documentation into search section	2014-03-25 10:01:41 +01:00
Simon Willnauer	b4e504df99	[Docs] Add coming tag for context suggester docs	2014-03-25 09:46:49 +01:00
Igor Motov	3414deb215	[DOCS] Mark snapshot status API as coming in 1.1.0	2014-03-24 21:55:19 -04:00
Kevin	1496b03458	Merge null_value for boolean field and remove include_in_all for boolean field in doc Close #5502	2014-03-24 11:00:57 +01:00
Kevin Wang	bfd3236378	Merge GeoPoint specific mapping properties Close #5505	2014-03-24 09:30:55 +01:00
Jun Ohtani	20e596cb86	fix typo joda-time link	2014-03-21 10:02:53 +01:00
Andrew Selden	89e45fde9c	Recovery API Adds a new API endpoint at /_recovery as well as to the Java API. The recovery API allows one to see the recovery status of all shards in the cluster. It will report on percent complete, recovery type, and which files are copied. Closes #4637	2014-03-20 10:13:30 -07:00
Alexander Reelsen	8f6e1d4720	Query Templates: Adding dedicated /_search/template endpoint In order to simplify query template execution an own endpoint has been added Closes #5353	2014-03-20 17:43:40 +01:00
uboness	7d6ad8d91c	Added extended_bounds support for date_/histogram aggs By default the date_/histogram returns all the buckets within the range of the data itself, that is, the documents with the smallest values (on which with histogram) will determine the min bucket (the bucket with the smallest key) and the documents with the highest values will determine the max bucket (the bucket with the highest key). Often, when when requesting empty buckets (min_doc_count : 0), this causes a confusion, specifically, when the data is also filtered. To understand why, let's look at an example: Lets say the you're filtering your request to get all docs from the last month, and in the date_histogram aggs you'd like to slice the data per day. You also specify min_doc_count:0 so that you'd still get empty buckets for those days to which no document belongs. By default, if the first document that fall in this last month also happen to fall on the first day of the second week of the month, the date_histogram will not return empty buckets for all those days prior to that second week. The reason for that is that by default the histogram aggregations only start building buckets when they encounter documents (hence, missing on all the days of the first week in our example). With extended_bounds, you now can "force" the histogram aggregations to start building buckets on a specific min values and also keep on building buckets up to a max value (even if there are no documents anymore). Using extended_bounds only makes sense when min_doc_count is 0 (the empty buckets will never be returned if the min_doc_count is greater than 0). Note that (as the name suggest) extended_bounds is not filtering buckets. Meaning, if the min bounds is higher than the values extracted from the documents, the documents will still dictate what the min bucket will be (and the same goes to the extended_bounds.max and the max bucket). For filtering buckets, one should nest the histogram agg under a range filter agg with the appropriate min/max. Closes #5224	2014-03-20 14:48:27 +01:00
Clinton Gormley	1fff379742	[DOCS] Documented the fact that binary fields are not stored by default	2014-03-20 12:43:43 +01:00
Florian Schilling	c0a092aa92	[Doc] Updated docs for distance scripting Updated docs for distance scripting and added missing geohash distance functions Closes #5397	2014-03-20 12:18:25 +01:00
Clinton Gormley	4c34615686	[DOCS] Fixed some bad UTF8	2014-03-19 12:46:06 +01:00
Clinton Gormley	1f497c6678	[DOCS] Updated Drupal integration	2014-03-19 11:49:39 +01:00
Shay Banon	0ef3b03be1	Move to use serial merge schedule by default Today, we use ConcurrentMergeScheduler, and this can be painful since it is concurrent on a shard level, with a max of 3 threads doing concurrent merges. If there are several shards being indexed, then there will be a minor explosion of threads trying to do merges, all being throttled by our merge throttling. Moving to serial merge scheduler will still maintain concurrency of merges across shards, as we have the merge thread pool that schedules those merges. It will just be a serial one on a specific shard. Also, on serial merge scheduler, we now have a limit of how many merges it will do at one go, so it will let other shards get their fair chance of merging. We use the pending merges on IW to check if merges are needed or not for it. Note, that if a merge is happening, it will not block due to a sync on the maybeMerge call at indexing (flush) time, since we wrap our merge scheduler with the EnabledMergeScheduler, where maybeMerge is not activated during indexing, only with explicit calls to IW#maybeMerge (see Merges). closes #5447	2014-03-18 13:17:00 +01:00
Igor Motov	a1192044f2	Add ability to get snapshot status for running snapshots Closes #4946	2014-03-17 20:13:49 -04:00
David Pilato	0805c01984	[DOCS] Add Azure storage repositories	2014-03-17 19:40:28 +01:00
markharwood	5f1d9af9fe	Documentation fix for significant_terms heading levels	2014-03-17 12:17:54 +00:00
Randy Stauner	1486188a3b	[DOCS] Reword clear-scroll sentence	2014-03-17 12:08:49 +01:00
lzhoucs	5a5171cb70	[DOCS] Fix typo in the reference doc. SuSe -> SUSE SUSE, as a Linux distribution, is never lower cased fixes #5354	2014-03-17 12:03:25 +01:00
Justin Etheredge	36219a1786	[DOCS] Updating scripting docs for geo functions Added a few functions are corrected the default unit where necessary	2014-03-17 11:59:02 +01:00
Boaz Leskes	ee8743f3f2	[Docs] added a missing reference to significantterms-aggergations Also fix header level mismatch issue reported by the build	2014-03-17 11:45:55 +01:00
David Pilato	f54e9246c1	Add _cat/plugins endpoint If we want to have a full picture of versions running in a cluster, we need to add a `_cat/plugins` endpoint. Response could look like: ```sh % curl es2:9200/_cat/plugins?v node component version type url desc es1 mapper-attachments 1.7.0 j Adds the attachment type allowing to parse difference attachment formats es1 lang-javascript 1.4.0 j JavaScript plugin allowing to add javascript scripting support es1 analysis-smartcn 1.9.0 j Smart Chinese analysis support es1 marvel 1.1.0 j/s http://localhost:9200/_plugins/marvel Elasticsearch Management & Monitoring es1 kopf 0.5.3 s http://localhost:9200/_plugins/kopf kopf - simple web administration tool for ElasticSearch es2 mapper-attachments 2.0.0.RC1 j Adds the attachment type allowing to parse difference attachment formats es2 lang-javascript 2.0.0.RC1 j JavaScript plugin allowing to add javascript scripting support es2 analysis-smartcn 2.0.0.RC1 j Smart Chinese analysis support ``` Closes #4824.	2014-03-16 12:16:09 +01:00
Clinton Gormley	fb934aff57	[DOCS] Documented gateway.local.auto_import_dangled Relates to #4996	2014-03-15 12:07:17 +01:00
rphadake	36a0cb99d7	[Doc] doc updates for date histogram interval Close #5308	2014-03-14 18:55:32 +01:00
Adrien Grand	65d3b61b97	Add an option to force _optimize operations. When forced, the index will be merged even if it contains a single segment with no deletions. Close #5243	2014-03-14 18:21:56 +01:00
Adrien Grand	eef71da650	[Doc] Add a chart about the relative error of the percentiles aggregation.	2014-03-14 12:23:23 +01:00
markharwood	767bef0596	Significant_terms aggregation identifies terms that are significant rather than merely popular in a set. Significance is related to the changes in document frequency observed between everyday use in the corpus and frequency observed in the result set. The asciidocs include extensive details on the applications of this feature. Closes #5146	2014-03-14 10:34:24 +00:00
Adrien Grand	5821fa042c	Cardinality aggregation. This aggregation computes unique term counts using the hyperloglog++ algorithm which uses linear counting to estimate low cardinalities and hyperloglog on higher cardinalities. Since this algorithm works on hashes, it is useful for high-cardinality fields to store the hash of values directly in the index, which is the purpose of the new `murmur3` field type. This is less necessary on low-cardinality string fields because the aggregator is smart enough to only compute the hash once per unique value per segment thanks to ordinals, or on numeric fields since hashing them is very fast. Close #5426	2014-03-13 19:19:56 +01:00
Florian Schilling	81e537bd5e	ContextSuggester ================ This commit extends the `CompletionSuggester` by context informations. In example such a context informations can be a simple string representing a category reducing the suggestions in order to this category. Three base implementations of these context informations have been setup in this commit. - a Category Context - a Geo Context All the mapping for these context informations are specified within a context field in the completion field that should use this kind of information.	2014-03-13 11:24:46 +01:00
Kurt Hurtado	ca6a2bb790	[DOCS] Various aggregation doc fixes	2014-03-13 09:05:25 +01:00
Mohsin Husen	9fcee312dc	[DOCS] Added spring data elasticsearch integration	2014-03-13 08:44:17 +01:00
Costin Leau	9624b215fb	Add docs for plugin isolation	2014-03-11 12:32:58 +02:00
Boaz Leskes	b7a95d11a7	Introduced VersionType.FORCE & VersionType.EXTERNAL_GTE Also added "external_gt" as an alias name for VersionType.EXTERNAL , accessible for the rest layer. Closes #4213 , Closes #2946	2014-03-10 21:07:17 +01:00
javanna	d5aaa90f34	[TEST] Randomized number of shards used for indices created during tests Introduced two levels of randomization for the number of shards (between 1 and 10) when running tests: 1) through the existing random index template, which now sets a random number of shards that is shared across all the indices created in the same test method unless overwritten 2) through `createIndex` and `prepareCreate` methods, similar to what happens using the `indexSettings` method, which changes for every `createIndex` or `prepareCreate` unless overwritten (overwrites index template for what concerns the number of shards) Added the following facilities to deal with the random number of shards: - `getNumShards` to retrieve the number of shards of a given existing index, useful when doing comparisons based on the number of shards and we can avoid specifying a static number. The method returns an object containing the number of primaries, number of replicas and the total number of shards for the existing index - added `assertFailures` that checks that a shard failure happened during a search request, either partial failure or total (all shards failed). Checks also the error code and the error message related to the failure. This is needed as without knowing the number of shards upfront, when simulating errors we can run into either partial (search returns partial results and failures) or total failures (search returns an error) - added common methods similar to `indexSettings`, to be used in combination with `createIndex` and `prepareCreate` method and explicitly control the second level of randomization: `numberOfShards`, `minimumNumberOfShards` and `maximumNumberOfShards`. Added also `numberOfReplicas` despite the number of replicas is not randomized (default not specified but can be overwritten by tests) Tests that specified the number of shards have been reviewed and the results follow: - removed number_of_shards in node settings, ignored anyway as it would be overwritten by both mechanisms above - remove specific number of shards when not needed - removed manual shards randomization where present, replaced with ordinary one that's now available - adapted tests that didn't need a specific number of shards to the new random behaviour - fixed a couple of test bugs (e.g. 3 levels parent child test could only work on a single shard as the routing key used for grand-children wasn't correct) - also done some cleanup, shared code through shard size facets and aggs tests and used common methods like `assertAcked`, `ensureGreen`, `refresh`, `flush` and `refreshAndFlush` where possible - made sure that `indexSettings()` is always used as a basis when using `prepareCreate` to inject specific settings - converted indexRandom(false, ...) + refresh to indexRandom(true, ...)	2014-03-10 13:01:52 +01:00
Simon Willnauer	fbb8c0fafa	[DOCS] Add `coming` tag to multiple rescores Closes #5365	2014-03-10 09:27:44 +01:00
Clinton Gormley	8383f271d1	[DOCS] Updated the Perl docs	2014-03-09 19:45:16 +01:00
Andrew Raines	2f48be597e	Display all available endpoints by default at /_cat Closes #5106	2014-03-07 13:21:43 -06:00
Konrad Feldmeier	d7b0d547d4	[DOCS] Multiple doc fixes Closes #5047	2014-03-07 14:24:58 +01:00
Benjamin Devèze	2affa5004f	Fix small typo in percentiles doc	2014-03-07 10:10:19 +01:00
Adrien Grand	f359b7f38b	[DOC] The percentiles aggregation is coming in 1.1.0.	2014-03-07 10:03:15 +01:00
Brusic	95274c18c5	Added support for char filters in the analyze API Closes #5148	2014-03-06 12:23:51 +01:00
James Brook	a93d6d55a5	Added support for aliases to index templates Adapted existing PR (#2739) to updated code (post #4920), added tests and docs (@javanna) Closes #1825	2014-03-06 11:11:07 +01:00
uboness	9d0fc76f54	Added support for sorting buckets based on sub aggregations Supports sorting on sub-aggs down the current hierarchy. This is supported as long as the aggregation in the specified order path are of a single-bucket type, where the last aggregation in the path points to either a single-bucket aggregation or a metrics one. If it's a single-bucket aggregation, the sort will be applied on the document count in the bucket (i.e. doc_count), and if it is a metrics type, the sort will be applied on the pointed out metric (in case of a single-metric aggregations, such as avg, the sort will be applied on the single metric value) NOTE: this commit adds a constraint on what should be considered a valid aggregation name. Aggregations names must be alpha-numeric and may contain '-' and '_'. Closes #5253	2014-03-06 00:05:27 +01:00
Igor Motov	b723ee0d20	[DOCS] Update boolean mapping docs with a full list of values that are treated as false Closes #5337	2014-03-05 15:33:59 -05:00
Clinton Gormley	98ecf80f07	[DOCS] Formatting error Closes #5346	2014-03-05 17:40:51 +01:00
Kevin	2c7a3a49c5	[DOCS] add Elasticsearch Image Plugin	2014-03-05 14:16:56 +01:00
Binh Ly	612e95a321	[DOCS] Java API JSON typo	2014-03-03 18:20:49 -05:00
Zachary Tong	7b16c5857d	Percentiles aggregation. A new metric aggregation that can compute approximate values of arbitrary percentiles. Close #5323	2014-03-03 18:06:14 +01:00
Martijn van Groningen	dcb590398d	[DOCS] Better document the limitation of nested objects.	2014-03-03 14:12:18 +01:00
Binh Ly	7e49848697	Clarify range aggregations	2014-02-28 14:38:57 -05:00
Clinton Gormley	53ce0e8e27	[DOCS] Fixed added[] tag version number	2014-02-28 15:29:43 +01:00
Lee Hinman	e53a43800e	Add `explain` flag support to the reroute API By specifying the `explain` flag, an explanation for the reason a command can or cannot be executed is returned. No allocation commands are actually performed. Returns a response similar to: { "state": {...cluster state...}, "acknowledged": true, "explanations" : [ { "command" : "cancel", "parameters" : { "index" : "decide", "shard" : 0, "node" : "IvpoKRdtRiGrQ_WKtt4_4w", "allow_primary" : false }, "decisions" : [ { "decider" : "cancel_allocation_command", "decision" : "YES", "explanation" : "..." } ] }, { "command" : "move", "parameters" : { "index" : "decide", "shard" : 0, "from_node" : "IvpoKRdtRiGrQ_WKtt4_4w", "to_node" : "IvpoKRdtRiGrQ_WKtt4_4w" }, "decisions" : [ { "decider" : "same_shard", "decision" : "NO", "explanation" : "shard cannot be allocated on same node [IvpoKRdtRiGrQ_WKtt4_4w] it already exists on" }, etc ] }] } also removes AllocationExplanation from cluster state Closes #2483 Closes #5169	2014-02-27 09:48:51 -07:00
Simon Willnauer	9160516b28	Expose `filler_token` via ShingleTokenFilterFactory Lucene 4.7 supports a setter for the `filler_token` that is inserted if there are gaps in the token stream. This change exposes this setting. Closes #4307	2014-02-26 22:21:10 +01:00
Martijn van Groningen	1441fec068	[DOCS] Updated memory considerations for p/c queries and filters.	2014-02-26 22:16:51 +01:00
Simon Willnauer	90e57c15e8	[DOCS]: fixed small problem in example json	2014-02-26 16:40:04 +01:00
Clinton Gormley	03ad168b24	[DOCS] Added note about dely in clearing filter cache. Closes #5231	2014-02-24 11:36:22 +01:00
hura	818f8c0e2b	[DOCS] Fix wrong explanation in configuration.asciidoc Replaced network.host with node.name to match config file	2014-02-24 11:29:50 +01:00
Luca Cavanna	4e6610a798	Fixed multi term queries support in postings highlighter for non top-level queries In #4052 we added support for highlighting multi term queries using the postings highlighter. That worked only for top-level queries though, and not for multi term queries that are nested for instance within a bool query, or filtered query, or a constant score query. The way we make this work is by walking the query structure and temporarily overriding the query rewrite method with a method that allows for multi terms extraction. Closes #5102	2014-02-21 21:43:40 +01:00
Adrien Grand	edb854d952	Document the indices segments response format.	2014-02-21 12:01:32 +01:00
Lee Hinman	8f8cc7205d	Add "locale" parameter to query_string and simple_query_string Fixes #5128 Remove java 7 specific Locale functions, add "coming[1.1.0]" to documentation add LocaleUtils utility class for dealing with Locale functions	2014-02-20 15:53:08 -07:00
Martijn van Groningen	a81a4a5efe	[DOCS] Included the `_percolator` index breaking change to migration docs.	2014-02-20 16:43:06 +01:00
Isabel Drost-Fromm	48004ff8a5	Add mustache templating to query execution. Adds support for storing mustache based query templates that can later be filled with query parameter values at execution time. Templates may be both quoted, non-quoted and referencing templates stored in config/scripts/*.mustache by file name. See docs/reference/query-dsl/queries/template-query.asciidoc for templating examples. Implementation detail: mustache itself is being shaded as it depends directly on guava - so having it marked optional but included in the final distribution raises chances of version conflicts downstream. Fixes #4879	2014-02-20 12:21:59 +01:00
javanna	419db6ee12	[DOCS] Fixed typo in create index api	2014-02-19 17:49:38 +01:00
Boaz Leskes	e379f419e6	[DOCS] Remove clear flag from node-stats as it is not used anymore	2014-02-17 15:20:12 +01:00
Luca Cavanna	3afdf4a872	Added support for aliases to create index api It is now possible to specify aliases during index creation: curl -XPUT 'http://localhost:9200/test' -d ' { "aliases" : { "alias1" : {}, "alias2" : { "filter" : { "term" : {"field":"value"}} } } }' Closes #4920	2014-02-17 14:54:21 +01:00
Britta Weber	db3c6c2a8e	Enable percolation for nested documents closes #5082	2014-02-14 22:42:33 +01:00
Lee Hinman	c97bcc3602	Add support for `lowercase_expanded_terms` flag to simple_query_string Default the flag to true, making simple_query_string behave similarly to query_string Fixes #5008	2014-02-14 11:51:23 -07:00
Nik Everett	5c3f4ceafb	Add preserve original token option to ASCIIFolding Closes #4931	2014-02-14 19:37:00 +01:00
Luca Cavanna	6abd0a76bd	[DOCS] improved get docs - added _version to response - exists call use -XHEAD with -i flag to include headers in the output	2014-02-14 13:11:10 +01:00
Lars Francke	2a765415c8	Update get.asciidoc Minor improvements. curl -XHEAD doesn't actually print anything so I've changed to use -I which actually prints the headers received.	2014-02-14 13:11:10 +01:00
Brian Yoder	41dba68bda	Added the `DistanceUnit.NAUTICALMILES` enumeration label with the corresponding NM and nmi unit suffixes. Update the docs to match. Closes #5085	2014-02-14 19:48:58 +09:00
uboness	d335630e57	[docs] fixed errors in aggs docs - error in nested aggs example - error in terms aggs example	2014-02-13 20:36:02 +01:00
Oleg Anashkin	eb0e1aa38f	Fix typo in similarity docs DRF similarity -> DFR similarity	2014-02-13 07:45:30 -08:00
Luca Cavanna	179750f0f5	[DOCS] fixed count docs, it now requires a top-level query object, same as other apis Relates to #4074	2014-02-13 13:36:20 +01:00
Luca Cavanna	9902f04033	[DOCS] rephrased delete by query docs	2014-02-13 11:44:51 +01:00
Luca Cavanna	01abea5945	[DOCS] fixed count and validate query docs, they now require a top-level query object, same as other apis Relates to #4074 Closes #5111	2014-02-13 11:42:04 +01:00
Kevin	5d01aac87e	add elasticsearch-osem to integrations page	2014-02-13 11:02:36 +01:00
Kevin	99942089a8	[DOCS] add DynamoDB river plugin	2014-02-13 10:38:04 +01:00
James Yu	699fe5e929	fixed markup and typo	2014-02-13 10:33:15 +01:00
Kevin	1075b9ae33	[DOCS] should use setPostFilter instead of setFilter	2014-02-13 14:28:00 +11:00
Clinton Gormley	80c7619591	[DOCS] Changed coming[] to added[] for 1.0.0*	2014-02-12 17:17:25 +02:00

... 2 3 4 5 6 ...

699 Commits