OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jim Ferenczi	437488ae64	Remove the MissingQueryBuilder which was deprecated in 2.2.0. As a replacement use ExistsQueryBuilder inside a mustNot() clause. So instead of using `new ExistsQueryBuilder(name)` now use: `new BoolQueryBuilder().mustNot(new ExistsQueryBuilder(name))`. Closes #14112	2015-12-10 12:55:05 +01:00
Johnny Lim	a66be6cfb9	Polish doc Closes #15011	2015-11-28 18:25:42 +01:00
Christoph Büscher	71e734647b	Docs: `exists` query does not need to be wrapped inside `constant_score` In the example we show an `exists` query inside a constant score query. While this is possible, it can mislead users to think it is necessary so we should remove it.	2015-11-20 18:08:51 +01:00
Isabel Drost-Fromm	a656f90a72	Revert back to type instead of child_type... for has child queries. Relates to #14249	2015-11-19 14:47:01 +01:00
Nicholas Knize	0d349854d3	Add CONTAINS relation to geo_shape query At the time of geo_shape query conception, CONTAINS was not yet a supported spatial operation in Lucene. Since it is now available this commit adds ShapeRelation.CONTAINS to GeoShapeQuery. Randomized testing is included and documentation is updated.	2015-11-18 14:15:35 -06:00
Clinton Gormley	685a0462d8	Merge pull request #13894 from dmland/patch-1 Update query-string-syntax.asciidoc	2015-11-18 15:50:30 +01:00
Christoph Büscher	ecbc7774ca	Docs: Add query to Query DSL documentation We already introduced the MatchNoneQueryBuilder query that does not return any documents, mainly because we needed it for internal representation of the NONE option in the IndicesQueryBuilder. However, the query was requested at least once also for the query dsl, and since we can parser it already we should document it as `match_none` query in the relevant reference docs as well.	2015-11-18 14:39:28 +01:00
Kopylov German	951ddcf4ea	Fix missed comma in bool query	2015-11-18 03:02:52 +03:00
Isabel Drost-Fromm	7698ab7bfc	Fix typos in query dsl docs. When passing the example json snippets through the query parser while working on #14249 some of the examples could not be parsed. This PR fixes those examples. Relates to #14249	2015-11-17 13:40:10 +01:00
javanna	6076ccb7b2	Remove support for filter element in nested query Replaced by query.	2015-10-28 10:50:29 +01:00
javanna	f5fb669a46	[DOCS] remove tip on indices query elements order Since we parse queries on the coordinating node, the order of the elements doesn't count anymore.	2015-10-28 10:50:28 +01:00
Christoph Büscher	ca57763d2c	Update indices-query.asciidoc	2015-10-23 22:55:40 +02:00
Christoph Büscher	0362550fd7	Update indices-query.asciidoc The current description of the query seemed confusing to some people (e.g. https://twitter.com/wielinde/status/654582620630687744), this is just a proposal to simplify it.	2015-10-23 21:14:52 +02:00
debadair	69acde33c2	Fixed broken xrefs to query-dsl-not-query, which has been removed.	2015-10-20 13:01:37 -07:00
Christoph Büscher	5d25bc30cd	Query DSL: Remove NotQueryBuilder The NotQueryBuilder has been deprecated on the 2.x branches and can be removed with the next major version. It can be replaced by boolean query with added mustNot() clause. Closes #13761	2015-10-20 19:43:16 +02:00
bharvi	fd412551e4	Changed boost keyword with bool in bounding box query	2015-10-18 15:12:47 +05:30
bharvi	00db388722	The boost keyword changed to bool	2015-10-18 13:45:10 +05:30
Clinton Gormley	14b6314766	Update fuzzy-query.asciidoc Closes #14092	2015-10-13 21:03:46 +02:00
Clinton Gormley	dc018cf622	Updated docs for 3.0.0-beta	2015-10-07 13:27:46 +02:00
javanna	3d3197342d	Merge branch 'master' into merge/query-refactoring	2015-09-25 14:38:53 +02:00
André Carvalho	070c836dce	Gives more emphasys	2015-09-24 11:30:00 -03:00
André Carvalho	d90fc8f7fd	Documents that now is not affected by time_zone in range queries	2015-09-23 22:00:49 -03:00
javanna	73f7df510e	Merge branch 'master' into feature/query-refactoring Conflicts: core/src/main/java/org/elasticsearch/index/query/HasChildQueryBuilder.java core/src/main/java/org/elasticsearch/index/query/HasChildQueryParser.java core/src/main/java/org/elasticsearch/index/query/HasParentQueryBuilder.java core/src/main/java/org/elasticsearch/index/query/HasParentQueryParser.java core/src/main/java/org/elasticsearch/index/query/functionscore/FunctionScoreQueryBuilder.java core/src/main/java/org/elasticsearch/index/query/functionscore/FunctionScoreQueryParser.java core/src/main/java/org/elasticsearch/index/query/functionscore/factor/FactorParser.java core/src/test/java/org/elasticsearch/ExceptionSerializationTests.java core/src/test/java/org/elasticsearch/benchmark/search/child/ChildSearchBenchmark.java core/src/test/java/org/elasticsearch/benchmark/search/child/ChildSearchShortCircuitBenchmark.java core/src/test/java/org/elasticsearch/index/query/SimpleIndexQueryParserTests.java core/src/test/java/org/elasticsearch/percolator/PercolatorIT.java core/src/test/java/org/elasticsearch/search/child/ChildQuerySearchIT.java docs/reference/query-dsl/has-parent-query.asciidoc	2015-09-11 14:15:12 +02:00
Adrien Grand	86f1b07df0	Docs: Remove docs for the `filtered`, `and`, `or` and `(f)query` queries.	2015-09-11 11:00:54 +02:00
Martijn van Groningen	ab0847e0df	parent/child: several cleanups * Dropped ScoreType in favour of Lucene's ScoreMode * Removed `score_type` option from `has_child` and `has_parent` queries in favour for the already existing `score_mode` option. * Removed the score mode `sum` in favour for the already existing `total` score mode. (`sum` doesn't exist in Lucene's ScoreMode class) * If `max_children` is set to `0` it now really means that zero children are allowed to match.	2015-09-10 17:15:41 +02:00
Simon Willnauer	d2e53e0e0c	Split HasParentQueryParser into toQuery and formXContent This commit splits HasParentQueryParser into toQuery and fromXContent. This change also deprecates several keys in favor of simplified settings and adds basic unittests for HasParentQueryParser. Relates to #10217	2015-09-09 12:06:44 +02:00
Clinton Gormley	0b5a027d6a	Docs: Fixed bad ID in geo bound box	2015-08-18 12:20:00 +02:00
Clinton Gormley	d13078546a	Docs: Fixed malforme table in geo-polygon query	2015-08-18 12:16:49 +02:00
Nicholas Knize	b2ba3847f7	Refactor geo_point validate* and normalize* options to ignore_malformed and coerce* For consistency geo_point mapper's validate and normalize options are converted to ignore_malformed and coerced	2015-08-17 14:46:23 -05:00
Clinton Gormley	c6c3a40cb6	Docs: Updated annotations for 2.0.0-beta1	2015-08-14 10:51:09 +02:00
Adrien Grand	00093a21dc	Merge pull request #12294 from jpountz/fix/multi_match_boost `multi_match` query applies boosts too many times.	2015-08-06 19:07:44 +02:00
Adrien Grand	8d5fff37ae	`multi_match` query applies boosts too many times. The `multi_match` query groups terms that have the same analyzer together and then applies the boost of the first query in each group. This is not necessary given that boosts for each term are already applied another way.	2015-08-06 19:07:12 +02:00
Clinton Gormley	ac2b8951c6	Docs: Mapping docs completely rewritten for 2.0	2015-08-06 17:24:51 +02:00
Clinton Gormley	0b0846f84b	Updated multi-match-query.asciidoc Corrected note about which field is boosted in a cross-fields multi_match query. Relates to #12294	2015-08-05 10:52:56 +02:00
Ryan Ernst	f4ba5ac6b8	Merge pull request #12357 from rjernst/fix/12317 Update time_zone specification	2015-07-21 00:27:27 -07:00
Ryan Ernst	dba42a83e2	Docs: Update time_zone specification closes #12317	2015-07-21 00:22:53 -07:00
Clinton Gormley	ecf801692e	Docs: Fixed the date math expression on filtered query	2015-07-18 14:13:26 +02:00
Clinton Gormley	bb4c4b55d2	Merge pull request #12264 from peschlowp/patch-6 Update match-query.asciidoc	2015-07-15 13:22:23 +02:00
Clinton Gormley	2b512f1f29	Docs: Use "js" instead of "json" and "sh" instead of "shell" for source highlighting	2015-07-14 18:14:09 +02:00
Jakob Reiter	daa5a8da24	Fixed typos in examples on common-terms-query.asciidoc. JSON was invalid before	2015-07-08 17:47:04 +02:00
Adrien Grand	da5fa6c4f3	Minor fixes to the `match` query. Fixed documentation since the default rewrite method for fuzzy queries is to select top terms, fixed usage of the fuzzy rewrite method, and removed unused `rewrite` parameter. Close #6932	2015-07-08 16:51:41 +02:00
Adrien Grand	8238f497d8	Expose Lucene's new TopTermsBlendedFreqScoringRewrite. This rewrite method is interesting because it computes scores as if all terms had the same frequencies, which avoids disappointments with ranking when a fuzzy query ranks typos first given that they are less frequent than the correct term.	2015-07-08 16:01:47 +02:00
Christoph Büscher	1548e845e4	Update filtered-query.asciidoc	2015-07-06 15:00:13 +02:00
Clinton Gormley	1e5e4439ce	Merge pull request #11983 from cdosborn/patch-1 Grammar fix to match query docs	2015-07-05 18:24:06 +02:00
Martijn van Groningen	47a43e4063	nested query: Added `min` score mode. This score mode was added with the Lucene 5.2 release, but the `nested` query parser hasn't been changed to use it.	2015-06-29 12:26:30 +02:00
Clinton Gormley	765ac45168	Docs: Tidied up function score query docs Closes #5991	2015-06-26 17:31:32 +02:00
Christoph Büscher	f5f73259e4	Docs: Update Joda URLs in documentation.	2015-06-26 10:23:02 +02:00
Igor Motov	d32443bfb5	Docs: add description of the analyze_wildcard parameter to the simple query string query docs	2015-06-22 18:26:31 -04:00
Alex Ksikes	3f6dae1a73	More Like This: renamed `ignore_like` to `unlike` This changes the parameter name `ignore_like` to the more user friendly name `unlike`. This later feature generates a query from the terms in `A` but not from the terms in `B`. This translates to a result set which is like `A` but unlike `B`. We could have further negatively boosted any documents that have some `B`, but these documents already do not receive any contribution from having `B`, and would therefore negatively compete with documents having `A`. Closes #11117	2015-06-17 17:18:50 -05:00
Clinton Gormley	6e71f60b82	Update bool-query.asciidoc Emphasise section about using bool query in filter context	2015-06-10 21:46:23 +02:00
Clinton Gormley	a138f627be	Docs: removed the unused query_dsl/index.asciidoc	2015-06-04 19:31:28 +02:00
Boaz Leskes	708320446e	Doc: Minor typo fix in query_filter_context.asciidoc	2015-06-04 15:42:55 +02:00
Clinton Gormley	f85a17ff1a	Docs: Fixed heading level for in query DSL docs	2015-06-04 13:16:32 +02:00
Clinton Gormley	171687d207	Docs: Reorganised the Query DSL docs into families and explaing query vs filter context	2015-06-04 01:59:37 +02:00
Martijn van Groningen	1cfb6a79f1	Parent/child: refactored _parent field mapper and parent/child queries * Cut the `has_child` and `has_parent` queries over to use Lucene's query time global ordinal join. The main benefit of this change is that parent/child queries can now efficiently execute if parent/child queries are wrapped in a bigger boolean query. If the rest of the query only hit a few documents both has_child and has_parent queries don't need to evaluate all parent or child documents any more. * Cut the `_parent` field over to use doc values. This significantly reduces the on heap memory footprint of parent/child, because the parent id values are never loaded into memory. Breaking changes: * The `type` option on the `_parent` field can only point to a parent type that doesn't exist yet, so this means that an existing type/mapping can't become a parent type any longer. * The `has_child` and `has_parent` queries can no longer be use in alias filters. All these changes, improvements and breaks in compatibility only apply for indices created with ES version 2.0 or higher. For indices creates with ES <= 2.0 the older implementation is used. It is highly recommended to re-index all your indices with parent and child documents to benefit from all the improvements that come with this refactoring. The easiest way to achieve this is by using the scan and bulk apis using a simple script. Closes #6107 Closes #8134	2015-05-29 21:44:17 +02:00
Colin Goodheart-Smithe	35a58d874e	Scripting: Unify script and template requests across codebase This change unifies the way scripts and templates are specified for all instances in the codebase. It builds on the Script class added previously and adds request building and parsing support as well as the ability to transfer script objects between nodes. It also adds a Template class which aims to provide the same functionality for template APIs Closes #11091	2015-05-29 16:52:04 +01:00
Clinton Gormley	6171ae6cc4	Docs: Added stub entries for pages deleted from 1.x	2015-05-24 17:57:34 +02:00
Martijn van Groningen	ece18f162e	Removed `id_cache` from stats and cat apis. Also removed the `id_cache` option from the clear cache api. Closes #5269	2015-05-15 14:06:18 +02:00
Adrien Grand	630757906a	Query DSL: Add `filter` clauses to `bool` queries. These clauses filter the document space without affecting scoring and map to Lucene's BooleanClause.Occur.FILTER. The `filtered` query is now deprecated and ```json { "filtered": { "query": { //query }, "filter": { //filter } } } ``` should be replaced with ```json { "bool": { "must": { //query }, "filter": { //filter } } } ```	2015-05-13 12:04:56 +02:00
Martijn van Groningen	acdd9a5dd9	parent/child: Removed the `top_children` query.	2015-05-10 16:30:19 +02:00
Clinton Gormley	a536bd5f81	Docs: Rewrote the term query docs to explain analyzed vs not_analyzed	2015-05-08 08:32:13 +02:00
Ryan Ernst	e29492ce94	Docs: Cleanup meta field docs Meta fields were locked down to not allow exotic options to the underlying field types in #8143. This change fixes the docs to no longer refer to the old settings. closes #10879	2015-05-07 11:26:49 -07:00
Adrien Grand	a0af88e996	Query DSL: Remove filter parsers. This commit makes queries and filters parsed the same way using the QueryParser abstraction. This allowed to remove duplicate code that we had for similar queries/filters such as `range`, `prefix` or `term`.	2015-05-07 20:14:34 +02:00
Martijn van Groningen	f7c29457d0	parent/child: Deprecated the `top_children` in favour of the `has_child` query.	2015-05-07 09:27:54 +02:00
Pascal Borreli	af6d890ad5	Docs: Fixed typos Closes #10973	2015-05-05 10:38:05 +02:00
Clément Salaün	c0659ce4d4	Docs: Update geo-distance-range-filter.asciidoc missing comma Closes #10957	2015-05-04 17:17:48 +02:00
Adrien Grand	b72f27a410	Core: Cut over to the Lucene filter cache. This removes Elasticsearch's filter cache and uses Lucene's instead. It has some implications: - custom cache keys (`_cache_key`) are unsupported - decisions are made internally and can't be overridden by users ('_cache`) - not only filters can be cached but also all queries that do not need scores - parent/child queries can now be cached, however cached entries are only valid for the current top-level reader so in practice it will likely only be used on read-only indices - the cache deduplicates filters, which plays nicer with large keys (eg. `terms`) - better stats: we already had ram usage and evictions, but now also hit count, miss count, lookup count, number of cached doc id sets and current number of doc id sets in the cache - dynamically changing the filter cache size is not supported anymore Internally, an important change is that it removes the NoCacheFilter infrastructure in favour of making Query.rewrite specializing the query for the current reader so that it will only be cached on this reader (look for IndexCacheableQuery). Note that consuming filters with the query API (createWeight/scorer) instead of the filter API (getDocIdSet) is important for parent/child queries because otherwise a QueryWrapperFilter(ParentQuery) would run the wrapped query per segment while relations might be cross segments.	2015-05-04 09:02:15 +02:00
Robert Muir	dfe1d1463c	fix doc typo	2015-04-30 23:46:37 -04:00
Robert Muir	aade6194b7	Add span within/containing queries. Expose new span queries from https://issues.apache.org/jira/browse/LUCENE-6083 Within returns matches from 'little' that are enclosed inside of a match from 'big'. Containing returns matches from 'big' that enclose matches from 'little'.	2015-04-30 23:31:31 -04:00
Nik Everett	cb89a14010	Add default to field_value_factor field_value_factor now takes a default that is used if the document doesn't have a value for that field. It looks like: "field_value_factor": { "field": "popularity", "missing": 1 } Closes #10841	2015-04-28 11:06:24 -04:00
navins	84636557e1	Docs: correct three mis-match of brackets Closes #10806	2015-04-26 19:43:14 +02:00
Christine	9e81e4c09b	Docs: Update bool-filter.asciidoc from, to deprecated in favour of gt, lt Closes #10682	2015-04-26 19:23:11 +02:00
Mark Mulder	690c16e81a	Docs: Fix minor spelling mistakes in Match Query doc Closes #10751	2015-04-26 16:29:41 +02:00
Oliver Eilhard	95e9b86505	Mustache tags syntax Hi there. I've been experimenting with the search templates recently and I'm a bit confused. Shouldn't the Mustache tags be written like `{{tagname}}` instead of `{tagname}`? Your using `{{...}}` [here](http://www.elastic.co/guide/en/elasticsearch/reference/current/search-template.html) BTW. Using the first example in that page seems to indicate that something's wrong, or am I missing something? ``` $ curl 'localhost:9200/test/_search' -d '{"query":{"template":{"query":{"match":{"text":"{keywords}"}},"params":{"keywords":"value1_foo"}}}}' {"took":1,"timed_out":false,"_shards":{"total":1,"successful":1,"failed":0},"hits":{"total":0,"max_score":null,"hits":[]}} $ curl 'localhost:9200/test/_search' -d '{"query":{"template":{"query":{"match":{"text":"{{keywords}}"}},"params":{"keywords":"value1_foo"}}}}' {"took":1,"timed_out":false,"_shards":{"total":1,"successful":1,"failed":0},"hits":{"total":1,"max_score":1.0,"hits":[{"_index":"test","_type":"testtype","_id":"1","_score":1.0,"_source":{"text":"value1_foo"}}]}} ```	2015-04-24 21:23:58 +02:00
Adrien Grand	d7abb12100	Replace deprecated filters with equivalent queries. In Lucene 5.1 lots of filters got deprecated in favour of equivalent queries. Additionally, random-access to filters is now replaced with approximations on scorers. This commit - replaces the deprecated NumericRangeFilter, PrefixFilter, TermFilter and TermsFilter with NumericRangeQuery, PrefixQuery, TermQuery and TermsQuery, wrapped in a QueryWrapperFilter - replaces XBooleanFilter, AndFilter and OrFilter with a BooleanQuery in a QueryWrapperFilter - removes DocIdSets.isBroken: the new two-phase iteration API will now help execute slow filters efficiently - replaces FilterCachingPolicy with QueryCachingPolicy Close #8960	2015-04-21 15:32:43 +02:00
Alex Ksikes	d339ee4005	Term Vectors: terms filtering This adds a new feature to the Term Vectors API which allows for filtering of terms based on their tf-idf scores. With `dfs` option on, this could be useful for finding out a good characteric vector of a document or a set of documents. The parameters are similar to the ones used in the MLT Query. Closes #9561	2015-04-14 19:11:09 +02:00
Benoit Delbosc	1b35854768	Docs: Fix simple_query_string example The "&" is not part of the simple_query_string DSL Closes #10563	2015-04-13 14:46:47 +02:00
Adrien Grand	ab8926bc6a	Docs: fix build.	2015-04-10 17:38:36 +02:00
Adrien Grand	5b3cc2f07c	Search: deprecate the limit filter. This is really a Collector instead of a filter. This commit deprecates the `limit` filter, makes it a no-op and recommends to use the `terminate_after` parameter instead that we introduced in the meantime.	2015-04-10 17:18:50 +02:00
Adrien Grand	919589b908	Queries: Remove fuzzy-like-this support. The fuzzy-like-this query builds very expensive queries and only serves esoteric use-cases.	2015-04-10 17:16:02 +02:00
Clinton Gormley	abc7de96ae	Docs: Updated version annotations in master	2015-04-09 14:50:11 +02:00
Isabel Drost-Fromm	60bb65c4d9	Docs: Note on shard vs. index level doc frequencies. Relates to #10154 and #10150 Adds link to additional information on how document frequencies are treated across shards to the cutoff_frequency parameter documentation. Closes #10451	2015-04-07 14:28:01 +02:00
joelbourbon	3c52bc1098	Docs: Missing 1 escape character in example Closes #10446	2015-04-07 14:10:17 +02:00
Patrick Peschlow	a9af488bb3	Update prefix-filter.asciidoc text said phrase instead of prefix, probably due to copy-paste	2015-03-31 09:25:15 +02:00
Lee Hinman	6aec68cd29	Revert "[QUERY] Remove lowercase_expanded_terms and locale options" This reverts commit `d1f7bd97cb`. Ryan pointed out that this needs to work with the multi term query, so additional analysis and tests should be added.	2015-03-13 13:51:44 -06:00
Lee Hinman	d1f7bd97cb	[QUERY] Remove lowercase_expanded_terms and locale options The analysis chain should be used instead of relying on this, as it is confusing when dealing with different per-field analysers. The `locale` option was only used for `lowercase_expanded_terms`, which, once removed, is no longer needed, so it was removed as well. Fixes #9978 Relates to #9973	2015-03-13 13:17:27 -06:00
Clinton Gormley	3f9d4f9635	Update query-string-syntax.asciidoc Closes #9965	2015-03-03 20:03:51 +01:00
cgp	b1e6df3b6c	Update span-multi-term-query.asciidoc Added comma - there is no "term range" query Closes #9855	2015-02-28 03:05:05 +01:00
Lee Hinman	2e9ea4abaf	Add support for `minimum_should_match` to `simple_query_string` This behaves similar to the way that `minimum_should_match` works for the `match` query (in fact it is implemented in the exact same way) Fixes #6449	2015-02-25 11:35:33 -07:00
Michael Sander	fd6c6058ce	Remove Triple Negative! Double negatives are confusing, but a triple negative (1 no, 2 non, 3 null)? It takes five minutes to understand this little sentence. Cleaned that up a bit. Closes #9789	2015-02-23 20:09:05 +01:00
Blake Niemyjski	8cba6c3abb	Fixed an invalid query Closes #9682	2015-02-13 21:11:42 +01:00
Clinton Gormley	856b0fa1a0	Docs: Fixed explanation of how the query string query is rewritten	2015-02-12 12:46:44 +01:00
Alfredo Serafini	e607e53591	Update span-multi-term-query.asciidoc added wildcard to the list of possible nested queries Closes #9586	2015-02-09 16:01:46 +01:00
Adam	928ea82188	Docs: Updated documentation for query-string-syntax to include '>' '<' and '=' as reserved characters Closes #9518	2015-02-04 17:55:15 +01:00
Clinton Gormley	8978aa5465	Docs: Improved the template query docs Added the `file` and `id` parameters. Closes #9458	2015-01-28 14:19:59 +01:00
Alex Ksikes	615513ee9b	Docs: clearer MLT documentation Closes #9351	2015-01-20 16:42:39 +01:00
Adrien Grand	bc86796592	Core: Remove terms filter cache. This is our only cache which is not 'exact' and might allow for stalled results. Additionally, a similar cache that we have and needs to perform lookups in other indices in order to run queries is the script index, and for this index we rely on the filesystem cache, so we should probably do the same with terms filters lookups. Close #9056	2015-01-06 17:21:20 +01:00
Clinton Gormley	f83909f7ae	Docs: The regexp query defaults to the `ALL` flag, and removed the `AUTOMATON` flag which is not used in Elasticsearch. Closes #6180	2014-12-30 19:53:31 +01:00
Adrien Grand	fb6c3b7c29	[Docs] Improve documentation of the new caching policy for filters.	2014-12-22 17:14:47 +01:00
Adrien Grand	ce11e0ee6d	Filter cache: add a `_cache: auto` option and make it the default. Up to now, all filters could be cached using the `_cache` flag that could be set to `true` or `false` and the default was set depending on the type of the `filter`. For instance, `script` filters are not cached by default while `terms` are. For some filters, the default is more complicated and eg. date range filters are cached unless they use `now` in a non-rounded fashion. This commit adds a 3rd option called `auto`, which becomes the default for all filters. So for all filters a cache wrapper will be returned, and the decision will be made at caching time, per-segment. Here is the default logic: - if there is already a cache entry for this filter in the current segment, then return the cache entry. - else if the doc id set cannot iterate (eg. script filter) then do not cache. - else if the doc id set is already cacheable and it has been used twice or more in the last 1000 filters then cache it. - else if the filter is costly (eg. multi-term) and has been used twice or more in the last 1000 filters then cache it. - else if the doc id set is not cacheable and it has been used 5 times or more in the last 1000 filters, then load it into a cacheable set and cache it. - else return the uncached set. So for instance geo-distance filters and script filters are going to use this new default and are not going to be cached because of their iterators. Similarly, date range filters are going to use this default all the time, but it is very unlikely that those that use `now` in a not rounded fashion will get reused so in practice they won't be cached. `terms`, `range`, ... filters produce cacheable doc id sets with good iterators so they will be cached as soon as they have been used twice. Filters that don't produce cacheable doc id sets such as the `term` filter will need to be used 5 times before being cached. This ensures that we don't spend CPU iterating over all documents matching such filters unless we have good evidence of reuse. One last interesting point about this change is that it also applies to compound filters. So if you keep on repeating the same `bool` filter with the same underlying clauses, it will be cached on its own while up to now it used to never be cached by default. `_cache: true` has been changed to only cache on large segments, in order to not pollute the cache since small segments should not be the bottleneck anyway. However `_cache: false` still has the same semantics. Close #8449	2014-12-18 15:51:36 +01:00
Timothy Perisho	ceafde41e9	Docs: typo on "frequent" I replaced "high frequent terms" with "high frequency terms" and "low frequent terms" with "low frequency terms". Alternatively, we could write, "highly frequent terms" and "minimally frequent terms" (or just "rare terms"). Closes #8962	2014-12-15 19:59:50 +01:00
John Michael Luy	01ef80a33d	Update range-filter.asciidoc Closes #8741	2014-12-02 18:00:38 +01:00
John Michael Luy	f20f6ffe22	Docs: Update range-query.asciidoc Closes #8740	2014-12-02 12:55:44 +01:00
Alex Ksikes	256712640f	MLT Query: Support for ignore docs Adds a `ignore_like` parameter to the MLT Query, which simply tells the algorithm to skip all the terms from the given documents. This could be useful in order to better guide nearest neighbor search by telling the algorithm to never explore the space spanned by the given `ignore_like` docs. In essence we are interested about the characteristic of a given item, but not of the ones provided by `ignore_like`, thereby forcing the algorithm to go deeper in its selection of terms. Note that this is different than simply performing a must not boolean query on the unliked items. The syntax is exactly the same as the `like` parameter. Closes #8674	2014-11-28 14:48:43 +01:00
pmamat	9e2eaeece4	Docs: Additional info about _score calculation Description taken from http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/multi-query-strings.html / 110_Multi_Field_Search/05_Multiple_query_strings.asciidoc Closes #8635	2014-11-28 13:54:45 +01:00
Britta Weber	59507cf793	function_score: match only document with score above custom score threshold functon_score matched each document regardless of the computed score. This commit adds a query parameter `min_score` (-Float.MAX_VALUE default). Documents that have a score lower than this threshold will not be mached. closes #6952	2014-11-28 12:35:26 +01:00
Britta Weber	f00b431c18	[docs] explain default settings for parameters of decay functions relates to #8624	2014-11-27 19:18:55 +01:00
Martijn van Groningen	1d7cdd7d22	Applied PR, changed the way defaults are handled and updated the docs. Closes #4452	2014-11-24 13:32:41 +01:00
dw	bb81055c33	Docs: Remove reference to imaginary "no_docs_query" No reference to it in the source code except this file. Closes #8566	2014-11-23 13:56:33 +01:00
Michael McCandless	8aebb9656b	Core: add max_determinized_states to query_string and regexp query/filter This prevents too-difficult regular expressions from consuming excessive RAM/CPU; the default max_determinized_states is 10,000 (same as Lucene) but query_string and regepx query/filter can override per-request. The also upgrades to a new Lucene 5.0.0 snapshot. Closes #8386 Closes #8357	2014-11-10 13:43:48 -05:00
Clinton Gormley	cff544dcc2	Docs: Removed old coming/added tags	2014-11-10 14:41:24 +01:00
Britta Weber	c5a4c1d6b4	[docs] add 2d vis for decay functions and parameters closes #8420	2014-11-10 10:56:41 +01:00
Veres Lajos	4059e4ac86	typo fixes - https://github.com/vlajos/misspell_fixer Closes #8323	2014-11-08 18:55:57 +01:00
Clinton Gormley	b9149f836b	Docs: Improve the exists/missing filters documentation Closes #7274	2014-11-08 16:57:41 +01:00
Robert Muir	610ce078fb	Upgrade master to lucene 5.0 snapshot This has a lot of improvements in lucene, particularly around memory usage, merging, safety, compressed bitsets, etc. On the elasticsearch side, summary of the larger changes: API changes: postings API became a "pull" rather than "push", collector API became per-segment, etc. packaging changes: add lucene-backwards-codecs.jar as a dependency. improvements to boolean filtering: especially ensuring it will not be slow for SparseBitSet. use generic BitSet api in plumbing so that concrete bitset type is an implementation detail. use generic BitDocIdSetFilter api for dedicated bitset cache, so there is type safety. changes to support atomic commits implement Accountable.getChildResources (detailed memory usage API) for fielddata, etc change handling of IndexFormatTooOld/New, since they no longer extends CorruptIndexException Closes #8347. Squashed commit of the following: commit d90d53f5f21b876efc1e09cbd6d63c538a16cd89 Author: Simon Willnauer <simonw@apache.org> Date: Wed Nov 5 21:35:28 2014 +0100 Make default codec/postings/docvalues format constants commit cb66c22c71cd304a36e7371b199a8c279908ae37 Merge: d4e2f6d `ad4ff43` Author: Robert Muir <rmuir@apache.org> Date: Wed Nov 5 11:41:13 2014 -0500 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit d4e2f6dfe767a5128c9b9ae9e75036378de08f47 Merge: 4e5445c `4111d93` Author: Robert Muir <rmuir@apache.org> Date: Wed Nov 5 06:26:32 2014 -0500 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit 4e5445c775f580730eb01360244e9330c0dc3958 Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 16:19:19 2014 -0500 FixedBitSet -> BitSet commit 9887ea73e8b857eeda7f851ef3722ef580c92acf Merge: 1bf8894 `fc84666` Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 15:26:25 2014 -0500 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit 1bf8894430de3e566d0dc5623b0cc28b0d674ebb Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 15:22:51 2014 -0500 remove nocommit commit a9c2a2259ff79c69bae7806b64e92d5f472c18c8 Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 13:48:43 2014 -0500 turn jenkins red again commit 067baaaa4d52fce772c81654dcdb5051ea79139f Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 13:18:21 2014 -0500 unzip from stream commit 82b6fba33d362aca2313cc0ca495f28f5ebb9260 Merge: b2214bb `6523cd9` Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 13:10:59 2014 -0500 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit b2214bb093ec2f759003c488c3c403c8931db914 Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 13:09:53 2014 -0500 go back to my URL until we can figure out what is up with jenkins commit e7d614172240175a51f580aeaefb6460d21cede9 Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 10:52:54 2014 -0500 try this jenkins commit 337a3c7704efa7c9809bf373152d711ee55f876c Author: Simon Willnauer <simonw@apache.org> Date: Tue Nov 4 16:17:49 2014 +0100 Rename temp-files under lock to prevent metadata reads while renaming commit 77d5ba80d0a76efa549dd753b9f114b2f2d2d29c Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 10:07:11 2014 -0500 continue to treat too-old/too-new as corruption for now commit 98d0fd2f4851bc50e505a94ca592a694d502c51c Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 09:24:21 2014 -0500 fix last nocommit commit 643fceed66c8caf22b97fc489d67b4a2a90a1a1c Author: Simon Willnauer <simonw@apache.org> Date: Tue Nov 4 14:46:17 2014 +0100 remove NoSuchDirectoryException commit 2e43c4feba05cfaf451df70f946c0930cbcc4557 Merge: 93826e4 `8163107` Author: Simon Willnauer <simonw@apache.org> Date: Tue Nov 4 14:38:00 2014 +0100 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit 93826e4d56a6a97c2074669014af77ff519bde63 Merge: 7f10129 `44e24d3` Author: Simon Willnauer <simonw@apache.org> Date: Tue Nov 4 12:54:27 2014 +0100 Merge branch 'master' into enhancement/lucene_5_0_upgrade Conflicts: src/main/java/org/elasticsearch/index/store/DistributorDirectory.java src/main/java/org/elasticsearch/index/store/Store.java src/main/java/org/elasticsearch/indices/recovery/RecoveryStatus.java src/test/java/org/elasticsearch/index/store/DistributorDirectoryTest.java src/test/java/org/elasticsearch/index/store/StoreTest.java src/test/java/org/elasticsearch/indices/recovery/RecoveryStatusTests.java commit 7f10129364623620575c109df725cf54488b3abb Author: Adrien Grand <jpountz@gmail.com> Date: Tue Nov 4 11:32:24 2014 +0100 Fix TopHitsAggregator to not ignore the top-level/leaf collector split. commit 042fadc8603b997bdfdc45ca44fec70dc86774a6 Author: Adrien Grand <jpountz@gmail.com> Date: Tue Nov 4 11:31:20 2014 +0100 Remove MatchDocIdSet in favor of DocValuesDocIdSet. commit 7d877581ff5db585a674c95ac391ac78a0282826 Author: Adrien Grand <jpountz@gmail.com> Date: Tue Nov 4 11:10:08 2014 +0100 Make the and filter use the cost API. Lucene 5 ensured that cost() can safely be used, and this will have the benefit that the order in which filters are specified is not important anymore (only for slow random-access filters in practice). commit 78f1718aa2cd82184db7c3a8393e6215f43eb4a8 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 23:55:17 2014 -0500 fix previous eclipse import braindamage commit 186c40e9258ce32f22a9a714ab442a310b6376e0 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 22:32:34 2014 -0500 allow child queries to exhaust iterators again commit b0b1271305e1b6d0c4c4da51a3c54df1aa5c0605 Author: Ryan Ernst <ryan@iernst.net> Date: Mon Nov 3 14:50:44 2014 -0800 Fix nocommit for mapping output. index_options will not be printed if the field is not indexed. commit ba223eb85e399c9620a347a983e29bf703953e7a Author: Ryan Ernst <ryan@iernst.net> Date: Mon Nov 3 14:07:26 2014 -0800 Remove no commit for chinese analyzer provider. We should have a separate issue to address not using this provider on new indexes. commit ca554b03c4471797682b2fb724f25205cf040c4a Author: Ryan Ernst <ryan@iernst.net> Date: Mon Nov 3 13:41:59 2014 -0800 Fix stop tests commit de67c4653ec47dee9c671390536110749d2bb05f Author: Ryan Ernst <ryan@iernst.net> Date: Mon Nov 3 12:51:17 2014 -0800 Remove analysis nocommits, switching over to Lucene43*Filters for backcompat commit 50cae9bec72c25c33a1ab8a8931bccb3355171e2 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 15:32:25 2014 -0500 add ram accounting and TODO lazy-loading (its no worse than master, can be a followup improvement) for suggesters commit 7a7f0122f138684b312d0f0b03dc2a9c16c15f9c Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 15:11:26 2014 -0500 bump lucene version commit cd0cae5c35e7a9e049f49ae45431f658fb86676b Merge: 446bc09 `3c72073` Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 14:49:05 2014 -0500 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit 446bc09b4e8bf4602d3c252b53ddaa0da65cce2f Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 14:46:30 2014 -0500 remove hack commit a19d85a968d82e6d00292b49630ef6ff2dbf2f32 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 12:53:11 2014 -0500 dont create exceptions with circular references on corruption (will open a PR for this) commit 0beefb9e821d97c37e90ec556d81ac7b00369b8a Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 11:47:14 2014 -0500 temporarily add craptastic detector for this horrible bug commit e9f2d298bff75f3d1591f8622441e459c3ce7ac3 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 10:56:01 2014 -0500 add nocommit commit e97f1d50a91a7129650b8effc7a9ecf74ca0569a Merge: c57a3c8 `f1f50ac` Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 10:12:12 2014 -0500 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit c57a3c8341ed61dca62eaf77fad6b8b48aeb6940 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 10:11:46 2014 -0500 fix nocommit commit dd0e77e4ec07c7011ab5f6b60b2ead33dc2333d2 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 09:54:09 2014 -0500 nocommit -> TODO, this is in much more places in the codebase, bigger issue commit 3cc3bf56d72d642059f8fe220d6f2fed608363e9 Author: Ryan Ernst <ryan@iernst.net> Date: Sat Nov 1 23:59:17 2014 -0700 Remove nocommit and awaitsfix for edge ngram filter test. commit 89f115245155511c0fbc0d5ee62e63141c3700c1 Author: Ryan Ernst <ryan@iernst.net> Date: Sat Nov 1 23:57:44 2014 -0700 Fix EdgeNGramTokenFilter logic for version <= 4.3, and fixed instanceof checks in corresponding tests to correctly check for reverse filter when applicable. commit 112df869cd199e36aab0e1a7a288bb1fdb2ebf1c Author: Robert Muir <rmuir@apache.org> Date: Sun Nov 2 00:08:30 2014 -0400 execute geo disjoint query/filter as intersects commit e5061273cc685f1252e9a3a9ae4877ec9bce7752 Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 22:58:59 2014 -0400 remove chinese analyzer from docs commit ea1af11b8978fcc551f198e24fe21d52806993ef Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 22:29:00 2014 -0400 fix ram accounting bug commit 53c0a42c6aa81aa6bf81d3aa77b95efd513e0f81 Merge: e3bcd3c `6011a18` Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 22:16:29 2014 -0400 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit e3bcd3cc07a4957e12c7b3affc462c31290a9186 Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 22:15:01 2014 -0400 fix url-email back compat (thanks ryan) commit 91d6b096a96c357755abee167098607223be1aad Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 22:11:26 2014 -0400 bump lucene version commit d2bb9568df72b37ec7050d25940160b8517394bc Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 20:33:07 2014 -0400 remove nocommit commit 1d049c471e19e5c457262c7399c5bad9e023b2e3 Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 20:28:58 2014 -0400 fix eclipse to group org/com imports together: without this, its madness commit 09d8c1585ee99b6e63be032732c04ef6fed84ed2 Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 14:27:41 2014 -0400 remove nocommit, if you dont liek it, print assembly and tell me how it can be better commit 8a6a294313fdf33b50c7126ec20c07867ecd637c Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 20:01:55 2014 +0100 Remove deprecated usage of DocIdSets.newDocIDSet. commit 601bee60543610558403298124a84b1b3bbd1045 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 14:13:18 2014 -0400 maybe one of these zillions of annotations will stop thread leaks commit 9d3f69abc7267c5e455aefa26db95cb554b02d62 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 14:05:39 2014 -0400 fix some analysis nocommits commit 312e3a29c77214b8142d21c33a6b2c2b151acf9a Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 18:28:45 2014 +0100 Remove XConstantScoreQuery/XFilteredQuery/ApplyAcceptedDocsFilter. commit 5a0cb9f8e167215df7f1b1fad11eec6e6c74940f Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 17:06:45 2014 +0100 Fix misleading documentation of DocIdSets.toCacheable. commit 8b4ef2b5b476fff4c79c0c2a0e4769ead26cf82b Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 17:05:59 2014 +0100 Fix CustomRandomAccessFilterStrategy to override the right method. commit d7a9a407a615987cfffc651f724fbd8795c9c671 Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 16:21:35 2014 +0100 Better handle the special case when there is a single SHOULD clause. commit 648ad389f07e92dfc451f345549c9841ba5e4c9a Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 15:53:38 2014 +0100 Cut over XBooleanFilter to BitDocIdSet.Builder. The idea is similar to what happened to Lucene's BooleanFilter. Yet XBooleanFilter is a bit more sophisticated and I had to slightly change the way it is implemented in order to make it work. The main difference with before is that slow filters are now applied lazily, so eg. if you have 3 MUST clauses, two with a fast iterator and the third with a slow iterator, the previous implementation used to apply the fast iterators first and then only check the slow filter for bits which were set in the bit set. Now we are computing a bit set based on the fast must clauses and then basically returning a BitsFilteredDocIdSet.wrap(bitset, slowClause). Other than that, BooleanFilter still uses the bitset optimizations when or-ing and and-ind filters. Another improvement is that BooleanFilter is now aware of the cost API. commit b2dad312b4bc9f931dc3a25415dd81c0d9deee08 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 10:18:53 2014 -0400 clear nocommit commit 4851d2091e744294336dfade33906c75fbe695cd Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 15:15:16 2014 +0100 cut over to RoaringDocIdSet commit ca6aec24a901073e65ce4dd6b70964fd3612409e Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 14:57:30 2014 +0100 make nocommit more explicit commit d0742ee2cb7a6c48b0bbb31580b7fbcebdb6ec40 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 09:55:24 2014 -0400 fix standardtokenizer nocommit commit 7d6faccafff22a86af62af0384838391d46695ca Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 14:54:08 2014 +0100 fix compilation commit a038a405c1ff6458ad294e6b5bc469e622f699d0 Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 14:53:43 2014 +0100 fix compilation commit 30c9e307b1f5d80e2deca3392c0298682241207f Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 14:52:35 2014 +0100 fix compilation commit e5139bc5a0a9abd2bdc6ba0dfbcb7e3c2e7b8481 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 09:52:16 2014 -0400 clear nocommit here commit 85dd2cedf7a7994bed871ac421cfda06aaf5c0a5 Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 14:46:17 2014 +0100 fix CompletionPostingsFormatTest commit c0f3781f616c9b0ee3b5c4d0998810f595868649 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 09:38:00 2014 -0400 add tests for these analyzers commit 51f9999b4ad079c283ae762c862fd0e22d00445f Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 14:10:26 2014 +0100 remove nocommit - this is not an issue commit fd1388fa03e622b0738601c8aeb2dbf7949a6dd2 Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Fri Oct 31 14:07:01 2014 +0100 Remove redundant null check commit 3d6dd51b0927337ba941a235446b22e8cd500dc3 Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Fri Oct 31 14:01:37 2014 +0100 Removed the work around to prevent p/c error when invoking #iterator() twice, because the custom query filter wrapper now doesn't transform the result to a cache doc id set any more. I think the transforming to a cachable doc id set in CustomQueryWrappingFilter isn't needed at all, because we use the DocIdSet only once and because of that is just slowed things down. commit 821832a537e00cd1216064b379df3e01d2911d3a Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 13:54:33 2014 +0100 one more nocommit commit 77eb9ea4c4ea50afb2680c29682ddcb3851a9d4f Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Fri Oct 31 13:52:29 2014 +0100 Remove cast commit a400573c034ed602221f801b20a58a9186a06eae Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 13:49:24 2014 +0100 fix stop filter commit 51746087cf8ec34c4d20aa05ba8dbff7b3b43eec Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 13:21:36 2014 +0100 fix changed semantics of FBS.nextSetBit to check for NO_MORE_DOCS commit 8d0a4e2511310f1293860823fe3ba80ac771bbe3 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 08:13:44 2014 -0400 do the bogus cast differently commit 46a5cc5732dea096c0c80ae5ce42911c9c51e44e Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 13:00:16 2014 +0100 I hate it but P/C now passes commit 580c0c2f82bbeacf217e594f22312b11d1bdb839 Merge: a9d3c00 `1645434` Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 06:54:31 2014 -0400 fix nocommit/classcast commit a9d3c004d62fe04989f49a897e6ff84973c06eb9 Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 08:49:31 2014 +0100 Update TODO. commit aa75af0b407792aeef32017f03a6f442ed970baa Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 19:18:25 2014 -0400 clear obselete nocommits from lucene bump commit d438534cf41fcbe2d88070e2f27c994625e082c2 Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 18:53:20 2014 -0400 throw classcastexception when ES abuses regular filtercache for nested docs commit 2c751f3a8feda43ec127c34769b069de21f3d16f Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 18:31:34 2014 -0400 bump lucene revision, fix tests commit d6ef7f6304ae262bf6228a7d661b2a452df332be Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 22:37:58 2014 +0100 fix merge problems commit de9d361f88a9ce6bb3fba85285de41f223c95767 Merge: 41f6aab `f6b37a3` Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 22:28:59 2014 +0100 Merge branch 'master' into enhancement/lucene_5_0_upgrade Conflicts: pom.xml src/main/java/org/elasticsearch/Version.java src/main/java/org/elasticsearch/gateway/local/state/meta/MetaDataStateFormat.java commit 41f6aab388aa80c40b08a2facab2617576203a0d Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 17:48:46 2014 +0100 fix potiential NPE commit c4428b12e1ae838b91e847df8b4a8be7f49e10f4 Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 17:38:46 2014 +0100 don't advance iterator in a match(doc) method commit 28ab948e99e3ea4497c9b1e468384806ba7e1790 Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 17:34:58 2014 +0100 don't advance iterator in a match(doc) method commit eb0f33f6634fadfcf4b2bf7327400e568f0427bb Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 16:55:54 2014 +0100 fix GeoUtilsTest commit 7f711fe3eaf73b6c2268cf42d5a41132a61ad831 Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 16:43:16 2014 +0100 Use a dedicated default index option if field type is not indexed by default commit 78e3f37ab779e3e1b25b45a742cc86ab5f975149 Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 10:56:14 2014 -0400 disable this test with AwaitsFix to reduce noise commit 9a590f563c8e03a99ecf0505c92d12d7ab20d11d Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 09:38:49 2014 +0100 fix lucene version commit abe3ca1d8bb6b5101b545198f59aec44bacfa741 Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 09:35:05 2014 +0100 fix AnalyzingCompletionLookupProvider to wrok with new codec API commit 464293b245852d60bde050c6d3feb5907dcfbf5f Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 00:26:00 2014 -0400 don't try to write stuff to tests class directory commit 031cc6c19f4fe4423a034b515f77e5a0e282a124 Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 00:12:36 2014 -0400 AwaitsFix these known issues to reduce noise commit 4600d51891e35847f2d344247d6f915a0605c0d1 Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 00:06:53 2014 -0400 openbitset lives on commit 8492bae056249e2555d24acd55f1046b66a667c4 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 23:42:54 2014 -0400 fixes for filter tests commit 31f24ce4efeda31f97eafdb122346c7047a53bf2 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 23:12:38 2014 -0400 don't use fieldcache commit 8480789942fdff14a6d2b2cd8134502fe62f20c8 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 23:04:29 2014 -0400 ancient index no longer supported commit 02e78dc7ebdd827533009f542582e8db44309c57 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 23:37:02 2014 +0100 fix more tests commit ff746c6df23c50b3f3ec24922413b962c8983080 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 23:08:19 2014 +0100 fix all mapper commit e4fb84b517107b25cb064c66f83c9aa814a311b2 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 22:55:54 2014 +0100 fix distributor tests and cut over to FileStore API commit 20c850e2cfe3210cd1fb9e232afed8d4ac045857 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 22:42:18 2014 +0100 use DOCS_ONLY if index=true and current options == null commit 44169c108418413cfe51f5ce23ab82047463e4c2 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 22:33:36 2014 +0100 Fix index=yes\|no settings in mappers commit a3c5f77987461a18121156ed345d42ded301c566 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 21:51:41 2014 +0100 fix several field mappers conversion from setIndexed to indexOptions commit df84d736908e88a031d710f98e222be68ae96af1 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 21:33:35 2014 +0100 fix SourceFieldMapper to be not indexed commit b2bf01d12a8271a31fb2df601162d0e89924c8f5 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 21:23:08 2014 +0100 Cut over to .liv files in store and corruption tests commit 619004df436f9ef05d24bef1b6a7f084c6b0ad75 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 17:05:52 2014 +0100 fix more tests commit b7ed653a8b464de446e00456bce0a89e47627c38 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 16:19:08 2014 +0100 [STORE] Add dedicated method to write temporary files Recovery writes temporary files which might not end up in the right distributor directories today. This commit adds a dedicated API that allows specifying the target file name in order to create the tempoary file in the correct directory. commit 7d574659f6ae04adc2b857146ad0d8d56ca66f12 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 10:28:49 2014 -0400 add some leniency to temporary bogus method commit f97022ea7c2259f7a5cf97d924c59ed75ab65b32 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 10:24:17 2014 -0400 fix MultiCollector bug commit b760533128c2b4eb10ad76e9689ef714293dd819 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:56:08 2014 +0100 CheckIndex is now closeable we need to close it commit 9dae9fb6d63546a6c2427be2a2d5c8358f5b1934 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:45:11 2014 +0100 s/Lucene51/Lucene50 commit 7aea9b86856a8c1b06a08e7c312ede1168af1287 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:42:30 2014 +0100 fix BloomFilterPostingsFormat commit 16fea6fe842e88665d59cc091e8224e8dc6ce08c Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:41:16 2014 +0100 fix some codec format issues commit 3d77aa97dd2c4012b63befef3f2ba2525965e8a6 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:30:43 2014 +0100 fix CodecTests commit 6ef823b1fde25657438ace1aabd9d552d6ae215e Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:26:47 2014 +0100 make it compile commit 9991eee1fe99435118d4dd42b297ffc83fce5ec5 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 09:12:43 2014 -0400 add an ugly hack for TopHitsAggregator for now commit 03e768a01fcae6b1f4cb50bcceec7d42977ac3e6 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:01:02 2014 +0100 cut over ES090PostingsFormat commit 463d281faadb794fdde3b469326bdaada25af048 Merge: 0f8740a `8eac79c` Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 08:30:36 2014 -0400 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit 0f8740a782455a63524a5a82169f6bbbfc613518 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 01:00:15 2014 -0400 fix/hack remaining filter and analysis issues commit df534488569da13b31d66e581456dfd4b55156b9 Author: Robert Muir <rmuir@apache.org> Date: Tue Oct 28 23:11:47 2014 -0400 fix ngrams / openbitset usage commit 11f5dc3b9887f4da80a0fa1818e1350b30599329 Author: Robert Muir <rmuir@apache.org> Date: Tue Oct 28 22:42:44 2014 -0400 hack over sort comparators commit 4ebdc754350f512596f6a02770d223e9f5f7975a Author: Robert Muir <rmuir@apache.org> Date: Tue Oct 28 21:27:07 2014 -0400 compiler errors < 100 commit 2d60c9e29de48ccb0347dd87f7201f47b67b83a0 Author: Robert Muir <rmuir@apache.org> Date: Tue Oct 28 03:13:08 2014 -0400 clear some nocommits around ram usage commit aaf47fe6c0aabcfb2581dd456fc50edf871da758 Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 12:27:34 2014 -0400 migrate fieldinfo handling commit ef6ed6d15d8def71cd880d97249678136cd29fe3 Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 12:07:13 2014 -0400 more simple fixes commit f475e1048ae697dd9da5bd9da445102b0b7bc5b3 Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 11:58:21 2014 -0400 more fielddata ram accounting fixes commit 16b4239eaa9b4262df258257df4f31d39f28a3a2 Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 16:47:32 2014 +0100 add missing file commit 5b542fa2a6da81e36a0c35b8e891a1d8bc58f663 Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 16:43:29 2014 +0100 cut over completion posting formats - still some nocommits commit ecdea49404c4ec4e1b78fb54575825f21b4e096e Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 11:21:09 2014 -0400 fielddata accountable fixes commit d43da265718917e20c8264abd43342069198fe9c Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 16:19:53 2014 +0100 cut over BloomFilterPostings to new API commit 29b192ba621c14820175775d01242162b88bd364 Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 10:22:51 2014 -0400 fix more analyzers commit 74b4a0c5283e323a7d02490df469497c722780d2 Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 09:54:25 2014 -0400 fix tests commit 554084ccb4779dd6b1c65fa7212ad1f64f3a6968 Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 14:51:48 2014 +0100 maintain supressed exceptions on CorruptIndexException commit cf882d9112c5e8ef1e9f2b0f800f7aa59001a4f2 Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 14:47:17 2014 +0100 commitOnClose=false commit ebb2a9189ab2f459b7c6c9985be610fd90dfe410 Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 14:46:06 2014 +0100 cut over indexwriter closeing in InternalEngine commit cd21b3d4706f0b562bd37792d077d60832aff65f Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 14:38:10 2014 +0100 fix constant commit f93f900c4a1c90af3a21a4af5735a7536423fe28 Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 09:50:49 2014 -0400 fix test commit a9a752940b1ab4699a6a08ba8b34afca82b843fe Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Mon Oct 27 09:26:18 2014 +0100 Be explicit about the index options commit d9ee815babd030fa2ceaec9f467c105ee755bf6b Author: Simon Willnauer <simonw@apache.org> Date: Sun Oct 26 20:03:44 2014 +0100 cut over store and directory commit b3f5c8e39039dd8f5caac0c4dd1fc3b1116e64ca Author: Robert Muir <rmuir@apache.org> Date: Sun Oct 26 13:08:39 2014 -0400 more test fixes commit 8842f2684e3606aae0860c27f7a4c53e273d47fb Author: Robert Muir <rmuir@apache.org> Date: Sun Oct 26 12:14:52 2014 -0400 tests manual labor commit c43de5aec337919a3fdc3638406dff17fc80bc98 Author: Robert Muir <rmuir@apache.org> Date: Sun Oct 26 11:04:13 2014 -0400 BytesRef -> BytesRefBuilder commit 020c0d087a2f37566a1db390b0e044ebab030138 Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Sun Oct 26 15:53:37 2014 +0100 Moved over to BitSetFilter commit 48dd1b909e6c52cef733961c9ecebfe4f67109fe Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Sun Oct 26 15:53:11 2014 +0100 Left over Collector api change in ScanContext commit 6ec248ef63f262bcda400181b838fd9244752625 Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Sun Oct 26 15:47:40 2014 +0100 Moved indexed() over to indexOptions != null or indexOptions == null commit 9937aebfd8546ae4bb652cd976b3b43ac5ab7a63 Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Sun Oct 26 13:26:31 2014 +0100 Fixed many compile errors. Mainly around the breaking Collector api change in 5.0. commit fec32c4abc0e3309cf34260c8816305a6f820c9e Author: Robert Muir <rmuir@apache.org> Date: Sat Oct 25 11:22:17 2014 -0400 more easy fixes commit dab22531d801800d17a65dc7c9464148ce8ebffd Author: Robert Muir <rmuir@apache.org> Date: Sat Oct 25 09:33:41 2014 -0400 more progress commit 414767e9a955010076b0497cc4f6d0c1850b48d3 Author: Robert Muir <rmuir@apache.org> Date: Sat Oct 25 06:33:17 2014 -0400 more progress commit ad9d969fddf139a8830254d3eb36a908ba87cc12 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 24 14:28:01 2014 -0400 current state of fun commit 464475eecb0be15d7d084135ed16051f76a7e521 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 24 11:42:41 2014 -0400 bump to 5.0 snapshot	2014-11-05 15:48:51 -05:00
Alex Ksikes	35f55608cc	MLT Field Query: remove it from master The MLT field query is simply replaced by a MLT query set to specififc field. To simplify code maintenance we should deprecate it in 1.4 and remove it in 2.0. Closes #8238	2014-10-29 10:19:00 +01:00
Alex Ksikes	0be5c60bce	MLT Query: use ParseField#withAllDeprecated for percent_terms_to_match Also the parameter was deprecated but not removed so we keep it in the doc and mark it as deprecated ... Closes #8241	2014-10-27 17:35:06 +01:00
Alex Ksikes	991f3e2cd3	Docs: fix tags for dfs and new like parameter	2014-10-27 15:42:44 +01:00
Alex Ksikes	4da407a869	MLT Query: versatile 'like' parameter The MLT query has a lot of parameters. For example, a set of documents is specified with either `like_text`, `ids` or `docs`, with at least one parameter required. This commit groups all the document specification parameters under one called `like`. The syntax is described below and could easily be extended to allow for new means of specifying document input. The `like_text`, `ids` and `docs` parameters are deprecated. As a single piece text: { "query": { "more_like_this": { "like": "some text here" } } } As a single item: { "query": { "more_like_this": { "like": { "_index": "imdb", "_type": "movies", "_id": "88247" } } } } Or as a mixture of all: { "query": { "more_like_this": { "like": [ "Some random text ...", { "_index": "imdb", "_type": "movies", "_id": "88247" }, { "_index": "imdb", "_type": "movies", "doc": { "title": "Document with an artificial title!" } } ] } } } Closes #8039	2014-10-25 11:04:51 +02:00
Andrei Kolosok	c31a783930	Docs: Update filtered-query.asciidoc Fix mistyping Closes #8167	2014-10-21 09:45:19 +02:00
Andrei Kolosok	92abfc8e24	Docs: Update minimum-should-match.asciidoc Add %-sign to examle in the last section Closes #8157	2014-10-21 09:43:55 +02:00
David Pilato	0ff61e1d6f	Add time_zone setting for query_string Query String query now supports a new `time_zone` option based on JODA time zones. When using a range on date field, the time zone is applied. ```json { "query": { "query_string": { "text": "date:[2012 TO 2014]", "timezone": "Europe/Paris" } } } ``` Closes #7880.	2014-10-20 19:09:45 +02:00
John Michael Luy	86f6e21cdf	Docs: Update query-string-query.asciidoc Update sample Closes #8024	2014-10-16 20:21:12 +02:00
Patrick Peschlow	340e0efcaa	Docs: cross_type query needs to be cross_fields Closes #8021	2014-10-15 18:27:48 +02:00
Chris Earle	9b84ad3c7b	Adding "min" score mode to parent-child queries Support for "max", "sum", and "avg" already existed.	2014-10-15 09:23:36 -05:00
Hang Su	dd62e50861	fix mismatched curly bracket	2014-10-14 13:24:44 +02:00
David Pilato	6ae6a078de	Search: add `format` support for date range filter and queries When the date format is defined in mapping, you can not use another format when querying using range date query or filter. For example, this won't work: ``` DELETE /test PUT /test/t/1 { "date": "2014-01-01" } GET /test/_search { "query": { "filtered": { "filter": { "range": { "date": { "from": "01/01/2014" } } } } } } ``` It causes: ``` Caused by: org.elasticsearch.ElasticsearchParseException: failed to parse date field [01/01/2014], tried both date format [dateOptionalTime], and timestamp number ``` It could be nice if we can support at query time another date format just like we support `analyzer` at search time on String fields. Something like: ``` GET /test/_search { "query": { "filtered": { "filter": { "range": { "date": { "from": "01/01/2014", "format": "dd/MM/yyyy" } } } } } } ``` Same for queries: ``` GET /test/_search { "query": { "range": { "date": { "from": "01/01/2014", "format": "dd/MM/yyyy" } } } } ``` Closes #7189.	2014-10-06 15:59:18 +02:00
Alex Ksikes	349b7a3a8b	Term Vectors/MLT Query: support for different analyzers than default at field This adds a `per_field_analyzer` parameter to the Term Vectors API, which allows to override the default analyzer at the field. If the field already stores term vectors, then they will be re-generated. Since the MLT Query uses the Term Vectors API under its hood, this commits also adds the same ability to the MLT Query, thereby allowing users to fine grain how each field item should be processed and analyzed. Closes #7801	2014-10-03 16:40:17 +02:00
Clinton Gormley	ea7ed51014	Docs: Typoes on the range filter page	2014-09-30 19:28:35 +02:00
Alex Ksikes	b118558962	MLT Query: Support for artificial documents Previously, the only way to specify a document not present in the index was to use `like_text`. This would usually lead to complex queries made of multiple MLT queries per document field. This commit adds the ability to the MLT query to directly specify documents not present in the index (artificial documents). The syntax is similar to the Percolator API or to the Multi Term Vector API. Closes #7725	2014-09-29 15:49:13 +02:00
Alex Ksikes	5014158d6b	MLT Query: use minimum should match more extensive syntax The minimum number of optional should clauses of the generated query to match can now be set using the more extensive minimum should match syntax. This makes the `percent_terms_to_match` parameter deprecated, and replaced in favor to a new `minimum_should_match` parameter. Closes #7898	2014-09-29 11:14:56 +02:00
Thom Dixon	2d7264cd03	GeoJSON is lon,lat Although emphasized in the text, the example was backwards. Closes #7843	2014-09-28 11:05:42 +02:00
Clinton Gormley	cb00d4a542	Docs: Removed all the added/deprecated tags from 1.x	2014-09-26 21:04:42 +02:00
Alex Ksikes	51bf3e6730	MLT Query: fix percent_terms_to_match The parameter `percent_terms_to_match` (percentage of terms that must match in the generated query) was wrongly set to the top level boolean query. This would lead to zero or all results type of situations. This commit ensures that the parameter is indeed applied to the query of generated terms. Closes #7754	2014-09-25 09:56:53 +02:00
Clinton Gormley	df8b949c45	Docs: Typos on range filter Closes #7713	2014-09-24 20:19:05 +02:00
David Pilato	f2be8d83dc	Docs: fix typo	2014-09-23 23:41:07 +02:00
Ulrich Reffle	4ed52073fa	Docs: A simple typo. Closes #7535	2014-09-11 11:17:49 +02:00
Clinton Gormley	e0684c4f6e	Docs: Update regexp-syntax.asciidoc Closes #7419	2014-09-07 11:42:33 +02:00
Clinton Gormley	0578a703df	Docs: Update prefix-query.asciidoc Typo (missing word) Closes #7283	2014-09-07 11:32:05 +02:00
Clinton Gormley	cf001d6837	Docs: Adds missing explanation to multi-match Closes #7281	2014-09-07 11:30:52 +02:00
pachanta	8bae96485e	Docs: Update span-not-query.asciidoc The example given does not clearly explain what the query does. Closes #7049	2014-09-07 11:28:19 +02:00
Britta Weber	c5ff70bf43	function_score: add optional weight parameter per function Weights can be defined per function like this: ``` "function_score": { "functions": [ { "filter": {}, "FUNCTION": {}, "weight": number } ... ``` If `weight` is given without `FUNCTION` then `weight` behaves like `boost_factor`. This commit deprecates `boost_factor`. The following is valid: ``` POST testidx/_search { "query": { "function_score": { "weight": 2 } } } POST testidx/_search { "query": { "function_score": { "functions": [ { "weight": 2 }, ... ] } } } POST testidx/_search { "query": { "function_score": { "functions": [ { "FUNCTION": {}, "weight": 2 }, ... ] } } } POST testidx/_search { "query": { "function_score": { "functions": [ { "filter": {}, "weight": 2 }, ... ] } } } POST testidx/_search { "query": { "function_score": { "functions": [ { "filter": {}, "FUNCTION": {}, "weight": 2 }, ... ] } } } ``` The following is not valid: ``` POST testidx/_search { "query": { "function_score": { "weight": 2, "FUNCTION(including boost_factor)": 2 } } } POST testidx/_search { "query": { "function_score": { "functions": [ { "weight": 2, "boost_factor": 2 } ] } } } ```` closes #6955 closes #7137	2014-09-01 11:04:40 +02:00
Ryan Ernst	65afa1d93b	FunctionScore: Refactor RandomScoreFunction to be consistent, and return values in rang [0.0, 1.0] RandomScoreFunction previously relied on the order the documents were iterated in from Lucene. This caused changes in ordering, with the same seed, if documents moved to different segments. With this change, a murmur32 hash of the _uid for each document is used as the "random" value. Also, the hash is adjusted so as to only return values between 0.0 and 1.0 to enable easier manipulation to fit into users' scoring models. closes #6907, #7446	2014-08-27 08:37:25 -07:00
Lee Hinman	c5c11b1837	[DOCS] Document NOT as a simple_query_string flag Fixes #7426	2014-08-25 10:27:24 +02:00
Alex Ksikes	e78694ae82	More Like This Query: defaults to all possible fields for items Items with no specified field now defaults to all the possible fields from the document source. Previously, we had required 'fields' to be specified either as a top level parameter or for each item. The default behavior is now similar to the MLT API. Closes #7382	2014-08-22 15:07:22 +02:00
Alex Ksikes	f1a6b4e9fe	More Like This Query: Switch to using the multi-termvectors API The term vector API can now generate term vectors on the fly, if the terms are not already stored in the index. This commit exploits this new functionality for the MLT query. Now the terms are directly retrieved using multi- termvectors API, instead of generating them from the texts retrieved using the multi-get API. Closes #7014	2014-08-21 12:18:21 +02:00
Adrien Grand	ea96359d82	Facets: Removal from master. Close #7337	2014-08-21 10:34:39 +02:00
Britta Weber	639692943f	Docs: Document distance type and sort mode for many to many geo_points closes #7280	2014-08-18 16:15:55 +02:00
Lee Hinman	99b0faed14	[DOCS] document simple_query_string's `lenient` option	2014-08-18 15:56:06 +02:00
Xiao Yu	4f417aa774	Docs: Update nested-filter.asciidoc Fix whitespace to standardize on spaces for indents on both code examples for readability. Closes #6568	2014-08-18 13:09:00 +02:00
Britta Weber	aa1dbc0778	Docs: Fix ambigous explanation of the "fields" parameter in `query_string` query Closes #7292	2014-08-18 13:06:48 +02:00
Gheorghita Catalin Bordianu	ca77db1c83	Fixed a typo	2014-08-14 16:27:30 +02:00
Konrad Feldmeier	657b954528	Resolve wording inconsistency AND and OR filter docs talk about different targets for the operators. I believe that both should be described in terms of modifying other 'filters'. I also added articles for easier (human) parsing. This fixes #4762 Closes #7165	2014-08-05 17:16:53 +02:00
David Pilato	873a45eaba	Search: add time zone setting for relative date math in range filter/query Filters and Queries now supports `time_zone` parameter which defines which time zone should be applied to the query or filter to convert it to UTC time based value. When applied on `date` fields the `range` filter and queries accept also a `time_zone` parameter. The `time_zone` parameter will be applied to your input lower and upper bounds and will move them to UTC time based date: [source,js] -------------------------------------------------- { "constant_score": { "filter": { "range" : { "born" : { "gte": "2012-01-01", "lte": "now", "time_zone": "+1:00" } } } } } { "range" : { "born" : { "gte": "2012-01-01", "lte": "now", "time_zone": "+1:00" } } } -------------------------------------------------- In the above examples, `gte` will be actually moved to `2011-12-31T23:00:00` UTC date. NOTE: if you give a date with a timezone explicitly defined and use the `time_zone` parameter, `time_zone` will be ignored. For example, setting `from` to `2012-01-01T00:00:00+01:00` with `"time_zone":"+10:00"` will still use `+01:00` time zone. Closes #3729.	2014-08-04 15:42:03 +02:00
Clinton Gormley	c367ae09e3	Update nested-query.asciidoc Changed score_mode `total` to `sum` to be consistent with parent-child etc	2014-07-26 22:32:28 +02:00
Britta Weber	10201d511c	[doc] Correct decay function equations in function_score description Impact of decay and scale was missing from the equations. Closes #6983	2014-07-23 17:33:22 +02:00
Simon Willnauer	5bfea56457	[DOCS] move all coming tags to added in master	2014-07-23 16:37:19 +02:00
babeya	81a83aab22	Docs: Update query-string-syntax.asciidoc Closes #6253	2014-07-23 16:32:32 +02:00
Konrad Feldmeier	48812ff1f2	Reflect that 'field_value_factor' is only in 1.2.x While the blogpost http://www.elasticsearch.org/blog/2014-04-02-this-week-in-elasticsearch/ states, that feature #5519 was added to 1.x, the release notes for, e.g. v1.1.2, however tell otherwise. Only the release notes for 1.2.0 list #5519 as a new feature. Since the 1.x docs deprecate/discourage from using `_boost`, and seemingly give a migration example at http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/mapping-boost-field.html#function-score-instead-of-boost users of 1.1.x should be warned.	2014-07-23 15:49:03 +02:00
Clinton Gormley	254aa71693	Docs: Added Tiki Wiki integration Closes #6746	2014-07-23 11:00:46 +02:00
Clinton Gormley	a862732434	Docs: Typo	2014-07-21 18:51:49 +02:00
Brian Murphy	cbd2a97abd	[DOCS] : Indexed scripts/templates These are the docs for the indexed scripts/templates feature. Also moved the namespace for the REST endpoints. Closes #6851	2014-07-16 10:49:02 +01:00
Malte Schirnacher	647a2a64a1	Docs: Update query-string-syntax.asciidoc Closes #6853	2014-07-14 16:35:17 +02:00
Matthew L Daniel	53f2301eea	Docs: Add clarifying text about regexp and terms For the casual reader, the reference to "term queries" may be glossed over, yielding an unexpected result when using `regexp` queries. This attempts to make that distinction more prominent. Closes #6698	2014-07-03 11:39:57 +02:00
jnguyenx	1883f74cc0	Docs: Fixed missing comma in multi match query example	2014-07-03 08:17:09 +02:00
Clinton Gormley	30c80319c0	Match query with operator and, cutoff_frequency and stacked tokens If the match query with cutoff_frequency encounters stacked tokens, like synonyms in the same position, it returns a boolean query instead of a common terms query. However, if the original operator was set to "and", it was ignoring that and resetting the operator to "or". In fact, if operator is "and" then there is little benefit in using a common terms query as a must query is already executed efficiently.	2014-06-25 17:53:43 +02:00
Clinton Gormley	fac724cc99	Docs: Updated the explanation about memory usage with parent/child	2014-06-21 16:32:29 +02:00
Clinton Gormley	adf6e794b6	Docs: Rewrote the filtered query docs to be clearer Closes #1688	2014-06-19 16:34:26 +02:00
Boaz Leskes	7fb16c783d	Added caching support to geohash_filter Caching is turned off by default. Closes #6478	2014-06-12 22:19:34 +02:00
shadow000fire	1b45b216fd	Update nested-query.asciidoc Added note that fields inside a nested query must be full qualified.	2014-06-12 12:48:23 +02:00
Yervand Aghababyan	cb22417cc1	[DOCS] Fixed the fuzzy query docs with correct default value max_expansion option	2014-06-05 19:52:12 +02:00
violuke	4f99f0c6f1	[DOCS] Improved readability of multi-match query docs	2014-06-03 14:23:34 +02:00
Steve Fuller	b800be891f	[DOCS] fixed typo in fucntion-score query docs	2014-06-03 12:05:59 +02:00
violuke	0020e5fc0a	[DOCS] Improved grammar in multi-match query docs	2014-06-03 11:50:41 +02:00
Clinton Gormley	46a67b638d	Parent/Child: Added min_children/max_children to has_child query/filter Added support for min_children and max_children parameters to the has_child query and filter. A parent document will only be considered if a match if the number of matching children fall between the min/max bounds. Closes #6019	2014-05-30 19:38:39 +02:00
Adrien Grand	328a7e513c	[DOCS] Document filtered query strategies.	2014-05-28 17:57:43 +02:00
Alex Ksikes	2546c06131	More Like This Query: allow for both 'like_text' and 'docs/ids' to be specified. Closes #6246	2014-05-22 13:50:17 +02:00
Alex Ksikes	a29b4a800d	More Like This Query: replaced 'exclude' with 'include' to avoid double negation when set. Closes #6248	2014-05-21 18:45:03 +02:00
Alex Ksikes	db991dc3a4	More Like This Query: Added searching for multiple items. The syntax to specify one or more items is the same as for the Multi GET API. If only one document is specified, the results returned are the same as when using the More Like This API. Relates #4075 Closes #5857	2014-05-17 19:14:56 +02:00
ericheiker	0eb7b5024d	Update match-query.asciidoc	2014-05-14 11:59:12 +02:00
Alex Ksikes	48b7172ee7	Provided some insights as to how More Like This works internally. In the Google Groups forum there appears to be some confusion as to what mlt does. This documentation update should hopefully help demystifying this feature, and provide some understanding as to how to use its parameters. Closes #6092	2014-05-09 12:13:29 +02:00
Benjamin Devèze	6feeac98c8	s/boost_factor/boost in custom_filters_score doc I may be wrong but I think custom_filters_score used boost rather than boost factor?	2014-05-06 16:15:36 +02:00
Audrey	d7023fbb3f	Update "Character classes" part	2014-05-06 16:05:51 +02:00
Clinton Gormley	7a9aad30f4	[DOCS] Changed score_type to score_mode for has_child/parent queries	2014-05-05 18:30:12 +02:00
gabriel-tessier	48930c2950	[DOC] Fix typo in function score query documentation.	2014-05-02 23:44:56 +02:00
Alex Ksikes	b55d8ed2e3	Fix behavior on default boost factor for More Like This. A boost terms factor of 1.0 is not the same as no boosting of terms. The desired behavior is to deactivate boosting by default. If the user specifies any value other than 0, then boosting is activated. Closes #6021	2014-05-02 16:59:09 +02:00
Simon Willnauer	f285ffc610	Multi value handling in decay functions Decay functions currently only use the first value in a field that contains multiple values to compute the distance to the origin. Instead, it should consider all distances if more values are in the field and then use one of min/max/sum/avg which is defined by the user. Relates to #3960 closes #5940	2014-04-28 11:55:32 +02:00
Clinton Gormley	39705aa236	[DOCS] rewrite -> fuzzy_rewrite in match query Fixed typo	2014-04-23 21:05:14 +02:00
Samuel Molinari	909cf4de44	Update function-score-query.asciidoc	2014-04-20 13:39:32 +02:00
Clinton Gormley	65906d176a	Update multi-match-query.asciidoc Typo	2014-04-16 15:41:38 +02:00
Binh Ly	51a6a95de3	[DOC] Fixed flags example incorrect syntax	2014-04-01 14:43:38 -04:00
Andrew O'Brien	bd9c1bc8d9	Update has-parent-filter.asciidoc "This filter return child..." => This filter returns child...	2014-03-31 00:06:35 +02:00
Lee Hinman	8fbd1bdd48	Add the `field_value_factor` function to the function_score query The `field_value_factor` function uses the value of a field in the document to influence the score. A query that looks like: { "query": { "function_score": { "query": {"match": { "body": "foo" }}, "functions": [ { "field_value_factor": { "field": "popularity", "factor": 1.1, "modifier": "square" } } ], "score_mode": "max", "boost_mode": "sum" } } } Would have the score modified by: square(1.1 * doc['popularity'].value) Closes #5519	2014-03-27 14:29:37 -06:00
javanna	42c36ef72d	[DOCS] fixed typo Closes #5272	2014-03-26 14:51:02 +01:00
bleskes	5d832374dd	Update Documentation Feature Flags [1.1.0]	2014-03-25 17:51:30 +01:00
Boaz Leskes	fc8dc3f733	[Docs] updated the search template and query template docs	2014-03-25 15:25:02 +01:00
Alexander Reelsen	4fc461a97c	[DOCS] Moved the template query documentation into search section	2014-03-25 10:01:41 +01:00
Alexander Reelsen	8f6e1d4720	Query Templates: Adding dedicated /_search/template endpoint In order to simplify query template execution an own endpoint has been added Closes #5353	2014-03-20 17:43:40 +01:00
Clinton Gormley	98ecf80f07	[DOCS] Formatting error Closes #5346	2014-03-05 17:40:51 +01:00
Martijn van Groningen	1441fec068	[DOCS] Updated memory considerations for p/c queries and filters.	2014-02-26 22:16:51 +01:00
Simon Willnauer	90e57c15e8	[DOCS]: fixed small problem in example json	2014-02-26 16:40:04 +01:00
Lee Hinman	8f8cc7205d	Add "locale" parameter to query_string and simple_query_string Fixes #5128 Remove java 7 specific Locale functions, add "coming[1.1.0]" to documentation add LocaleUtils utility class for dealing with Locale functions	2014-02-20 15:53:08 -07:00
Isabel Drost-Fromm	48004ff8a5	Add mustache templating to query execution. Adds support for storing mustache based query templates that can later be filled with query parameter values at execution time. Templates may be both quoted, non-quoted and referencing templates stored in config/scripts/*.mustache by file name. See docs/reference/query-dsl/queries/template-query.asciidoc for templating examples. Implementation detail: mustache itself is being shaded as it depends directly on guava - so having it marked optional but included in the final distribution raises chances of version conflicts downstream. Fixes #4879	2014-02-20 12:21:59 +01:00
Lee Hinman	c97bcc3602	Add support for `lowercase_expanded_terms` flag to simple_query_string Default the flag to true, making simple_query_string behave similarly to query_string Fixes #5008	2014-02-14 11:51:23 -07:00
Clinton Gormley	93930d6dc7	Removed 0.90.* deprecation and addition notifications Closes #5052	2014-02-07 20:52:49 +01:00
Lee Hinman	d2078a5e28	Add fuzzy/slop support to `simple_query_string` Ports the change from https://issues.apache.org/jira/browse/LUCENE-5410	2014-02-06 10:05:10 -07:00
Simon Willnauer	162ca99376	Added `cross_fields` mode to multi_match query `cross_fields` attemps to treat fields with the same analysis configuration as a single field and uses maximum score promotion or combination of the scores based depending on the `use_dis_max` setting. By default scores are combined. `cross_fields` can also search across fields of hetrogenous types for instance if numbers can be part of the query it makes sense to search also on numeric fields if an analyzer is provided in the reqeust. Relates to #2959	2014-02-06 17:15:55 +01:00
Martijn van Groningen	7e1eed9814	The forceful no cache behaviour for range filter with now date match expression should only be active if no rounding has been specified for `now` in the date range range expression (for example: `now/d`). Also the automatic now detection in range filters is overrideable by the `_cache` option. Closes #4947 Relates to #4846	2014-01-30 15:51:33 +01:00
Brusic	d9b71a8083	[DOCS] various docs fixes Removed unused misc.asciidoc file Added plugins directory to directory layout Fixed transport.tcp.connect_timeout value to match the code found in NetworkService.TcpSettings Clarified that phrase query does not preserve order of terms Clarified merge page Added instructions on how to build documentation to docs/README	2014-01-23 10:52:13 +01:00
Simon Willnauer	e6ace1313e	[DOCS]: fixed added / coming tags in docs	2014-01-22 12:02:37 +01:00
Simon Willnauer	f0bce08c30	Return `MatchNoDocsQuery` if query string is emtpy Closes #3952	2014-01-20 16:08:57 +01:00
Martijn van Groningen	db394117c4	Made sure that any filter that wraps a p/c filter (has_child & has_parent) either directly or indirectly will never be cached by making CustomQueryWrappingFilter extend from NoCacheFilter. Closes #4757	2014-01-20 10:54:09 +01:00
markharwood	2795f4e55d	Standardized use of “_length” for parameter names rather than “_len”. Java Builder apis drop old “len” methods in favour of new “length” Rest APIs support both old “len: and new “length” forms using new ParseField class to a) provide compiler-checked consistency between Builder and Parser classes and b) a common means of handling deprecated syntax in the DSL. Documentation and rest specs only document the new “*length” forms Closes #4083	2014-01-13 15:59:15 +00:00
LightGuard	e89d5d0d86	Fixing up code block delimeters for asciidoctor You can now successfully run the docs through asciidoctor	2014-01-13 15:26:53 +01:00
Florian Schilling	464037e0c1	Geo clean Up ============ The default unit for measuring distances is MILES in most cases. This commit moves ES over to the International System of Units and make it work on a default which relates to METERS . Also the current structures of the `GeoBoundingBox Filter` changed in order to define the Bounding by setting abitrary corners. Distances --------- Since the default unit for measuring distances has changed to a default unit `DistanceUnit.DEFAULT` relating to meters, the REST API has changed at the following places: * `ScriptDocValues.factorDistance()` returns meters instead of miles * `ScriptDocValues.factorDistanceWithDefault()` returns meters instead of miles * `ScriptDocValues.arcDistance()` returns meters instead of miles one might use `ScriptDocValues.arcDistanceInMiles()` * `ScriptDocValues.arcDistanceWithDefault()` returns meters instead of miles * `ScriptDocValues.distance()` returns meters instead of miles one might use `ScriptDocValues.distanceInMiles()` * `ScriptDocValues.distanceWithDefault()` returns meters instead of miles one might use `ScriptDocValues.distanceInMilesWithDefault()` * `GeoDistanceFilter` default unit changes from kilometers to meters * `GeoDistanceRangeFilter` default unit changes from miles to meters * `GeoDistanceFacet` default unit changes from miles to meters Geo Bounding Box Filter ----------------------- The naming of the GeoBoundingBoxFilter properties allows to set arbitrary corners (see #4084) namely `top_right`, `top_left`, `bottom_right` and `bottom_left`. This change also includes the fields `topRight` and `bottomLeft` Also it is be possible to set the single values by using just `top`, `bottom`, `left` and `right` parameters. Closes #4515, #4084	2014-01-11 21:30:29 +09:00
Simon Willnauer	bc5a9ca342	Rename edit_distance/min_similarity to fuzziness A lot of different API's currently use different names for the same logical parameter. Since lucene moved away from the notion of a `similarity` and now uses an `fuzziness` we should generalize this and encapsulate the generation, parsing and creation of these settings across all queries. This commit adds a new `Fuzziness` class that handles the renaming and generalization in a backwards compatible manner. This commit also added a ParseField class to better support deprecated Query DSL parameters The ParseField class allows specifying parameger that have been deprecated. Those parameters can be more easily tracked and removed in future version. This also allows to run queries in `strict` mode per index to throw exceptions if a query is executed with deprected keys. Closes #4082	2014-01-09 15:14:51 +01:00
Lee Hinman	5463f7953f	Expose `simple_query_string` flags in `flags` parameter	2014-01-03 16:14:19 -07:00
Florian Schilling	bc452dff84	* setup accurate GeoDistance Function * adapt tests * introduced default GeoDistance function * Updated docs closes #4498	2013-12-27 19:15:19 +09:00
Martijn van Groningen	eb86a3a6fe	[DOCS] Changed `shape_field_name` to `path` in geo_shape filter documentation. Relates to #4486	2013-12-23 11:27:06 +01:00
Grégory Quatannens	c64abaae7e	Fixing typo and grammar	2013-12-17 11:39:02 +01:00
Clinton Gormley	34b9b16233	[DOCS] Fixed some bad link refs	2013-12-16 18:07:33 +01:00
Lee Hinman	db431b7cb3	Remove the `field` and `text` queries. The `text` query was replaced by the `match` query and has been deprecated for quite a while. The `field` query should be replaced by a `query_string` query with the `default_field` specified. Fixes #4033	2013-12-16 08:59:36 -07:00
Lee Hinman	77fcf71338	Add new `simple_query_string` query type This adds support for Lucene's SimpleQueryParser by adding a new type of query called the `simple_query_string`. The `simple_query_string` query is designed to be able to parse human-entered queries without throwing any exceptions. Resolves #4159	2013-12-12 12:09:32 -07:00
Martijn van Groningen	a03556daa0	Added execution option to `range` filter, with the `index` and `fielddata` as values. Deprecated `numeric_range` filter in favor for the `range` filter with `fielddata` as execution. Closes #4034	2013-11-25 23:43:40 +01:00
Matt Weber	a841a422f6	Add a field data based TermsFilter Add FieldDataTermsFilter that compares terms out of the fielddata cache. When filtering on a large set of terms this filter can be considerably faster than using a standard lucene terms filter. Add the "fielddata" execution mode to the terms filter parser to enable the use of the new FieldDataTermsFilter. Add supporting tests and documentation. Closes #4209	2013-11-19 19:18:16 +01:00
Luca Cavanna	0aaa39d00a	Minor improvements to indices filter and query & updated docs Slightly simplified indices filter and query parsers code Trimmed down tests where possible	2013-11-14 17:25:34 +01:00
Olivier Favre	fa80ca97b2	Indices query/filter skip parsing altogether for irrelevant indices when possible Closes #2416	2013-11-14 17:24:49 +01:00
Clinton Gormley	7189310764	In ctor of GeoPointFieldMapper, geohash_prefix now implicitly enables geohash option Also improved docs for geopoint type and geohash_cell filte Closes #3951	2013-11-08 13:52:17 +01:00
Clinton Gormley	356de95840	Added simplified range syntax to query string docs	2013-11-04 18:18:36 +01:00
Alexander Reelsen	dfcb3ca2d4	RegexpQueryBuilder now implements MultiTermQueryBuilder This allows the RegexpQueryBuilder to be used in span queries Added tests for all span multi term queries. Also updated the documentation and removed mentioning of numeric range queries for span queries (they have to be terms). Closes #3392	2013-10-31 09:12:57 +01:00
Martijn van Groningen	b7c4adeea3	[Docs] update reference to remove documentation about percolating during an index, bulk or update request.	2013-10-16 16:31:36 +02:00
Boaz Leskes	57b2d45142	[Docs] added document for the lenient option in match queries	2013-10-16 10:53:25 +02:00
Britta Weber	34441f3897	fix naming in function_score - "boost" should be "boost_factor" - "mult" should be "multiply" Also, store combine function names in ImmutableMap instead of iterating over all possible names each time. closes #3872 for master	2013-10-14 14:56:59 +02:00
Simon Willnauer	25d6f04f13	[DOCS] Note that cutoff_frequency doesn't handle stacked tokens gracefully	2013-10-14 14:09:38 +02:00
Clinton Gormley	264a00a40f	[DOCS] Added pages explaining lucene query parser syntax and regular expression syntax	2013-10-07 14:42:49 +02:00
Luca Cavanna	b0fee6c01b	Changed nested filter example to use an inner bool filter instead of a bool query, to demonstrate the usage of a filter rather than a query.	2013-10-04 14:08:37 +02:00
Lee Hinman	0442b737be	Add more anchor links to documentation Related to #3679	2013-09-30 13:13:16 -06:00
Martijn van Groningen	d365a4ccba	Added nested filter join option to the docs. Closes #3738	2013-09-20 21:22:56 +02:00
Adrien Grand	90524d7ad2	Fix formatting of the documentation. Remaining '@'s have been replaced with '`'s.	2013-09-18 12:35:44 +02:00
Britta Weber	b7c3b50909	add date field to decay function doc	2013-09-17 19:54:31 +02:00
David Pilato	1e3ffa0df7	Add distance supported units	2013-09-17 14:21:45 +02:00
Clinton Gormley	c2eb4a1c40	[DOCS] Tidied up function score	2013-09-16 15:57:08 +02:00
Clinton Gormley	d6ecdecc19	[DOCS] Deprecated the from/to/include_lower/include_upper params in the range query, range filter and numeric range filter. Better to use gt/gte/lt/lte as they are explicit.	2013-09-12 15:07:36 +02:00
Clinton Gormley	9e6d30a14a	[DOCS] Changed the deprecation of custom_boost/score/filters_score queries to 0.90.4	2013-09-05 12:14:10 +02:00
Clinton Gormley	2b3a762c27	[DOCS] Function score was added in 0.90.4 not 1.00.Beta	2013-09-05 11:25:06 +02:00
Clinton Gormley	765bd026f5	[DOCS] Added function score query	2013-09-04 23:20:55 +02:00
Clinton Gormley	9f5d0b6e89	[DOCS] Added a few clarifications to the docs from the issues list	2013-09-04 23:20:55 +02:00
Clinton Gormley	393c28bee4	[DOCS] Removed outdated new/deprecated version notices	2013-09-03 21:28:31 +02:00
Clinton Gormley	822043347e	Migrated documentation into the main repo	2013-08-29 01:24:34 +02:00

... 3 4 5 6 7 ...

449 Commits