OpenSearch

Commit Graph

Author	SHA1	Message	Date
xuzha	cd527c5b92	Add support for customizing the rule file in ICU tokenizer Lucene allows to create a ICUTokenizer with a special config argument enabling the customization of the rule based iterator by providing custom rules files. This commit enable this feature. Users could provide a list of RBBI rule files to ICU tokenizer. closes #13146	2016-04-22 12:39:20 -07:00
chenxiang	a0aea5baf7	Update terms-query.asciidoc user id of tweet hould exist in the `followers`, otherwise the search result is empty	2016-04-22 10:56:13 -06:00
ericamick	069eb72604	Update bucket.asciidoc	2016-04-22 10:54:25 -06:00
ericamick	f081bf4e26	Update bulk.asciidoc	2016-04-22 10:51:33 -06:00
ericamick	3004c45f7b	Update update.asciidoc	2016-04-22 10:50:42 -06:00
ericamick	276b89242c	Update get.asciidoc	2016-04-22 10:48:58 -06:00
Nik Everett	61f0b665b8	Fix fallback setting for two get/2	2016-04-22 11:10:01 -04:00
Christoph Büscher	a1c9025eaa	Update completion-suggest.asciidoc Removed trailing comma.	2016-04-22 14:00:37 +02:00
Martijn van Groningen	c5ad2e2865	Changed indexed scripts to be stored in the cluster state instead of the `.scripts` index. Also added max script size soft limit for stored scripts. Closes #16651	2016-04-22 13:42:55 +02:00
Clinton Gormley	e4df68b627	Added cautionary note to match_phrase_prefix explaining its shortcomings Closes #17655	2016-04-22 12:45:12 +02:00
Christoph Büscher	0ec4ffcb3a	Remove QueryFilterBuilder section from migration docs. This query builder was deprecated in 2.0 and has been removed.	2016-04-21 18:11:01 +02:00
Martijn van Groningen	dd2184ab25	ingest: Streamline option naming for several processors: * `rename` processor, renamed `to` to `target_field` * `date` processor, renamed `match_field` to `field` and renamed `match_formats` to `formats` * `geoip` processor, renamed `source_field` to `field` and renamed `fields` to `properties` * `attachment` processor, renamed `source_field` to `field` and renamed `fields` to `properties` Closes #17835	2016-04-21 13:40:43 +02:00
Jun Ohtani	9eb242a5fe	Analyze API : Rename filters/token_filters/char_filter to filter/token_filter/char_filter Closes #15189	2016-04-21 18:05:11 +09:00
Zachary Tong	80288ad60c	Add `fingerprint` token filter and `fingerprint` analyzer Adds a `fingerprint` token filter which uses Lucene's FingerprintFilter, and a `fingerprint` analyzer that combines the Fingerprint filter with lowercasing, stop word removal and asciifolding. Closes #13325	2016-04-20 16:10:56 -04:00
Martijn van Groningen	81449fc912	percolator: renamed `percolator` query to `percolate` query	2016-04-20 15:23:54 +02:00
Clinton Gormley	ca8ea36b30	Updated decay-function image in function_score query Closes #17479	2016-04-20 13:37:52 +02:00
Clinton Gormley	b89e6cd5d8	Added link to breaking changes to release notes	2016-04-19 20:05:18 +02:00
Lee Hinman	b8899cdb78	Merge remote-tracking branch 'dakrone/allow-bad-json'	2016-04-19 10:02:53 -06:00
Martijn van Groningen	ba08313417	settings: Removed `action.get.realtime` setting Closes #12543	2016-04-19 17:14:23 +02:00
Lee Hinman	a1e8fb794c	Allow JSON with unquoted field names by enabling system property In Elasticsearch 5.0.0, by default unquoted field names in JSON will be rejected. This can cause issues, however, for documents that were already indexed with unquoted field names. To alleviate this, a system property has been added that can be enabled so migration can occur. This system property will be removed in Elasticsearch 6.0.0 Resolves #17674	2016-04-19 09:14:13 -06:00
Clinton Gormley	102a398d9f	Fixed split processor example	2016-04-19 14:11:45 +02:00
Clinton Gormley	68f96868a6	Percolator docs missing a callout	2016-04-19 14:11:23 +02:00
Russ Cam	e53131dd79	Update has-parent-query.asciidoc (#17841 ) Change reference to `score_mode` to `score`	2016-04-19 11:56:05 +02:00
Clinton Gormley	c024504842	Update search.asciidoc Corrected breaking changes for `has_parent`. Relates to https://github.com/elastic/elasticsearch/pull/17841	2016-04-19 11:54:48 +02:00
Martijn van Groningen	8e63ce00f0	docs: removed confusing statement.	2016-04-19 11:49:51 +02:00
Martijn van Groningen	40c22fc654	percolator: removed .percolator type instead a field of type `percolator` should be configured before indexing percolator queries * Added an extra `field` parameter to the `percolator` query to indicate what percolator field should be used. This must be an existing field in the mapping of type `percolator`. * The `.percolator` type is now forbidden. (just like any type that starts with a `.`) This only applies for new indices created on 5.0 and later. Indices created on previous versions the .percolator type is still allowed to exist. The new `percolator` field type isn't active in such indices and the `PercolatorQueryCache` knows how to load queries from these legacy indices. The `PercolatorQueryBuilder` will not enforce that the `field` parameter is of type `percolator`.	2016-04-19 11:20:31 +02:00
Clinton Gormley	a2ab13ddd1	Update ingest-node.asciidoc Documented `separator` in the `split processor Closes https://github.com/elastic/elasticsearch/issues/17831	2016-04-19 11:11:58 +02:00
Clinton Gormley	40b84d2ef6	Update mapping.asciidoc Correct `fielddata.frequency.regex` to `fielddata.filter.regex` in breaking changes	2016-04-18 21:00:27 +02:00
Danilo Vaz	2e2d8c1442	Updated copyright years to include 2016 (#17808 )	2016-04-18 12:39:23 +02:00
Sergii Golubev	5ce3eb96b0	tophits-aggregation.asciidoc: fix a typo	2016-04-18 09:23:39 +02:00
LeonardGC	0b8be7f894	Update field-mapping.asciidoc (#17670 )	2016-04-15 09:22:38 +02:00
bloublou	83944c5628	Typo correction heap_size.asciidoc (#17745 ) * Typo correction Xms Xmx Typo correction on "-Xms4000mb -Xmx4000mb" * Change mb to m for Xms/Xmx	2016-04-14 20:37:37 +02:00
Adrien Grand	d84c643f58	Use the new points API to index numeric fields. #17746 This makes all numeric fields including `date`, `ip` and `token_count` use points instead of the inverted index as a lookup structure. This is expected to perform worse for exact queries, but faster for range queries. It also requires less storage. Notes about how the change works: - Numeric mappers have been split into a legacy version that is essentially the current mapper, and a new version that uses points, eg. LegacyDateFieldMapper and DateFieldMapper. - Since new and old fields have the same names, the decision about which one to use is made based on the index creation version. - If you try to force using a legacy field on a new index or a field that uses points on an old index, you will get an exception. - IP addresses now support IPv6 via Lucene's InetAddressPoint and store them in SORTED_SET doc values using the same encoding (fixed length of 16 bytes and sortable). - The internal MappedFieldType that is stored by the new mappers does not have any of the points-related properties set. Instead, it keeps setting the index options when parsing the `index` property of mappings and does `if (fieldType.indexOptions() != IndexOptions.NONE) { // add point field }` when parsing documents. Known issues that won't fix: - You can't use numeric fields in significant terms aggregations anymore since this requires document frequencies, which points do not record. - Term queries on numeric fields will now return constant scores instead of giving better scores to the rare values. Known issues that we could work around (in follow-up PRs, this one is too large already): - Range queries on `ip` addresses only work if both the lower and upper bounds are inclusive (exclusive bounds are not exposed in Lucene). We could either decide to implement it, or drop range support entirely and tell users to query subnets using the CIDR notation instead. - Since IP addresses now use a different representation for doc values, aggregations will fail when running a terms aggregation on an ip field on a list of indices that contains both pre-5.0 and 5.0 indices. - The ip range aggregation does not work on the new ip field. We need to either implement range aggs for SORTED_SET doc values or drop support for ip ranges and tell users to use filters instead. #17700 Closes #16751 Closes #17007 Closes #11513	2016-04-14 17:56:23 +02:00
Colin Goodheart-Smithe	c595322d90	Adds ignore_unmapped option to geo queries The change adds a new option to the geo_* queries: ignore_unmapped. If this option is set to false, the toQuery method on the QueryBuilder will throw an exception if the field specified in the query is unmapped. If the option is set to true, the toQuery method on the QueryBuilder will return a MatchNoDocsQuery. The default value is false so the queries work how they do today (throwing an exception on unmapped field)	2016-04-14 15:29:07 +01:00
Colin Goodheart-Smithe	686aff1545	Adds ignore_unmapped option to nested and P/C queries The change adds a new option to the `nested`, `has_parent`, `has_children` and `parent_id` queries: `ignore_unmapped`. If this option is set to false, the `toQuery` method on the QueryBuilder will throw an exception if the type/path specified in the query is unmapped. If the option is set to true, the `toQuery` method on the QueryBuilder will return a MatchNoDocsQuery. The default value is `false`so the queries work how they do today (throwing an exception on unmapped paths/types)	2016-04-14 10:34:30 +01:00
Clinton Gormley	acec464eb8	Docs: Clarified the purpose of the parent_id query	2016-04-14 11:25:26 +02:00
Sergii Golubev	434a563fe0	terms-aggregation.asciidoc tiny edit	2016-04-13 16:51:47 -06:00
Martijn van Groningen	16fa3e546e	docs: remove mention of file based grok pattern	2016-04-13 22:51:12 +02:00
Clinton Gormley	447f099544	Improve glossary to not refer to types as "like a table" (#17704 ) Closes #17673	2016-04-13 14:29:47 +02:00
Nik Everett	0f9804b0e2	reindex: gracefully handle when _source is disabled Closes #17666	2016-04-13 08:19:58 -04:00
Sergii Golubev	39b914bd77	histogram-aggregation.asciidoc: tiny edit (#17706 )	2016-04-13 14:19:05 +02:00
Martijn van Groningen	ca5bd89581	docs: adjust grok processor docs to not mention pattern files as these no longer exist Closes #17692	2016-04-13 12:37:50 +02:00
Daniel Mitterdorfer	0c7795f53d	Merge remote-tracking branch 'danielmitterdorfer/bulk-size-limit' Closes #17133	2016-04-13 10:43:00 +02:00
Clinton Gormley	a62b9296c6	Docs: Fixed link to phonetic plugin	2016-04-13 10:17:46 +02:00
Clinton Gormley	bdf62b5615	More asciidoc errors	2016-04-13 10:14:09 +02:00
Clinton Gormley	1a15e55f94	More asciidoc errors	2016-04-13 10:02:09 +02:00
Clinton Gormley	b201605a81	Fix bad asciidoc	2016-04-13 09:57:00 +02:00
Daniel Mitterdorfer	52b2016447	Limit request size on transport level With this commit we limit the size of all in-flight requests on transport level. The size is guarded by a circuit breaker and is based on the content size of each request. By default we use 100% of available heap meaning that the parent circuit breaker will limit the maximum available size. This value can be changed by adjusting the setting network.breaker.inflight_requests.limit Relates #16011	2016-04-13 09:54:59 +02:00
Clinton Gormley	d26b7457cf	Docs: Added note about older versions of RPM not being supported, and mentioned CentOS 5	2016-04-13 09:43:38 +02:00
Clinton Gormley	29b75960df	Docs: Added note about RPMs not being supported on SLES 11	2016-04-13 09:34:01 +02:00

1 2 3 4 5 ...

2798 Commits