OpenSearch

Commit Graph

Author	SHA1	Message	Date
Konrad Feldmeier	657b954528	Resolve wording inconsistency AND and OR filter docs talk about different targets for the operators. I believe that both should be described in terms of modifying other 'filters'. I also added articles for easier (human) parsing. This fixes #4762 Closes #7165	2014-08-05 17:16:53 +02:00
Jun Ohtani	5be8aecd10	add Proxy setting for using plugin command Closes #7150	2014-08-05 12:44:02 +02:00
David Pilato	873a45eaba	Search: add time zone setting for relative date math in range filter/query Filters and Queries now supports `time_zone` parameter which defines which time zone should be applied to the query or filter to convert it to UTC time based value. When applied on `date` fields the `range` filter and queries accept also a `time_zone` parameter. The `time_zone` parameter will be applied to your input lower and upper bounds and will move them to UTC time based date: [source,js] -------------------------------------------------- { "constant_score": { "filter": { "range" : { "born" : { "gte": "2012-01-01", "lte": "now", "time_zone": "+1:00" } } } } } { "range" : { "born" : { "gte": "2012-01-01", "lte": "now", "time_zone": "+1:00" } } } -------------------------------------------------- In the above examples, `gte` will be actually moved to `2011-12-31T23:00:00` UTC date. NOTE: if you give a date with a timezone explicitly defined and use the `time_zone` parameter, `time_zone` will be ignored. For example, setting `from` to `2012-01-01T00:00:00+01:00` with `"time_zone":"+10:00"` will still use `+01:00` time zone. Closes #3729.	2014-08-04 15:42:03 +02:00
Britta Weber	f84dc23b96	Docs: remove duplicate label	2014-08-04 08:43:44 +02:00
Britta Weber	5706858722	Add parameter to GET for checking if generated fields can be retrieved Fields of type `token_count`, `murmur3`, `_all` and `_field_names` are generated only when indexing. If a GET requests accesses the transaction log (because no refresh between indexing and GET request) then these fields cannot be retrieved at all. Before the behavior was so: `_all, _field_names`: The field was siletly ignored `murmur3, token_count`: `NumberFormatException` because GET tried to parse the values from the source. In addition, if these fields were not stored, the same behavior occured if the fields were retrieved with GET after a `refresh()` because here also the source was used to get the fields. Now, GET accepts a parameter `ignore_errors_on_generated_fields` which has the following effect: - Throw exception with meaningful error message explaining the problem if set to false (default) - Ignore the field if set to true - Always ignore the field if it was not set to stored This changes the behavior for `_all` and `_field_names` as now an Exception is thrown if a user tries to GET them before a `refresh()`. closes #6676 closes #6973	2014-08-04 08:15:34 +02:00
Britta Weber	a3cefd919e	significant terms: add google normalized distance, add chi square closes #6858	2014-08-04 08:15:26 +02:00
Shay Banon	95762e8126	Support "default" for tcpNoDelay and tcpKeepAlive Allow to set the value default to network.tcp.no_delay and network.tcp.keep_alive so they won't be set at all, since on solaris, setting tcpNoDelay can actually cause failure relates to #7115	2014-08-02 17:32:41 +02:00
uboness	3c9c9f33e2	Aggregations Added Filters aggregation A multi-bucket aggregation where multiple filters can be defined (each filter defines a bucket). The buckets will collect all the documents that match their associated filter. This aggregation can be very useful when one wants to compare analytics between different criterias. It can also be accomplished using multiple definitions of the single filter aggregation, but here, the user will only need to define the sub-aggregations only once. Closes #6118	2014-08-01 16:01:08 +01:00
Adrien Grand	d9d5b35be9	Sort: Make `ignore_unmapped` work for cross-index queries. Close #2255	2014-08-01 15:30:17 +02:00
Stefan Antoni	8e862f15c1	[DOCS] fixed small typo in percolate.asciidoc	2014-08-01 12:38:35 +02:00
Britta Weber	d6a18ab2ba	Docs: add 1.4.0 label to many to many geo distance sort	2014-08-01 12:30:08 +02:00
Kurt Hurtado	66560acebb	Update fielddata-fields.asciidoc	2014-08-01 09:20:19 +02:00
Areek Zillur	1d581e6286	Search Exists API: Checks if any matching documents exist for a given query Implements a new Exists API allowing users to do fast exists check on any matched documents for a given query. This API should be faster then using the Count API as it will: - early terminate the search execution once any document is found to exist - return the response as soon as the first shard reports matched documents closes #6995	2014-07-31 15:42:30 -04:00
David Pilato	85eb0ea0e7	Generate timestamp when path is null Index process fails when having `_timestamp` enabled and `path` option is set. It fails with a `TimestampParsingException[failed to parse timestamp [null]]` message. Reproduction: ``` DELETE test PUT test { "mappings": { "test": { "_timestamp" : { "enabled" : "yes", "path" : "post_date" } } } } PUT test/test/1 { "foo": "bar" } ``` You can define a default value for when timestamp is not provided within the index request or in the `_source` document. By default, the default value is `now` which means the date the document was processed by the indexing chain. You can disable that default value by setting `default` to `null`. It means that `timestamp` is mandatory: ``` { "tweet" : { "_timestamp" : { "enabled" : true, "default" : null } } } ``` If you don't provide any timestamp value, indexation will fail. You can also set the default value to any date respecting timestamp format: ``` { "tweet" : { "_timestamp" : { "enabled" : true, "format" : "YYYY-MM-dd", "default" : "1970-01-01" } } } ``` If you don't provide any timestamp value, indexation will fail. Closes #4718. Closes #7036.	2014-07-31 19:48:22 +02:00
Britta Weber	fe86c8bc88	_geo_distance sort: allow many to many geo point distance Add computation of disyance to many geo points. Example request: ``` { "sort": [ { "_geo_distance": { "location": [ { "lat":1.2, "lon":3 }, { "lat":1.2, "lon":3 } ], "order": "desc", "unit": "km", "sort_mode": "max" } } ] } ``` closes #3926	2014-07-31 17:33:45 +02:00
Clinton Gormley	4b0a89d4fb	Update translog.asciidoc Documented `index.gateway.local.sync`	2014-07-31 14:06:24 +02:00
Clinton Gormley	36e1c7928c	Rewrote post-filter.asciidoc Closes #5166	2014-07-31 12:56:11 +02:00
Nik Everett	34426eb8c2	Docs: Fix syntax on lang-analyzer Some of the language analyzer documentation contained invalid json. Closes #7098	2014-07-30 20:17:27 +02:00
Alex Ksikes	e3b3b6c055	Term Vectors API: adds support for wildcards in selected fields This could useful to generate all term vectors or a chosen set of them. Closes #7061	2014-07-30 17:44:37 +02:00
gabriel-tessier	eaac8141cc	Docs: Fix typo in scripting.asciidoc Replace the mvel by groovy in the forgotten place. I add the previous change in this one. Sorry for the spam! Closes #7071	2014-07-29 12:30:09 +02:00
gabriel-tessier	c2c2190d27	Docs: Fix typo in scripting.asciidoc Closes #7070	2014-07-29 12:28:41 +02:00
Adrien Grand	1fe76b891b	Docs: Add links to the equivalent aggs in facets documentation.	2014-07-28 15:22:49 +02:00
Lee Hinman	6abe4c951d	Add HierarchyCircuitBreakerService Adds a breaker for request BigArrays, which are used for parent/child queries as well as some aggregations. Certain operations like Netty HTTP responses and transport responses increment the breaker, but will not trip. This also changes the output of the nodes' stats endpoint to show the parent breaker as well as the fielddata and request breakers. There are a number of new settings for breakers now: `indices.breaker.total.limit`: starting limit for all memory-use breaker, defaults to 70% `indices.breaker.fielddata.limit`: starting limit for fielddata breaker, defaults to 60% `indices.breaker.fielddata.overhead`: overhead for fielddata breaker estimations, defaults to 1.03 (the fielddata breaker settings also use the backwards-compatible setting `indices.fielddata.breaker.limit` and `indices.fielddata.breaker.overhead`) `indices.breaker.request.limit`: starting limit for request breaker, defaults to 40% `indices.breaker.request.overhead`: request breaker estimation overhead, defaults to 1.0 The breaker service infrastructure is now generic and opens the path to adding additional circuit breakers in the future. Fixes #6129 Conflicts: src/main/java/org/elasticsearch/index/fielddata/IndexFieldData.java src/main/java/org/elasticsearch/index/fielddata/IndexFieldDataService.java src/main/java/org/elasticsearch/index/fielddata/RamAccountingTermsEnum.java src/main/java/org/elasticsearch/index/fielddata/ordinals/GlobalOrdinalsBuilder.java src/main/java/org/elasticsearch/index/fielddata/ordinals/InternalGlobalOrdinalsBuilder.java src/main/java/org/elasticsearch/index/fielddata/plain/AbstractIndexOrdinalsFieldData.java src/main/java/org/elasticsearch/index/fielddata/plain/DisabledIndexFieldData.java src/main/java/org/elasticsearch/index/fielddata/plain/IndexIndexFieldData.java src/main/java/org/elasticsearch/index/fielddata/plain/NonEstimatingEstimator.java src/main/java/org/elasticsearch/index/fielddata/plain/PackedArrayIndexFieldData.java src/main/java/org/elasticsearch/index/fielddata/plain/ParentChildIndexFieldData.java src/main/java/org/elasticsearch/index/fielddata/plain/SortedSetDVOrdinalsIndexFieldData.java src/main/java/org/elasticsearch/node/internal/InternalNode.java src/test/java/org/elasticsearch/index/aliases/IndexAliasesServiceTests.java src/test/java/org/elasticsearch/index/codec/CodecTests.java src/test/java/org/elasticsearch/index/fielddata/AbstractFieldDataTests.java src/test/java/org/elasticsearch/index/fielddata/IndexFieldDataServiceTests.java src/test/java/org/elasticsearch/index/mapper/MapperTestUtils.java src/test/java/org/elasticsearch/index/query/IndexQueryParserFilterCachingTests.java src/test/java/org/elasticsearch/index/query/SimpleIndexQueryParserTests.java src/test/java/org/elasticsearch/index/query/guice/IndexQueryParserModuleTests.java src/test/java/org/elasticsearch/index/search/FieldDataTermsFilterTests.java src/test/java/org/elasticsearch/index/search/child/ChildrenConstantScoreQueryTests.java src/test/java/org/elasticsearch/index/similarity/SimilarityTests.java	2014-07-28 11:27:33 +02:00
Clinton Gormley	be86556946	Update request-body.asciidoc Added link from `timeout` to time-units Closes #6361	2014-07-28 11:08:59 +02:00
mikemccand	96ecec34d1	Docs: fix documentation for bloom filter defaults	2014-07-27 18:39:29 -04:00
Clinton Gormley	c367ae09e3	Update nested-query.asciidoc Changed score_mode `total` to `sum` to be consistent with parent-child etc	2014-07-26 22:32:28 +02:00
Clinton Gormley	10b4177def	Docs: Fixed path to search-shards	2014-07-26 15:05:53 +02:00
Clinton Gormley	88c8754a3c	Docs: Removed search-shards from request-body	2014-07-26 14:52:50 +02:00
Clinton Gormley	93d9628975	Docs: Reorganised the search-shards API docs	2014-07-26 14:51:44 +02:00
Colin Goodheart-Smithe	655157c83a	Aggregations: Added an option to show the upper bound of the error for the terms aggregation. This is only applicable when the order is set to _count. The upper bound of the error in the doc count is calculated by summing the doc count of the last term on each shard which did not return the term. The implementation calculates the error by summing the doc count for the last term on each shard for which the term IS returned and then subtracts this value from the sum of the doc counts for the last term from ALL shards. Closes #6696	2014-07-25 14:24:24 +01:00
Justin Honold	593fffc7a1	Docs: Changing ES_MAX_MEM default from '1gb' to '1g' If you set ES_HEAP_SIZE to '1gb' as suggested, Java will yield an "Invalid initial heap size". Closes #6824	2014-07-25 12:50:59 +02:00
rendel	50634e6a3d	Docs: Added new entry for the SIREn plugin. Closes #6961	2014-07-25 12:49:50 +02:00
Alexander Reelsen	a1e335b1e9	CORS: Support regular expressions for origin to match against This commit adds regular expression support for the allow-origin header depending on the value of the request `Origin` header. The existing HttpRequestBuilder is also extended to support the OPTIONS HTTP method. Relates #5601 Closes #6891	2014-07-25 10:51:22 +02:00
Lee Hinman	1fb9f404df	[DOCS] correct documentation about groovy/mvel defaults and deprecations	2014-07-25 10:39:33 +02:00
Simon Willnauer	bd51d7a07f	Add `wait_if_ongoing` option to _flush requests This commit adds the ability to force blocking on the flush operaition to make sure all files have been written and synced to disk. Without this option a flush might be executing at the same time causing the current flush to fail and return before all files being synced. Closes #6996	2014-07-24 15:34:53 +02:00
Areek Zillur	5487c56c70	Search & Count: Add option to early terminate doc collection Allow users to control document collection termination, if a specified terminate_after number is set. Upon setting the newly added parameter, the response will include a boolean terminated_early flag, indicating if the document collection for any shard terminated early. closes #6876	2014-07-23 15:10:15 -04:00
Lee Hinman	a1a03a184c	[DOCS] Fix nested root object indexing documentation Types can no longer be specified when indexing, see: https://github.com/elasticsearch/elasticsearch/pull/4552	2014-07-23 18:34:27 +02:00
Britta Weber	10201d511c	[doc] Correct decay function equations in function_score description Impact of decay and scale was missing from the equations. Closes #6983	2014-07-23 17:33:22 +02:00
Clinton Gormley	0f943850a0	Update named-queries-and-filters.asciidoc	2014-07-23 17:28:49 +02:00
Simon Willnauer	5bfea56457	[DOCS] move all coming tags to added in master	2014-07-23 16:37:19 +02:00
babeya	81a83aab22	Docs: Update query-string-syntax.asciidoc Closes #6253	2014-07-23 16:32:32 +02:00
Konrad Feldmeier	48812ff1f2	Reflect that 'field_value_factor' is only in 1.2.x While the blogpost http://www.elasticsearch.org/blog/2014-04-02-this-week-in-elasticsearch/ states, that feature #5519 was added to 1.x, the release notes for, e.g. v1.1.2, however tell otherwise. Only the release notes for 1.2.0 list #5519 as a new feature. Since the 1.x docs deprecate/discourage from using `_boost`, and seemingly give a migration example at http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/mapping-boost-field.html#function-score-instead-of-boost users of 1.1.x should be warned.	2014-07-23 15:49:03 +02:00
Peter Johnson @insertcoffee	9a4abc2620	Docs: typo example fails in bash Closes #6977	2014-07-23 12:43:43 +02:00
mikemccand	cc4d7c6272	Core: don't load bloom filters by default This change just changes the default for index.codec.bloom.load to false: with recent performance improvements to ID lookup, such as #6298, bloom filters don't give much of a performance gain anymore, and they can consume non-trivial RAM when there are many tiny documents. For now, we still index the bloom filters, so if a given app wants them back, it can just update the index.codec.bloom.load to true. Closes #6959	2014-07-23 05:58:41 -04:00
Clinton Gormley	3f9aea883f	Docs: Made current version, branch and jdk into asciidoc attributes	2014-07-23 11:55:35 +02:00
Clinton Gormley	17df714229	Docs: Change public signing key instructions to work with sudo Closes #6823	2014-07-23 11:13:03 +02:00
Clinton Gormley	254aa71693	Docs: Added Tiki Wiki integration Closes #6746	2014-07-23 11:00:46 +02:00
Areek Zillur	f39d4e1f89	PhraseSuggester: Collate option should allow returning phrases with no matching docs A new option `prune` has been added to allow users to control phrase suggestion pruning when `collate` is set. If the new option is set, the phrase suggestion option will contain a boolean `collate_match` indicating whether the respective result had hits in collation. CLoses #6927	2014-07-22 17:17:15 -04:00
Brian Murphy	b98f19a54b	[DOCS] Fix typo	2014-07-22 14:51:31 +01:00
Brian Murphy	3c5de7d4a1	[DOCS] Fix indentation	2014-07-22 14:49:45 +01:00
Brian Murphy	e3b1aed0fc	[DOCS] Update examples to groovy.	2014-07-22 14:45:46 +01:00
Nik Everett	79433d23e3	Update: Detect noop updates sent with doc_as_upsert This should help prevent spurious updates that just cause extra writing and cache invalidation for no real reason. Close #6822	2014-07-22 14:55:34 +02:00
Clinton Gormley	8aefaef68a	Update scripting.asciidoc Added an ID for native java scripts	2014-07-22 11:36:40 +02:00
Peter Johnson @insertcoffee	77a2c979ab	typo causes the example to fail in bash	2014-07-21 19:09:22 +02:00
Clinton Gormley	a862732434	Docs: Typo	2014-07-21 18:51:49 +02:00
Adrien Grand	abeefbddea	Docs: Update documentation about execution hints for the terms aggregation.	2014-07-21 11:55:57 +02:00
Clinton Gormley	6a7a77eada	Docs: Add links to client helper classes for bulk/scroll/reindexing	2014-07-18 13:55:47 +02:00
Alex Ksikes	f22f3db30f	Term Vectors API: Computes term vectors on the fly if not stored in the index. Adds the ability to the Term Vector API to generate term vectors for some chosen fields, even though they haven't been explicitely stored in the index. Relates to #5184 Closes #6567	2014-07-17 23:29:05 +02:00
Peter Kim	6a25d9b7b5	[DOCS] Fixed typos	2014-07-17 15:25:34 -04:00
Simon Willnauer	f9a9348508	[DOCS] Move benchmark API to 1.4	2014-07-16 15:02:20 +02:00
Brian Murphy	d6cd2c2b73	[DOCS][FIX] Fix reference check in indexed scripts/templates doc.	2014-07-16 11:24:18 +01:00
Brian Murphy	bc570919ee	[DOCS][FIX] Fix doc parsing, broken closing block	2014-07-16 11:18:21 +01:00
Brian Murphy	cbd2a97abd	[DOCS] : Indexed scripts/templates These are the docs for the indexed scripts/templates feature. Also moved the namespace for the REST endpoints. Closes #6851	2014-07-16 10:49:02 +01:00
Nik Everett	da5fb34163	Mappings: Add transform to document before index. Closes #6566	2014-07-15 18:40:46 +02:00
mikemccand	63cab559e3	Docs: explain that SerialMergeScheduler just maps to CMS for back compat Closes #6878	2014-07-15 11:38:43 -04:00
Ryan Ernst	64ab22816c	Scripting: Add script engine for lucene expressions. These are javascript expressions, which can only access numeric fielddata, parameters, and _score. They can only be used for searches (not document updates). closes #6818	2014-07-15 07:49:01 -07:00
Areek Zillur	d0d1b98d23	Stats: Expose IndexWriter and VersionMap RAM usage to ShardStats and _cat endpoint This commit adds the RAM usage of IndexWriter and VersionMap Closes #6483	2014-07-14 19:46:12 -04:00
Areek Zillur	76343899ea	Phrase Suggester: Add collate option to PhraseSuggester The newly added collate option will let the user provide a template query/filter which will be executed for every phrase suggestions generated to ensure that the suggestion matches at least one document for the filter/query. The user can also add routing preference `preference` to route the collate query/filter and additional `params` to inject into the collate template. Closes #3482	2014-07-14 16:07:52 -04:00
Malte Schirnacher	647a2a64a1	Docs: Update query-string-syntax.asciidoc Closes #6853	2014-07-14 16:35:17 +02:00
Clinton Gormley	6e70edb0a4	Analysis: Improve Hunspell error messages The Hunspell service would throw a confusing error message if more than one affix file was present. This commit distinguishes between the two error cases: where there are no affix files and when there are too many affix files. Also implements lazy dictionary loading, which was used in the tests but not implemented. Closes #6850	2014-07-14 12:13:32 +02:00
Britta Weber	74927adced	significant terms: infrastructure for changing easily the significance heuristic This commit adds the infrastructure to allow pluging in different measures for computing the significance of a term. Significance measures can be provided externally by overriding - SignificanceHeuristic - SignificanceHeuristicBuilder - SignificanceHeuristicParser closes #6561	2014-07-14 11:00:50 +02:00
Igor Motov	60b317caa4	Snapshot/Restore: Add ability to restore indices without their aliases Closes #6457	2014-07-13 17:52:41 +09:00
Florian Hopf	3689f67a76	Docs: Fixed invalid word count in geodistance agg doc Closes #6838	2014-07-11 18:35:36 +02:00
mikemccand	6c78147f5f	Docs: remove orphan comma	2014-07-11 08:26:08 -04:00
mikemccand	b4e80999a7	Docs: fix merge docs to match the code (the max_thread_count default is 'aggressive' (favor SSDs))	2014-07-11 07:00:57 -04:00
Boaz Leskes	f480969503	[Gateway] set a default of 5m to `recover_after_time` when any to the `expectedNodes` is set The `recovery_after_time` tells the gateway to wait before starting recovery from disk. The goal here is to allow for more nodes to join the cluster and thus not start potentially unneeded replications. The `expectedNodes` setting (and friends) tells the gateway when it can start recovering even if the `recover_after_time` has not yet elapsed. However, `expectedNodes` is useless if one doesn't set `recovery_after_time`. This commit changes that by setting a sensible default of 5m for `recover_after_time` if* a `expectedNodes` setting is present. Closes #6742	2014-07-11 11:28:45 +02:00
Iulia Pasov	eed3513c37	Docs: Update plugins.asciidoc to fix typo Changed the name of the European Environment Agency (from European Environmental Agency) Closes #6807	2014-07-10 14:04:26 +02:00
Simon Willnauer	154bd0309c	[DOCS] Fix typo in reference	2014-07-10 08:47:18 +02:00
Simon Willnauer	d82a434d10	[STORE] Make a hybrid directory default using `mmapfs` and `niofs` `mmapfs` is really good for random access but can have sideeffects if memory maps are large depending on the operating system etc. A hybrid solution where only selected files are actually memory mapped but others mostly consumed sequentially brings the best of both worlds and minimizes the memory map impact. This commit mmaps only the `dvd` and `tim` file for fast random access on docvalues and term dictionaries. Closes #6636	2014-07-10 00:01:43 +02:00
Shay Banon	8910e09beb	Disable JSONP by default By default, disable the option to use JSONP in our REST layer closes #6795	2014-07-09 21:17:17 +02:00
Iulia Pasov	a79d0744d3	Docs: Update plugins.asciidoc Closes #6683	2014-07-09 16:15:59 +02:00
Clinton Gormley	b6baa4be4a	Update preference.asciidoc Clarify that `preference` is a query string parameter only and provide an example.	2014-07-09 11:13:17 +02:00
Clinton Gormley	6c30ad1ce6	Docs: Improved the docs for nested mapping Closes #1643	2014-07-08 15:54:11 +02:00
Clinton Gormley	feb81e228b	Docs: Rewrote the scroll/scan docs Closes #6774	2014-07-08 11:54:53 +02:00
Andrii Gakhov	80321d89d9	Docs: Update histogram-aggregation.asciidoc filter in a filtered query should be under "filter" key Closes #6738	2014-07-07 10:44:11 +02:00
Carsten Brandt	bd4699da7e	Docs: fixed a typo in the docs Closes: #6718	2014-07-07 10:41:36 +02:00
Clinton Gormley	e4baa56f4b	Docs: Language analyzers Clarified the use of stem_exclusion and the keyword_marker token filter Closes #6613	2014-07-07 10:06:18 +02:00
Clinton Gormley	54790eea10	Update lang-analyzer.asciidoc Clarified the use of the `stem_exclusion` token filter. Closes #6613	2014-07-04 17:50:43 +02:00
Shinsuke Sugaya	4bddb4e346	Update plugins.asciidoc	2014-07-05 00:44:02 +09:00
Shikhar Bhushan	1e894111b0	Docs: Link to eskka discovery plugin from doc Closes #6721	2014-07-04 17:06:51 +02:00
Clinton Gormley	d3f8c66e26	Updated cache.asciidoc The index level filter cache was removed a long time ago Closes #6455	2014-07-04 14:26:20 +02:00
David Pilato	162c62dbcc	[DOCS] Add information regarding _type parameter requirement for _mget Change ID to `[[mget-type]]` Closes #6670.	2014-07-03 15:38:06 +02:00
David Pilato	de48d7f94c	[DOCS] Add information regarding _type parameter requirement for _mget Closes #6670.	2014-07-03 15:23:35 +02:00
Jun Ohtani	0c6a859357	Docs: fixed ICU plugin documentation add ICU Normalization CharFilter to docs Closes #6711	2014-07-03 15:21:51 +02:00
Mikhail Korobov	955473f475	Docs: unescape regexes in Pattern Tokenizer docs Currently regexes in Pattern Tokenizer docs are escaped (it seems according to Java rules). I think it is better not to escape them because JSON escaping should be automatic in client libraries, and string escaping depends on a client language used. The default pattern is `\W+`, not `\\W+`. Closes #6615	2014-07-03 13:34:13 +02:00
hanneskaeufler	6e6f4def5d	Docs: Fix typo in timestamp-field.asciidoc Closes #6661	2014-07-03 13:27:37 +02:00
Robert Muir	2935b751e9	Fix doc formatting. Norwegian stemmers and Scandinavian normalizers were missing commas between entries.	2014-07-03 07:08:33 -04:00
Robert Muir	b9a09c2b06	Analysis: Add additional Analyzers, Tokenizers, and TokenFilters from Lucene Add `irish` analyzer Add `sorani` analyzer (Kurdish) Add `classic` tokenizer: specific to english text and tries to recognize hostnames, companies, acronyms, etc. Add `thai` tokenizer: segments thai text into words. Add `classic` tokenfilter: cleans up acronyms and possessives from classic tokenizer Add `apostrophe` tokenfilter: removes text after apostrophe and the apostrophe itself Add `german_normalization` tokenfilter: umlaut/sharp S normalization Add `hindi_normalization` tokenfilter: accounts for hindi spelling differences Add `indic_normalization` tokenfilter: accounts for different unicode representations in Indian languages Add `sorani_normalization` tokenfilter: normalizes kurdish text Add `scandinavian_normalization` tokenfilter: normalizes Norwegian, Danish, Swedish text Add `scandinavian_folding` tokenfilter: much more aggressive form of `scandinavian_normalization` Add additional languages to stemmer tokenfilter: `galician`, `minimal_galician`, `irish`, `sorani`, `light_nynorsk`, `minimal_nynorsk` Add support access to default Thai stopword set "_thai_" Fix some bugs and broken links in documentation. Closes #5935	2014-07-03 05:47:49 -04:00
Matthew L Daniel	53f2301eea	Docs: Add clarifying text about regexp and terms For the casual reader, the reference to "term queries" may be glossed over, yielding an unexpected result when using `regexp` queries. This attempts to make that distinction more prominent. Closes #6698	2014-07-03 11:39:57 +02:00
jnguyenx	1883f74cc0	Docs: Fixed missing comma in multi match query example	2014-07-03 08:17:09 +02:00

1 2 3 4 5 ...

761 Commits