OpenSearch

Commit Graph

Author	SHA1	Message	Date
Adrien Grand	d84c643f58	Use the new points API to index numeric fields. #17746 This makes all numeric fields including `date`, `ip` and `token_count` use points instead of the inverted index as a lookup structure. This is expected to perform worse for exact queries, but faster for range queries. It also requires less storage. Notes about how the change works: - Numeric mappers have been split into a legacy version that is essentially the current mapper, and a new version that uses points, eg. LegacyDateFieldMapper and DateFieldMapper. - Since new and old fields have the same names, the decision about which one to use is made based on the index creation version. - If you try to force using a legacy field on a new index or a field that uses points on an old index, you will get an exception. - IP addresses now support IPv6 via Lucene's InetAddressPoint and store them in SORTED_SET doc values using the same encoding (fixed length of 16 bytes and sortable). - The internal MappedFieldType that is stored by the new mappers does not have any of the points-related properties set. Instead, it keeps setting the index options when parsing the `index` property of mappings and does `if (fieldType.indexOptions() != IndexOptions.NONE) { // add point field }` when parsing documents. Known issues that won't fix: - You can't use numeric fields in significant terms aggregations anymore since this requires document frequencies, which points do not record. - Term queries on numeric fields will now return constant scores instead of giving better scores to the rare values. Known issues that we could work around (in follow-up PRs, this one is too large already): - Range queries on `ip` addresses only work if both the lower and upper bounds are inclusive (exclusive bounds are not exposed in Lucene). We could either decide to implement it, or drop range support entirely and tell users to query subnets using the CIDR notation instead. - Since IP addresses now use a different representation for doc values, aggregations will fail when running a terms aggregation on an ip field on a list of indices that contains both pre-5.0 and 5.0 indices. - The ip range aggregation does not work on the new ip field. We need to either implement range aggs for SORTED_SET doc values or drop support for ip ranges and tell users to use filters instead. #17700 Closes #16751 Closes #17007 Closes #11513	2016-04-14 17:56:23 +02:00
Martijn van Groningen	2928fd6ef3	Cleanup query builder for inner hits construction. * Inner hits can now only be provided and prepared via setter in the nested, has_child and has_parent query. * Also made `score_mode` a required constructor parameter. * Moved has_child's min_child/max_children validation from doToQuery(...) to a setter.	2016-04-14 14:43:21 +02:00
Nik Everett	cca3154c43	Rename isSourceEmpty to hasSource And add a test case for {} to reindex.	2016-04-13 08:19:58 -04:00
Adrien Grand	013acf9179	Remove MappedFieldType.value. #17557 This commit removes `MappedFieldType.value` and simplifies `MappedFieldType.valueforSearch`. `valueforSearch` was used to post-process values that come for stored fields (eg. to convert a long back to a string representation of a date in the case of a date field) and also values that are extracted from the source but only in the case of GET calls: it would not be called when performing source filtering on search requests. `valueforSearch` is now only called for stored fields, since values that are extracted from the source should already be formatted as expected.	2016-04-12 09:12:56 +02:00
Adrien Grand	0eb1a816c8	Allow the query cache to be disabled. #16268 This replaces the internal `index.queries.cache.type` setting with a new `index.queries.cache.enabled` setting, which is documented. Closes #15802	2016-04-11 18:06:16 +02:00
Adrien Grand	42526ac28e	Remove Settings.settingsBuilder. We have both `Settings.settingsBuilder` and `Settings.builder` that do exactly the same thing, so we should keep only one. I kept `Settings.builder` since it has my preference but also it is the one that we use in examples of the Java API.	2016-04-08 18:10:02 +02:00
Adrien Grand	bef38a4d12	Fix test bug.	2016-04-07 09:49:27 +02:00
Adrien Grand	e1bfe23c22	ExtendedStatsAggregator should also pass sigma to emtpy aggs. #17388 Because sigma is also used at reduce time, it should be passed to empty aggs. Otherwise it causes bugs when an empty aggregation is used to perform reduction is it would assume a sigma of zero. Closes #17362	2016-04-07 09:34:11 +02:00
Nik Everett	16c12afabe	Rework ScoreFunctionBuilder registration to remove PROTOTYPEs This removes PROTOTYPEs from ScoreFunctionsBuilders. To do so we rework registration so it doesn't need PROTOTYPEs and lines up with the recent changes to query registration.	2016-04-06 13:04:11 -04:00
javanna	b9f9b2e3ee	Merge branch 'master' into enhancement/discovery_node_one_getter	2016-03-30 17:22:40 +02:00
Nik Everett	1c16d63a9a	Merge pull request #17394 from camilojd/refactor/replace-getrandom Refactor: replace all ocurrences of ESTestCase.getRandom() with LuceneTestCase.random()	2016-03-30 08:58:21 -04:00
javanna	a8bbdff3bc	Remove DiscoveryNode#name in favour of existing DiscoveryNode#getName	2016-03-30 14:47:36 +02:00
Adrien Grand	068c788ec8	Disable fielddata on text fields by defaults. #17386 `text` fields will have fielddata disabled by default. Fielddata can still be enabled on an existing index by setting `fielddata=true` in the mappings.	2016-03-30 14:35:32 +02:00
Camilo Diaz Repka	7be11a36cd	Refactor: replace all ocurrences of ESTestCase.getRandom() for random(). Remove getRandom().	2016-03-29 23:18:05 -04:00
javanna	061f09d9a4	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-29 20:19:33 +02:00
Colin Goodheart-Smithe	ff3fd99074	Prevents exception being raised when ordering by an aggregation which wasn't collected If a terms aggregation was ordered by a metric nested in a single bucket aggregator which did not collect any documents (e.g. a filters aggregation which did not match in that term bucket) an ArrayOutOfBoundsException would be thrown when the ordering code tried to retrieve the value for the metric. This fix fixes all numeric metric aggregators so they return their default value when a bucket ordinal is requested which was not collected. Closes #17225	2016-03-29 13:28:03 +01:00
javanna	27d4994aff	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-24 18:10:11 +01:00
Areek Zillur	ed49ec437f	remove suggest transport action	2016-03-23 16:37:56 -04:00
javanna	030453d320	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-23 11:25:34 +01:00
Adrien Grand	e50eeeaffb	Refactor fielddata mappings. #17148 The fielddata settings in mappings have been refatored so that: - text and string have a `fielddata` (boolean) setting that tells whether it is ok to load in-memory fielddata. It is true by default for now but the plan is to make it default to false for text fields. - text and string have a `fielddata_frequency_filter` which contains the same thing as `fielddata.filter.frequency` used to (but validated at parsing time instead of being unchecked settings) - regex fielddata filtering is not supported anymore and will be dropped from mappings automatically on upgrade. - text, string and _parent fields have an `eager_global_ordinals` (boolean) setting that tells whether to load global ordinals eagerly on refresh. - in-memory fielddata is not supported on keyword fields anymore at all. - the `fielddata` setting is not supported on other fields that text and string and will be dropped when upgrading if specified.	2016-03-23 09:48:13 +01:00
javanna	eebd0cfccd	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-22 10:34:40 +01:00
Simon Willnauer	47f0e6e8f4	[TEST] Disable InternalClusterInfoService in messy tests, it sends IndicesStatsRequest periodically which messes with the messy test	2016-03-22 10:12:03 +01:00
javanna	bf390a935e	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-21 17:18:23 +01:00
Martijn van Groningen	e3b7e5d75a	percolator: Replace percolate api with the new percolator query Also replaced the PercolatorQueryRegistry with the new PercolatorQueryCache. The PercolatorFieldMapper stores the rewritten form of each percolator query's xcontext in a binary doc values field. This make sure that the query rewrite happens only during indexing (some queries for example fetch shapes, terms in remote indices) and the speed up the loading of the queries in the percolator query cache. Because the percolator now works inside the search infrastructure a number of features (sorting fields, pagination, fetch features) are available out of the box. The following feature requests are automatically implemented via this refactoring: Closes #10741 Closes #7297 Closes #13176 Closes #13978 Closes #11264 Closes #10741 Closes #4317	2016-03-21 12:21:50 +01:00
Simon Willnauer	e91a141233	Prevent index level setting from being configured on a node level Today we allow to set all kinds of index level settings on the node level which is error prone and difficult to get right in a consistent manner. For instance if some analyzers are setup in a yaml config file some nodes might not have these analyzers and then index creation fails. Nevertheless, this change allows some selected settings to be specified on a node level for instance: * `index.codec` which is used in a hot/cold node architecture and it's value is really per node or per index * `index.store.fs.fs_lock` which is also dependent on the filesystem a node uses All other index level setting must be specified on the index level. For existing clusters the index must be closed and all settings must be updated via the API on each of the indices. Closes #16799	2016-03-17 14:42:18 +01:00
Christoph Büscher	6ddf9ae92f	Merge branch 'master' into feature-suggest-refactoring	2016-03-16 15:27:02 +01:00
Nik Everett	7197172047	[reindex] Properly register status Without this commit fetching the status of a reindex from a node that isn't coordinating the reindex will fail. This commit properly registers reindex's status so this doesn't happen. To do so it moves all task status registration into NetworkModule and creates a method to register other statuses which the reindex plugin calls.	2016-03-16 07:40:49 -04:00
Christoph Büscher	39667b5793	Merge branch 'master' into feature-suggest-refactoring Conflicts: docs/reference/migration/migrate_5_0/java.asciidoc	2016-03-16 12:06:42 +01:00
Jason Tedor	618441aea3	Merge pull request #17088 from jasontedor/simplify-bootstrap-settings Bootstrap does not set system properties	2016-03-15 19:25:16 -04:00
Jason Tedor	66ba044ec5	Use setting in integration test cluster config	2016-03-15 17:45:17 -04:00
Yannick Welsch	7fb07a9a53	Merge pull request #17112 from ywelsch/enhance/forbid-sysout-for-tests Forbid test sources to use System.out.println and Throwable.printStackTrace	2016-03-15 16:36:34 +01:00
Yannick Welsch	f5e6db4090	Remove System.out.println and Throwable.printStackTrace from tests	2016-03-15 15:40:37 +01:00
Christoph Büscher	1f1f6861b7	Merge branch 'master' into sort-serialization-scriptsort Conflicts: core/src/test/java/org/elasticsearch/search/sort/AbstractSortTestCase.java	2016-03-15 11:03:28 +01:00
Christoph Büscher	40f3501d7f	Merge branch 'master' into feature-suggest-refactoring	2016-03-14 14:57:57 +01:00
Christoph Büscher	f0074668db	Merge branch 'master' into sort-serialization-scriptsort	2016-03-14 12:15:53 +01:00
Christoph Büscher	97638c95fc	Merge branch 'master' into feature-suggest-refactoring Conflicts: docs/reference/migration/migrate_5_0.asciidoc	2016-03-14 11:13:47 +01:00
Simon Willnauer	31740e279f	Resolve index names to Index instances early Today index names are often resolved lazily, only when they are really needed. This can be problematic especially when it gets to mapping updates etc. when a node sends a mapping update to the master but while the request is in-flight the index changes for whatever reason we would still apply the update since we use the name of the index to identify the index in the clusterstate. The problem is that index names can be reused which happens in practice and sometimes even in a automated way rendering this problem as realistic. In this change we resolve the index including it's UUID as early as possible in places where changes to the clusterstate are possible. For instance mapping updates on a node use a concrete index rather than it's name and the master will fail the mapping update iff the index can't be found by it's <name, uuid> tuple. Closes #17048	2016-03-14 11:08:48 +01:00
Jason Tedor	8a05c2a2be	Bootstrap does not set system properties Today, certain bootstrap properties are set and read via system properties. This action-at-distance way of managing these properties is rather confusing, and completely unnecessary. But another problem exists with setting these as system properties. Namely, these system properties are interpreted as Elasticsearch settings, not all of which are registered. This leads to Elasticsearch failing to startup if any of these special properties are set. Instead, these properties should be kept as local as possible, and passed around as method parameters where needed. This eliminates the action-at-distance way of handling these properties, and eliminates the need to register these non-setting properties. This commit does exactly that. Additionally, today we use the "-D" command line flag to set the properties, but this is confusing because "-D" is a special flag to the JVM for setting system properties. This creates confusion because some "-D" properties should be passed via arguments to the JVM (so via ES_JAVA_OPTS), and some should be passed as arguments to Elasticsearch. This commit changes the "-D" flag for Elasticsearch settings to "-E".	2016-03-13 20:09:15 -04:00
Ryan Ernst	5bd7da5659	Addressed PR feedback * Fix tests still referring to -E * add comment about missing classes * rename writer constant	2016-03-11 11:46:23 -08:00
Ryan Ernst	591fb8f028	Merge branch 'master' into cli-parsing	2016-03-11 10:45:05 -08:00
Christoph Büscher	bbcbba1bf5	Fixing some tests and compile problems in reindex module	2016-03-11 17:58:13 +01:00
Christoph Büscher	5107388fe9	Added enum for script sort type	2016-03-11 17:32:39 +01:00
Yannick Welsch	04e55ecf6b	Make logging message String constant to allow static checks	2016-03-11 10:30:59 +01:00
Yannick Welsch	718876a941	Fix wrong placeholder usage in logging statements	2016-03-11 10:30:59 +01:00
Simon Willnauer	e72dac91b3	Use index UUID to lookup indices on IndicesService Today we use the index name to lookup index instances on the IndicesService which applied to search reqeusts but also to index deletion etc. This commit moves the interface to expcet and `Index` instance which is a <name, uuid> tuple and looks up the index by uuid rather than by name. This prevents accidential modificaiton of the wrong index if and index is recreated or searching from the _wrong_ index in such a case. Accessing an index that has the same name but different UUID will now result in an IndexNotFoundException. Closes #17001	2016-03-09 19:42:15 +01:00
Ryan Ernst	1dafead2eb	Fix precommit	2016-03-08 22:55:24 -08:00
Christoph Büscher	5ff413074a	Adding tests for `time_zone` parameter for date range aggregation	2016-03-07 15:38:24 +01:00
Adrien Grand	c69fc008d4	Merge pull request #16972 from alexshadow007/fix-16812 Build empty extended stats aggregation if no docs collected for bucket	2016-03-07 10:44:20 +01:00
Robert Muir	54018a5d37	upgrade to lucene 6.0.0-snapshot-bea235f Closes #16964 Squashed commit of the following: commit a23f9d2d29220991aa498214530753d7a5a148c6 Merge: eec9c4e `0b0a251` Author: Robert Muir <rmuir@apache.org> Date: Mon Mar 7 04:12:02 2016 -0500 Merge branch 'master' into lucene6 commit eec9c4e5cd11e9c3e0b426f04894bb2a6dae4f21 Merge: bc67205 `675d940` Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 13:45:00 2016 -0500 Merge branch 'master' into lucene6 commit bc67205bdfe1526eae277ab7856fc050ecbdb7b2 Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 09:56:31 2016 -0500 fix test bug commit a60723b007ff12d97b1810cef473bd7b553a0327 Author: Simon Willnauer <simonw@apache.org> Date: Fri Mar 4 15:35:35 2016 +0100 Fix SimpleValidateQueryIT to put braces around boosted terms commit ae3a49d7ba7ced448d2a5262e5d8ec98671a9090 Author: Simon Willnauer <simonw@apache.org> Date: Fri Mar 4 15:27:25 2016 +0100 fix multimatchquery commit ae23fdb88a8f6d3fb7ba60fd1aaf3fd72d899aa5 Author: Simon Willnauer <simonw@apache.org> Date: Fri Mar 4 15:20:49 2016 +0100 Rewrite DecayFunctionScoreIT to be independent of the similarity used This test relied a lot on the term scoring and compared scores that are dependent on the similarity. This commit changes the base query to be a predictable constant score query. commit 366c2d518c35d31251033f1b6f6a93f6e2ae327d Author: Simon Willnauer <simonw@apache.org> Date: Fri Mar 4 14:06:14 2016 +0100 Fix scoring in tests due to changes to idf calculation. Lucene 6 uses a different default similarity as well as a different way to calculate IDF. In contrast to older version lucene 6 uses docCount per field to calculate the IDF not the # of docs in the index to overcome the sparse field cases. commit dac99fd64ac2fa71b8d8d106fe68825e574c49f8 Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 08:21:57 2016 -0500 don't hardcoded expected termquery score commit 6e9f340ba49ab10eed512df86d52a121aa775b0f Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 08:04:45 2016 -0500 suppress deprecation warning until migrated to points commit 3ac8908424b3fdad44a90a4f7bdb3eff7efd077d Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 07:21:43 2016 -0500 Remove invalid test: all commits have IDs, and its illegal to do this. commit c12976288124ad1a26467e7e848fb810548e7eab Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 07:06:14 2016 -0500 don't test with unsupported back compat commit 18bbfe76128570bc70883bf91ff4c44c82d27817 Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 07:02:18 2016 -0500 remove now invalid lucene 4 backcompat test commit 7e730e572886f0ef2d3faba712e4256216ff01ec Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 06:58:52 2016 -0500 remove now invalid lucene 4 backwards test commit 244d2ab6868ba5ac9e0bcde3c2833743751a25ec Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 06:47:23 2016 -0500 use 6.0 codec commit 5f64d4a431a6fdaa1234adca23f154c2a1de8284 Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 06:43:08 2016 -0500 compile, javadocs, forbidden-apis, etc commit 1f273cd62a7fe9ca8f8944acbbfc5cbdd3d81ccb Merge: cd33921 `29e3443` Author: Simon Willnauer <simonw@apache.org> Date: Fri Mar 4 10:45:29 2016 +0100 Merge branch 'master' into lucene6 commit cd33921ac742ef9fb351012eff35f3c7dbda7264 Author: Robert Muir <rmuir@apache.org> Date: Thu Mar 3 23:58:37 2016 -0500 fix hunspell dictionary loading commit c7fdbd837b01f7defe9cb1c24e2ec65604b0dc96 Merge: 4d4190f `d8948ba` Author: Robert Muir <rmuir@apache.org> Date: Thu Mar 3 23:41:53 2016 -0500 Merge branch 'master' into lucene6 commit 4d4190fd82601aaafac6b8254ccb3edf218faa34 Author: Robert Muir <rmuir@apache.org> Date: Thu Mar 3 23:39:14 2016 -0500 remove nocommit commit 77ca69e288b1a41aa9595c921ed166c272a00ea8 Author: Robert Muir <rmuir@apache.org> Date: Thu Mar 3 23:38:24 2016 -0500 clean up numericutils vs legacynumericutils commit a466d696fbaad04b647ffbc0857a9439b583d0bf Author: Robert Muir <rmuir@apache.org> Date: Thu Mar 3 23:32:43 2016 -0500 upgrade spatial4j commit 5412c747a8cfe638bacedbc8233163cb75cc3dc5 Author: Robert Muir <rmuir@apache.org> Date: Thu Mar 3 23:19:28 2016 -0500 move to 6.0.0-snapshot-8eada27 commit b32bfe924626b87e540692375ece09e7c2edb189 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 11:30:09 2016 +0100 Fix some test compile errors. commit 6ccde35e9840b03c68d1a2cd47c7923a06edf64a Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 11:25:51 2016 +0100 Current Lucene version is 6.0.0. commit f62e1015d931b4cc04c778298a8fa1ba65e97ad9 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 11:20:48 2016 +0100 Fix compile errors in NGramTokenFilterFactory. commit 6837c6eabf96075f743649da9b9b52dd39611c58 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 10:50:59 2016 +0100 Fix the edge ngram tokenizer/filter. commit ccd7f070de5efcdfbeb34b9555c65c4990bf1ba6 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 10:42:44 2016 +0100 The missing value is now accessible through a getter. commit bd3b77f9b28e5b05daa3d49683a9922a6baf2963 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 10:41:51 2016 +0100 Remove IndexCacheableQuery. commit 05f3091c347aeae80eeb16349ac51d2b53cf86f7 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 10:39:43 2016 +0100 Fix compilation of function_score queries. commit 81cda79a2431ac78f56b0cc5a5765387f662d801 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 10:35:02 2016 +0100 Fix compile errors in BlendedTermQuery. commit 70994ce8dd1eca0b995870974a38e20f26f96a7b Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 23:33:03 2016 -0500 add bug ID commit 29d4f1a71f36f646b5a6060bed3db019564a279d Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 21:02:32 2016 -0500 easy .store changes commit 5e1a1e6fd665fa455e88d3a8987362fad5f44bb1 Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 20:47:24 2016 -0500 cleanups mostly around boosting commit 333a669ec6c305ada5645d13ed1da0e19ec1d053 Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 20:27:56 2016 -0500 more simple fixes commit bd5cd98a1e089c866b6b4a5e159400b110140ce6 Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 19:49:38 2016 -0500 more easy fixes and removal of ancient cruft commit a68f419ee47da5f9c9ce5b372f01d707e902474c Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 19:35:02 2016 -0500 cutover numerics commit 4ca5dc1fa47dd5892db00899032133318fff3116 Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 18:34:18 2016 -0500 fix some constants commit 88710a17817086e477c6c021ec346d0534b7fb88 Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 18:14:25 2016 -0500 Add spatial-extras jar as a core dependency commit c8cd6726583e5ce3f546ed355d4eca037164a30d Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 18:03:33 2016 -0500 update to lucene 6 jars	2016-03-07 04:12:23 -05:00
Alexander Kazakov	3674774b73	Build empty extended stats aggregation if no docs are collected for bucket #16812	2016-03-05 22:50:54 +03:00

1 2 3

142 Commits