OpenSearch

Commit Graph

Author	SHA1	Message	Date
Ryan Ernst	e817b5daa3	Plugins: Remove guice from Mapper plugins This changes adds a MapperPlugin interface which allows pull style retrieval of mappers and metadata mappers added by plugins. For now, I have kept the MapperRegistry, but this should be removed in the future as it is just a silly container for 2 maps which could themselves be passed around.	2016-06-21 22:50:39 -07:00
Ryan Ernst	a4503c2aed	Plugins: Remove name() and description() from api In 2.0 we added plugin descriptors which require defining a name and description for the plugin. However, we still have name() and description() which must be overriden from the Plugin class. This still exists for classpath plugins. But classpath plugins are mainly for tests, and even then, referring to classpath plugins with their class is a better idea. This change removes name() and description(), replacing the name for classpath plugins with the full class name.	2016-06-15 17:12:22 -07:00
Adrien Grand	864ed04059	Lessen leniency of the query dsl. #18276 This change does the following: - Queries that are currently unsupported such as prefix queries on numeric fields or term queries on geo fields now throw an error rather than returning a query that does not match anything. - Fuzzy queries on numeric, date and ip fields are now unsupported: they used to create range queries, we now expect users to use range queries directly. Fuzzy, regexp and prefix queries are now only supported on text/keyword fields (including `_all`). - The `_uid` and `_id` fields do not support prefix or range queries anymore as it would prevent us to store them more efficiently in the future, eg. by using a binary encoding. Note that it is still possible to ignore these errors by using the `lenient` option of the `match` or `query_string` queries.	2016-05-16 17:37:00 +02:00
Alexander Reelsen	f71eb0b888	Version: Set version to 5.0.0-alpha2	2016-04-26 09:30:26 +02:00
Adrien Grand	d84c643f58	Use the new points API to index numeric fields. #17746 This makes all numeric fields including `date`, `ip` and `token_count` use points instead of the inverted index as a lookup structure. This is expected to perform worse for exact queries, but faster for range queries. It also requires less storage. Notes about how the change works: - Numeric mappers have been split into a legacy version that is essentially the current mapper, and a new version that uses points, eg. LegacyDateFieldMapper and DateFieldMapper. - Since new and old fields have the same names, the decision about which one to use is made based on the index creation version. - If you try to force using a legacy field on a new index or a field that uses points on an old index, you will get an exception. - IP addresses now support IPv6 via Lucene's InetAddressPoint and store them in SORTED_SET doc values using the same encoding (fixed length of 16 bytes and sortable). - The internal MappedFieldType that is stored by the new mappers does not have any of the points-related properties set. Instead, it keeps setting the index options when parsing the `index` property of mappings and does `if (fieldType.indexOptions() != IndexOptions.NONE) { // add point field }` when parsing documents. Known issues that won't fix: - You can't use numeric fields in significant terms aggregations anymore since this requires document frequencies, which points do not record. - Term queries on numeric fields will now return constant scores instead of giving better scores to the rare values. Known issues that we could work around (in follow-up PRs, this one is too large already): - Range queries on `ip` addresses only work if both the lower and upper bounds are inclusive (exclusive bounds are not exposed in Lucene). We could either decide to implement it, or drop range support entirely and tell users to query subnets using the CIDR notation instead. - Since IP addresses now use a different representation for doc values, aggregations will fail when running a terms aggregation on an ip field on a list of indices that contains both pre-5.0 and 5.0 indices. - The ip range aggregation does not work on the new ip field. We need to either implement range aggs for SORTED_SET doc values or drop support for ip ranges and tell users to use filters instead. #17700 Closes #16751 Closes #17007 Closes #11513	2016-04-14 17:56:23 +02:00
Adrien Grand	3bf6f4076c	Do not set analyzers on numeric fields. When it comes to query parsing, either a field is tokenized and it would go through analysis with its search_analyzer. Or it is not tokenized and the raw string should be passed to termQuery(). Since numeric fields are not tokenized and also declare a search analyzer, values would currently go through analysis twice...	2016-04-12 17:47:29 +02:00
Areek Zillur	c3078f4d65	adapt tests to use index uuid as folder name	2016-03-14 23:24:24 -04:00
Simon Willnauer	fdfb0e56f6	Remove bw compat from size mapper	2016-03-07 12:48:02 +01:00
Simon Willnauer	f96900013c	Remove bw compat from murmur3 mapper	2016-03-07 12:44:53 +01:00
Adrien Grand	eef19be072	Deprecate string in favor of text/keyword. #16877 This commit removes the ability to use string fields on indices created on or after 5.0. Dynamic mappings now generate text fields by default for strings but there are plans to also add a sub keyword field (in a future PR). Most of the changes in this commit are just about replacing string with keyword or text. Some tests have been removed because they existed because of corner cases of string mappings like setting ignore-above on a text field or enabling term vectors on a keyword field which are now impossible. The plan is to remove strings entirely in 6.0.	2016-03-03 10:20:56 +01:00
Simon Willnauer	96bcb47fc9	Detach QueryShardContext from IndexShard and remove obsolete threadlocals IndexShard currently holds an arbitraritly used `getQueryShardContext` that comes out of a ThreadLocal. It's usage is undefined and arbitraty since there is also such a method with different semantics on `IndexService` This commit removes the threadLocal on IndexShard as well as on the context itself. It's types are now a member and the QueryShardContext lifecycle is managed byt SearchContext which passes the types on from the SearchRequest.	2016-02-08 15:05:03 +01:00
Daniel Mitterdorfer	e9bb3d31a3	Convert "path.*" and "pidfile" to new settings infra	2016-01-22 15:14:13 +01:00
Simon Willnauer	fbfa9f4925	Merge branch 'master' into new_index_settings	2016-01-19 10:13:48 +01:00
Ryan Ernst	ef4f0a8699	Test: Make rest test framework accept http directly for the test cluster The rest test framework, because it used to be tightly integrated with ESIntegTestCase, currently expects the addresses for the test cluster to be passed using the transport protocol port. However, it only uses this to then find the http address. This change makes ESRestTestCase extend from ESTestCase instead of ESIntegTestCase, and changes the sysprop used to tests.rest.cluster, which now takes the http address. closes #15459	2016-01-18 16:44:14 -08:00
Simon Willnauer	a8eedd0457	fix all kinds of crazy test failures	2016-01-18 09:23:34 +01:00
Simon Willnauer	ceef6d7a42	fix Murmur3FieldMapperTests	2016-01-18 09:23:34 +01:00
Nik Everett	0786c506dc	Remove a few more Xlint skips	2016-01-06 23:28:13 -05:00
Martijn van Groningen	2d6adf6428	Percolator refactoring: * Added percolator field mapper that extracts the query terms and indexes these terms with the percolator query. * At percolate time these extracted terms are used to query percolator queries that are like to be evaluated. This can significantly cut down the time it takes to percolate. Whereas before all percolator queries were evaluated if they matches with the document being percolated. * Changes made to percolator queries are no longer immediately visible, a refresh needs to happen before the changes are visible. * By default the percolate api only returns upto 10 matches instead of returning all matching percolator queries. * Made percolate more modular, so that it is easier to add unit tests. * Added unit tests for the percolator. Closes #12664 Closes #13646	2016-01-06 16:08:10 +01:00
Adrien Grand	ac393b7a31	Make mappings tests more realistic. DocumentMapperParser has both parse and parseCompressed methods. Except that the parse methods are ONLY used from the unit tests. This commit removes the parse method and moves all tests to parseCompressed so that they test more realistically how mappings are managed. Then I renamed parseCompressed to parse given that this is the only alternative anyway.	2015-12-21 10:44:00 +01:00
Ryan Ernst	4ea19995cf	Remove wildcard imports	2015-12-18 12:43:47 -08:00
Adrien Grand	50eeafa75c	Make mappings immutable. Today mappings are mutable because of two APIs: - Mapper.merge, which expects changes to be performed in-place - IncludeInAll, which allows to change whether values should be put in the `_all` field in place. This commit changes both APIs to return a modified copy instead of modifying in place so that mappings can be immutable. For now, only the type-level object is immutable, but in the future we can imagine making them immutable at the index-level so that mapping updates could be completely atomic at the index level. Close #9365	2015-12-15 10:20:28 +01:00
David Pilato	d23d8a891f	Remove "empty" licenses dir Follow up #15168 We don't need to have "fake" licenses dir anymore.	2015-12-02 10:22:52 +01:00
Simon Willnauer	65b661b1f4	[TEST] Fix MapperUpgrade tests to use a dedicated master to ensure dangeling index import works predictably When importing dangling indices on a single node that is data and master eligable the async dangling index call can still be in-flight when the cluster is checked for green / yellow. Adding a dedicated master node and a data only node that does the importing fixes this issus just like we do in OldIndexBackwardsCompatibilityIT	2015-11-27 10:32:21 +01:00
Adrien Grand	e8520bf519	Tests: For single data path for *FieldMapperUpgradeTests.	2015-11-25 11:46:19 +01:00
Adrien Grand	aad84395c9	Add a test that upgrades succeed even if a mapping contains fields that come from a plugin.	2015-11-24 19:14:19 +01:00
Adrien Grand	5f33fbdb75	Register field mappers at the node level. This moves the registration of field mappers from the index level to the node level and also ensures that mappers coming from plugins are treated no differently from core mappers.	2015-11-24 08:59:37 +01:00
Simon Willnauer	94bed42213	Simplify plugin API and fix IndexService internal allocation	2015-11-05 13:16:35 +01:00
Simon Willnauer	487af301ea	Remove guice from the index level This commit removes guice from the index level and adds a simple extension point to add class instances with an index-lifecycle scope.	2015-11-05 11:18:11 +01:00
Ryan Ernst	542522531a	Build: Remove maven pom files and supporting ant files This change removes the leftover pom files. A couple files were left for reference, namely in qa tests that have not yet been migrated (vagrant and multinode). The deb and rpm assemblies also still exist for reference when finishing their setup in gradle. See #13930	2015-10-29 23:53:49 -07:00
Ryan Ernst	c86100f636	Switch build system to Gradle See #13930	2015-10-29 11:40:19 -07:00
Simon Willnauer	66d5d0c4f2	Replace IndexSettings annotation with a full-fledged class The @IndexSettings annoationat has been used to differentiate between node-level and index level settings. It was also decoupled from realtime-updates such that the settings object that a class got injected when it was created was static and not subject to change when an update was applied. This change removes the annoation and replaces it with a full-fledged class that adds type-safety and encapsulates additional functionality as well as checks on the settings.	2015-10-22 20:43:41 +02:00
Ryan Ernst	45f757de6d	Test: Move rest-api-spec for plugins into test resources Plugin tests require having rest-api tests, and currently copy that spec from a directory in the root of the plugin source into the test resources. This change moves the rest-api-spec dir into test resources so it is like any other test resources. It also removes unnecessary configuration for resources from the shared plugin pom.	2015-09-16 03:04:53 -07:00
David Pilato	a38bcc5d62	[test] plugins simple RestIT tests don't work from IDE When running a RestIT test from the IDE, you actually start an internal node which does not automatically load the plugin you would like to test. We need to add: ```java @Override protected Collection<Class<? extends Plugin>> nodePlugins() { return pluginList(PLUGIN_HERE.class); } ``` Everything works fine when running from maven because each test basically: * installs elasticsearch * installs one plugin * starts elasticsearch with this plugin loaded * runs the test Note that this PR only fixes the fact we run an internal cluster with the expected plugin. Cloud tests will still fail when run from the IDE because is such a case you actually start an internal node with many mock plugins. And REST test suite for cloud plugins basically checks if the plugin is running by checking the output of NodesInfo API. And we check: ```yml - match: { nodes.$master.plugins.0.name: cloud-azure } - match: { nodes.$master.plugins.0.jvm: true } ``` But in that case, this condition is certainly false as we started also `mock-transport-service`, `mock-index-store`, `mock-engine-factory`, `node-mocks`, `asserting-local-transport`, `mock-search-service`. Closes #13479	2015-09-15 10:10:05 +02:00
Ryan Ernst	9e8a90a657	Add xlint ignores for warning classes, where appropriate.	2015-09-09 12:47:07 -07:00
Simon Willnauer	796701d52e	Move version to 3.0.0-SNAPSHOT	2015-09-03 10:43:28 +02:00
Ryan Ernst	54ae55662c	Remove leftover class and fix compile issues from master merge	2015-08-18 14:28:03 -07:00
David Pilato	807d35e96f	[maven] change murmur3 plugin groupId and name	2015-08-18 13:45:59 +02:00
Adrien Grand	a72adbf0b4	Fix mapper-murmur3 compatibility version.	2015-08-18 12:31:36 +02:00
Adrien Grand	a91b3fcbb9	Move the `murmur3` field to a plugin and fix defaults. This move the `murmur3` field to the `mapper-murmur3` plugin and fixes its defaults so that values will not be indexed by default, as the only purpose of this field is to speed up `cardinality` aggregations on high-cardinality string fields, which only requires doc values. I also removed the `rehash` option from the `cardinality` aggregation as it doesn't bring much value (rehashing is cheap) and allowed to remove the coupling between the `cardinality` aggregation and the `murmur3` field. Close #12874	2015-08-18 11:41:52 +02:00

39 Commits