OpenSearch

Commit Graph

Author	SHA1	Message	Date
Adrien Grand	77711508b0	Upgrade to Lucene 7.2.0. (#27910 )	2017-12-20 14:17:40 +01:00
Henrik Lindström	7a44596446	Catch InvalidPathException in IcuCollationTokenFilterFactory (#27202 ) Using custom rules in the icu_collation filter can fail on Windows. If the rules are interpreted as a file location, this leads to an InvalidPathException when trying to read the rules from a file.	2017-12-04 10:29:08 +01:00
Adrien Grand	6323bb0d97	Upgrade to lucene-7.2.0-snapshot-8c94404. (#27619 ) This new snapshot mostly brings a change to TopFieldCollector which can now early terminate collection when trackTotalHits is `false`. As a follow-up, we should replace our usage of `EarlyTerminatingSortingCollector` with this new option.	2017-12-04 09:40:08 +01:00
Adrien Grand	996990ad1f	Upgrade to lucene-7.2.0-snapshot-8c94404. (#27496 ) The main highlight of this new snapshot is that it introduces the opportunity for queries to opt out of caching. In case a query opts out of caching, not only will it never be cached, but also no compound query that wraps it will be cached.	2017-11-28 14:52:42 +01:00
Colin Goodheart-Smithe	c1b8140c83	Upgrade to Lucene 7.1 (#27225 )	2017-11-02 13:25:33 +00:00
Colin Goodheart-Smithe	99aca9cdfc	Enhances exists queries to reduce need for `_field_names` (#26930 ) * Enhances exists queries to reduce need for `_field_names` Before this change we wrote the name all the fields in a document to a `_field_names` field and then implemented exists queries as a term query on this field. The problem with this approach is that it bloats the index and also affects indexing performance. This change adds a new method `existsQuery()` to `MappedFieldType` which is implemented by each sub-class. For most field types if doc values are available a `DocValuesFieldExistsQuery` is used, falling back to using `_field_names` if doc values are disabled. Note that only fields where no doc values are available are written to `_field_names`. Closes #26770 * Addresses review comments * Addresses more review comments * implements existsQuery explicitly on every mapper * Reinstates ability to perform term query on `_field_names` * Added bwc depending on index created version * Review Comments * Skips tests that are not supported in 6.1.0 These values will need to be changed after backporting this PR to 6.x	2017-11-01 10:46:59 +00:00
Simon Willnauer	cdd7c1e6c2	Return List instead of an array from settings (#26903 ) Today we return a `String[]` that requires copying values for every access. Yet, we already store the setting as a list so we can also directly return the unmodifiable list directly. This makes list / array access in settings a much cheaper operation especially if lists are large.	2017-10-09 09:52:08 +02:00
Md. Abdulla-Al-Sun	a40c474e10	Added Bengali Analyzer to Elasticsearch with respect to the lucene update(PR#238)	2017-10-05 13:25:05 +02:00
Martijn van Groningen	dca787ed8a	upgrade to Lucene 7.1.0 snapshot version	2017-10-05 09:06:56 +02:00
Simon Willnauer	aab4655e63	Unify Settings xcontent reading and writing (#26739 ) This change adds a fromXContent method to Settings that allows to read the xcontent that is produced by toXContent. It also replaces the entire settings loader infrastructure and removes the structured map representation. Future PRs will also tackle the `getAsMap` that exposes the internal represenation of settings for better encapsulation.	2017-09-25 13:23:01 +02:00
Jason Tedor	e0db89bc35	Upgrade to Lucene 7.0.0 This commit upgrades to the GA release of Luence 7! Relates #26744	2017-09-21 19:19:33 -04:00
Adrien Grand	1adee8b5a8	Fix the MapperFieldType.rangeQuery API. (#26552 ) RangeQueryBuilder needs to perform too many `instanceof` checks in order to check for `date` or `range` fields in order to know what it should do with the shape relation, time zone and date format. This commit adds those 3 parameters to the `rangeQuery` factory method so that those instanceof checks are not necessary anymore.	2017-09-11 11:02:05 +02:00
Andy Bristol	33faf5ec70	forbid ICU Collator creation with default locale (#26476 )	2017-09-07 14:47:52 -07:00
Adrien Grand	78681bc9e5	Upgrade to lucene-7.0.0-snapshot-d94a5f0. (#26441 )	2017-08-31 09:06:40 +02:00
Andy Bristol	e00366ba95	ICU plugin: use root locale by default for collators (#26413 ) Calls to Collator.getInstance without arguments returns a collator that uses the system's default locale, which we don't want because it makes behavior harder to reproduce. Change it to always use the root locale instead. For #25587	2017-08-29 08:58:36 -07:00
Jim Ferenczi	86d97971a4	Remove the _all metadata field (#26356 ) * Remove the _all metadata field This change removes the `_all` metadata field. This field is deprecated in 6 and cannot be activated for indices created in 6 so it can be safely removed in the next major version (e.g. 7).	2017-08-28 17:43:59 +02:00
Adrien Grand	eb782492be	Remove support for lenient booleans. Closes #22298	2017-08-28 09:56:01 +02:00
Matt Weber	e89d9400c9	ICUCollationKeywordFieldMapper use SortedSetDocValuesField (#26267 ) Switch ICUCollationKeywordFieldMapper from using SortedDocValuesField to SortedSetDocValuesField so we can support fields with multiple values.	2017-08-21 10:40:56 +02:00
Adrien Grand	f0c1e30544	Upgrade to lucene-7.0.0-snapshot-a128fcb. (#26090 )	2017-08-08 13:03:19 +02:00
Simon Willnauer	82fa531ab4	Remove `_index` fielddata hack if cluster alias is present (#26082 ) We introduced a hack in #25885 to respect the cluster alias if available on the `_index` field. This is important if aggregations or other field data related operations are executed. Yet, we added a small hack that duplicated an implementation detail from the `_index` field data builder to make this work. This change adds a necessary but simple API change that allows us to remove the hack and only have a single implementation.	2017-08-08 09:24:24 +02:00
Adrien Grand	481d5d09b2	Upgrade to lucene-7.0.0-snapshot-00142c9. (#25641 ) Lucene 7.0 is feature-frozen now, so there should not be many changes until GA.	2017-07-11 13:58:55 +02:00
Adrien Grand	44e9c0b947	Upgrade to lucene-7.0.0-snapshot-ad2cb77. (#25349 ) Most notable changes: - better update concurrency: LUCENE-7868 - TopDocs.totalHits is now a long: LUCENE-7872 - QueryBuilder does not remove the boolean query around multi-term synonyms: LUCENE-7878 - removal of Fields: LUCENE-7500 For the `TopDocs.totalHits` change, this PR relies on the fact that the encoding of vInts and vLongs are compatible: you can write and read with any of them as long as the value can be represented by a positive int.	2017-06-22 12:35:33 +02:00
David Causse	ff9edb627e	[analysis-icu] Allow setting unicodeSetFilter (#20814 ) UnicodeSetFilter was only allowed in the icu_folding token filter. It seems useful to expose this setting in icu_normalizer token filter and char filter.	2017-06-16 11:08:39 +02:00
Jim Ferenczi	0036f28a6a	Upgrade icu4j for the ICU analysis plugin to 59.1 (#25243 ) * Upgrade icu4j for the ICU analysis plugin to 59.1 Lucene upgraded to 59.1 so we should use the same. Closes #21425 * Add breaking change for the icu upgrade	2017-06-15 13:26:48 +02:00
Adrien Grand	0c117145f6	Upgrade to lucene-7.0.0-snapshot-92b1783. (#25222 ) This snapshot has faster range queries on range fields (LUCENE-7828), more accurate norms (LUCENE-7730) and the ability to use fake term frequencies (LUCENE-7854).	2017-06-15 09:52:07 +02:00
Jim Ferenczi	4e70235d55	Upgrade icu4j to latest version (#24821 )	2017-05-22 09:34:50 +02:00
Nicholas Knize	deb7caf4d3	Upgrade to lucene-7.0.0-snapshot-a0aef2f This commit upgrades master to a current lucene snapshot with commit id a0aef2f.	2017-05-19 10:20:55 -05:00
Ryan Ernst	2a65bed243	Tests: Change rest test extension from .yaml to .yml (#24659 ) This commit renames all rest test files to use the .yml extension instead of .yaml. This way the extension used within all of elasticsearch for yaml is consistent.	2017-05-16 17:24:35 -07:00
Matt Weber	b24326271e	Add ICUCollationFieldMapper (#24126 ) Adds a new "icu_collation" field type that exposes lucene's ICUCollationDocValuesField. ICUCollationDocValuesField is the replacement for ICUCollationKeyFilter which has been deprecated since Lucene 5.	2017-05-10 10:35:11 +02:00
Nik Everett	bb06d8ec4f	Allow plugins to build pre-configured token filters (#24223 ) This changes the way we register pre-configured token filters so that plugins can declare them and starts to move all of the pre-configured token filters out of core. It doesn't finish the job because doing so would make the change unreviewably large. So this PR includes a shim that keeps the "old" way of registering pre-configured token filters around. The Lowercase token filter is special because there is a "special" interaction between it and the lowercase tokenizer. I'm not sure exactly what to do about it so for now I'm leaving it alone with the intent of figuring out what to do with it in a followup. This also renames these pre-configured token filters from "pre-built" to "pre-configured" because that seemed like a more descriptive name. This is a part of #23658	2017-05-09 14:50:49 -04:00
Ryan Ernst	212f24aa27	Tests: Clean up rest test file handling (#21392 ) This change simplifies how the rest test runner finds test files and removes all leniency. Previously multiple prefixes and suffixes would be tried, and tests could exist inside or outside of the classpath, although outside of the classpath never quite worked. Now only classpath tests are supported, and only one resource prefix is supported, `/rest-api-spec/tests`. closes #20240	2017-04-18 15:07:08 -07:00
Adrien Grand	4632661bc7	Upgrade to a Lucene 7 snapshot (#24089 ) We want to upgrade to Lucene 7 ahead of time in order to be able to check whether it causes any trouble to Elasticsearch before Lucene 7.0 gets released. From a user perspective, the main benefit of this upgrade is the enhanced support for sparse fields, whose resource consumption is now function of the number of docs that have a value rather than the total number of docs in the index. Some notes about the change: - it includes the deprecation of the `disable_coord` parameter of the `bool` and `common_terms` queries: Lucene has removed support for coord factors - it includes the deprecation of the `index.similarity.base` expert setting, since it was only useful to configure coords and query norms, which have both been removed - two tests have been marked with `@AwaitsFix` because of #23966, which we intend to address after the merge	2017-04-18 15:17:21 +02:00
Jim Ferenczi	0e95c90e9f	Upgrade to Lucene 6.5.0 (#23750 )	2017-03-27 15:57:54 +02:00
Luca Cavanna	cc65a94fd4	[TEST] improve yaml test sections parsing (#23407 ) Throw error when skip or do sections are malformed, such as they don't start with the proper token (START_OBJECT). That signals bad indentation, which would be ignored otherwise. Thanks (or due to) our pull parsing code, we were still able to properly parse the sections, yet other runners weren't able to. Closes #21980 * [TEST] fix indentation in matrix_stats yaml tests * [TEST] fix indentation in painless yaml test * [TEST] fix indentation in analysis yaml tests * [TEST] fix indentation in generated docs yaml tests * [TEST] fix indentation in multi_cluster_search yaml tests	2017-03-02 12:43:20 +01:00
Jim Ferenczi	5c84640126	Upgrade to lucene-6.5.0-snapshot-d00c5ca (#23385 ) Lucene upgrade	2017-02-27 18:39:04 +01:00
Adrien Grand	709cc9ba65	Upgrade to lucene-6.5.0-snapshot-f919485. (#23087 )	2017-02-10 15:08:47 +01:00
Adrien Grand	c8496fc4f4	Upgrade to Lucene 6.4.1. (#22978 )	2017-02-06 09:28:43 +01:00
Jim Ferenczi	8028578305	Upgrade to Lucene 6.4.0 (#22724 ) * Upgrade to Lucene 6.4.0 `ValueSource`s are now converted to `DoubleValueSource`s using the Lucene adapter made for the migration to the new API in 6.4.0.	2017-01-21 04:48:01 +01:00
Jason Tedor	9781b88a38	Fix deprecation logging for lenient booleans This commit fixes an issue with deprecation logging for lenient booleans. The underlying issue is that adding deprecation logging for lenient booleans added a static deprecation logger to the Settings class. However, the Settings class is initialized very early and in CLI tools can be initialized before logging is initialized. This leads to status logger error messages. Additionally, the deprecation logging for a lot of the settings does not provide useful context (for example, in the token filter factories, the deprecation logging only produces the name of the setting, but gives no context which token filter factory it comes from). This commit addresses both of these issues by changing the call sites to push a deprecation logger through to the lenient boolean parsing. Relates #22696	2017-01-19 12:30:33 -05:00
Daniel Mitterdorfer	aece89d6a1	Make boolean conversion strict (#22200 ) This PR removes all leniency in the conversion of Strings to booleans: "true" is converted to the boolean value `true`, "false" is converted to the boolean value `false`. Everything else raises an error.	2017-01-19 07:59:18 +01:00
Adrien Grand	f8998fece5	Upgrade to lucene-6.4.0-snapshot-084f7a0. (#22413 )	2017-01-04 19:03:52 +01:00
Nik Everett	f5f2149ff2	Remove much ceremony from parsing client yaml test suites (#22311 ) * Remove a checked exception, replacing it with `ParsingException`. * Remove all Parser classes for the yaml sections, replacing them with static methods. * Remove `ClientYamlTestFragmentParser`. Isn't used any more. * Remove `ClientYamlTestSuiteParseContext`, replacing it with some static utility methods. I did not rewrite the parsers using `ObjectParser` because I don't think it is worth it right now.	2016-12-22 11:00:34 -05:00
Jim Ferenczi	d791ddf704	Upgrade to lucene-6.4.0-snapshot-ec38570 (#21853 ) Set lucene version to 6.4.0-snapshot-ec38570 and update all the sha1s/license Fix invalid combo after upgrade in query_string query. split_on_whitespace=false is disallowed if auto_generate_phrase_queries=true Adapt the expectations of some tests to the new format of the Lucene explain output	2016-11-29 18:40:31 +01:00
Adrien Grand	1fd5c47e7f	Upgrade to lucene-6.3.0. (#21464 )	2016-11-14 09:36:45 +01:00
Ryan Ernst	7a2c984bcc	Test: Remove multi process support from rest test runner (#21391 ) At one point in the past when moving out the rest tests from core to their own subproject, we had multiple test classes which evenly split up the tests to run. However, we simplified this and went back to a single test runner to have better reproduceability in tests. This change removes the remnants of that multiplexing support.	2016-11-07 15:07:34 -08:00
Adrien Grand	2a70f6e7b1	Upgrade to lucene-6.3.0-snapshot-a66a445. (#21309 ) This addresses a bug that was introduced with https://issues.apache.org/jira/browse/LUCENE-7501.	2016-11-04 10:34:04 +01:00
Adrien Grand	b3cc54cf0d	Upgrade to lucene-6.3.0-snapshot-ed102d6 (#21150 ) Lucene 6.3 is expected to be released in the next weeks so it'd be good to give it some integration testing. I had to upgrade randomized-testing too so that both Lucene and Elasticsearch are on the same version.	2016-10-28 14:47:15 +02:00
Jun Ohtani	a66c76eb44	Merge pull request #20704 from johtani/remove_request_params_in_analyze_api Removing request parameters in _analyze API	2016-10-27 17:43:18 +09:00
Tanguy Leroux	44ac5d057a	Remove empty javadoc (#20871 ) This commit removes as many as empty javadocs comments my regexp has found	2016-10-12 10:27:09 +02:00
Jun Ohtani	370f0b885e	Removing request parameters in _analyze API Remove request params in _analyze API without index param Change rest-api-test using JSON Change docs using JSON Closes #20246	2016-10-07 16:23:24 +09:00
Simon Willnauer	fe1803c957	Remove AnalysisService and reduce it to a simple name to analyzer mapping (#20627 ) Today we hold on to all possible tokenizers, tokenfilters etc. when we create an index service on a node. This was mainly done to allow the `_analyze` API to directly access all these primitive. We fixed this in #19827 and can now get rid of the AnalysisService entirely and replace it with a simple map like class. This ensures we don't create a gazillion long living objects that are entirely useless since they are never used in most of the indices. Also those objects might consume a considerable amount of memory since they might load stopwords or synonyms etc. Closes #19828	2016-09-23 08:53:50 +02:00
Mike McCandless	0ccfe69789	Upgrade to Lucene 6.2.0	2016-08-24 17:26:28 -04:00
Nik Everett	9270e8b22b	Rename client yaml test infrastructure This makes it obvious that these tests are for running the client yaml suites. Now that there are other ways of running tests using the REST client against a running cluster we can't go on calling the shared client yaml tests "REST tests". They are rest tests, but they aren't the rest tests.	2016-07-26 13:53:44 -04:00
Nik Everett	a95d4f4ee7	Add Location header and improve REST testing This adds a header that looks like `Location: /test/test/1` to the response for the index/create/update API. The requirement for the header comes from https://www.w3.org/Protocols/rfc2616/rfc2616-sec10.html https://tools.ietf.org/html/rfc7231#section-7.1.2 claims that relative URIs are OK. So we use an absolute path which should resolve to the appropriate location. Closes #19079 This makes large changes to our rest test infrastructure, allowing us to write junit tests that test a running cluster via the rest client. It does this by splitting ESRestTestCase into two classes: * ESRestTestCase is the superclass of all tests that use the rest client to interact with a running cluster. * ESClientYamlSuiteTestCase is the superclass of all tests that use the rest client to run the yaml tests. These tests are shared across all official clients, thus the `ClientYamlSuite` part of the name.	2016-07-25 17:02:40 -04:00
Ali Beyad	19d0dbcd17	Removes waiting for yellow cluster health upon index (#19460 ) creation in the REST tests, as we no longer need it due to index creation now waiting for active shard copies before returning (by default, it waits for the primary of each shard, which is the same as ensuring yellow health). Relates #19450	2016-07-15 17:18:34 -04:00
Jason Tedor	3343ceeae4	Do not catch throwable Today throughout the codebase, catch throwable is used with reckless abandon. This is dangerous because the throwable could be a fatal virtual machine error resulting from an internal error in the JVM, or an out of memory error or a stack overflow error that leaves the virtual machine in an unstable and unpredictable state. This commit removes catch throwable from the codebase and removes the temptation to use it by modifying listener APIs to receive instances of Exception instead of the top-level Throwable. Relates #19231	2016-07-04 08:41:06 -04:00
Nik Everett	71b95fb63c	Switch analysis from push to pull Instead of plugins calling `registerTokenizer` to extend the analyzer they now instead have to implement `AnalysisPlugin` and override `getTokenizer`. This lines up extending plugins in with extending scripts. This allows `AnalysisModule` to construct the `AnalysisRegistry` immediately as part of its constructor which makes testing anslysis much simpler. This also moves the default analysis configuration into `AnalysisModule` which is how search is setup. Like `ScriptModule`, `AnalysisModule` no longer extends `AbstractModule`. Instead it is only responsible for building `AnslysisRegistry`. We still bind `AnalysisRegistry` but we only do so in `Node`. This is means it is available at module construction time so we slowly remove the need to bind it in guice.	2016-06-26 07:15:42 -04:00
Adrien Grand	7ba5bceebe	Add a MultiTermAwareComponent marker interface to analysis factories. #19028 This is the same as what Lucene does for its analysis factories, and we hawe tests that make sure that the elasticsearch factories are in sync with Lucene's. This is a first step to move forward on #9978 and #18064.	2016-06-23 10:19:24 +02:00
Adrien Grand	600cbb6ab0	Upgrade to Lucene 6.1.0. #18926	2016-06-17 09:03:00 +02:00
Ryan Ernst	a4503c2aed	Plugins: Remove name() and description() from api In 2.0 we added plugin descriptors which require defining a name and description for the plugin. However, we still have name() and description() which must be overriden from the Plugin class. This still exists for classpath plugins. But classpath plugins are mainly for tests, and even then, referring to classpath plugins with their class is a better idea. This change removes name() and description(), replacing the name for classpath plugins with the full class name.	2016-06-15 17:12:22 -07:00
Adrien Grand	44c653f5a8	Upgrade to lucene-6.1.0-snapshot-3a57bea.	2016-06-10 16:18:12 +02:00
Adrien Grand	d182e171a4	Upgrade to Lucene 6.0.1.	2016-06-01 10:31:10 +02:00
Jason Tedor	9d39b05845	Remove deprecation suppression Failing the build on deprecation warnings was removed in `19b3ec88af`. This commit removes the suppressed deprecation warnings so that their use is surfaced in the build now. Relates #18582	2016-05-25 17:15:36 -04:00
Xu Zhang	3e4b470f83	Fix icu IndexScope setting	2016-04-22 15:03:02 -07:00
xuzha	cd527c5b92	Add support for customizing the rule file in ICU tokenizer Lucene allows to create a ICUTokenizer with a special config argument enabling the customization of the rule based iterator by providing custom rules files. This commit enable this feature. Users could provide a list of RBBI rule files to ICU tokenizer. closes #13146	2016-04-22 12:39:20 -07:00
Jun Ohtani	9eb242a5fe	Analyze API : Rename filters/token_filters/char_filter to filter/token_filter/char_filter Closes #15189	2016-04-21 18:05:11 +09:00
Adrien Grand	496c7fbd84	Upgrade Lucene 6 Release * upgrades numerics to new Point format * updates geo api changes * adds GeoPointDistanceRangeQuery as XGeoPointDistanceRangeQuery * cuts over to ES GeoHashUtils	2016-04-11 16:50:04 -05:00
Adrien Grand	42526ac28e	Remove Settings.settingsBuilder. We have both `Settings.settingsBuilder` and `Settings.builder` that do exactly the same thing, so we should keep only one. I kept `Settings.builder` since it has my preference but also it is the one that we use in examples of the Java API.	2016-04-08 18:10:02 +02:00
Simon Willnauer	e91a141233	Prevent index level setting from being configured on a node level Today we allow to set all kinds of index level settings on the node level which is error prone and difficult to get right in a consistent manner. For instance if some analyzers are setup in a yaml config file some nodes might not have these analyzers and then index creation fails. Nevertheless, this change allows some selected settings to be specified on a node level for instance: * `index.codec` which is used in a hot/cold node architecture and it's value is really per node or per index * `index.store.fs.fs_lock` which is also dependent on the filesystem a node uses All other index level setting must be specified on the index level. For existing clusters the index must be closed and all settings must be updated via the API on each of the indices. Closes #16799	2016-03-17 14:42:18 +01:00
Adrien Grand	5596e31068	Upgrade to lucene-6.0.0-f0aa4fc. #17075	2016-03-14 07:58:52 +01:00
Simon Willnauer	7a53a396e4	Remove Unneded @Inject annotations	2016-03-09 12:10:47 +01:00
Robert Muir	54018a5d37	upgrade to lucene 6.0.0-snapshot-bea235f Closes #16964 Squashed commit of the following: commit a23f9d2d29220991aa498214530753d7a5a148c6 Merge: eec9c4e `0b0a251` Author: Robert Muir <rmuir@apache.org> Date: Mon Mar 7 04:12:02 2016 -0500 Merge branch 'master' into lucene6 commit eec9c4e5cd11e9c3e0b426f04894bb2a6dae4f21 Merge: bc67205 `675d940` Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 13:45:00 2016 -0500 Merge branch 'master' into lucene6 commit bc67205bdfe1526eae277ab7856fc050ecbdb7b2 Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 09:56:31 2016 -0500 fix test bug commit a60723b007ff12d97b1810cef473bd7b553a0327 Author: Simon Willnauer <simonw@apache.org> Date: Fri Mar 4 15:35:35 2016 +0100 Fix SimpleValidateQueryIT to put braces around boosted terms commit ae3a49d7ba7ced448d2a5262e5d8ec98671a9090 Author: Simon Willnauer <simonw@apache.org> Date: Fri Mar 4 15:27:25 2016 +0100 fix multimatchquery commit ae23fdb88a8f6d3fb7ba60fd1aaf3fd72d899aa5 Author: Simon Willnauer <simonw@apache.org> Date: Fri Mar 4 15:20:49 2016 +0100 Rewrite DecayFunctionScoreIT to be independent of the similarity used This test relied a lot on the term scoring and compared scores that are dependent on the similarity. This commit changes the base query to be a predictable constant score query. commit 366c2d518c35d31251033f1b6f6a93f6e2ae327d Author: Simon Willnauer <simonw@apache.org> Date: Fri Mar 4 14:06:14 2016 +0100 Fix scoring in tests due to changes to idf calculation. Lucene 6 uses a different default similarity as well as a different way to calculate IDF. In contrast to older version lucene 6 uses docCount per field to calculate the IDF not the # of docs in the index to overcome the sparse field cases. commit dac99fd64ac2fa71b8d8d106fe68825e574c49f8 Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 08:21:57 2016 -0500 don't hardcoded expected termquery score commit 6e9f340ba49ab10eed512df86d52a121aa775b0f Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 08:04:45 2016 -0500 suppress deprecation warning until migrated to points commit 3ac8908424b3fdad44a90a4f7bdb3eff7efd077d Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 07:21:43 2016 -0500 Remove invalid test: all commits have IDs, and its illegal to do this. commit c12976288124ad1a26467e7e848fb810548e7eab Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 07:06:14 2016 -0500 don't test with unsupported back compat commit 18bbfe76128570bc70883bf91ff4c44c82d27817 Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 07:02:18 2016 -0500 remove now invalid lucene 4 backcompat test commit 7e730e572886f0ef2d3faba712e4256216ff01ec Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 06:58:52 2016 -0500 remove now invalid lucene 4 backwards test commit 244d2ab6868ba5ac9e0bcde3c2833743751a25ec Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 06:47:23 2016 -0500 use 6.0 codec commit 5f64d4a431a6fdaa1234adca23f154c2a1de8284 Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 06:43:08 2016 -0500 compile, javadocs, forbidden-apis, etc commit 1f273cd62a7fe9ca8f8944acbbfc5cbdd3d81ccb Merge: cd33921 `29e3443` Author: Simon Willnauer <simonw@apache.org> Date: Fri Mar 4 10:45:29 2016 +0100 Merge branch 'master' into lucene6 commit cd33921ac742ef9fb351012eff35f3c7dbda7264 Author: Robert Muir <rmuir@apache.org> Date: Thu Mar 3 23:58:37 2016 -0500 fix hunspell dictionary loading commit c7fdbd837b01f7defe9cb1c24e2ec65604b0dc96 Merge: 4d4190f `d8948ba` Author: Robert Muir <rmuir@apache.org> Date: Thu Mar 3 23:41:53 2016 -0500 Merge branch 'master' into lucene6 commit 4d4190fd82601aaafac6b8254ccb3edf218faa34 Author: Robert Muir <rmuir@apache.org> Date: Thu Mar 3 23:39:14 2016 -0500 remove nocommit commit 77ca69e288b1a41aa9595c921ed166c272a00ea8 Author: Robert Muir <rmuir@apache.org> Date: Thu Mar 3 23:38:24 2016 -0500 clean up numericutils vs legacynumericutils commit a466d696fbaad04b647ffbc0857a9439b583d0bf Author: Robert Muir <rmuir@apache.org> Date: Thu Mar 3 23:32:43 2016 -0500 upgrade spatial4j commit 5412c747a8cfe638bacedbc8233163cb75cc3dc5 Author: Robert Muir <rmuir@apache.org> Date: Thu Mar 3 23:19:28 2016 -0500 move to 6.0.0-snapshot-8eada27 commit b32bfe924626b87e540692375ece09e7c2edb189 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 11:30:09 2016 +0100 Fix some test compile errors. commit 6ccde35e9840b03c68d1a2cd47c7923a06edf64a Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 11:25:51 2016 +0100 Current Lucene version is 6.0.0. commit f62e1015d931b4cc04c778298a8fa1ba65e97ad9 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 11:20:48 2016 +0100 Fix compile errors in NGramTokenFilterFactory. commit 6837c6eabf96075f743649da9b9b52dd39611c58 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 10:50:59 2016 +0100 Fix the edge ngram tokenizer/filter. commit ccd7f070de5efcdfbeb34b9555c65c4990bf1ba6 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 10:42:44 2016 +0100 The missing value is now accessible through a getter. commit bd3b77f9b28e5b05daa3d49683a9922a6baf2963 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 10:41:51 2016 +0100 Remove IndexCacheableQuery. commit 05f3091c347aeae80eeb16349ac51d2b53cf86f7 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 10:39:43 2016 +0100 Fix compilation of function_score queries. commit 81cda79a2431ac78f56b0cc5a5765387f662d801 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 10:35:02 2016 +0100 Fix compile errors in BlendedTermQuery. commit 70994ce8dd1eca0b995870974a38e20f26f96a7b Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 23:33:03 2016 -0500 add bug ID commit 29d4f1a71f36f646b5a6060bed3db019564a279d Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 21:02:32 2016 -0500 easy .store changes commit 5e1a1e6fd665fa455e88d3a8987362fad5f44bb1 Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 20:47:24 2016 -0500 cleanups mostly around boosting commit 333a669ec6c305ada5645d13ed1da0e19ec1d053 Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 20:27:56 2016 -0500 more simple fixes commit bd5cd98a1e089c866b6b4a5e159400b110140ce6 Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 19:49:38 2016 -0500 more easy fixes and removal of ancient cruft commit a68f419ee47da5f9c9ce5b372f01d707e902474c Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 19:35:02 2016 -0500 cutover numerics commit 4ca5dc1fa47dd5892db00899032133318fff3116 Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 18:34:18 2016 -0500 fix some constants commit 88710a17817086e477c6c021ec346d0534b7fb88 Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 18:14:25 2016 -0500 Add spatial-extras jar as a core dependency commit c8cd6726583e5ce3f546ed355d4eca037164a30d Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 18:03:33 2016 -0500 update to lucene 6 jars	2016-03-07 04:12:23 -05:00
Adrien Grand	eef19be072	Deprecate string in favor of text/keyword. #16877 This commit removes the ability to use string fields on indices created on or after 5.0. Dynamic mappings now generate text fields by default for strings but there are plans to also add a sub keyword field (in a future PR). Most of the changes in this commit are just about replacing string with keyword or text. Some tests have been removed because they existed because of corner cases of string mappings like setting ignore-above on a text field or enabling term vectors on a keyword field which are now impossible. The plan is to remove strings entirely in 6.0.	2016-03-03 10:20:56 +01:00
Nik Everett	95cc3e38fc	Check test naming conventions on all modules The big win here is catching tests that are incorrectly named and will be skipped by gradle, providing a false sense of security. The whole thing takes about 10 seconds on my Macbook Air, not counting compiling the test classes, which seems worth it. Because this runs as a gradle task with propery UP-TO-DATE handling it can be skipped if the tests haven't been changed which should save some time. I chose to keep this in test:framework rather than a new subproject of buildSrc because ESIntegTestCase and doesn't inroduce any additional dependencies.	2016-02-29 16:31:49 -05:00
Jason Tedor	d94e391e71	Use System#lineSeparator and not system property This commit replaces a use of the system property "line.separator" and replaces it with a dedicated method that provides the same value. Closes #16776	2016-02-25 12:22:57 -05:00
Mike McCandless	5fffede2b0	Upgrade to Lucene 5.5.0 official release	2016-02-20 17:34:16 -05:00
Nicholas Knize	52ee4c7027	upgrade to lucene 5.5.0-snapshot-850c6c2	2016-02-11 14:28:50 -06:00
Simon Willnauer	e02d2e004e	Rewrite SettingsFilter to be immutable This change rewrites the entire settings filtering mechanism to be immutable. All filters must be registered up-front in the SettingsModule. Filters that are comma-sparated are not allowed anymore and check on registration. This commit also adds settings filtering to the default settings recently added to ensure we don't render filtered settings.	2016-02-03 20:05:55 +01:00
Robert Muir	d5dc05f69e	Upgrade to lucene 5.5.0-snapshot-1725675	2016-02-02 22:53:39 -05:00
Boaz Leskes	2a137b5548	Make index uuid available in Index, ShardRouting & ShardId In the early days Elasticsearch used to use the index name as the index identity. Around 1.0.0 we introduced a unique index uuid which is stored in the index setting. Since then we used that uuid in a few places but it is by far not the main identifier when working with indices, partially because it's not always readily available in all places. This PR start to make a move in the direction of using uuids instead of name by making sure that the uuid is available on the Index class (currently just a wrapper around the name) and as such also available via ShardRouting and ShardId. Note that this is by no means an attempt to do the right thing with the uuid in all places. In almost all places it falls back to the name based comparison that was done before. It is meant as a first step towards slowly improving the situation. Closes #16217	2016-01-28 08:40:10 +01:00
Daniel Mitterdorfer	e9bb3d31a3	Convert "path.*" and "pidfile" to new settings infra	2016-01-22 15:14:13 +01:00
Robert Muir	6e7e3a2274	Update lucene to r1725675 Adds DFI (divergence from independence) provider. Fixes test bugs passing invalid values for BM25 parameters.	2016-01-20 03:32:51 -05:00
Ryan Ernst	ef4f0a8699	Test: Make rest test framework accept http directly for the test cluster The rest test framework, because it used to be tightly integrated with ESIntegTestCase, currently expects the addresses for the test cluster to be passed using the transport protocol port. However, it only uses this to then find the http address. This change makes ESRestTestCase extend from ESTestCase instead of ESIntegTestCase, and changes the sysprop used to tests.rest.cluster, which now takes the http address. closes #15459	2016-01-18 16:44:14 -08:00
Nik Everett	81a7607256	Remove -Xlint:-deprecation from plugins Instead we suppress warnings about using deprecated stuff near the usage site with a comment about why its ok.	2016-01-07 20:44:46 -05:00
Adrien Grand	cf52e96c42	Upgrade to lucene-5.5.0-snapshot-1721183. Some files that implement or use the Scorer API had to be changed because of https://issues.apache.org/jira/browse/LUCENE-6919.	2015-12-21 17:02:08 +01:00
Ryan Ernst	4ea19995cf	Remove wildcard imports	2015-12-18 12:43:47 -08:00
Robert Muir	2741888498	Remove RuntimePermission("accessDeclaredMembers") Upgrades lucene to 5.5.0-1719088, randomizedtesting to 2.3.2, and securemock to 1.2	2015-12-10 14:26:55 -05:00
Simon Willnauer	9f6598b18d	Fix compile errors	2015-11-26 13:41:00 +01:00
Michael McCandless	e13b0d4bde	upgrade lucene to 5.4.0-snapshot-1715952	2015-11-23 17:13:49 -05:00
Ryan Ernst	c3cb1fd08c	Merge branch 'master' into javadocs	2015-11-19 10:43:43 -08:00
Michael McCandless	a0bf253d16	upgrade lucene 5.4 snapshot	2015-11-16 14:38:05 -05:00
Michael McCandless	9d7ca53022	upgrade lucene 5.4 snapshot	2015-11-16 14:35:17 -05:00
Ryan Ernst	4b17492456	Build: Add javadocs jars This change adds javadoc jars to core, test-framework and plugins. There were a couple issues which javadoc found, but doclint did not already find.	2015-11-15 01:44:42 -08:00
Boaz Leskes	ac0da91bf7	Extend usage of IndexSetting class I decided to leave external listeners (used by plugins) alone, for now. Closes #14731	2015-11-13 14:30:23 +01:00
Ryan Ernst	4b5f87cb7d	Build: Remove transitive dependencies Transitive dependencies can be confusing and hard to deal with when conflicts arise between them. This change removes transitive dependencies from elasticsearch, and forces any dependency conflicts to be resolved manually, instead of automatically by gradle. closes #14627	2015-11-10 15:01:41 -08:00
Adrien Grand	d6d7af0a6c	Upgrade to lucene-5.4.0-snapshot-1712973.	2015-11-09 15:53:27 +01:00
Ryan Ernst	b6dee6bd43	Merge pull request #14375 from rjernst/sweep_up_maven Remove maven pom files and supporting ant files	2015-10-30 18:59:11 -07:00
Areek Zillur	13b60e1b92	update to lucene-5.4.x-snapshot-1711508	2015-10-30 15:42:02 -04:00
Simon Willnauer	aa38d053d7	Simplify Analysis registration and configuration This change moves all the analysis component registration to the node level and removes the significant API overhead to register tokenfilter, tokenizer, charfilter and analyzer. All registration is done without guice interaction such that real factories via functional interfaces are passed instead of class objects that are instantiated at runtime. This change also hides the internal analyzer caching that was done previously in the IndicesAnalysisService entirely and decouples all analysis registration and creation from dependency injection.	2015-10-30 11:40:18 +01:00
Ryan Ernst	542522531a	Build: Remove maven pom files and supporting ant files This change removes the leftover pom files. A couple files were left for reference, namely in qa tests that have not yet been migrated (vagrant and multinode). The deb and rpm assemblies also still exist for reference when finishing their setup in gradle. See #13930	2015-10-29 23:53:49 -07:00
Ryan Ernst	c86100f636	Switch build system to Gradle See #13930	2015-10-29 11:40:19 -07:00
Adrien Grand	43958db10b	Upgrade to lucene-5.4-snapshot-1710880.	2015-10-28 09:34:54 +01:00
Simon Willnauer	8a9dd871d3	Make IndexSettings also own the IndexMetaData and separate node settings	2015-10-23 10:53:39 +02:00
Simon Willnauer	66d5d0c4f2	Replace IndexSettings annotation with a full-fledged class The @IndexSettings annoationat has been used to differentiate between node-level and index level settings. It was also decoupled from realtime-updates such that the settings object that a class got injected when it was created was static and not subject to change when an update was applied. This change removes the annoation and replaces it with a full-fledged class that adds type-safety and encapsulates additional functionality as well as checks on the settings.	2015-10-22 20:43:41 +02:00
Nik Everett	2cc97a0d3e	Remove and ban @Test There are three ways `@Test` was used. Way one: ```java @Test public void flubTheBlort() { ``` This way was always replaced with: ```java public void testFlubTheBlort() { ``` Or, maybe with a better method name if I was feeling generous. Way two: ```java @Test(throws=IllegalArgumentException.class) public void testFoo() { methodThatThrows(); } ``` This way of using `@Test` is actually pretty OK, but to get the tools to ban `@Test` entirely it can't be used. Instead: ```java public void testFoo() { try { methodThatThrows(); fail("Expected IllegalArgumentException"); } catch (IllegalArgumentException e ) { assertThat(e.getMessage(), containsString("something")); } } ``` This is longer but tests more than the old ways and is much more precise. Compare: ```java @Test(throws=IllegalArgumentException.class) public void testFoo() { some(); copy(); and(); pasted(); methodThatThrows(); code(); // <---- This was left here by mistake and is never called } ``` to: ```java @Test(throws=IllegalArgumentException.class) public void testFoo() { some(); copy(); and(); pasted(); try { methodThatThrows(); fail("Expected IllegalArgumentException"); } catch (IllegalArgumentException e ) { assertThat(e.getMessage(), containsString("something")); } } ``` The final use of test is: ```java @Test(timeout=1000) public void testFoo() { methodThatWasSlow(); } ``` This is the most insidious use of `@Test` because its tempting but tragically flawed. Its flaws are: 1. Hard and fast timeouts can look like they are asserting that something is faster and even do an ok job of it when you compare the timings on the same machine but as soon as you take them to another machine they start to be invalid. On a slow VM both the new and old methods fail. On a super-fast machine the slower and faster ways succeed. 2. Tests often contain slow `assert` calls so the performance of tests isn't sure to predict the performance of non-test code. 3. These timeouts are rude to debuggers because the test just drops out from under it after the timeout. Confusingly, timeouts are useful in tests because it'd be rude for a broken test to cause CI to abort the whole build after it hits a global timeout. But those timeouts should be very very long "backstop" timeouts and aren't useful assertions about speed. For all its flaws `@Test(timeout=1000)` doesn't have a good replacement __in__ __tests__. Nightly benchmarks like http://benchmarks.elasticsearch.org/ are useful here because they run on the same machine but they aren't quick to check and it takes lots of time to figure out the regressions. Sometimes its useful to compare dueling implementations but that requires keeping both implementations around. All and all we don't have a satisfactory answer to the question "what do you replace `@Test(timeout=1000)`" with. So we handle each occurrence on a case by case basis. For files with `@Test` this also: 1. Removes excess blank lines. They don't help anything. 2. Removes underscores from method names. Those would fail any code style checks we ever care to run and don't add to readability. Since I did this manually I didn't do it consistently. 3. Make sure all test method names start with `test`. Some used to end in `Test` or start with `verify` or `check` and they were picked up using the annotation. Without the annotation they always need to start with `test`. 4. Organizes imports using the rules we generate for Eclipse. For the most part this just removes `` imports which is a win all on its own. It was "required" to quickly remove `@Test`. 5. Removes unneeded casts. This is just a setting I have enabled in Eclipse and forgot to turn off before I did this work. It probably isn't hurting anything. 6. Removes trailing whitespace. Again, another Eclipse setting I forgot to turn off that doesn't hurt anything. Hopefully. 7. Swaps some tests override superclass tests to make them empty with `assumeTrue` so that the reasoning for the skips is logged in the test run and it doesn't "look like" that thing is being tested when it isn't. 8. Adds an oxford comma to an error message. The total test count doesn't change. I know. I counted. ```bash git checkout master && mvn clean && mvn install \| tee with_test git no_test_annotation master && mvn clean && mvn install \| tee not_test grep 'Tests summary' with_test > with_test_summary grep 'Tests summary' not_test > not_test_summary diff with_test_summary not_test_summary ``` These differ somewhat because some tests are skipped based on the random seed. The total shouldn't differ. But it does! ``` 1c1 < [INFO] Tests summary: 564 suites (1 ignored), 3171 tests, 31 ignored (31 assumptions) --- > [INFO] Tests summary: 564 suites (1 ignored), 3167 tests, 17 ignored (17 assumptions) ``` These are the core unit tests. So we dig further: ```bash cat with_test \| perl -pe 's/\n// if /^Suite/;s/.\n// if /IGNOR/;s/.\n// if /Assumption #/;s/.\n// if /HEARTBEAT/;s/Completed .+?,//' \| grep Suite > with_test_suites cat not_test \| perl -pe 's/\n// if /^Suite/;s/.\n// if /IGNOR/;s/.\n// if /Assumption #/;s/.*\n// if /HEARTBEAT/;s/Completed .+?,//' \| grep Suite > not_test_suites diff <(sort with_test_suites) <(sort not_test_suites) ``` The four tests with lower test numbers are all extend `AbstractQueryTestCase` and all have a method that looks like this: ```java @Override public void testToQuery() throws IOException { assumeTrue("test runs only when at least a type is registered", getCurrentTypes().length > 0); super.testToQuery(); } ``` It looks like this method was being double counted on master and isn't anymore. Closes #14028	2015-10-20 17:37:36 -04:00
Adrien Grand	5ae810991c	Upgrade to lucene-5.4-snapshot-1708254.	2015-10-16 09:41:36 +02:00
Robert Muir	b582de79ae	Merge pull request #13702 from rmuir/broke_javadocs Fix all javadocs issues, re-enable compiler warnings (but disable on java 9 where maven is broken)	2015-09-22 00:46:31 -04:00
Robert Muir	2f67cacaa3	Fix all javadocs issues, re-enable compiler warnings (but disable on java9 where maven is broken)	2015-09-21 23:35:32 -04:00
Ryan Ernst	18c519145d	Remove unnecessary copies of license and notice files We moved a lot of repositories into elasticsearch, but in their new location they retained their LICENSE.txt and NOTICE.txt files. These are all the same, and having the license and notice and the root of the repository should be sufficient.	2015-09-18 17:48:30 -07:00
Ryan Ernst	b14326d494	Merge pull request #13611 from rjernst/spec_in_resources Move rest-api-spec for plugins into test resources	2015-09-16 11:15:35 -07:00
Ryan Ernst	45f757de6d	Test: Move rest-api-spec for plugins into test resources Plugin tests require having rest-api tests, and currently copy that spec from a directory in the root of the plugin source into the test resources. This change moves the rest-api-spec dir into test resources so it is like any other test resources. It also removes unnecessary configuration for resources from the shared plugin pom.	2015-09-16 03:04:53 -07:00
Robert Muir	01e6d8e3dc	Remove java.lang.reflect.ReflectPermission "suppressAccessChecks" Closes #13603 Squashed commit of the following: commit 8799fb42d80297a79285beaf407b1bbecdb5854d Author: Robert Muir <rmuir@apache.org> Date: Wed Sep 16 03:32:29 2015 -0400 Add randomizedtesting snapshot note commit 0d874d9f0f5fddaeab8f48f9816a052dcaa691be Author: Robert Muir <rmuir@apache.org> Date: Wed Sep 16 03:11:01 2015 -0400 Add a mechanism for insecure plugins and get all tests passing commit 80540aeb9a264f6f299aaa3bc89df7f9b7923a60 Author: Robert Muir <rmuir@apache.org> Date: Tue Sep 15 22:59:29 2015 -0400 Really remove, we are killing this commit 884818c1ad44ca2e7572a6998c086580be919657 Author: Robert Muir <rmuir@apache.org> Date: Tue Sep 15 22:57:22 2015 -0400 fill in TODOs commit 34f4cb81f249edfec4d8d211da892f8c987e5948 Author: Robert Muir <rmuir@apache.org> Date: Tue Sep 15 22:31:43 2015 -0400 Publish snapshots of RR and lucene and cutover commit d68eb9d66ce059761805c64d67e41a29098c9afa Merge: f27e208 `f62da59` Author: Robert Muir <rmuir@apache.org> Date: Tue Sep 15 12:32:41 2015 -0400 Merge branch 'master' into kill-setaccessible commit f27e20855216dab6a6ad035d41018d8c67f3144c Author: Robert Muir <rmuir@apache.org> Date: Tue Sep 15 12:32:21 2015 -0400 make a real lucene snapshot	2015-09-16 04:08:31 -04:00
David Pilato	a38bcc5d62	[test] plugins simple RestIT tests don't work from IDE When running a RestIT test from the IDE, you actually start an internal node which does not automatically load the plugin you would like to test. We need to add: ```java @Override protected Collection<Class<? extends Plugin>> nodePlugins() { return pluginList(PLUGIN_HERE.class); } ``` Everything works fine when running from maven because each test basically: * installs elasticsearch * installs one plugin * starts elasticsearch with this plugin loaded * runs the test Note that this PR only fixes the fact we run an internal cluster with the expected plugin. Cloud tests will still fail when run from the IDE because is such a case you actually start an internal node with many mock plugins. And REST test suite for cloud plugins basically checks if the plugin is running by checking the output of NodesInfo API. And we check: ```yml - match: { nodes.$master.plugins.0.name: cloud-azure } - match: { nodes.$master.plugins.0.jvm: true } ``` But in that case, this condition is certainly false as we started also `mock-transport-service`, `mock-index-store`, `mock-engine-factory`, `node-mocks`, `asserting-local-transport`, `mock-search-service`. Closes #13479	2015-09-15 10:10:05 +02:00
Robert Muir	c1f2fc76c2	Upgrade lucene to r1702090 The semantics of the `boost` parameter for `function_score` changed. This is due to the fact that Lucene now requires that query boosts and top-level boosts are applied the same way.	2015-09-10 23:36:43 +02:00
Ryan Ernst	9e8a90a657	Add xlint ignores for warning classes, where appropriate.	2015-09-09 12:47:07 -07:00
Robert Muir	f216d92d19	Upgrade to lucene 5.4-snapshot r1701068	2015-09-03 15:13:33 -04:00
Simon Willnauer	796701d52e	Move version to 3.0.0-SNAPSHOT	2015-09-03 10:43:28 +02:00
Adrien Grand	c6d282f9f6	Remove extra licenses	2015-09-01 17:44:57 +02:00
Adrien Grand	3619cce53b	Update licenses for analysis plugins.	2015-09-01 15:21:49 +02:00
Ryan Ernst	c3a22e6f0e	Merge branch 'master' into construct_it_yourself	2015-08-18 09:50:47 -07:00
David Pilato	d21afc8090	[maven] rename artifactIds from `elasticsearch-something` to `something` In plugins, we are using non consistent naming. We use `elasticsearch-cloud-aws` as the artifactId, which generates a jar file called `elasticsearch-cloud-aws-VERSION.jar`. But when you want to install the plugin, you will end up with a shorter name for the plugin `cloud-aws`. ``` bin/plugin install cloud-aws ``` This commit changes that and use consistent names for `artifactId`, so `finalName`. Also changed maven names.	2015-08-18 13:38:48 +02:00
Ryan Ernst	dc1fa6736a	Merged AbstractPlugin and Plugin. Also added Settings back to indexModules and shardModules	2015-08-18 02:46:32 -07:00
Ryan Ernst	2bf84593e0	Plugins: Simplify Plugin API for constructing modules The Plugin interface currently contains 6 different methods for adding modules. Elasticsearch has 3 different levels of injectors, and for each of those, there are two methods. The first takes no arguments and returns a collection of class objects to construct. The second takes a Settings object and returns a collection of module objects already constructed. The settings argument is unecessary because the plugin can already get the settings from its constructor. Removing that, the only difference between the two versions is returning an already constructed Module, or a module Class, and there is no reason the plugin can't construct all their modules themselves. This change reduces the plugin api down to just 3 methods for adding modules. Each returns a Collection<Module>. It also removes the processModule method, which was unnecessary since onModule implementations fullfill the same requirement. And finally, it renames the modules() method to nodeModules() so it is clear these are created once for each node.	2015-08-17 20:41:45 -07:00
Ryan Ernst	2450e3ccc8	Internal: Flatten IndicesModule and add tests The IndicesModule was made up of two submodules, one which handled registering queries, and the other for registering hunspell dictionaries. This change moves those into IndicesModule. It also adds a new extension point type, InstanceMap. This is simply a Map<K,V>, where K and V are actual objects, not classes like most other extension points. I also added a test method to help testing instance map extensions. This was particularly painful because of how guice binds the key and value as separate bindings, and then reconstitutes them into a Map at injection time. In order to gain access to the object which links the key and value, I had to tweak our guice copy to not use an anonymous inner class for the Provider. Note that I also renamed the existing extension point types, since they were very redundant. For example, ExtensionPoint.MapExtensionPoint is now ExtensionPoint.ClassMap. See #12783.	2015-08-16 17:56:35 -07:00
Clinton Gormley	e143c6e460	Docs: Prepare plugin and integration docs for 2.0 * Centralised plugin docs in docs/plugins/ * Moved integrations into same docs * Moved community clients into the clients section of the docs * Removed docs/community Closes #11734 Closes #11724 Closes #11636 Closes #11635 Closes #11632 Closes #11630 Closes #12046 Closes #12438 Closes #12579	2015-08-15 18:02:43 +02:00
Simon Willnauer	b447e2ae99	Move master to [2.1.0-SNAPSHOT]	2015-08-14 23:44:06 +02:00
Ryan Ernst	be638fb6ef	Internal: Remove Environment.resolveConfig This method has multiple modes of resolving config files by first looking in the config directory, then on the classpath, and finally by prefixing with "config/" on the classpath. Most of the places taking advantage of this were tests, so they did not have to setup a real home dir with config. The only place that was really relying on it was the code which loads names.txt to randomly choose a node name. This change fixes test to setup fake home dirs with their config files. It also makes the logic for finding names.txt explicit: look in config dir, and if it doesn't exist, load /config/names.txt from the classpath.	2015-08-14 03:03:47 -07:00
Adrien Grand	28708f8013	Tests: Fix SimpleIcuAnalysisTests to not load a non-existent configuration file.	2015-08-13 14:39:10 +02:00
Simon Willnauer	605253a39f	Cut over master to 2.0.0-SNAPSHOT	2015-08-12 21:16:08 +02:00
Robert Muir	6f9a067197	Change master branch back to 2.0-beta1	2015-08-04 15:38:21 -04:00
Simon Willnauer	6753f7f03e	Cut over master to 2.0.0-SNAPSHOT	2015-08-04 10:54:12 +02:00
Ryan Ernst	1e12d03252	Tests: Rename base tests cases to use "TestCase" suffix Most of the abstract base test classes we have were previously @Ignored. However, there were also some other tests ignored. Having two ways to quiet tests is confusing, and clearly it has caused some tests to get lost in the fold. This change moves all base test classes to use the "TestCase" suffix, which is not picked up by the test class name pattern. It also removes @Ignore from (almost) all tests, and adds it to forbidden apis. And since we were renaming, I shorted base test class names to use "ES" instead of "Elasticsearch". I type this a lot of types a day, and I have heard others express a similar desire for a shorter name. closes #10659	2015-08-03 17:43:00 -07:00
Robert Muir	4040f194f5	Refactor pluginservice Closes #12367 Squashed commit of the following: commit 9453c411798121aa5439c52e95301f60a022ba5f Merge: 3511a9c `828d8c7` Author: Robert Muir <rmuir@apache.org> Date: Wed Jul 22 08:22:41 2015 -0400 Merge branch 'master' into refactor_pluginservice commit 3511a9c616503c447de9f0df9b4e9db3e22abd58 Author: Ryan Ernst <ryan@iernst.net> Date: Tue Jul 21 21:50:15 2015 -0700 Remove duplicated constant commit 4a9b5b4621b0ef2e74c1e017d9c8cf624dd27713 Author: Ryan Ernst <ryan@iernst.net> Date: Tue Jul 21 21:01:57 2015 -0700 Add check that plugin must specify at least site or jvm commit 19aef2f0596153a549ef4b7f4483694de41e101b Author: Ryan Ernst <ryan@iernst.net> Date: Tue Jul 21 20:52:58 2015 -0700 Change plugin "plugin" property to "classname" commit 07ae396f30ed592b7499a086adca72d3f327fe4c Author: Robert Muir <rmuir@apache.org> Date: Tue Jul 21 23:36:05 2015 -0400 remove test with no methods commit 550e73bf3d0f94562f4dde95239409dc5a24ce25 Author: Robert Muir <rmuir@apache.org> Date: Tue Jul 21 23:31:58 2015 -0400 fix loading to use classname commit 04463aed12046da0da5cac2a24c3ace51a79f799 Author: Robert Muir <rmuir@apache.org> Date: Tue Jul 21 23:24:19 2015 -0400 rename to classname commit 9f3afadd1caf89448c2eb913757036da48758b2d Author: Ryan Ernst <ryan@iernst.net> Date: Tue Jul 21 20:18:46 2015 -0700 moved PluginInfo and refactored parsing from properties file commit df63ccc1b8b7cc64d3e59d23f6c8e827825eba87 Author: Robert Muir <rmuir@apache.org> Date: Tue Jul 21 23:08:26 2015 -0400 fix test commit c7febd844be358707823186a8c7a2d21e37540c9 Author: Robert Muir <rmuir@apache.org> Date: Tue Jul 21 23:03:44 2015 -0400 remove test commit 017b3410cf9d2b7fca1b8653e6f1ebe2f2519257 Author: Robert Muir <rmuir@apache.org> Date: Tue Jul 21 22:58:31 2015 -0400 fix test commit c9922938df48041ad43bbb3ed6746f71bc846629 Merge: ad59af4 `01ea89a` Author: Robert Muir <rmuir@apache.org> Date: Tue Jul 21 22:37:28 2015 -0400 Merge branch 'master' into refactor_pluginservice commit ad59af465e1f1ac58897e63e0c25fcce641148a7 Author: Areek Zillur <areek.zillur@elasticsearch.com> Date: Tue Jul 21 19:30:26 2015 -0400 [TEST] Verify expected number of nodes in cluster before issuing shardStores request commit f0f5a1e087255215b93656550fbc6bd89b8b3205 Author: Lee Hinman <lee@writequit.org> Date: Tue Jul 21 11:27:28 2015 -0600 Ignore EngineClosedException during translog fysnc When performing an operation on a primary, the state is captured and the operation is performed on the primary shard. The original request is then modified to increment the version of the operation as preparation for it to be sent to the replicas. If the request first fails on the primary during the translog sync (because the Engine is already closed due to shadow primaries closing the engine on relocation), then the operation is retried on the new primary after being modified for the replica shards. It will then fail due to the version being incorrect (the document does not yet exist but the request expects a version of "1"). Order of operations: - Request is executed against primary - Request is modified (version incremented) so it can be sent to replicas - Engine's translog is fsync'd if necessary (failing, and throwing an exception) - Modified request is retried against new primary This change ignores the exception where the engine is already closed when syncing the translog (similar to how we ignore exceptions when refreshing the shard if the ?refresh=true flag is used). commit 4ac68bb1658688550ced0c4f479dee6d8b617777 Author: Shay Banon <kimchy@gmail.com> Date: Tue Jul 21 22:37:29 2015 +0200 Replica allocator unit tests First batch of unit tests to verify the behavior of replica allocator commit 94609fc5943c8d85adc751b553847ab4cebe58a3 Author: Jason Tedor <jason@tedor.me> Date: Tue Jul 21 14:04:46 2015 -0400 Correctly list blobs in Azure storage to prevent snapshot corruption and do not unnecessarily duplicate Lucene segments in Azure Storage This commit addresses an issue that was leading to snapshot corruption for snapshots stored as blobs in Azure Storage. The underlying issue is that in cases when multiple snapshots of an index were taken and persisted into Azure Storage, snapshots subsequent to the first would repeatedly overwrite the snapshot files. This issue does render useless all snapshots except the final snapshot. The root cause of this is due to String concatenation involving null. In particular, to list all of the blobs in a snapshot directory in Azure the code would use the method listBlobsByPrefix where the prefix is null. In the listBlobsByPrefix method, the path keyPath + prefix is constructed. However, per 5.1.11, 5.4 and 15.18.1 of the Java Language Specification, the reference null is first converted to the string "null" before performing the concatenation. This leads to no blobs being returned and therefore the snapshot mechanism would operate as if it were writing the first snapshot of the index. The fix is simply to check if prefix is null and handle the concatenation accordingly. Upon fixing this issue so that subsequent snapshots would no longer overwrite earlier snapshots, it was discovered that the snapshot metadata returned by the listBlobsByPrefix method was not sufficient for the snapshot layer to detect whether or not the Lucene segments had already been copied to the Azure storage layer in an earlier snapshot. This led the snapshot layer to unnecessarily duplicate these Lucene segments in Azure Storage. The root cause of this is due to known behavior in the CloudBlobContainer.getBlockBlobReference method in the Azure API. Namely, this method does not fetch blob attributes from Azure. As such, the lengths of all the blobs appeared to the snapshot layer to be of length zero and therefore they would compare as not equal to any new blobs that the snapshot layer is going to persist. To remediate this, the method CloudBlockBlob.downloadAttributes must be invoked. This will fetch the attributes from Azure Storage so that a proper comparison of the blobs can be performed. Closes elastic/elasticsearch-cloud-azure#51, closes elastic/elasticsearch-cloud-azure#99 commit cf1d481ce5dda0a45805e42f3b2e0e1e5d028b9e Author: Lee Hinman <lee@writequit.org> Date: Mon Jul 20 08:41:55 2015 -0600 Unit tests for `nodesAndVersions` on shared filesystems With the `recover_on_any_node` setting, these unit tests check that the correct node list and versions are returned. commit 3c27cc32395c3624f7c794904d9ea4faf2eccbfb Author: Robert Muir <rmuir@apache.org> Date: Tue Jul 21 14:15:59 2015 -0400 don't fail junit4 integration tests if there are no tests. instead fail the failsafe plugin, which means the external cluster will still get shut down commit 95d2756c5a8c21a157fa844273fc83dfa3c00aea Author: Alexander Reelsen <alexander@reelsen.net> Date: Tue Jul 21 17:16:53 2015 +0200 Testing: Fix help displaying tests under windows The help files are using a unix based file separator, where as the test relies on the help being based on the file system separator. This commit fixes the test to remove all `\r` characters before comparing strings. The test has also been moved into its own CliToolTestCase, as it does not need to be an integration test. commit 944f06ea36bd836f007f8eaade8f571d6140aad9 Author: Clinton Gormley <clint@traveljury.com> Date: Tue Jul 21 18:04:52 2015 +0200 Refactored check_license_and_sha.pl to accept a license dir and package path In preparation for the move to building the core zip, tar.gz, rpm, and deb as separate modules, refactored check_license_and_sha.pl to: * accept a license dir and path to the package to check on the command line * to be able to extract zip, tar.gz, deb, and rpm * all packages except rpm will work on Windows commit 2585431e8dfa5c82a2cc5b304cd03eee9bed7a4c Author: Chris Earle <pickypg@users.noreply.github.com> Date: Tue Jul 21 08:35:28 2015 -0700 Updating breaking changes - field names cannot be mapped with `.` in them - fixed asciidoc issue where the list was not recognized as a list commit de299b9d3f4615b12e2226a1e2eff5a38ecaf15f Author: Shay Banon <kimchy@gmail.com> Date: Tue Jul 21 13:27:52 2015 +0200 Replace primaryPostAllocated flag and use UnassignedInfo There is no need to maintain additional state as to if a primary was allocated post api creation on the index routing table, we hold all this information already in the UnassignedInfo class. closes #12374 commit 43080bff40f60bedce5bdbc92df302f73aeb9cae Author: Alexander Reelsen <alexander@reelsen.net> Date: Tue Jul 21 15:45:05 2015 +0200 PluginManager: Fix bin/plugin calls in scripts/bats test The release and smoke test python scripts used to install plugins in the old fashion. Also the BATS testing suite installed/removed plugins in that way. Here the marvel tests have been removed, as marvel currently does not work with the master branch. In addition documentation has been updated as well, where it was still missing. commit b81ccba48993bc13c7678e6d979fd96998499233 Author: Boaz Leskes <b.leskes@gmail.com> Date: Tue Jul 21 11:37:50 2015 +0200 Discovery: make sure NodeJoinController.ElectionCallback is always called from the update cluster state thread This is important for correct handling of the joining thread. This causes assertions to trip in our test runs. See http://build-us-00.elastic.co/job/es_g1gc_master_metal/11653/ as an example Closes #12372 commit 331853790bf29e34fb248ebc4c1ba585b44f5cab Author: Boaz Leskes <b.leskes@gmail.com> Date: Tue Jul 21 15:54:36 2015 +0200 Remove left over no commit from TransportReplicationAction It asks to double check thread pool rejection. I did and don't see problems with it. commit e5724931bbc1603e37faa977af4235507f4811f5 Author: Alexander Reelsen <alexander@reelsen.net> Date: Tue Jul 21 15:31:57 2015 +0200 CliTool: Various PluginManager fixes The new plugin manager parser was not called correctly in the scripts. In addition the plugin manager now creates a plugins/ directory in case it does not exist. Also the integration tests called the plugin manager in the deprecated way. commit 7a815a370f83ff12ffb12717ac2fe62571311279 Author: Alexander Reelsen <alexander@reelsen.net> Date: Tue Jul 21 13:54:18 2015 +0200 CLITool: Port PluginManager to use CLITool In order to unify the handling and reuse the CLITool infrastructure the plugin manager should make use of this as well. This obsolets the -i and --install options but requires the user to use `install` as the first argument of the CLI. This is basically just a port of the existing functionality, which is also the reason why this is not a refactoring of the plugin manager, which will come in a separate commit. commit 7f171eba7b71ac5682a355684b6da703ffbfccc7 Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Tue Jul 21 10:44:21 2015 +0200 Remove custom execute local logic in TransportSingleShardAction and TransportInstanceSingleOperationAction and rely on transport service to execute locally. (forking thread etc.) Change TransportInstanceSingleOperationAction to have shardActionHandler to, so we can execute locally without endless spinning. commit 0f38e3eca6b570f74b552e70b4673f47934442e1 Author: Ryan Ernst <ryan@iernst.net> Date: Tue Jul 21 17:36:12 2015 -0700 More readMetadata tests and pickiness commit 880b47281bd69bd37807e8252934321b089c9f8e Author: Ryan Ernst <ryan@iernst.net> Date: Tue Jul 21 14:42:09 2015 -0700 Started unit tests for plugin service commit cd7c8ddd7b8c4f3457824b493bffb19c156c7899 Author: Robert Muir <rmuir@apache.org> Date: Tue Jul 21 07:21:07 2015 -0400 fix tests commit 673454f0b14f072f66ed70e32110fae4f7aad642 Author: Robert Muir <rmuir@apache.org> Date: Tue Jul 21 06:58:25 2015 -0400 refactor pluginservice	2015-07-22 10:45:45 -04:00
Alexander Reelsen	2f54b89a23	CLITool: Port PluginManager to use CLITool In order to unify the handling and reuse the CLITool infrastructure the plugin manager should make use of this as well. This obsolets the -i and --install options but requires the user to use `install` as the first argument of the CLI. This is basically just a port of the existing functionality, which is also the reason why this is not a refactoring of the plugin manager, which will come in a separate commit.	2015-07-21 14:15:39 +02:00
uboness	b40186652c	updated the elasticsearch versioning format Moving to from `X.Y.Z.beta1`/`X.Y.Z.RC1` to `X.Y.Z-beta1`/`X.Y.Z-rc1`	2015-07-13 20:26:37 +02:00
Simon Willnauer	e0708813a9	Make 2.0.0.beta1-SNAPSHOT the current version. Today everything is tight to having the next version as the latest. In order to work towards 2.0.0.beta1 we need to fix all the usage of 2.0.0-SNAPSHOT to reflect the version we will release soon. Usually we do this on the release branch but to simplify things I wanna keep this on master for now and move to 2.1.0-SNAPSHOT on master once we created a 2.0 branch. Closes #12148	2015-07-09 21:24:32 +02:00
David Pilato	4738fa04b1	[ICU] move integration tests to REST tests We can keep only unit tests in plugins instead of starting each time a local node and running tests against it. Also follow up of #12091 Tests can use more than one JVM We don't need anymore to set the number of jvm to run tests as we moved IT to Rest Tests For all plugins but cloud plugins which will require another way for running integration tests.	2015-07-08 15:19:26 +02:00
Robert Muir	c88c12c6c8	Add rest tests for analysis-icu	2015-07-07 00:15:49 -04:00
David Pilato	e7a6b51bab	[maven] change groupId / artifactId When we generate our project, we can get something like: ``` ├── dev-tools ├── elasticsearch ├── elasticsearch-parent ├── elasticsearch-plugin ├── plugin │ ├── elasticsearch-analysis-icu │ ├── elasticsearch-analysis-kuromoji │ ├── elasticsearch-analysis-phonetic │ ├── elasticsearch-analysis-smartcn │ ├── elasticsearch-analysis-stempel │ ├── elasticsearch-cloud-aws │ ├── elasticsearch-cloud-azure │ ├── elasticsearch-cloud-gce │ ├── elasticsearch-delete-by-query │ ├── elasticsearch-lang-javascript │ └── elasticsearch-lang-python ├── rest-api-spec └── securemock ``` I propose here to use a common naming for artifacts: start always with `elasticsearch-`. Also, move `elasticsearch-plugin` to `org.elasticsearch.plugin` groupId. So we could have: ``` ├── elasticsearch ├── elasticsearch-dev-tools ├── elasticsearch-parent ├── elasticsearch-rest-api-spec ├── elasticsearch-securemock ├── plugin │ ├── elasticsearch-analysis-icu │ ├── elasticsearch-analysis-kuromoji │ ├── elasticsearch-analysis-phonetic │ ├── elasticsearch-analysis-smartcn │ ├── elasticsearch-analysis-stempel │ ├── elasticsearch-cloud-aws │ ├── elasticsearch-cloud-azure │ ├── elasticsearch-cloud-gce │ ├── elasticsearch-delete-by-query │ ├── elasticsearch-lang-javascript │ ├── elasticsearch-lang-python │ └── elasticsearch-plugin ```	2015-07-06 17:17:07 +02:00
David Pilato	e429b8d190	[build] include in plugins only needed jars Follow up for https://github.com/elastic/elasticsearch-analysis-kuromoji/issues/61 We don't shade anymore elasticsearch dependencies, so plugins might include jars in the distribution ZIP file which might not be needed anymore. For example, `elasticsearch-cloud-aws` comes with: ``` Archive: cloud-aws/target/releases/elasticsearch-cloud-aws-2.0.0-SNAPSHOT.zip Length Date Time Name -------- ---- ---- ---- 1920788 05-18-15 09:42 aws-java-sdk-ec2-1.9.34.jar 503963 05-18-15 09:42 aws-java-sdk-core-1.9.34.jar 232771 01-19-15 09:24 commons-codec-1.6.jar 915096 01-19-15 09:24 jackson-databind-2.3.2.jar 252288 05-18-15 09:42 aws-java-sdk-kms-1.9.34.jar 62050 01-19-15 09:24 commons-logging-1.1.3.jar 282269 10-31-14 13:19 httpcore-4.3.2.jar 35058 01-19-15 09:24 jackson-annotations-2.3.0.jar 229998 05-29-15 12:28 jackson-core-2.5.3.jar 589289 01-19-15 09:24 joda-time-2.7.jar 562858 05-18-15 09:42 aws-java-sdk-s3-1.9.34.jar 590533 10-31-14 13:19 httpclient-4.3.5.jar 44854 06-12-15 19:22 elasticsearch-cloud-aws-2.0.0-SNAPSHOT.jar -------- ------- 6221815 13 files ``` A lot of those files are already distributed with elasticsearch itself so classes are available within the classloader. We mark all es core dependencies as provided in plugins. We also remove `groupId` as already defined in parent pom. And we remove non needed licenses files as some jars are not included anymore in plugins. Closes #11647.	2015-07-01 21:37:27 +02:00
Clinton Gormley	d8a186e121	Added LICENSE and NOTICE files for all plugins	2015-06-23 12:50:31 +02:00
Simon Willnauer	ed3cc8d034	add analysis-icu module	2015-06-05 13:12:23 +02:00

1 2 3 4 5

242 Commits