OpenSearch

Commit Graph

Author	SHA1	Message	Date
Alan Woodward	5107949402	Allow TokenFilterFactories to rewrite themselves against their preceding chain (#33702 ) We currently special-case SynonymFilterFactory and SynonymGraphFilterFactory, which need to know their predecessors in the analysis chain in order to correctly analyze their synonym lists. This special-casing doesn't work with Referring filter factories, such as the Multiplexer or Conditional filters. We also have a number of filters (eg the Multiplexer) that will break synonyms when they appear before them in a chain, because they produce multiple tokens at the same position. This commit adds two methods to the TokenFilterFactory interface. * `getChainAwareTokenFilterFactory()` allows a filter factory to rewrite itself against its preceding filter chain, or to resolve references to other filters. It replaces `ReferringFilterFactory` and `CustomAnalyzerProvider.checkAndApplySynonymFilter`, and by default returns `this`. * `getSynonymFilter()` defines whether or not a filter should be applied when building a synonym list `Analyzer`. By default it returns `true`. Fixes #33609	2018-09-19 15:52:14 +01:00
Vladimir Dolzhenko	a3e8b831ee	add elasticsearch-shard tool (#32281 ) Relates #31389	2018-09-19 10:28:22 +02:00
David Turner	c9765d5fb9	Emphasize that filesystem-level backups don't work (#33102 ) It is not obvious that a filesystem-level backup may capture an inconsistent set of files that may fail on restore, or (worse) succeed having silently discarded some data. This change spells the out, and reorganises the first page or so of the snapshot/restore docs to make this warning fit more nicely.	2018-09-19 08:36:03 +01:00
Tim Heckel	3928921a1d	[DOCS] Update scroll.asciidoc (#32530 )	2018-09-18 17:00:22 +02:00
Abdon Pijpelink	32ee6148d2	[DOCS] Clarify scoring for multi_match phrase type (#32672 ) The original statement "Runs a match_phrase query on each field and combines the _score from each field." for the phrase type is a but misleading. The phrase type behaves like the best_fields type and does not combine the scores of each fields.	2018-09-18 16:57:33 +02:00
Dan Tennery-Spalding	3596512e6a	[DOCS] Corrected several grammar errors (#33781 )	2018-09-18 16:46:22 +02:00
David Turner	421f58e172	Remove discovery-file plugin (#33257 ) In #33241 we moved the file-based discovery functionality to core Elasticsearch, but preserved the `discovery-file` plugin, and support for the existing location of the `unicast_hosts.txt` file, for BWC reasons. This commit completes the removal of this plugin.	2018-09-18 12:01:16 +01:00
markharwood	2fa09f062e	New plugin - Annotated_text field type (#30364 ) New plugin for annotated_text field type. Largely a copy of `text` field type but adds ability to include markdown-like syntax in the text. The “AnnotatedText” class parses text+markup and converts into plain text and AnnotationTokens. The annotation token values are injected unchanged alongside the regular text tokens to provide a form of additional indexed overlay useful in positional searches and highlighting. Annotated_text fields do not support fielddata as we want to phase this out. Also includes a new "annotated" highlighter type that retains annotations and merges in search hits as additional annotation markup. Closes #29467	2018-09-18 10:25:27 +01:00
Shaunak Kashyap	2aba52de8f	Implement xpack.monitoring.elasticsearch.collection.enabled setting (#33474 ) * Implement xpack.monitoring.elasticsearch.collection.enabled setting * Fixing line lengths * Updating constructor calls in test * Removing unused import * Fixing line lengths in test classes * Make monitoringService.isElasticsearchCollectionEnabled() return true for tests * Remove wrong expectation * Adding unit tests for new flag to be false * Fixing line wrapping/indentation for better readability * Adding docs * Fixing logic in ClusterStatsCollector::shouldCollect * Rebasing with master and resolving conflicts * Simplifying implementation by gating scheduling * Doc fixes / improvements * Making methods package private * Fixing wording * Fixing method access	2018-09-17 18:33:43 -07:00
ben5556	012b9c7539	Corrected aggregation name to match the example (#33786 )	2018-09-17 18:24:43 -07:00
Or Bin	a5bad4d92c	Docs: Fixed a grammatical mistake: 'a HTTP ...' -> 'an HTTP ...' (#33744 ) Fixed a grammatical mistake: 'a HTTP ...' -> 'an HTTP ...' Closes #33728	2018-09-17 15:35:54 -04:00
Ryan Ernst	3046656ab1	Scripting: Rework joda time backcompat (#33486 ) This commit switches the joda time backcompat in scripting to use augmentation over ZonedDateTime. The augmentation methods provide compatibility with the missing methods between joda's DateTime and java's ZonedDateTime. Due to getDayOfWeek returning an enum in the java API, ZonedDateTime is wrapped so that the method can return int like the joda time does. The java time api version is renamed to getDayOfWeekEnum, which will be kept through 7.x for compatibility while users switch back to getDayOfWeek once joda compatibility is removed.	2018-09-16 19:18:00 -07:00
Lisa Cawley	9706584836	[DOCS] Moves security reference to docs folder (#33643 )	2018-09-14 13:09:47 -07:00
Jay Modi	3914a980f7	Security: remove wrapping in put user response (#33512 ) This change removes the wrapping of the created field in the put user response. The created field was added as a top level field in #32332, while also still being wrapped within the `user` object of the response. Since the value is available in both formats in 6.x, we can remove the wrapped version for 7.0.	2018-09-13 14:40:36 -06:00
Costin Leau	32a22ca00e	DOC: improved wording in SQL client app section	2018-09-13 22:07:23 +03:00
Lisa Cawley	c3a817957d	[DOCS] Moves securing-communications to docs (#33640 )	2018-09-13 10:42:26 -07:00
Costin Leau	a192785fc8	DOC: Add SQL section on client applications Add setup instructions for a number of GUI SQL applications	2018-09-13 15:44:52 +03:00
Jason Tedor	c023f67c5d	Add migration note for remote cluster settings (#33632 ) The remote cluster settings search.remote.* have been renamed to cluster.remote.* and are automatically upgraded in the cluster state on gateway recovery, and on put. This commit adds a note to the migration docs for these changes.	2018-09-12 13:37:11 -04:00
Simon Willnauer	c783488e97	Add `_source`-only snapshot repository (#32844 ) This change adds a `_source` only snapshot repository that allows to wrap any existing repository as a _backend_ to snapshot only the `_source` part including live docs markers. Snapshots taken with the `source` repository won't include any indices, doc-values or points. The snapshot will be reduced in size and functionality such that it requires full re-indexing after it's successfully restored. The restore process will copy the `_source` data locally starts a special shard and engine to allow `match_all` scrolls and searches. Any other query, or get call will fail with and unsupported operation exception. The restored index is also marked as read-only. This feature aims mainly for disaster recovery use-cases where snapshot size is a concern or where time to restore is less of an issue. NOTE: The snapshot produced by this repository is still a valid lucene index. This change doesn't allow for any longer retention policies which is out of scope for this change.	2018-09-12 17:47:10 +02:00
Christoph Büscher	fe478c23b7	[Docs] Fix heading in composite-aggregation.asciidoc (#33627 ) The heading for the "Missing buckets" should be on the same level as the the "Order" section.	2018-09-12 16:56:03 +02:00
Joel Green	0b567c0eeb	[Docs] Update match-query.asciidoc (#33610 )	2018-09-12 14:35:27 +02:00
Jim Ferenczi	4561c5ee83	Clarify context suggestions filtering and boosting (#33601 ) This change clarifies the documentation of the context completion suggester regarding filtering and boosting with contexts. Unlike the suggester v1, filtering on multiple contexts works as a disjunction, a suggestion matches if it contains at least one of the provided context values and boosting selects the maximum score among the matching contexts. This commit also adapts an old test that was written for the v1 suggester and commented out for version 2 because the behavior changed.	2018-09-12 08:47:32 +02:00
Lisa Cawley	cbc6fa0ecb	[DOCS] Adds missing built-in user information (#33585 )	2018-09-11 07:56:26 -07:00
Alan Woodward	f598297f55	Add predicate_token_filter (#33431 ) This allows users to filter out tokens from a TokenStream using painless scripts, instead of having to write specialised Java code and packaging it up into a plugin. The commit also refactors the AnalysisPredicateScript.Token class so that it wraps and makes read-only an AttributeSource.	2018-09-11 09:16:39 +01:00
Tanguy Leroux	079d130d8c	[Test] Remove duplicate method in TestShardRouting (#32815 )	2018-09-10 18:29:00 +02:00
lcawl	6b780e9926	[DOCS] Fixing formatting issues in breaking changes	2018-09-07 16:53:36 -07:00
lcawl	944868908c	[DOCS] Fixes formatting error	2018-09-07 10:26:44 -07:00
Jim Ferenczi	79cd6385fe	Collapse package structure for metrics aggs (#33463 ) This change collapses all metrics aggregations classes into a single package `org.elasticsearch.aggregations.metrics`. It also restricts the visibility of some classes (aggregators and factories) that should not be used outside of the package. Relates #22868	2018-09-07 10:58:06 +02:00
Lisa Cawley	7441c0376e	[DOCS] Adds delete forecast API (#33401 )	2018-09-06 09:20:42 -07:00
Costin Leau	443f9caddd	DOC: Enhance SQL Functions documentation Split function section into multiple chapters Add String functions Add (small) section on Conversion/Cast functions Add missing aggregation functions Enable documentation testing (was disabled by accident). While at it, fix failing tests Improve spec tests to allow multi-line queries (useful for docs) Add ability to ignore a spec test (name should end with -Ignore)	2018-09-06 18:09:53 +03:00
Jim Ferenczi	7ad71f906a	Upgrade to a Lucene 8 snapshot (#33310 ) The main benefit of the upgrade for users is the search optimization for top scored documents when the total hit count is not needed. However this optimization is not activated in this change, there is another issue opened to discuss how it should be integrated smoothly. Some comments about the change: * Tests that can produce negative scores have been adapted but we need to forbid them completely: #33309 Closes #32899	2018-09-06 14:42:06 +02:00
Jason Tedor	d71ced1b00	Generalize search.remote settings to cluster.remote (#33413 ) With features like CCR building on the CCS infrastructure, the settings prefix search.remote makes less sense as the namespace for these remote cluster settings than does a more general namespace like cluster.remote. This commit replaces these settings with cluster.remote with a fallback to the deprecated settings search.remote.	2018-09-05 20:43:44 -04:00
Christoph Büscher	eafc2a5470	Don't count metadata fields towards index.mapping.total_fields.limit (#33386 ) The maximum number of fields per index is limited to 1000 by default by the `index.mapping.total_fields.limit` setting to prevent accidental mapping explosions due to too many fields. Currently all metadata fields also count towards this limit, which can lead to some confusion when using lower limits. It is not obvious for users that they cannot actually add as many fields as are specified by the limit in this case. This change takes the number of metadata fields out of the field count that we check against the field limit. It also adds tests that check that we can add fields up to the specified limit, but throw an exception for any additional field added. Closes #24096	2018-09-05 18:27:21 +02:00
Alan Woodward	636442700c	Add conditional token filter to elasticsearch (#31958 ) This allows tokenfilters to be applied selectively, depending on the status of the current token in the tokenstream. The filter takes a scripted predicate, and only applies its subfilter when the predicate returns true.	2018-09-05 14:52:43 +01:00
Paul Sanwald	c303006e6b	Add interval response parameter to AutoDateInterval histogram (#33254 ) Adds the interval used to the aggregation response.	2018-09-05 07:35:59 -04:00
Gordon Brown	cfd3fa72ed	Add user-defined cluster metadata (#33325 ) Adds a place for users to store cluster-wide data they wish to associate with the cluster via the Cluster Settings API. This is strictly for user-defined data, Elasticsearch makes no other other use of these settings.	2018-09-04 16:14:18 -06:00
Lisa Cawley	f3f8d9b833	[DOCS] Moves monitoring pages to docs folder (#33324 )	2018-09-04 10:02:13 -07:00
lcawl	c5109a54ee	[DOCS] Revert fix for broken link	2018-09-04 09:26:28 -07:00
Costin Leau	43f80fa82b	DOCS: Fix anchor and example typos	2018-09-04 19:06:44 +03:00
lcawl	303ae25a6a	[DOCS] Fixes broken link	2018-09-04 09:05:30 -07:00
Costin Leau	17c7f99343	SQL: Show/desc commands now support table ids (#33363 ) Extend SHOW TABLES, DESCRIBE and SHOW COLUMNS to support table identifiers not just SQL LIKE pattern. This allows both Elasticsearch-style multi-index patterns and SQL LIKE. To disambiguate between the two (as the " vs ' can be easy to miss), the grammar now requires LIKE keyword as a prefix for all LIKE-like patterns. Also added some docs comparing the two types of patterns. Fix #33294	2018-09-04 16:54:10 +03:00
Nikolay Vasiliev	d9f394b099	[DOCS] fix a couple of typos (#33356 )	2018-09-04 10:07:11 +02:00
Christoph Büscher	79db16f9bb	[Docs] Add search timeout caveats (#33354 ) Global search timeouts and timeouts specified in the search request body use the same internal mechanism as search cancellation. Therefore the same caveats apply, mostly around the responsiveness of the timeout which gets only checked by a running search on segment boundaries by default. Closes #31263	2018-09-03 20:56:05 +02:00
Christoph Büscher	978d1ed257	[Docs] Improve tuning for speed advice (#33315 ) This change merges two sections in the "Tune for search speed" documentation that recommend mapping numeric identifiers as keywords. Both sections contain mostly the same advice, so they can be merged. Closes #32733	2018-09-03 11:09:30 +02:00
Jim Ferenczi	713c07e14d	Add early termination support to BucketCollector (#33279 ) This commit adds the support to early terminate the collection of a leaf in the aggregation framework. This change introduces a MultiBucketCollector which handles CollectionTerminatedException exactly like the Lucene MultiCollector. Any aggregator can now throw a CollectionTerminatedException without stopping the collection of a sibling aggregator. This is useful for aggregators that can infer their result without visiting all documents (e.g.: a min/max aggregation on a match_all query).	2018-09-03 09:34:35 +02:00
Nhat Nguyen	3197a6bbdd	Merge branch 'master' into ccr * master: HLRC: ML Flush job (#33187) HLRC: Adding ML Job stats (#33183) LLREST: Drop deprecated methods (#33223) Mute testSyncerOnClosingShard [DOCS] Moves machine learning APIs to docs folder (#31118)	2018-09-02 09:30:51 -04:00
Nik Everett	f28cddf951	LLREST: Drop deprecated methods (#33223 ) In #29623 we added `Request` object flavored requests to the low level REST client and in #30315 we deprecated the old `performRequest`s. In a long series of PRs I've changed all of the old style requests. This drops the deprecated methods and will be released with 7.0.	2018-09-01 11:11:25 -04:00
Lisa Cawley	b7a63f7e7d	[DOCS] Moves machine learning APIs to docs folder (#31118 )	2018-08-31 16:49:24 -07:00
Nhat Nguyen	b93507608a	Merge branch 'master' into ccr * master: Mute test watcher usage stats output [Rollup] Fix FullClusterRestart test Adjust soft-deletes version after backport into 6.5 completely drop `index.shard.check_on_startup: fix` for 7.0 (#33194) Fix AwaitsFix issue number Mute SmokeTestWatcherWithSecurityIT testsi drop `index.shard.check_on_startup: fix` (#32279) tracked at [DOCS] Moves ml folder from x-pack/docs to docs (#33248) [DOCS] Move rollup APIs to docs (#31450) [DOCS] Rename X-Pack Commands section (#33005) TEST: Disable soft-deletes in ParentChildTestCase Fixes SecurityIntegTestCase so it always adds at least one alias (#33296) Fix pom for build-tools (#33300) Lazy evaluate java9home (#33301) SQL: test coverage for JdbcResultSet (#32813) Work around to be able to generate eclipse projects (#33295) Highlight that index_phrases only works if no slop is used (#33303) Different handling for security specific errors in the CLI. Fix for https://github.com/elastic/elasticsearch/issues/33230 (#33255) [ML] Refactor delimited file structure detection (#33233) SQL: Support multi-index format as table identifier (#33278) MINOR: Remove Dead Code from PathTrie (#33280) Enable forbiddenapis server java9 (#33245)	2018-08-31 19:03:04 -04:00
Vladimir Dolzhenko	00b272af32	completely drop `index.shard.check_on_startup: fix` for 7.0 (#33194 ) Relates to #32279	2018-08-31 22:08:28 +02:00

1 2 3 4 5 ...

4637 Commits