OpenSearch

Commit Graph

Author	SHA1	Message	Date
Adrien Grand	b72f27a410	Core: Cut over to the Lucene filter cache. This removes Elasticsearch's filter cache and uses Lucene's instead. It has some implications: - custom cache keys (`_cache_key`) are unsupported - decisions are made internally and can't be overridden by users ('_cache`) - not only filters can be cached but also all queries that do not need scores - parent/child queries can now be cached, however cached entries are only valid for the current top-level reader so in practice it will likely only be used on read-only indices - the cache deduplicates filters, which plays nicer with large keys (eg. `terms`) - better stats: we already had ram usage and evictions, but now also hit count, miss count, lookup count, number of cached doc id sets and current number of doc id sets in the cache - dynamically changing the filter cache size is not supported anymore Internally, an important change is that it removes the NoCacheFilter infrastructure in favour of making Query.rewrite specializing the query for the current reader so that it will only be cached on this reader (look for IndexCacheableQuery). Note that consuming filters with the query API (createWeight/scorer) instead of the filter API (getDocIdSet) is important for parent/child queries because otherwise a QueryWrapperFilter(ParentQuery) would run the wrapped query per segment while relations might be cross segments.	2015-05-04 09:02:15 +02:00
Robert Muir	3a89b990ea	remove another unnecessary permission	2015-05-03 23:37:05 -04:00
Robert Muir	fc54ff5f10	remove now-unnecessary test permission	2015-05-02 15:19:01 -04:00
Robert Muir	86fc8ceac7	simplify security rules	2015-05-02 14:42:06 -04:00
Robert Muir	8c0d03c3ee	add a hack for windows	2015-05-02 12:41:38 -04:00
Robert Muir	bdd6d9c705	heisenbug	2015-05-02 12:27:38 -04:00
Robert Muir	e1238c5e4c	add 2 more x	2015-05-02 12:11:33 -04:00
Robert Muir	ff44f45af1	log this	2015-05-02 12:08:46 -04:00
Robert Muir	db003a0b32	remove hack	2015-05-02 10:52:27 -04:00
Karel Minarik	9a1f11da6e	Trimmed the main `elasticsearch.yml` configuration file The main `elasticsearch.yml` file mixed configuration, documentation and advice together. Due to a much improved documentation at <http://www.elastic.co/guide/>, the content has been trimmed, and only the essential settings have been left, to prevent the urge to excessive over-configuration. Related: 8d0f1a7d123f579fc772e82ef6b9aae08f6d13fd	2015-05-02 12:54:22 +02:00
Robert Muir	50a785c546	add a hack to see if this fixes windows issues	2015-05-01 16:41:49 -04:00
Zachary Tong	e3ae1df6f0	[DOCS] Restructure Aggs documentation	2015-05-01 16:04:55 -04:00
Robert Muir	dbcdb40f68	fix sigar policy line that cannot be really working	2015-05-01 16:02:00 -04:00
Robert Muir	6e6949d3f4	Add debugging when security init screws up (or at trace level if you wish)	2015-05-01 15:47:50 -04:00
Clinton Gormley	df1914cb21	Java API docs: Removed mlt-field	2015-05-01 21:32:02 +02:00
Robert Muir	5e87801a4b	Merge branch 'master' into eight_point_three	2015-05-01 15:24:06 -04:00
Robert Muir	09ff11812e	add debugging	2015-05-01 14:58:18 -04:00
Clinton Gormley	c28bf3bb3f	Docs: Updated elasticsearch.org links to elastic.co	2015-05-01 20:46:12 +02:00
Shay Banon	b2e022bd94	Merge pull request #10924 from kimchy/exclude_jackson_ann Exclude jackson-databind dependency	2015-05-01 19:09:35 +02:00
Shay Banon	16c7689355	Exclude jackson-databind dependency the jackson yaml data format pulls in the databind dependency, its important that we exclude it so we won't use any of its classes by mistake	2015-05-01 19:03:14 +02:00
Robert Muir	c15f3eff3a	Merge branch 'die_cwd_die' of github.com:elastic/elasticsearch	2015-05-01 12:06:02 -04:00
Robert Muir	260f5eb9ff	Merge pull request #10913 from rmuir/spanspanspanspanspan Add span within/containing queries.	2015-05-01 11:26:08 -04:00
Robert Muir	dfe1d1463c	fix doc typo	2015-04-30 23:46:37 -04:00
Robert Muir	aade6194b7	Add span within/containing queries. Expose new span queries from https://issues.apache.org/jira/browse/LUCENE-6083 Within returns matches from 'little' that are enclosed inside of a match from 'big'. Containing returns matches from 'big' that enclose matches from 'little'.	2015-04-30 23:31:31 -04:00
Igor Motov	c165afb4d5	Logging: Add logging of slow cluster state tasks Closes #10874	2015-04-30 19:13:49 -04:00
Jack Conradson	aa968f6b65	Scripting: Add Field Methods Added infrastructure to allow basic member methods in the expressions language to be called. The methods must have a signature with no arguments. Also added the following member methods for date fields (and it should be easy to add more) * getYear * getMonth * getDayOfMonth * getHourOfDay * getMinutes * getSeconds Allow fields to be accessed without using the member variable [value]. (Note that both ways can be used to access fields for back-compat.) closes #10890	2015-04-30 15:36:46 -07:00
Ryan Ernst	d2b12e4fc2	Mappings: Remove docs for type level analyzer defaults These settings were removed in #9430.	2015-04-30 13:57:55 -07:00
Ryan Ernst	4ef9f3ca63	Mappings: Remove file based default mappings Using files that must be specified on each node is an anti-pattern from the API based goal of ES. This change removes the ability to specify the default mapping with a file on each node. closes #10620	2015-04-30 13:50:35 -07:00
Ryan Ernst	a0451a37cc	Upgrade lucene snapshot to r1677039	2015-04-30 13:43:19 -07:00
Boaz Leskes	d596f5cc45	Decouple recoveries from engine flush In order to safely complete recoveries / relocations we have to keep all operation done since the recovery start at available for replay. At the moment we do so by preventing the engine from flushing and thus making sure that the operations are kept in the translog. A side effect of this is that the translog keeps on growing until the recovery is done. This is not a problem as we do need these operations but if the another recovery starts concurrently it may have an unneededly long translog to replay. Also, if we shutdown the engine for some reason at this point (like when a node is restarted) we have to recover a long translog when we come back. To void this, the translog is changed to be based on multiple files instead of a single one. This allows recoveries to keep hold to the files they need while allowing the engine to flush and do a lucene commit (which will create a new translog files bellow the hood). Change highlights: - Refactor Translog file management to allow for multiple files. - Translog maintains a list of referenced files, both by outstanding recoveries and files containing operations not yet committed to Lucene. - A new Translog.View concept is introduced, allowing recoveries to get a reference to all currently uncommitted translog files plus all future translog files created until the view is closed. They can use this view to iterate over operations. - Recovery phase3 is removed. That phase was replaying operations while preventing new writes to the engine. This is unneeded as standard indexing also send all operations from the start of the recovery to the recovering shard. Replay all ops in the view acquired in recovery start is enough to guarantee no operation is lost. - IndexShard now creates the translog together with the engine. The translog is closed by the engine on close. ShadowIndexShards do not open the translog. - Moved the ownership of translog fsyncing to the translog it self, changing the responsible setting to `index.translog.sync_interval` (was `index.gateway.local.sync`) Closes #10624	2015-04-30 23:42:50 +03:00
Adrien Grand	12c19508fb	Tests: Fix test bug in aggregations' EquivalenceTests due to the change of the default min_doc_count.	2015-04-30 19:14:56 +02:00
Martijn van Groningen	7a6fe809d0	inner_hits: Don't use bitset cache for children filters. Only parent filters should use bitset filter cache, to avoid memory being wasted. Also in case of object fields inline the field name into the nested object, instead of creating an additional (dummy) nested identity. Closes #10662 Closes #10629	2015-04-30 16:56:43 +02:00
Adrien Grand	82ad074dfe	Merge pull request #10904 from jpountz/enhancement/histogram_min_doc_count_0 Aggs: Change the default `min_doc_count` to 0 on histograms.	2015-04-30 15:54:02 +02:00
Adrien Grand	e5be85d586	Aggs: Change the default `min_doc_count` to 0 on histograms. The assumption is that gaps in histogram are generally undesirable, for instance if you want to build a visualization from it. Additionally, we are building new aggregations that require that there are no gaps to work correctly (eg. derivatives).	2015-04-30 15:48:23 +02:00
Colin Goodheart-Smithe	969f53e399	fix typo in Min bucket aggregation docs	2015-04-30 14:41:01 +01:00
Colin Goodheart-Smithe	d16bf992a9	Aggregations: min_bucket aggregation An aggregation to calculate the minimum value in a set of buckets. Closes #9999	2015-04-30 13:34:21 +01:00
javanna	77ac4528fb	Transport: read/writeGenericValue to support BytesRef Add support for BytesRef to existing StreamInput#readGenericValue and StreamOutput#writeGenericValue Closes #10878	2015-04-30 10:59:01 +02:00
Ryan Ernst	2fd387d378	fix dumb test copy/paste mistake	2015-04-29 23:33:43 -07:00
Ryan Ernst	79a1c38ed6	Mappings: Fix _field_names to not have doc values When doc values were turned on a by default, most meta fields had it explicitly disabled. However, _field_names was missed. This change forces doc values to be off always for _field_names and removes the unnecessary support when creating index fields. closes #10892	2015-04-29 23:12:16 -07:00
Ryan Ernst	d8fed71fd4	Merge branch 'master' into die_cwd_die	2015-04-29 16:48:13 -07:00
Ryan Ernst	5236a3d226	fix some tests, all fast tests now pass	2015-04-29 16:32:54 -07:00
Zachary Tong	351a4d3315	[DOCS] Fix movavg images and naming	2015-04-29 13:33:54 -04:00
Igor Motov	8e5543dea0	Test: ignore cluster state differences on the nodes that disconnected from the master	2015-04-29 12:49:41 -04:00
Igor Motov	bac135261c	Test: make sure that tests are not affected by changing in address resolution settings	2015-04-29 12:49:40 -04:00
Igor Motov	38be1e8a1a	Test: remove reference to the local node before comparing cluster states in ensureClusterStateConsistency	2015-04-29 12:49:40 -04:00
Simon Willnauer	9828e955f3	[TEST] enable host name resolving to gain consistent transport addresses in clusterstate	2015-04-29 12:49:40 -04:00
Simon Willnauer	58eed45ee5	[TEST] Move XContentTestUtils.java into o.e.test folder Classes referenced by the Test base classes must be under this package otherwise the test jar can't be used in a 3rd party application.	2015-04-29 12:49:40 -04:00
Igor Motov	478c253f89	Add support for cluster state diffs Adds support for calculating and sending diffs instead of full cluster state of the most frequently changing elements - cluster state, meta data and routing table. Closes #6295	2015-04-29 12:49:40 -04:00
Lee Hinman	6e1c995741	Clarify logging about disk thresholds in DiskThresholdDecider	2015-04-29 10:43:00 -06:00
David Pilato	3c3e9b63a7	fix: query string time zone not working If you define exactly the same date range query using either `DATE+0200` notation or `DATE` and set `timezone: +0200`, elasticsearch gives back different results: ``` DELETE foo PUT /foo { "mapping": { "tweets": { "properties": { "tweet_date": { "type": "date" } } } } } POST /foo/tweets/1/ { "tweet_date": "2015-04-05T23:00:00+0000" } POST /foo/tweets/2/ { "tweet_date": "2015-04-06T00:00:00+0000" } GET /foo/tweets/_search?pretty { "query": { "query_string": { "query": "tweet_date:[2015-04-06T00:00:00+0200 TO 2015-04-06T23:00:00+0200]" } } } GET /foo/tweets/_search?pretty { "query": { "query_string": { "query": "tweet_date:[2015-04-06T00:00:00 TO 2015-04-06T23:00:00]", "time_zone": "+0200" } } } ``` This PR fixes it and will also allow us to add the same feature to simple_query_string as well in another PR. Closes #10477. (cherry picked from commit 880f4a0)	2015-04-29 18:10:02 +02:00

1 2 3 4 5 ...

11785 Commits All Branches Search

11785 Commits

All Branches