OpenSearch

Commit Graph

Author	SHA1	Message	Date
Adrien Grand	8ccfca3a2f	Fielddata: Remove BytesValues.WithOrdinals.currentOrd and copyShared. These methods don't exist in Lucene's sorted set doc values. Relates to #6524	2014-06-19 12:06:40 +02:00
Adrien Grand	9e624942d8	Fielddata: Move `getTermsEnum` from `AtomicFieldData` to `BytesValues.WithOrdinals`. Similarly to `SortedSetDocValues.termsEnum()`. Relates to #6524	2014-06-19 12:01:30 +02:00
Adrien Grand	9b02b5061b	Fielddata: Merge ordinals APIs into BytesValues.WithOrdinals. Mid-term we should switch from `BytesValues` to Lucene's doc values APIs, in particular the `SortedSetDocValues` class. While `BytesValues.WithOrdinals` and SortedSetDocValues expose the same functionality, `BytesValues.WithOrdinals` exposes its ordinals via a different `Ordinals.Docs` object while `SortedSetDocValues` exposes them on the same object as the one that holds the values. This commit merges ordinals into `BytesValues.WithOrdinals` in order to make both classes even closer. Global ordinals were a bit tricky to migrate so I just changed them to use Lucene's OrdinalMap that will soon (LUCENE-5767, scheduled for 4.9) have the same optimizations as our global ordinals. Close #6524	2014-06-19 12:00:51 +02:00
Adrien Grand	703dbff83d	Index field names of documents. The `exists` and `missing` filters need to merge postings lists of all existing terms, which can be very costly, especially on high-cardinality fields. This commit indexes the field names of a document under `_field_names` and reuses it to speed up the `exists` and `missing` filters. This is only enabled for indices that are created on or after Elasticsearch 1.3.0. Close #5659	2014-06-19 11:50:06 +02:00
Adrien Grand	e2da2114e7	Mappings: Allow _version to use `disk` as a doc values format. VersionFieldMapper.defaultDocValuesFormat claims that the default is `disk`. This is not used to choose the DV format in the index but for mappings serialization in order to know when the _version doc values format is different from the default format. This made it impossible to use the `disk` doc values format since mappings would never retain that information at serialization time. Close #6523	2014-06-19 11:39:17 +02:00
Boaz Leskes	5b919d4e4f	[TEST] Added (trace) logging to testGetFields_complexField	2014-06-19 11:10:13 +02:00
javanna	2024067465	Java API: BulkRequest#add(Iterable) to support UpdateRequests Closes #6551	2014-06-19 10:43:05 +02:00
Alexander Reelsen	9569166f94	Mapping: Fix possibility of losing meta configuration on field mapping update The TTL, size, timestamp and index meta properties could be lost on an update of a single field mapping due to a wrong comparison in the merge method (which was caused by a wrong initialization, which marked an update as explicitely disabled instead of unset. Closes #5053	2014-06-19 08:39:26 +02:00
Fitblip	d18fb8bfbd	REST API: Allow to configure JSONP/callback support Added the http.jsonp.enable option to configure disabling of JSONP responses, as those might pose a security risk, and can be disabled if unused. This also fixes bugs in NettyHttpChannel * JSONP responses were never setting application/javascript as the content-type * The content-type and content-length headers were being overwritten even if they were set before Closes #6164	2014-06-19 08:34:38 +02:00
Chris	011e20678d	[DOCS] Fixed json example in nested-aggregation.asciidoc	2014-06-18 19:38:02 +02:00
Britta Weber	1cbeaf6c45	significant terms: fix json response Commit `fbd7c9aa5d` introduced a regression that caused the min_doc_count to be equal to the number of documents in the background set. As a result no buckets were built when the response for significant terms was created. This only affected the final XContent response. closes #6535	2014-06-18 18:51:34 +02:00
Zachary Tong	ed62f90620	[TEST] Remove indentation on indices.stats test setups The new indices.stats tests have the setup section indented, which can cause problems for some yaml parsers.	2014-06-18 11:59:40 -04:00
javanna	a499254566	Fixed typo in TransportAction log line	2014-06-18 17:37:45 +02:00
Boaz Leskes	1114835de5	Also send Refresh and Flush actions to relocation targets Currently we send relocation & flush actions based on all assigned ShardRoutings. During the final stage of relocation, we may miss to refresh/flush a shard if the coordinating node has not yet processed the cluster state update indicating that a relocation is completed and the relocation target node has already processed it (i.e., started the shard and has accepted new indexing requests). Closes #6545	2014-06-18 16:12:42 +02:00
Martijn van Groningen	68046b64c2	[TEST] Only use the clients from the node.	2014-06-18 16:06:09 +02:00
Martijn van Groningen	cb97b79396	[TEST] Disable automatic refresh to prevent unintended field data warming of the the _parent field.	2014-06-18 16:06:09 +02:00
Colin Goodheart-Smithe	7423ce0560	Aggregations: Added percentile rank aggregation Percentile Rank Aggregation is the reverse of the Percetiles aggregation. It determines the percentile rank (the proportion of values less than a given value) of the provided array of values. Closes #6386	2014-06-18 12:02:08 +01:00
Britta Weber	fe89ea1ecf	percolator: fix handling of nested documents Nested documents were indexed as separate documents, but it was never checked if the hits represent nested documents or not. Therefore, nested objects could match not nested queries and nested queries could also match not nested documents. Examples are in issue #6540 . closes #6540 closes #6544	2014-06-18 12:24:47 +02:00
Clinton Gormley	69350dc426	Update stemmer-override-tokenfilter.asciidoc	2014-06-18 11:34:20 +02:00
Clinton Gormley	3eb291f334	Docs: tidied configuration.asciidoc	2014-06-17 17:37:07 +02:00
Shay Banon	f450c3ea30	update docs to reflect how default write consistency with 1 replica behaves	2014-06-17 14:25:04 +02:00
Martijn van Groningen	cb9548f811	Changed the type of field docCounts to IntArray instead of LongArray, because a shard can't hold more than Integer.MAX_VALUE a LongArray just takes unnecessary space. Closes #6529	2014-06-17 13:35:05 +02:00
Simon Willnauer	adb5c19849	[CLIENT] Remove unnecessary intermediate interfaces Client, ClusterAdminClient and IndicesAdminClient had corresponding intermediate `internal` interfaces that are unnecessary and cause a lot of casting. This commit removes the intermediate interfaces and uses the super interfaces directly. This commit also adds Releaseable to `Node` and `Client` in order to be used with utilities like try / with. Closes #4355 Closes #6517	2014-06-17 12:18:37 +02:00
Simon Willnauer	e198c58a6b	[TEST] Use test.bwc.version if compatibility version is not present	2014-06-17 12:16:10 +02:00
Britta Weber	ef05334fdd	don't check .metadata folder for tabs and nocommits	2014-06-17 12:07:13 +02:00
Adrien Grand	a06fd46a72	[Benchmark] Fix TermsAggregationSearchBenchmark: The `ordinals` execution mode doesn't exist anymore.	2014-06-17 01:46:02 +02:00
Robert Muir	3892b6ce05	Use ordinals for comparison in GlobalOrdinalsStringTermsAggregator.buildAggregation. Closes #6518	2014-06-16 18:25:28 -04:00
Shay Banon	0427e49b5d	[TEST] verify all threads created by node and client have the node name closes #6516	2014-06-16 21:50:12 +02:00
Martijn van Groningen	612f4618e7	[TEST] wait for ongoing recoveries to finish. Flush fails on shards otherwise.	2014-06-16 17:01:38 +02:00
Matt Janssen	946dde287a	[DOCS] Fixed is/if typo in Api Conventions doc	2014-06-16 15:44:47 +02:00
Simon Willnauer	61eac483ed	[TEST] Fix test cluster naming This commit renames `TestCluster` -> `InternalTestCluster` and `ImmutableTestCluster` to `TestCluster` for consistency. This also makes `ExternalTestCluster` and `InternalTestCluster` consistent with respect to their execution environment. Closes #6510	2014-06-16 15:14:54 +02:00
Lee Hinman	0f180bd5fd	[TEST] Add test for accessing _score in scripts	2014-06-16 14:12:21 +02:00
Simon Willnauer	4dfa822e1b	[TEST] Add basic Backwards Compatibility Tests This commit add a basic infrastructure as well as primitive tests to ensure version backwards compatibility between the current development trunk and an arbitrary previous version. The compatibility tests are simple unit tests derived from a base class that starts and manages nodes from a provided elasticsearch release package. Use the following commandline executes all backwards compatiblity tests in isolation: ``` mvn test -Dtests.bwc=true -Dtests.bwc.version=1.2.1 -Dtests.class=org.elasticsearch.bwcompat.* ``` These tests run basic checks like rolling upgrades and routing/searching/get etc. against the specified version. The version must be present in the `./backwards` folder as `./backwards/elasticsearch-x.y.z`	2014-06-16 12:40:43 +02:00
Simon Willnauer	93b56eb004	[TEST] Force flush even if not needed to ensure successful shards is greater than 0	2014-06-16 11:00:34 +02:00
Simon Willnauer	76fab9d42a	[TEST] consistently omit norms in test otherwise scoring will be dependent on merges etc.	2014-06-16 10:54:12 +02:00
Volker Fröhlich	06192686a2	[DOCS] Fixd typo in http.asciidoc	2014-06-16 10:42:34 +02:00
Shay Banon	4c579d6c8d	Better default size for global index -> alias map The alias -> (index -> alias) map, specifically the index -> alias one, typically just hold one entry, yet we eagerly initialize it to the number of indices. When there are many indices, each with many aliases, this is a very large overhead per alias... closes #6504	2014-06-16 00:52:33 +02:00
Simon Willnauer	6d77a248fb	[TEST] Stabelize test - wait for yellow to ensure all primaries are allocated	2014-06-14 21:52:15 +02:00
Adrien Grand	7bcabf9481	Fielddata: Don't expose hashes anymore. Our field data currently exposes hashes of the bytes values. That takes roughly 4 bytes per unique value, which is definitely not negligible on high-cardinality fields. These hashes have been used for 3 different purposes: - term-based aggregations, - parent/child queries, - the percolator _id -> Query cache. Both aggregations and parent/child queries have been moved to ordinals which provide a greater speedup and lower memory usage. In the case of the percolator it is used in conjunction with HashedBytesRef to not recompute the hash value when getting resolving a query given its ID. However, removing this has no impact on PercolatorStressBenchmark. Close #6500	2014-06-13 23:05:02 +02:00
Adrien Grand	232394e3a8	Aggregations: Remove `ordinals` execution hint. This was how terms aggregations managed to not be too slow initially by caching reads into the terms dictionary using ordinals. However, this doesn't behave nicely on high-cardinality fields since the reads into the terms dict are random and this execution mode loads all unique terms into memory. The `global_ordinals` execution mode (default since 1.2) is expected to be better in all cases. Close #6499	2014-06-13 23:02:20 +02:00
Adrien Grand	fbd7c9aa5d	Aggregations: Fix reducing of range aggregations. Under some rare circumstances: - local transport, - the range aggregation has both a parent and a child aggregation, - the range aggregation got no documents on one shard or more and several documents on one shard or more. the range aggregation could return incorrect counts and sub aggregations. The root cause is that since the reduce happens in-place and since the range aggregation uses the same instance for all sub-aggregation in case of an empty bucket, sometimes non-empty buckets would have been reduced into this shared instance. In order to avoid similar bugs in the future, aggregations have been updated to return a new instance when reducing instead of doing it in-place. Close #6435	2014-06-13 23:01:43 +02:00
Martijn van Groningen	52be3748ff	[TEST] Fix assert	2014-06-13 18:03:25 +02:00
javanna	b9ffb2b0a5	Java API: Make sure afterBulk is always called in BulkProcessor after beforeBulk Moved BulkProcessor tests from BulkTests to newly added BulkProcessorTests class. Strenghtened BulkProcessorTests by adding randomizations to existing tests and new tests for concurrent requests and expcetions. Also made sure that afterBulk is called only once per request if concurrentRequests==0. Closes #5038	2014-06-13 17:40:06 +02:00
Boaz Leskes	44097b358d	[Test] set search request size testGeohashCellFilter The default of 10 is not good enough as previously thought.	2014-06-13 16:11:13 +02:00
stephlag	13d910f016	Added missing comma in suggester example	2014-06-13 16:01:04 +02:00
Alexander Reelsen	b252aacc67	Packaging: Remove java-6 directories from debian init script Closes #6350	2014-06-13 13:48:22 +02:00
Adrien Grand	7a34702925	[DOCS] Clarify the trade-off of the `disk` doc values format.	2014-06-13 13:24:53 +02:00
Adrien Grand	01327d7136	Facets: deprecation. Users are encouraged to move to the new aggregation framework that was introduced in Elasticsearch 1.0. Close #6485	2014-06-13 13:13:44 +02:00
Clinton Gormley	eb6c9fe111	Docs: Linked to fielddata formats from core types Closes #6489	2014-06-13 12:58:03 +02:00
Daniel Winterstein	99549cab00	Added link to "native" ES client	2014-06-13 12:49:48 +02:00

... 3 4 5 6 7 ...

8675 Commits All Branches Search

8675 Commits

All Branches