4196 Commits

Author SHA1 Message Date
Martijn van Groningen
43b21719f5 [TEST] size should start from 1, top_hits aggregation doesn't support size <= 0 2014-06-02 13:21:13 +02:00
Simon Willnauer
3b31f25624 [TEST] Ensure cluster size reflected in the cluster state
We perform some management operations that require the cluster to be
consistent with respect to the number of nodes in the cluster state
/ visible to the master in order to rely on the ack mechanism. This
only applies to the test infrastructure when nodes are not explicitly
started / stopped as well as while tearing down the cluster and wiping
indices after the tests.
2014-06-02 11:57:32 +02:00
mikemccand
7552b69b1f Core: reuse Lucene's TermsEnum for faster _uid/version lookup during
Reusing Lucene's TermsEnum for _uid/version lookups gives a small
indexing (updates) speedup and brings us a closer to not having
to spend RAM on bloom filters.

Closes #6212
2014-05-31 17:38:48 -04:00
Martijn van Groningen
f51a09d8f7 Core: Protects against: 'from + size > scoreDocs.length' in case only single shard response 2014-05-31 20:30:11 +02:00
javanna
e8995ecaa7 [TEST] speed up HighlightSearchTests a bit
Randomize rewrite methods instead of trying them all when highlighting multi term queries with postings highlighter
Rely on search type randomization and remove all the explicit setSearchType calls as they are not needed anymore
Remove explicit `.from`, `.size` and `.explain`, not needed and might slow tests down (especially explain)
2014-05-31 16:29:53 +02:00
Martijn van Groningen
01ca8491cf Core: apply 'from' if there is one shard result. 2014-05-31 13:35:11 +02:00
Martijn van Groningen
b8366a3213 Aggregations: apply 'from' if there is one shard result. 2014-05-31 13:34:49 +02:00
Clinton Gormley
46a67b638d Parent/Child: Added min_children/max_children to has_child query/filter
Added support for min_children and max_children parameters to
the has_child query and filter. A parent document will only
be considered if a match if the number of matching children
fall between the min/max bounds.

Closes #6019
2014-05-30 19:38:39 +02:00
mikemccand
48ccb06160 remove stale nocommit 2014-05-30 13:22:48 -04:00
Martijn van Groningen
760cee7c24 Aggregations: Take the 'from' into account when getting a fetched hit (InternalSearchHit). Hits before the 'from' are included in each shard result. 2014-05-30 16:23:28 +02:00
Shay Banon
9c98bb3554 Have a dedicated join timeout that is higher than ping.timeout for node join
Using ping.timeout, which defaults to 3s, to use as a timeout value on the join request a node makes to the master once its discovered can be too small, specifically when there is a large cluster state involved (and by definition, all the buffers and such on the nio layer will be "cold"). Introduce a dedicated join.timeout setting, that by default is 10x the ping.timeout (so 30s by default).
closes #6342
2014-05-30 12:42:08 +02:00
Martijn van Groningen
0e2d33b4a4 [BUILD] Fix compile error 2014-05-30 12:24:11 +02:00
Martijn van Groningen
aab38fb2e6 Aggregations: added pagination support to top_hits aggregation by adding from option.
Closes #6299
2014-05-30 11:45:31 +02:00
Martijn van Groningen
35755cd8a4 Aggregations: Fixed bug in top_hits aggregation to not fail with NPE when shard results are empty.
The top_hits aggregation returned an empty InternalTopHits instance with no fields set when there were no result, causing reduce and serialization errors down the road. This is fixed by setting all required fields when a there are no results.

Closes #6346
2014-05-30 11:40:45 +02:00
Igor Motov
8c903f4787 [TESTS] Add get snapshot status test for partial snapshots 2014-05-29 19:07:04 -04:00
Boaz Leskes
93e0ce0c5b [Test] added search trace logging to IndexActionTests.testAutoGenerateIdNoDuplicates 2014-05-28 22:12:23 +02:00
Boaz Leskes
dc34ccebfe [Tests] assert indexRandom's deletion of injection dummy docs find them 2014-05-28 22:06:38 +02:00
Adrien Grand
4ff511000e [TESTS] There might be several live BigArrays instances at the same time. 2014-05-28 16:55:26 +02:00
Adrien Grand
cc9a7bd454 Recycling: change the default type of the page recycler to CONCURRENT instead of SOFT_CONCURRENT.
This default type has been inherited from its ancestor, the (non-paged) recycler whose memory
usage was unbounded and required soft references to make sure it could release memory eventually.
On the contrary, the page cache recycler memory usage is bounded so we could remove soft
references in order to remove load on the garbage collector.

Note: the cache type is already randomized in integration tests.

Close #6320
2014-05-28 15:23:18 +02:00
Simon Willnauer
a5866e226e Mustache: Ensure internal scope extrators are always operating on a Map
Mustache extracts the key/value pairs for parameter substitution from
objects and maps but it's decided on the first execution. We need to
make sure if the params are null we pass an empty map to ensure we
bind the map based extractor

Closes #6318
2014-05-28 13:29:21 +02:00
Mathias Fussenegger
82e9a4e80a Serialization: Add support for Byte to the XContentBuilder.
Close #6127
2014-05-28 12:19:44 +02:00
Adrien Grand
be29138962 [BUILD] Remember to use AtomicReader.addCoreClosedListener when upgrading to Lucene 4.9. 2014-05-28 09:35:00 +02:00
mateusz_kaczynski
e97a381db2 Highlighting: Plain highlighter to use analyzer defined on a document level when available.
At the moment plain highligher only uses an analyzer defined for on the type
level. However, during the indexing stage it is possible to define analyzer on
per document level, for example mapping '_analyzer' to another field, containing
required name. This commit attempts to make sure that highlighting works
correctly in this scenario.

Closes #5497
2014-05-28 08:27:14 +02:00
Shay Banon
13f49237df [Test] make sure to close the file at the end of the test 2014-05-27 11:08:29 +02:00
Shay Banon
cd94af2c9e [Test] make sure we test writeTo(Channel) in BytesReference
also introduce proper randomization of content in the bytes
2014-05-26 13:32:52 +02:00
Alex Brasetvik
15ff3df243 Fix MatchQueryParser not parsing fuzzy_transpositions 2014-05-23 22:02:21 +02:00
Martijn van Groningen
3f2f1f088d Set the sortValues on SearchHit post aggregation instead of during the reduce. 2014-05-23 19:05:30 +02:00
Lee Hinman
65ce5acfb4 Explicitly clean up fielddata cache when clearing entire cache 2014-05-23 16:29:26 +02:00
Robert Muir
2cbe9371d2 Improve error when mlockall fails (closes #6288) 2014-05-23 10:16:26 -04:00
Martijn van Groningen
5fafd2451a Added top_hits aggregation that keeps track of the most relevant document being aggregated per bucket.
Closes #6124
2014-05-23 16:01:18 +02:00
Adrien Grand
2d417cf5b6 [TESTS] Left-over from 14420d7c4e15df9b565b50ef5beab797f756c3ac. 2014-05-23 10:10:00 +02:00
Adrien Grand
14420d7c4e [TESTS] Fix test to use index-level doc IDs instead of segment-level doc IDs. 2014-05-23 01:20:41 +02:00
Adrien Grand
0d3410a837 [TESTS] Fix test bug in SimpleValidateQueryTests. 2014-05-23 00:52:56 +02:00
Nik Everett
0ff0985e01 Limit guava caches to 31.9GB
Guava's caches have overflow issues around 32GB with our default segment
count of 16 and weight of 1 unit per byte.  We give them 100MB of headroom
so 31.9GB.

This limits the sizes of both the field data and filter caches, the two
large guava caches.

Closes #6268
2014-05-23 00:20:12 +02:00
Adrien Grand
a836496e57 [TESTS] Randomly disable the filter cache.
Close #6280
2014-05-22 23:13:29 +02:00
Adrien Grand
6e49256fa8 Nested: Make sure queries/filters/aggs get a FixedBitSet when they expect one.
Close #6279
2014-05-22 23:13:13 +02:00
Adrien Grand
b3274bd770 Aggregations: Fix ReverseNestedAggregator to compute the parent document correctly.
Close #6278
2014-05-22 23:13:13 +02:00
Martijn van Groningen
cbdd11777f [TEST] Just start two nodes 2014-05-22 21:13:52 +02:00
Martijn van Groningen
41bcb3e0d3 [TEST] Don't stop master node. 2014-05-22 19:17:54 +02:00
Nik Everett
3573822b7e Highlight fields in request order
Because json objects are unordered this also adds an explicit order syntax
that looks like
    "highlight": {
        "fields": [
            {"title":{ /*params*/ }},
            {"text":{ /*params*/ }}
        ]
    }

This is not useful for any of the builtin highlighters but will be useful
in plugins.

Closes #4649
2014-05-22 16:44:14 +02:00
Alex Ksikes
2546c06131 More Like This Query: allow for both 'like_text' and 'docs/ids' to be specified.
Closes #6246
2014-05-22 13:50:17 +02:00
Martijn van Groningen
a717af505a [TEST] Use _uid sort field as tie, so that hits with the same score are sorted in the same way in both search responses. 2014-05-22 12:10:03 +02:00
Colin Goodheart-Smithe
cabd2340dd Aggregations: Fixed conversion of date field values when using multiple date formats
When multiple date formats are specified using the || syntax in the field mappings the date_histogram aggregation breaks.  This is because we are getting a parser rather than a printer from the date formatter for the object we use to convert the DateTime values back into Strings.  Simple fix to get the printer from the date format and test to back it up

Closes #6239
2014-05-22 10:21:50 +01:00
Martijn van Groningen
e8e684c6c4 Add number of shards statistic to PercolateContext instead of throwing exception.
Certain features like significant_terms aggregation rely on this statistic for sizing heuristics.

Closes #6037
Closes #6123
2014-05-22 10:44:50 +02:00
Martijn van Groningen
16e5cdf8d0 Cut over to Lucene's TopDocs#merge for shard topdocs sorting.
Closes #6197
2014-05-22 10:40:56 +02:00
Martijn van Groningen
157d511061 [TEST] Use SuiteScopeTest annotation instead of ClusterScope(scope = ElasticsearchIntegrationTest.Scope.SUITE, numDataNodes = 1) 2014-05-21 22:08:59 +02:00
Alex Ksikes
a29b4a800d More Like This Query: replaced 'exclude' with 'include' to avoid double negation when set.
Closes #6248
2014-05-21 18:45:03 +02:00
Britta Weber
8cca9b28df Percolator: Fix assertion in percolation with nested docs
Assertion was triggered for percolating documents with nested object
in mapping if the document did not actually contain a nested object.
Reason:
MultiDocumentPercolatorIndex checks if the number of documents is
actualu >1. Instead we can just use the SingleDocumentPercolatorIndex
in this case.

closes #6263
2014-05-21 18:17:36 +02:00
Simon Willnauer
17d34d5c97 Fix FieldDataWeighter generics to accept RamUsage instead of AtomicFieldData
The `FieldDataWeighter` allowed to use a concrete subclass of the caches
generic type to be used that causes ClassCastException and also trips the
CirciutBreaker to not be decremented appropriately.

This was tripped by settings randomization also part of this commit.

Closes #6260
2014-05-21 17:50:45 +02:00
Lee Hinman
03402c7ed8 [TEST] prevent dummy documents from being indexed in testSimpleQueryString() since scores are compared 2014-05-21 17:37:54 +02:00