8671 Commits

Author SHA1 Message Date
mikemccand
7552b69b1f Core: reuse Lucene's TermsEnum for faster _uid/version lookup during
Reusing Lucene's TermsEnum for _uid/version lookups gives a small
indexing (updates) speedup and brings us a closer to not having
to spend RAM on bloom filters.

Closes #6212
2014-05-31 17:38:48 -04:00
Martijn van Groningen
f51a09d8f7 Core: Protects against: 'from + size > scoreDocs.length' in case only single shard response 2014-05-31 20:30:11 +02:00
javanna
e8995ecaa7 [TEST] speed up HighlightSearchTests a bit
Randomize rewrite methods instead of trying them all when highlighting multi term queries with postings highlighter
Rely on search type randomization and remove all the explicit setSearchType calls as they are not needed anymore
Remove explicit `.from`, `.size` and `.explain`, not needed and might slow tests down (especially explain)
2014-05-31 16:29:53 +02:00
Martijn van Groningen
01ca8491cf Core: apply 'from' if there is one shard result. 2014-05-31 13:35:11 +02:00
Martijn van Groningen
b8366a3213 Aggregations: apply 'from' if there is one shard result. 2014-05-31 13:34:49 +02:00
Clinton Gormley
46a67b638d Parent/Child: Added min_children/max_children to has_child query/filter
Added support for min_children and max_children parameters to
the has_child query and filter. A parent document will only
be considered if a match if the number of matching children
fall between the min/max bounds.

Closes #6019
2014-05-30 19:38:39 +02:00
mikemccand
48ccb06160 remove stale nocommit 2014-05-30 13:22:48 -04:00
Martijn van Groningen
760cee7c24 Aggregations: Take the 'from' into account when getting a fetched hit (InternalSearchHit). Hits before the 'from' are included in each shard result. 2014-05-30 16:23:28 +02:00
Shay Banon
9c98bb3554 Have a dedicated join timeout that is higher than ping.timeout for node join
Using ping.timeout, which defaults to 3s, to use as a timeout value on the join request a node makes to the master once its discovered can be too small, specifically when there is a large cluster state involved (and by definition, all the buffers and such on the nio layer will be "cold"). Introduce a dedicated join.timeout setting, that by default is 10x the ping.timeout (so 30s by default).
closes #6342
2014-05-30 12:42:08 +02:00
Martijn van Groningen
0e2d33b4a4 [BUILD] Fix compile error 2014-05-30 12:24:11 +02:00
Clinton Gormley
7fff6f1f43 Docs: Tidied percolate.asciidoc 2014-05-30 11:56:06 +02:00
Martijn van Groningen
aab38fb2e6 Aggregations: added pagination support to top_hits aggregation by adding from option.
Closes #6299
2014-05-30 11:45:31 +02:00
Martijn van Groningen
35755cd8a4 Aggregations: Fixed bug in top_hits aggregation to not fail with NPE when shard results are empty.
The top_hits aggregation returned an empty InternalTopHits instance with no fields set when there were no result, causing reduce and serialization errors down the road. This is fixed by setting all required fields when a there are no results.

Closes #6346
2014-05-30 11:40:45 +02:00
javanna
74eff87dd6 [DOCS] Java 7 is required since 1.2.0 2014-05-30 10:45:22 +02:00
Igor Motov
8c903f4787 [TESTS] Add get snapshot status test for partial snapshots 2014-05-29 19:07:04 -04:00
Shay Banon
b2c7c8b0e7 Upgrade to netty 3.9.1
closes #6331
2014-05-30 00:20:37 +02:00
Clinton Gormley
bec51a618f Test: Python doesn't support atomic groups in regexes. Removing from cat.recovery 2014-05-29 16:57:09 +02:00
Boaz Leskes
93e0ce0c5b [Test] added search trace logging to IndexActionTests.testAutoGenerateIdNoDuplicates 2014-05-28 22:12:23 +02:00
Boaz Leskes
dc34ccebfe [Tests] assert indexRandom's deletion of injection dummy docs find them 2014-05-28 22:06:38 +02:00
Adrien Grand
328a7e513c [DOCS] Document filtered query strategies. 2014-05-28 17:57:43 +02:00
Adrien Grand
4ff511000e [TESTS] There might be several live BigArrays instances at the same time. 2014-05-28 16:55:26 +02:00
Adrien Grand
cc9a7bd454 Recycling: change the default type of the page recycler to CONCURRENT instead of SOFT_CONCURRENT.
This default type has been inherited from its ancestor, the (non-paged) recycler whose memory
usage was unbounded and required soft references to make sure it could release memory eventually.
On the contrary, the page cache recycler memory usage is bounded so we could remove soft
references in order to remove load on the garbage collector.

Note: the cache type is already randomized in integration tests.

Close #6320
2014-05-28 15:23:18 +02:00
Simon Willnauer
a5866e226e Mustache: Ensure internal scope extrators are always operating on a Map
Mustache extracts the key/value pairs for parameter substitution from
objects and maps but it's decided on the first execution. We need to
make sure if the params are null we pass an empty map to ensure we
bind the map based extractor

Closes #6318
2014-05-28 13:29:21 +02:00
Mathias Fussenegger
82e9a4e80a Serialization: Add support for Byte to the XContentBuilder.
Close #6127
2014-05-28 12:19:44 +02:00
Colin Goodheart-Smithe
3534ffcd1d Added plugins to .gitignore
Since plugins should never be committed to the core codebase and it is useful to be able to add plugins to the development environment adding plugins folder to the .gitignore file will stop it from appearing in the unstaged changes
2014-05-28 09:38:58 +01:00
Adrien Grand
be29138962 [BUILD] Remember to use AtomicReader.addCoreClosedListener when upgrading to Lucene 4.9. 2014-05-28 09:35:00 +02:00
mateusz_kaczynski
e97a381db2 Highlighting: Plain highlighter to use analyzer defined on a document level when available.
At the moment plain highligher only uses an analyzer defined for on the type
level. However, during the indexing stage it is possible to define analyzer on
per document level, for example mapping '_analyzer' to another field, containing
required name. This commit attempts to make sure that highlighting works
correctly in this scenario.

Closes #5497
2014-05-28 08:27:14 +02:00
Alexander Reelsen
5fdb35fc54 Startup: Reset locale to en_US.UTF-8 in bin/elasticsearch
Because the NetworkExceptionHelper class relies on the english language in
order to extract information and decide whether a certain exception is a
network problem, we need to set the english locale on startup in order
to prevent other locales to circumvent this check.
2014-05-27 17:31:09 +02:00
David Pilato
1dc186a595 [DOCS] fix typo 2014-05-27 15:57:39 +02:00
Itamar Syn-Hershko
ac812f72b7 Docs: Adding Hebrew analyzer
Closes #6306
2014-05-27 13:40:53 +02:00
Shay Banon
13f49237df [Test] make sure to close the file at the end of the test 2014-05-27 11:08:29 +02:00
Shay Banon
cd94af2c9e [Test] make sure we test writeTo(Channel) in BytesReference
also introduce proper randomization of content in the bytes
2014-05-26 13:32:52 +02:00
Alex Brasetvik
15ff3df243 Fix MatchQueryParser not parsing fuzzy_transpositions 2014-05-23 22:02:21 +02:00
Martijn van Groningen
3f2f1f088d Set the sortValues on SearchHit post aggregation instead of during the reduce. 2014-05-23 19:05:30 +02:00
Karel Minarik
0e920c17dd [SPEC] Added the percolator_format URL parameter to the "Percolate" API spec 2014-05-23 18:26:17 +02:00
Lee Hinman
65ce5acfb4 Explicitly clean up fielddata cache when clearing entire cache 2014-05-23 16:29:26 +02:00
Robert Muir
2cbe9371d2 Improve error when mlockall fails (closes #6288) 2014-05-23 10:16:26 -04:00
Martijn van Groningen
5fafd2451a Added top_hits aggregation that keeps track of the most relevant document being aggregated per bucket.
Closes #6124
2014-05-23 16:01:18 +02:00
Adrien Grand
2d417cf5b6 [TESTS] Left-over from 14420d7c4e15df9b565b50ef5beab797f756c3ac. 2014-05-23 10:10:00 +02:00
Adrien Grand
14420d7c4e [TESTS] Fix test to use index-level doc IDs instead of segment-level doc IDs. 2014-05-23 01:20:41 +02:00
Adrien Grand
0d3410a837 [TESTS] Fix test bug in SimpleValidateQueryTests. 2014-05-23 00:52:56 +02:00
Nik Everett
0ff0985e01 Limit guava caches to 31.9GB
Guava's caches have overflow issues around 32GB with our default segment
count of 16 and weight of 1 unit per byte.  We give them 100MB of headroom
so 31.9GB.

This limits the sizes of both the field data and filter caches, the two
large guava caches.

Closes #6268
2014-05-23 00:20:12 +02:00
Adrien Grand
a836496e57 [TESTS] Randomly disable the filter cache.
Close #6280
2014-05-22 23:13:29 +02:00
Adrien Grand
6e49256fa8 Nested: Make sure queries/filters/aggs get a FixedBitSet when they expect one.
Close #6279
2014-05-22 23:13:13 +02:00
Adrien Grand
b3274bd770 Aggregations: Fix ReverseNestedAggregator to compute the parent document correctly.
Close #6278
2014-05-22 23:13:13 +02:00
Martijn van Groningen
cbdd11777f [TEST] Just start two nodes 2014-05-22 21:13:52 +02:00
Martijn van Groningen
41bcb3e0d3 [TEST] Don't stop master node. 2014-05-22 19:17:54 +02:00
Nik Everett
3573822b7e Highlight fields in request order
Because json objects are unordered this also adds an explicit order syntax
that looks like
    "highlight": {
        "fields": [
            {"title":{ /*params*/ }},
            {"text":{ /*params*/ }}
        ]
    }

This is not useful for any of the builtin highlighters but will be useful
in plugins.

Closes #4649
2014-05-22 16:44:14 +02:00
Karel Minarik
81cddacffa [SPEC] Removed the URL parameter indices_boost from the "Search" API
Related: elasticsearch/elasticsearch-ruby#29
2014-05-22 15:29:46 +02:00
Simon Willnauer
9d5507047f Update Documentation Feature Flags [1.2.0] 2014-05-22 15:06:42 +02:00