Commit Graph

4731 Commits

Author SHA1 Message Date
Shay Banon 9ded2405a0 Use Lucene Version that was used to create the index in Analysis
Lucene ships with a version constant that is mainly used to provide consistent behaviour across lucene release versions. Lucene's Analysis capabilities are commonly applied at index and search time such that the search-time behaviour should be identical to the index-time behaviour in most of the cases. Currently ElasticSearch always uses the latest version from Lucene which can break backwards compatibility with the index for users that rely on behaviour that changed in new Lucene version.

Users should always use the version the index was created with unless it's explicitly configured.

closes #2945
2013-04-29 13:18:51 +02:00
Simon Willnauer bd7ff6946e Added X Versions of NGramTokenFilter and NGramTokenizer to ElasticSearch. These versions
don't produce broken positions anymore and prevent certain highlighter bugs that fail with
StringArrayOutOfBoundsExceptions as in #2931

This commit breaks backwards compatibility in terms of highlighting when NGramTokenFilter is used.
The highlighter will highlight the entire terms as produced by the tokenizer instead of the individual
sub-gram. To do sub-gram highlighting, the ngram tokenizer should be used. This behavior was based on
broken NGramTokenFilter behavior which will be fixed in Lucene 4.4 but was ported in this commit
to elasticsearch 0.90. The broken behavior can still be used if a version < LUCENE_42 is used
in the token filter mapping.

Closes #2931
2013-04-27 16:48:25 +02:00
Shay Banon f09ad507a4 open context stats
- rename to open_contexts from open, we might have other open stats in the future related to search (lucene index searchers?)
- add a test to verify it works
2013-04-27 15:09:47 +02:00
Simon Willnauer 8a7f81104f Remove XSimpleFragmentsBuilder and XScoreOrderFragmentsBuilder since the only difference
to the lucene version is that `discreteMultiValueHighlighting` does default to `true`. Yet
we set this anyway in the HighlightingPhase such that the classes are obsolet.
2013-04-26 20:04:38 +02:00
Simon Willnauer 355f80adc9 Added temporary fix for LUCENE-4899 where FastVectorHighlihgter failed with StringIndexOutOfBoundsException
if a single highlight phrase or term was greater than the fragCharSize producing negative string offsets

The fixed BaseFragListBuilder was added as XSimpleFragListBuilder which triggers an assert once Elasticsearch
upgrades to Lucene 4.3
2013-04-26 19:48:48 +02:00
Simon Willnauer 2ed2fab904 Add assert that fails one Elasticsearch upgrades to Lucene 4.3 in order to remove the duplicated class 2013-04-26 19:16:21 +02:00
Alexander Reelsen 90353ceb79 Fixing possible NoClassDefFoundError when trying to load nonexisting classes
In order to handle exceptions correctly, when classes are not found, one
needs to handle ClassNotFoundException as well as NoClassDefFoundError
in order to be sure to have caught every possible case. We did not cater
for the latter in ImmutableSettings yet.

This fix is just executing the same logic for both exceptions instead of
simply bubbling up NoClassDefFoundError.
2013-04-26 10:34:10 +02:00
Alexander Reelsen 22e25cc165 Added stolen time to OsStats output 2013-04-25 10:46:24 +02:00
Shay Banon c4968d7d65 no longer support snappy... 2013-04-25 09:38:58 +02:00
Igor Motov 982b570037 Fix serialization of sync/async replication type 2013-04-25 08:25:31 +02:00
Martijn van Groningen dd12e0b86c If searchContext not set, abort parsing and throw ISE 2013-04-24 10:24:15 +02:00
Simon Willnauer c884304753 Fall back to local statistics if global statistics are not availalbe for a field or term
Closes #2926
2013-04-23 13:32:35 +02:00
Simon Willnauer f372f7c109 Cut over StringScriptDataComparator to use BytesRef instead of Strings
Closes #2920
2013-04-23 13:29:19 +02:00
Simon Willnauer 7a36bed031 Remove per-doc ord collector callback infavor of an iterator 2013-04-23 10:35:40 +02:00
Martijn van Groningen c390f9b1a9 Added more test assertions 2013-04-19 22:16:42 +02:00
Simon Willnauer 7ea6cd6888 use Double/Float.compare for stable and correct float sort order 2013-04-19 21:40:01 +02:00
Clinton Gormley 1483a3a0e5 Added tests for multi_match with minimum_should_match 2013-04-19 21:40:01 +02:00
Clinton Gormley e508b27203 Apply minimum_should_match to inner clauses of multi_match query
When specifying minimum_should_match in a multi_match query it was being applied
to the outer bool query instead of to each of the inner field-specific bool queries.

Closes #2918
2013-04-19 21:39:54 +02:00
Simon Willnauer 3ab56e16b7 Support empty string in FSTBytesAtomicFieldData 2013-04-19 12:49:06 +02:00
Simon Willnauer a1c62759c9 remove size bound from cache recycler for performance reasons 2013-04-19 12:36:12 +02:00
Simon Willnauer 2d13aa29f8 s/ES.RECYCLE/es.cache.recycle 2013-04-19 11:48:28 +02:00
Simon Willnauer 05b6c46bec allow CacheRecycler to be cleared via the REST API 2013-04-19 11:45:33 +02:00
Simon Willnauer 79db1bfbf0 make object caching optional 2013-04-18 19:14:19 +02:00
Florian Schilling 54cb4b9615 # Response for Cluster Settings Update API
If cluster settings are update the REST API returns the accepted values. For
example, updating the `cluster.routing.allocation.disable_allocation` via
cluster settings:

```curl -XPUT http://localhost:9200/_cluster/settings -d '{
    "transient":{
        "cluster.routing.allocation.disable_allocation":"true"
    }
}'```

will respond:

```{
    "persistent":{},
    "transient":{
        "cluster.routing.allocation.disable_allocation":"true"
    }
}```

Closes #2907
2013-04-18 11:34:58 +02:00
Lucas Ward 99c101c37e If a value/field is a Calendar, it will be converted to a Date using getTime()
Closes #2911
2013-04-18 10:57:08 +02:00
Shay Banon 0eb298fe64 use more aggressive concurrency levels for CHM
- long running ones with high update rates
- also expose a *system* property of es.useConcurrentHashMapV8 to use the new non blocking Java8 CHM impl
2013-04-17 14:28:38 -07:00
Shay Banon 271305d5eb Search Stats: Add current open searches
closes #2906
2013-04-16 18:08:57 -07:00
Simon Willnauer efc9e8fe7b only return primary if it is active in PlainOperationRounting
Closes #2896
2013-04-16 17:20:22 +02:00
Martijn van Groningen bcc16654d2 Better error messaging when postings_format can be resolved or when a custom postings_format type can't be instantiated.
Relates to #2893
2013-04-16 16:29:54 +02:00
Martijn van Groningen 9a1c03408b Added support for the `_cache` and` _cache_key` options to the `has_child` and `has_parent` filters.
Closes #2900
2013-04-16 14:42:45 +02:00
Florian Schilling ef5b7412e6 Allow PolygonBuilder to create polygons with hole
Closes #2899
2013-04-16 11:22:48 +02:00
Simon Willnauer 30f9f278c3 Added UNICODE_CHARACTER_CLASS support to Regex flags. This flag is only supported in Java7 and is ignored if set on a java 6 JVM
Closes #2895
2013-04-16 10:06:53 +02:00
uboness eb21526552 Added missing support for lat, lats, lon, lons for doc notation in scripts 2013-04-13 13:58:30 -07:00
uboness 20e6df9f34 Optimization in fielddata cache where ordinals are used instead of flat arrays when number of unique values is low 2013-04-13 12:42:53 -07:00
Igor Motov e7b49d8936 Add more dynamic settings validation 2013-04-12 20:55:45 -04:00
Shay Banon d385e1b356 Clear Cache API: Streamline option names
closes #2890
2013-04-12 15:58:24 -07:00
Shay Banon a2d72697eb Expose field level field data statistics
closes #2889
2013-04-12 15:51:08 -07:00
David Pilato 3b7a195f6f Add toString() for FilterBuilders
Closes #2887.
2013-04-12 22:27:51 +02:00
Martijn van Groningen bf21466291 CacheTests test fix. 2013-04-12 19:14:38 +02:00
Martijn van Groningen 80dbca0809 Field data: Try to load short values as byte values and load int values as short or byte values to reduce the size they take in memory. 2013-04-12 19:11:18 +02:00
Shay Banon 5fbd4a12a0 fix memory computation for int field data 2013-04-12 08:38:52 -07:00
Martijn van Groningen 5c90e5f940 If no options are specified with the clear cache api then all caches should be cleared.
Closes #2886
2013-04-12 15:24:50 +02:00
Igor Motov 00c035f88c Make sure that settings are propagated to all nodes 2013-04-11 10:59:14 -04:00
Martijn van Groningen 2dfcc3c740 Test that size is actually computed.
Relates to #2882
2013-04-11 10:22:48 +02:00
Simon Willnauer 9a2d27a035 rename prefix_length to prefix_len for consistency
Closes #2883
2013-04-10 17:39:32 +02:00
Martijn van Groningen 4fd8c2c6d2 Ordinals were omitted from fielddata cache size calculation if field has more than one term.
Closes #2882
2013-04-10 14:50:07 +02:00
Martijn van Groningen 637eeacb20 Better error description if field(s) (statistical facet) and value_field (term_stats facet) are not a numeric field 2013-04-10 11:11:52 +02:00
Martijn van Groningen 6a3c53ef44 Should prevent OOM 2013-04-10 10:00:51 +02:00
Martijn van Groningen b8b28041e5 Fix for extended facets test. 2013-04-10 00:47:00 +02:00
Shay Banon 80fc55a01d upgrade to netty 3.6.5 2013-04-09 09:44:49 -07:00