OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-13 16:35:45 +00:00

Author	SHA1	Message	Date
Christoph Büscher	f927a235b3	Merge branch 'master' into feature/rank-eval	2016-10-17 14:50:10 +02:00
Jason Tedor	c1bdaaf80f	Fix connection keep-alive header handling This commit fixes an issue with the handling of the value "keep-alive" on the Connection header in the Netty 4 HTTP implementation while handling an HTTP 1.0 request. The issue was using the wrong equals method to compare an AsciiString instance and a String instance (they could never be equal). This commit fixes this to use the correct equals method to compare for content equality.	2016-10-16 19:51:00 -04:00
Jason Tedor	cd5777593a	Fix connection close header handling This commit fixes an issue with the handling of the value "close" on the Connection header in the Netty 4 HTTP implementation. The issue was using the wrong equals method to compare an AsciiString instance and a String instance (they could never be equal). This commit fixes this to use the correct equals method to compare for content equality. Relates #20956	2016-10-16 13:18:09 -04:00
Christoph Büscher	dfc6d1f369	Remove unknown docs from EvalQueryQuality The unknown document section in the response for each query can be rendered using the rated hits that are now also part of the response by just filtering the documents without a rating.	2016-10-14 17:14:05 +02:00
Christoph Büscher	9e394b0644	Pull common operations into RankedListQualityMetric interface Currently each implementation of RankedListQualityMetric does some initial joining operation that links the input search hits with a rated document rating, if available. Also all metrics collect unknown docs and now also need to add the list of rated search hits to the partial query evaluation. This change centralizes this work in some new helper methods in RankedListQualityMetric.	2016-10-14 17:14:05 +02:00
Christoph Büscher	ebe13100df	Add `hits` section to response for each ranking evaluation query This change adds a `hits` section to the response part for each ranking evaluation query, containing a list of documents (index/type/id) and ratings (if the document was rated in the request). This section can be used to better understand the calculation of the ranking quality of this particular query, but it can also be used to identify the "unknown" (that is unrated) documents that were part of the seach hits, for example because a UI later wants to present those documents to the user to get a rating for them. If the user specifies a set of field names using a parameter called `summary_fields` in the request, those fields are also included as part of the response in addition to "_index", "_type", "_id".	2016-10-14 17:14:05 +02:00
Christoph Büscher	cd9d07b91b	Merge branch 'master' into feature/rank-eval	2016-10-14 17:03:30 +02:00
Jason Tedor	595ec8c948	Remove artificial default processors limit Today Elasticsearch limits the number of processors used in computing thread counts to 32. This was from a time when Elasticsearch created more threads than it does now and users would run into out of memory errors. It appears the real cause of these out of memory errors was not well understood (it's often due to ulimit settings) and so users were left hitting these out of memory errors on boxes with high core counts. Today Elasticsearch creates less threads (but still a lot) and we have a bootstrap check in place to ensure that the relevant ulimit is not too low. There are some caveats still to having too many concurrent indexing threads as it can lead to too many little segments, and it's not a magical go faster knob if indexing is already bottlenecked by disk, but this limitation is artificial and surprising to users and so it should be removed. This commit also increases the lower bound of the max processes ulimit, to prepare for a world where Elasticsearch instances might be running with more the previous cap of 32 processors. With the current settings, Elasticsearch wants to create roughly 576 + 25 * p / 2 threads, where p is the number of processors. Add in roughly 7 * p / 8 threads for the GC threads and a fudge factor, and 4096 should cover us pretty well up to 256 cores. Relates #20874	2016-10-14 05:47:26 -04:00
Tanguy Leroux	e71c30c71d	Mustache: Add {{#url}}{{/url}} function to URL encode strings (#20838 ) This commit adds a new Mustache function (codename: url) and a new URLEncoder that can be used to URL encode strings in mustache templates.	2016-10-13 16:17:28 +02:00
Simon Willnauer	12392b5425	Ensure port range is readable in the exception message (#20893 ) Both netty3 and netty4 http implementation printed the default toString representation of PortRange if ports couldn't be bound. This commit adds a better default toString method to PortRange and uses the string representation for the error message in the http implementations.	2016-10-12 22:33:47 +02:00
Areek Zillur	133be6631d	Merge branch 'master' into cleanup/transport_bulk	2016-10-12 13:09:29 -04:00
Isabel Drost-Fromm	20c1e25609	Merge branch 'master' into feature/rank-eval	2016-10-12 14:41:17 +02:00
Tanguy Leroux	44ac5d057a	Remove empty javadoc (#20871 ) This commit removes as many as empty javadocs comments my regexp has found	2016-10-12 10:27:09 +02:00
Areek Zillur	481f7909ae	Merge branch 'master' into cleanup/transport_bulk	2016-10-11 16:04:47 -04:00
Areek Zillur	0e8b6532ec	rename DocumentRequest to DocWriteRequest	2016-10-11 16:00:10 -04:00
Tanguy Leroux	e4c7d8183e	XContentBuilder: Avoid building self-referencing objects (#20550 ) Some objects like maps, iterables or arrays of objects can self-reference themselves. This is mostly due to a bug in code but the XContentBuilder should be able to detect such situations and throws an IllegalArgumentException instead of building objects over and over until a stackoverflow occurs. closes #20540 closes #19475	2016-10-11 11:41:54 +02:00
Simon Willnauer	37ca38df3d	Expose `ctx._now` in update scripts (#20835 ) Update scripts might want to update the documents `_timestamp` but need a notion of `now()`. Painless doesn't support any notion of now() since it would make scripts non-pure functions. Yet, in the update case this is a valid value and we can pass it with the context together to allow the script to record the timestamp the document was updated. Relates to #17895	2016-10-10 21:14:14 +02:00
Jim Ferenczi	c80a563a71	Replace org.elasticsearch.common.lucene.search.MatchNoDocsQuery with its Lucene version (org.apache.lucene.search.MatchNoDocsQuery) (#20832 ) * Replace org.elasticsearch.common.lucene.search.MatchNoDocsQuery with its Lucene version (org.apache.lucene.search.MatchNoDocsQuery) This change removes the ES version of the match no docs query and replaces it with the Lucene version. relates #18030 * Add missing change	2016-10-10 17:45:19 +02:00
Simon Willnauer	4fd1276542	Prevent AbstractArrays from release bytes more than once (#20819 ) Today we throw an assertion error if we release an AbstractArray more than once. Yet, it's recommended to implement close methods such that they can be invoked more than once. Guaranteed single release calls are hard to implement and some situations might not be tested causing for instance `CircuitBreaker` to operate on corrupted memory stats.	2016-10-10 17:30:37 +02:00
Christoph Büscher	c3380863be	Adapting RestRankEvalAction to changes in API on master	2016-10-10 12:57:18 +02:00
Christoph Büscher	0c25cfbd16	Merge branch 'master' into feature/rank-eval	2016-10-10 12:10:34 +02:00
Areek Zillur	c747085b70	Merge branch 'master' into cleanup/transport_bulk	2016-10-07 17:52:53 -04:00
Areek Zillur	396f80c963	Revert "rename DocumentRequest to DocumentWriteRequest" This reverts commit b5079ce0092e2dfd742fbe3aed8a8f95931a378d.	2016-10-07 17:50:07 -04:00
Nik Everett	cf4038b668	DeGuice some of IndicesModule UpdateHelper, MetaDataIndexUpgradeService, and some recovery stuff. Move ClusterSettings to nullable ctor parameter of TransportService so it isn't forgotten.	2016-10-07 11:14:38 -04:00
Simon Willnauer	7452028e50	Simplify TransportAddress (#20798 ) since TransportAddress is now final we can simplify it's interface a bit and remove methods that are only used in tests or are plain delegates.	2016-10-07 15:56:54 +02:00
Simon Willnauer	194a6b1df0	Remove LocalTransport in favor of MockTcpTransport (#20695 ) This change proposes the removal of all non-tcp transport implementations. The mock transport can be used by default to run tests instead of local transport that has roughly the same performance compared to TCP or at least not noticeably slower. This is a master only change, deprecation notice in 5.x will be committed as a separate change.	2016-10-07 11:27:47 +02:00
Simon Willnauer	9c9afe3f01	Remove SearchContext#current and all it's threadlocals (#20778 ) Today SearchContext expose the current context as a thread local which makes any kind of sane interface design very very hard. This PR removes the thread local entirely and instead passes the relevant context anywhere needed. This simplifies state management dramatically and will allow for a much leaner SearchContext interface down the road.	2016-10-06 19:51:54 +02:00
Areek Zillur	b5079ce009	rename DocumentRequest to DocumentWriteRequest	2016-10-06 05:05:59 -04:00
Areek Zillur	9b691f0d93	Merge branch 'master' into cleanup/transport_bulk	2016-10-05 17:51:39 -04:00
Simon Willnauer	ce21b607bb	move test to a single node test	2016-10-05 21:55:50 +02:00
Simon Willnauer	50ca10378e	Merge branch 'master' into dont_cache_scripts	2016-10-05 20:43:52 +02:00
Simon Willnauer	838c28eeb4	add percolate with script query test	2016-10-05 20:43:46 +02:00
Jack Conradson	ba88d9af57	Remove all date 'now' methods from the Painless whitelist to ensure Painless scripts are pure functions.	2016-10-05 09:47:20 -07:00
Simon Willnauer	57afbadf33	PercolateQuery is never cacheable	2016-10-05 16:38:47 +02:00
Colin Goodheart-Smithe	7bffe95025	Fix percolator queries to not be cacheable	2016-10-05 15:03:29 +01:00
Simon Willnauer	587bdcef38	add extra safety when accessing scripts or now and reqeusts are cached	2016-10-05 09:41:48 +02:00
Simon Willnauer	80bf44f709	Merge branch 'master' into dont_cache_scripts	2016-10-04 21:09:40 +02:00
Areek Zillur	bd4a03a426	Merge branch 'master' into cleanup/transport_bulk	2016-10-04 14:06:17 -04:00
Jason Tedor	51d53791fe	Remove lenient URL parameter parsing Today when parsing a request, Elasticsearch silently ignores incorrect (including parameters with typos) or unused parameters. This is bad as it leads to requests having unintended behavior (e.g., if a user hits the _analyze API and misspell the "tokenizer" then Elasticsearch will just use the standard analyzer, completely against intentions). This commit removes lenient URL parameter parsing. The strategy is simple: when a request is handled and a parameter is touched, we mark it as such. Before the request is actually executed, we check to ensure that all parameters have been consumed. If there are remaining parameters yet to be consumed, we fail the request with a list of the unconsumed parameters. An exception has to be made for parameters that format the response (as opposed to controlling the request); for this case, handlers are able to provide a list of parameters that should be excluded from tripping the unconsumed parameters check because those parameters will be used in formatting the response. Additionally, some inconsistencies between the parameters in the code and in the docs are corrected. Relates #20722	2016-10-04 12:45:29 -04:00
Simon Willnauer	94b7873b49	Add a #markAsNotCachable() method to context to mark requests as not cachable	2016-10-04 18:05:00 +02:00
Areek Zillur	248ac240ed	Merge branch 'master' into cleanup/transport_bulk	2016-10-03 16:12:11 -04:00
Martijn van Groningen	55dce523c2	docs: marked `foreach` processor as experimental Closes #19602	2016-09-30 12:23:42 +02:00
Tal Levy	33b9e2065b	no null values in ingest configuration error messages (#20616 ) The invalid ingest configuration field name used to show itself, even when it was null, in error messages. Sometimes this does not make sense. e.g. ```[null] Only one of [file], [id], or [inline] may be configure``` vs. ```Only one of [file], [id], or [inline] may be configure``` The above deals with three fields, therefore this no one property responsible.	2016-09-29 11:34:52 +02:00
Tal Levy	92ab44d35c	[fix] JSON Processor was not properly added (#20613 )	2016-09-28 23:04:22 +02:00
Christoph Büscher	92baf14c0a	Merge branch 'master' into feature/rank-eval	2016-09-27 11:59:35 +02:00
Nik Everett	370afa371b	Make reindex-from-remote ignore unknown fields reindex-from-remote should ignore unknown fields so it is mostly future compatible. This makes it ignore unknown fields by adding an option to `ObjectParser` and `ConstructingObjectParser` that, if enabled, causes them to ignore unknown fields. Closes #20504	2016-09-26 00:55:46 +02:00
Simon Willnauer	fe1803c957	Remove AnalysisService and reduce it to a simple name to analyzer mapping (#20627 ) Today we hold on to all possible tokenizers, tokenfilters etc. when we create an index service on a node. This was mainly done to allow the `_analyze` API to directly access all these primitive. We fixed this in #19827 and can now get rid of the AnalysisService entirely and replace it with a simple map like class. This ensures we don't create a gazillion long living objects that are entirely useless since they are never used in most of the indices. Also those objects might consume a considerable amount of memory since they might load stopwords or synonyms etc. Closes #19828	2016-09-23 08:53:50 +02:00
Christoph Büscher	1909877728	Merge branch 'master' into feature/rank-eval	2016-09-22 12:21:20 +02:00
Christoph Büscher	29402a28e0	RankEval: Adding details section to response (#20497 ) In order to understand how well particular queries in a joint ranking evaluation request work we want to break down the overall metric into its components, each contributed by a particular query. The response structure now has a `details` section under which we can summarize this information. Each sub-section is keyed by the query-id and currently only contains the partial metric and the unknown_docs section for each query.	2016-09-22 12:17:10 +02:00
Jack Conradson	5755dd256d	Fix String concatentation bug.	2016-09-21 15:49:56 -07:00

1 2 3 4 5 ...

3756 Commits