Merging changes from 6.7.0 section into 7.0.0

This commit is contained in:
Anshum Gupta 2017-08-29 14:55:31 -07:00
parent de5c68b5f8
commit f4b2be1282
2 changed files with 135 additions and 176 deletions

View File

@ -87,6 +87,9 @@ New Features
* LUCENE-7838: Knn classifier based on fuzzified term queries (Tommaso Teofili) * LUCENE-7838: Knn classifier based on fuzzified term queries (Tommaso Teofili)
* LUCENE-7855: Added advanced options of the Wikipedia tokenizer to its factory.
(Juan Pedro via Adrien Grand)
API Changes API Changes
* LUCENE-2605: Classic QueryParser no longer splits on whitespace by default. * LUCENE-2605: Classic QueryParser no longer splits on whitespace by default.
@ -178,6 +181,15 @@ Bug Fixes
functions (Operations.isFinite and Operations.topsortState) to prevent functions (Operations.isFinite and Operations.topsortState) to prevent
large automaton to overflow the stack (Robert Muir, Adrien Grand, Jim Ferenczi) large automaton to overflow the stack (Robert Muir, Adrien Grand, Jim Ferenczi)
* LUCENE-7864: IndexMergeTool is not using intermediate hard links (even
if possible). (Dawid Weiss)
* LUCENE-7869: Changed MemoryIndex to sort 1d points. In case of 1d points, the PointInSetQuery.MergePointVisitor expects
that these points are visited in ascending order. The memory index doesn't do this and this can result in document
with multiple points that should match to not match. (Martijn van Groningen)
* LUCENE-7878: Fix query builder to keep the SHOULD clause that wraps multi-word synonyms. (Jim Ferenczi)
Improvements Improvements
* LUCENE-7489: Better storage of sparse doc-values fields with the default * LUCENE-7489: Better storage of sparse doc-values fields with the default
@ -189,6 +201,8 @@ Improvements
* LUCENE-7901: Original Highlighter now eagerly throws an exception if you * LUCENE-7901: Original Highlighter now eagerly throws an exception if you
provide components that are null. (Jason Gerlowski, David Smiley) provide components that are null. (Jason Gerlowski, David Smiley)
* LUCENE-7841: Normalize ґ to г in Ukrainian analyzer. (Andriy Rysin via Dawid Weiss)
Optimizations Optimizations
* LUCENE-7416: BooleanQuery optimizes queries that have queries that occur both * LUCENE-7416: BooleanQuery optimizes queries that have queries that occur both
@ -213,6 +227,10 @@ Optimizations
* LUCENE-7874: DisjunctionMaxQuery rewrites to a BooleanQuery when tiebreaker is set to 1. (Jim Ferenczi) * LUCENE-7874: DisjunctionMaxQuery rewrites to a BooleanQuery when tiebreaker is set to 1. (Jim Ferenczi)
* LUCENE-7828: Speed up range queries on range fields by improving how we
compute the relation between the query and inner nodes of the BKD tree.
(Adrien Grand)
Other Other
* LUCENE-7923: Removed FST.Arc.node field (unused). (Dawid Weiss) * LUCENE-7923: Removed FST.Arc.node field (unused). (Dawid Weiss)
@ -250,26 +268,6 @@ Other
* LUCENE-7773: Remove unused/deprecated token types from StandardTokenizer. * LUCENE-7773: Remove unused/deprecated token types from StandardTokenizer.
(Ahmet Arslan via Steve Rowe) (Ahmet Arslan via Steve Rowe)
======================= Lucene 6.7.0 =======================
New Features
* LUCENE-7855: Added advanced options of the Wikipedia tokenizer to its factory.
(Juan Pedro via Adrien Grand)
Bug Fixes
* LUCENE-7864: IndexMergeTool is not using intermediate hard links (even
if possible). (Dawid Weiss)
* LUCENE-7869: Changed MemoryIndex to sort 1d points. In case of 1d points, the PointInSetQuery.MergePointVisitor expects
that these points are visited in ascending order. The memory index doesn't do this and this can result in document
with multiple points that should match to not match. (Martijn van Groningen)
* LUCENE-7878: Fix query builder to keep the SHOULD clause that wraps multi-word synonyms. (Jim Ferenczi)
Other
* LUCENE-7800: Remove code that potentially rethrows checked exceptions * LUCENE-7800: Remove code that potentially rethrows checked exceptions
from methods that don't declare them ("sneaky throw" hack). (Robert Muir, from methods that don't declare them ("sneaky throw" hack). (Robert Muir,
Uwe Schindler, Dawid Weiss) Uwe Schindler, Dawid Weiss)
@ -278,16 +276,6 @@ Other
that are trivially replaced by LeafReader.terms() and MultiFields.getTerms() that are trivially replaced by LeafReader.terms() and MultiFields.getTerms()
(David Smiley) (David Smiley)
Improvements
* LUCENE-7841: Normalize ґ to г in Ukrainian analyzer. (Andriy Rysin via Dawid Weiss)
Optimizations
* LUCENE-7828: Speed up range queries on range fields by improving how we
compute the relation between the query and inner nodes of the BKD tree.
(Adrien Grand)
======================= Lucene 6.6.0 ======================= ======================= Lucene 6.6.0 =======================
New Features New Features

View File

@ -325,6 +325,14 @@ Upgrading from Solr 6.x
detect Java 9 correctly and setup Garbage Collector logging. If the configuration file contains detect Java 9 correctly and setup Garbage Collector logging. If the configuration file contains
logging options that are no longer supported with Java 9, startup will fail. logging options that are no longer supported with Java 9, startup will fail.
* SOLR-10307: If starting Jetty without the Solr start script, you must now pass keystore and truststore
passwords via the env variables SOLR_SSL_KEY_STORE_PASSWORD and SOLR_SSL_TRUST_STORE_PASSWORD rather
than system properties.
* SOLR-10379: ManagedSynonymFilterFactory has been deprecated in favor of ManagedSynonymGraphFilterFactory.
* SOLR-10503: CurrencyField has been deprecated in favor of new CurrencyFieldType.
New Features New Features
---------------------- ----------------------
* SOLR-9857, SOLR-9858: Collect aggregated metrics from nodes and shard leaders in overseer. (ab) * SOLR-9857, SOLR-9858: Collect aggregated metrics from nodes and shard leaders in overseer. (ab)
@ -406,6 +414,59 @@ New Features
* SOLR-11173 TermsComponent support for Points fields. (yonik) * SOLR-11173 TermsComponent support for Points fields. (yonik)
* SOLR-10849: MoreLikeThisComponent should expose setMaxDocFreqPct (maxDoc
frequency percentage). (Dawid Weiss)
* SOLR-10307: Allow Passing SSL passwords through environment variables. (Mano Kovacs, Michael Suzuki via Mark Miller)
* SOLR-10379: Add ManagedSynonymGraphFilterFactory, deprecate ManagedSynonymFilterFactory. (Steve Rowe)
* SOLR-10479: Adds support for HttpShardHandlerFactory.loadBalancerRequests(MinimumAbsolute|MaximumFraction)
configuration. (Ramsey Haddad, Daniel Collins, Christine Poerschke)
* SOLR-3702: concat(...) function query (Andrey Kudryavtsev via Mikhail Khludnev)
* SOLR-10767: Add movingAvg Stream Evaluator (Joel Bernstein)
* SOLR-10813: Add arraySort Stream Evaluator (Joel Bernstein)
* SOLR-10696: Add cumulative probability function (Joel Bernstein)
* SOLR-10765: Add anova Stream Evaluator (Joel Bernstein)
* SOLR-10754: Add hist Stream Evaluator (Joel Bernstein)
* SOLR-10753: Add array Stream Evaluator (Joel Bernstein)
* SOLR-10747: Allow /stream handler to execute Stream Evaluators directly (Joel Bernstein)
* SOLR-10743: Add sequence StreamEvaluator (Joel Bernstein)
* SOLR-10684: Add finddelay Stream Evaluator (Joel Bernstein)
* SOLR-10731: Add knn Streaming Expression (Joel Bernstein)
* SOLR-10724: Add describe Stream Evaluator (Joel Bernstein)
* SOLR-10693: Add copyOfRange Stream Evaluator (Joel Bernstein)
* SOLR-10623: Add sql Streaming Expression (Joel Bernstein)
* SOLR-10661: Add copyOf Stream Evaluator (Joel Bernstein)
* SOLR-10663: Add distance Stream Evaluator (Joel Bernstein)
* SOLR-10664: Add scale Stream Evaluator (Joel Bernstein)
* SOLR-10666: Add rank transformation Stream Evaluator (Joel Bernstein)
* SOLR-10662: Add length Stream Evaluator (Joel Bernstein)
* SOLR-10660: Add reverse Stream Evaluator (Joel Bernstein)
* SOLR-9910: Add solr/solr.cmd parameter to append jetty parameters to the start script.
(Mano Kovacs via Mark Miller)
Bug Fixes Bug Fixes
---------------------- ----------------------
* SOLR-9262: Connection and read timeouts are being ignored by UpdateShardHandler after SOLR-4509. * SOLR-9262: Connection and read timeouts are being ignored by UpdateShardHandler after SOLR-4509.
@ -504,6 +565,49 @@ Bug Fixes
* SOLR-8689: Fix bin/solr.cmd so it can run properly on Java 9 (Uwe Schindler, hossman) * SOLR-8689: Fix bin/solr.cmd so it can run properly on Java 9 (Uwe Schindler, hossman)
* SOLR-10723 JSON Facet API: resize() implemented incorrectly for CountSlotAcc, HllAgg.NumericAcc
resulting in exceptions when using a hashing faceting method and sorting by hll(numeric_field).
(yonik)
* SOLR-10719: Creating a core.properties fails if the parent of core.properties is a symlinked dierctory
(Erick Erickson)
* SOLR-10360: Solr HDFS snapshot export fails due to FileNotFoundException error when using MR1 instead of
yarn. (Hrishikesh via Mark Miller)
* SOLR-10137: Ensure that ConfigSets created via API are mutable. (Hrishikesh via Mark Miller)
* SOLR-10829: Fixed IndexSchema to enforce that uniqueKey can not be Points based for correctness (hossman)
* SOLR-10836: The query parsers igain, significantTerms, and tlogit (used by streaming expressions by
the same name) might throw a NullPointerException if the referenced field had no indexed data in some
shards. The fix included an optimization to use Solr's cached AtomicReader instead of re-calculating.
(David Smiley)
* SOLR-10715: /v2/ should not be an alias for /v2/collections (Cao Manh Dat)
* SOLR-10835: Add support for point fields in Export Handler (Tomás Fernández Löbbe)
* SOLR-10704: REPLACENODE may cause data loss when replicationFactor is 1. (ab, shalin)
* SOLR-10833: Point numeric fields should throw SolrException(BAD_REQUEST) for malformed numbers in queries.
Trie numeric fields should throw SolrException(BAD_REQUEST) for malformed docValues range queries.
(hossman, Tomás Fernández Löbbe)
* SOLR-10832: Fixed VersionInfo.getMaxVersionFromIndex when using PointsField with indexed="true" (hossman)
* SOLR-10763: Admin UI replication tab sometimes empty when failed replications (janhoy, Bojan Vitnik)
* SOLR-10824: fix NPE ExactSharedStatsCache, fixing maxdocs skew for terms which are absent at one of shards
when using one of Exact*StatsCache (Mikhail Khludnev)
* SOLR-10963: Fix example json in MultipleAdditiveTreesModel javadocs.
(Stefan Langenmaier via Christine Poerschke)
* SOLR-10914: RecoveryStrategy's sendPrepRecoveryCmd can get stuck for 5 minutes if leader is unloaded. (shalin)
* SOLR-11198: downconfig downloads empty file as folder (Erick Erickson)
Optimizations Optimizations
---------------------- ----------------------
@ -527,6 +631,13 @@ Optimizations
* SOLR-11070: Make docValues range queries behave the same as Trie/Point fields for Double/Float Infinity cases * SOLR-11070: Make docValues range queries behave the same as Trie/Point fields for Double/Float Infinity cases
(Tomás Fernández Löbbe, Andrey Kudryavtsev) (Tomás Fernández Löbbe, Andrey Kudryavtsev)
* SOLR-10634: JSON Facet API: When a field/terms facet will retrieve all buckets (i.e. limit:-1)
and there are no nested facets, aggregations are computed in the first collection phase
so that the second phase which would normally involve calculating the domain for the bucket
can be skipped entirely, leading to large performance improvements. (yonik)
* SOLR-10722: Speed up Solr's use of the UnifiedHighlighter be re-using FieldInfos. (David Smiley)
Other Changes Other Changes
---------------------- ----------------------
* SOLR-10236: Removed FieldType.getNumericType(). Use getNumberType() instead. (Tomás Fernández Löbbe) * SOLR-10236: Removed FieldType.getNumericType(). Use getNumberType() instead. (Tomás Fernández Löbbe)
@ -715,146 +826,6 @@ Other Changes
* SOLR-11183: V2 APIs are now available at /api endpoint. (Ishan Chattopadhyaya) * SOLR-11183: V2 APIs are now available at /api endpoint. (Ishan Chattopadhyaya)
================== 6.7.0 ==================
Consult the LUCENE_CHANGES.txt file for additional, low level, changes in this release.
Versions of Major Components
---------------------
Apache Tika 1.13
Carrot2 3.15.0
Velocity 1.7 and Velocity Tools 2.0
Apache UIMA 2.3.1
Apache ZooKeeper 3.4.10
Jetty 9.3.14.v20161028
Detailed Change List
----------------------
Upgrade Notes
----------------------
* SOLR-10307: If starting Jetty without the Solr start script, you must now pass keystore and truststore
passwords via the env variables SOLR_SSL_KEY_STORE_PASSWORD and SOLR_SSL_TRUST_STORE_PASSWORD rather
than system properties.
* SOLR-10379: ManagedSynonymFilterFactory has been deprecated in favor of ManagedSynonymGraphFilterFactory.
* SOLR-10503: CurrencyField has been deprecated in favor of new CurrencyFieldType.
New Features
----------------------
* SOLR-10849: MoreLikeThisComponent should expose setMaxDocFreqPct (maxDoc
frequency percentage). (Dawid Weiss)
* SOLR-10307: Allow Passing SSL passwords through environment variables. (Mano Kovacs, Michael Suzuki via Mark Miller)
* SOLR-10379: Add ManagedSynonymGraphFilterFactory, deprecate ManagedSynonymFilterFactory. (Steve Rowe)
* SOLR-10479: Adds support for HttpShardHandlerFactory.loadBalancerRequests(MinimumAbsolute|MaximumFraction)
configuration. (Ramsey Haddad, Daniel Collins, Christine Poerschke)
* SOLR-3702: concat(...) function query (Andrey Kudryavtsev via Mikhail Khludnev)
* SOLR-10767: Add movingAvg Stream Evaluator (Joel Bernstein)
* SOLR-10813: Add arraySort Stream Evaluator (Joel Bernstein)
* SOLR-10696: Add cumulative probability function (Joel Bernstein)
* SOLR-10765: Add anova Stream Evaluator (Joel Bernstein)
* SOLR-10754: Add hist Stream Evaluator (Joel Bernstein)
* SOLR-10753: Add array Stream Evaluator (Joel Bernstein)
* SOLR-10747: Allow /stream handler to execute Stream Evaluators directly (Joel Bernstein)
* SOLR-10743: Add sequence StreamEvaluator (Joel Bernstein)
* SOLR-10684: Add finddelay Stream Evaluator (Joel Bernstein)
* SOLR-10731: Add knn Streaming Expression (Joel Bernstein)
* SOLR-10724: Add describe Stream Evaluator (Joel Bernstein)
* SOLR-10693: Add copyOfRange Stream Evaluator (Joel Bernstein)
* SOLR-10623: Add sql Streaming Expression (Joel Bernstein)
* SOLR-10661: Add copyOf Stream Evaluator (Joel Bernstein)
* SOLR-10663: Add distance Stream Evaluator (Joel Bernstein)
* SOLR-10664: Add scale Stream Evaluator (Joel Bernstein)
* SOLR-10666: Add rank transformation Stream Evaluator (Joel Bernstein)
* SOLR-10662: Add length Stream Evaluator (Joel Bernstein)
* SOLR-10660: Add reverse Stream Evaluator (Joel Bernstein)
* SOLR-9910: Add solr/solr.cmd parameter to append jetty parameters to the start script.
(Mano Kovacs via Mark Miller)
Bug Fixes
----------------------
* SOLR-10723 JSON Facet API: resize() implemented incorrectly for CountSlotAcc, HllAgg.NumericAcc
resulting in exceptions when using a hashing faceting method and sorting by hll(numeric_field).
(yonik)
* SOLR-10719: Creating a core.properties fails if the parent of core.properties is a symlinked dierctory
(Erick Erickson)
* SOLR-10360: Solr HDFS snapshot export fails due to FileNotFoundException error when using MR1 instead of
yarn. (Hrishikesh via Mark Miller)
* SOLR-10137: Ensure that ConfigSets created via API are mutable. (Hrishikesh via Mark Miller)
* SOLR-10829: Fixed IndexSchema to enforce that uniqueKey can not be Points based for correctness (hossman)
* SOLR-10836: The query parsers igain, significantTerms, and tlogit (used by streaming expressions by
the same name) might throw a NullPointerException if the referenced field had no indexed data in some
shards. The fix included an optimization to use Solr's cached AtomicReader instead of re-calculating.
(David Smiley)
* SOLR-10715: /v2/ should not be an alias for /v2/collections (Cao Manh Dat)
* SOLR-10835: Add support for point fields in Export Handler (Tomás Fernández Löbbe)
* SOLR-10704: REPLACENODE may cause data loss when replicationFactor is 1. (ab, shalin)
* SOLR-10833: Point numeric fields should throw SolrException(BAD_REQUEST) for malformed numbers in queries.
Trie numeric fields should throw SolrException(BAD_REQUEST) for malformed docValues range queries.
(hossman, Tomás Fernández Löbbe)
* SOLR-10832: Fixed VersionInfo.getMaxVersionFromIndex when using PointsField with indexed="true" (hossman)
* SOLR-10763: Admin UI replication tab sometimes empty when failed replications (janhoy, Bojan Vitnik)
* SOLR-10824: fix NPE ExactSharedStatsCache, fixing maxdocs skew for terms which are absent at one of shards
when using one of Exact*StatsCache (Mikhail Khludnev)
* SOLR-10963: Fix example json in MultipleAdditiveTreesModel javadocs.
(Stefan Langenmaier via Christine Poerschke)
* SOLR-10914: RecoveryStrategy's sendPrepRecoveryCmd can get stuck for 5 minutes if leader is unloaded. (shalin)
* SOLR-11198: downconfig downloads empty file as folder (Erick Erickson)
Optimizations
----------------------
* SOLR-10634: JSON Facet API: When a field/terms facet will retrieve all buckets (i.e. limit:-1)
and there are no nested facets, aggregations are computed in the first collection phase
so that the second phase which would normally involve calculating the domain for the bucket
can be skipped entirely, leading to large performance improvements. (yonik)
* SOLR-10722: Speed up Solr's use of the UnifiedHighlighter be re-using FieldInfos. (David Smiley)
Other Changes
----------------------
* SOLR-10617: JDBCStream accepts columns of type TIME, DATE & TIMESTAMP as well as CLOBs and decimal * SOLR-10617: JDBCStream accepts columns of type TIME, DATE & TIMESTAMP as well as CLOBs and decimal
numeric types (James Dyer) numeric types (James Dyer)