lucene/lucene/MIGRATE.txt

# Apache Lucene Migration Guide

## Query.hashCode and Query.equals are now abstract methods (LUCENE-7277)
Any custom query subclasses should redeclare equivalence relationship according
to the subclass's details. See code patterns used in existing core Lucene query
classes for details.

## The way how number of document calculated is changed (LUCENE-6711)
The number of documents (numDocs) is used to calculate term specificity (idf) and average document length (avdl).
Prior to LUCENE-6711, collectionStats.maxDoc() was used for the statistics.
Now, collectionStats.docCount() is used whenever possible, if not maxDocs() is used.

Assume that a collection contains 100 documents, and 50 of them have "keywords" field.
In this example, maxDocs is 100 while docCount is 50 for the "keywords" field.
The total number of tokens for "keywords" field is divided by docCount to obtain avdl.
Therefore, docCount which is the total number of documents that have at least one term for the field, is a more precise metric for optional fields.

DefaultSimilarity does not leverage avdl, so this change would have relatively minor change in the result list.
Because relative idf values of terms will remain same.
However, when combined with other factors such as term frequency, relative ranking of documents could change.
Some Similarity implementations (such as the ones instantiated with NormalizationH2 and BM25) take account into avdl and would have notable change in ranked list.
Especially if you have a collection of documents with varying lengths.
Because NormalizationH2 tends to punish documents longer than avdl.

## FunctionValues.exist() Behavior Changes due to ValueSource bug fixes (LUCENE-5961)

Bugs fixed in several ValueSource functions may result in different behavior in 
situations where some documents do not have values for fields wrapped in other 
ValueSources.  Users who want to preserve the previous behavior may need to wrap 
their ValueSources in a "DefFunction" along with a ConstValueSource of "0.0".

## Removal of Filter and FilteredQuery (LUCENE-6301,LUCENE-6583)

Filter and FilteredQuery have been removed. Regular queries can be used instead
of filters as they have been optimized for the filtering case. And you can
construct a BooleanQuery with one MUST clause for the query, and one FILTER
clause for the filter in order to have similar behaviour to FilteredQuery.

## PhraseQuery and BooleanQuery made immutable (LUCENE-6531 LUCENE-6570)

PhraseQuery and BooleanQuery are now immutable and have a builder API to help
construct them. For instance a BooleanQuery that used to be constructed like
this:

  BooleanQuery bq = new BooleanQuery();
  bq.add(q1, Occur.SHOULD);
  bq.add(q2, Occur.SHOULD);
  bq.add(q3, Occur.MUST);
  bq.setMinimumNumberShouldMatch(1);

can now be constructed this way using its builder:

  BooleanQuery bq = new BooleanQuery.Builder()
      .add(q1, Occur.SHOULD)
      .add(q2, Occur.SHOULD)
      .add(q3, Occur.SHOULD)
      .setMinimumNumberShouldMatch(1)
      .build();

## AttributeImpl now requires that reflectWith() is implemented (LUCENE-6651)

AttributeImpl removed the default, reflection-based implementation of
reflectWith(AtrributeReflector). The method was made abstract. If you have
implemented your own attribute, make sure to add the required method sigature.
See the Javadocs for an example.

## Query.setBoost() and Query.clone() are removed (LUCENE-6590)

Query.setBoost has been removed. In order to apply a boost to a Query, you now
need to wrap it inside a BoostQuery. For instance,

  Query q = ...;
  float boost = ...;
  q = new BoostQuery(q, boost);

would be equivalent to the following code with the old setBoost API:

  Query q = ...;
  float boost = ...;
  q.setBoost(q.getBoost() * boost);

# PointValues replaces NumericField (LUCENE-6917)

PointValues provides faster indexing and searching, a smaller
index size, and less heap used at search time. See org.apache.lucene.index.PointValues
for an introduction. 

Legacy numeric encodings from previous versions of Lucene are
deprecated as LegacyIntField, LegacyFloatField, LegacyLongField, and LegacyDoubleField,
and can be searched with LegacyNumericRangeQuery.
LUCENE-4008: Use pegdown to transform MIGRATE.txt and other text-only files to readable HTML. Please alsows run ant documentation when you have changed anything on those files to check output. git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1328978 13f79535-47bb-0310-9956-ffa450edef68 2012-04-22 17:15:27 -04:00			`# Apache Lucene Migration Guide`
LUCENE-2380: cutover to shared byte[] (BytesRef) instead of String, in FieldCache git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@951104 13f79535-47bb-0310-9956-ffa450edef68 2010-06-03 14:38:05 -04:00
LUCENE-7277: Make Query.hashCode and Query.equals abstract. 2016-05-24 04:33:15 -04:00			`## Query.hashCode and Query.equals are now abstract methods (LUCENE-7277)`
			`Any custom query subclasses should redeclare equivalence relationship according`
			`to the subclass's details. See code patterns used in existing core Lucene query`
			`classes for details.`

LUCENE-6711: Use CollectionStatistics.docCount() for IDF and average field length computations git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1695744 13f79535-47bb-0310-9956-ffa450edef68 2015-08-13 13:37:15 -04:00			`## The way how number of document calculated is changed (LUCENE-6711)`
			`The number of documents (numDocs) is used to calculate term specificity (idf) and average document length (avdl).`
			`Prior to LUCENE-6711, collectionStats.maxDoc() was used for the statistics.`
			`Now, collectionStats.docCount() is used whenever possible, if not maxDocs() is used.`

			`Assume that a collection contains 100 documents, and 50 of them have "keywords" field.`
			`In this example, maxDocs is 100 while docCount is 50 for the "keywords" field.`
			`The total number of tokens for "keywords" field is divided by docCount to obtain avdl.`
			`Therefore, docCount which is the total number of documents that have at least one term for the field, is a more precise metric for optional fields.`

			`DefaultSimilarity does not leverage avdl, so this change would have relatively minor change in the result list.`
			`Because relative idf values of terms will remain same.`
			`However, when combined with other factors such as term frequency, relative ranking of documents could change.`
			`Some Similarity implementations (such as the ones instantiated with NormalizationH2 and BM25) take account into avdl and would have notable change in ranked list.`
			`Especially if you have a collection of documents with varying lengths.`
			`Because NormalizationH2 tends to punish documents longer than avdl.`

LUCENE-5961: Fix the exists() method for FunctionValues returned by many ValueSoures to behave properly when wrapping other ValueSources which do not exist for the specified document git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1632414 13f79535-47bb-0310-9956-ffa450edef68 2014-10-16 15:05:20 -04:00			`## FunctionValues.exist() Behavior Changes due to ValueSource bug fixes (LUCENE-5961)`

			`Bugs fixed in several ValueSource functions may result in different behavior in`
			`situations where some documents do not have values for fields wrapped in other`
			`ValueSources. Users who want to preserve the previous behavior may need to wrap`
			`their ValueSources in a "DefFunction" along with a ConstValueSource of "0.0".`

LUCENE-6301: Removal of org.apache.lucene.Filter. From a Lucene perspective Filter is gone. However it was still used for things like DocSet and SolrConstantScoreQuery in Solr, so it has been moved to the oas.search package for now, even though in the long term it would be nice for Solr to move to the Query API entirely as well. git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1708097 13f79535-47bb-0310-9956-ffa450edef68 2015-10-12 08:15:07 -04:00			`## Removal of Filter and FilteredQuery (LUCENE-6301,LUCENE-6583)`
LUCENE-6583: Remove FilteredQuery. git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1686203 13f79535-47bb-0310-9956-ffa450edef68 2015-06-18 08:29:56 -04:00
LUCENE-6301: Removal of org.apache.lucene.Filter. From a Lucene perspective Filter is gone. However it was still used for things like DocSet and SolrConstantScoreQuery in Solr, so it has been moved to the oas.search package for now, even though in the long term it would be nice for Solr to move to the Query API entirely as well. git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1708097 13f79535-47bb-0310-9956-ffa450edef68 2015-10-12 08:15:07 -04:00			`Filter and FilteredQuery have been removed. Regular queries can be used instead`
			`of filters as they have been optimized for the filtering case. And you can`
			`construct a BooleanQuery with one MUST clause for the query, and one FILTER`
			`clause for the filter in order to have similar behaviour to FilteredQuery.`
LUCENE-6570: Add a MIGRATE entry. git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1686440 13f79535-47bb-0310-9956-ffa450edef68 2015-06-19 11:57:21 -04:00
			`## PhraseQuery and BooleanQuery made immutable (LUCENE-6531 LUCENE-6570)`

			`PhraseQuery and BooleanQuery are now immutable and have a builder API to help`
			`construct them. For instance a BooleanQuery that used to be constructed like`
			`this:`

			`BooleanQuery bq = new BooleanQuery();`
			`bq.add(q1, Occur.SHOULD);`
			`bq.add(q2, Occur.SHOULD);`
			`bq.add(q3, Occur.MUST);`
			`bq.setMinimumNumberShouldMatch(1);`

			`can now be constructed this way using its builder:`

			`BooleanQuery bq = new BooleanQuery.Builder()`
			`.add(q1, Occur.SHOULD)`
			`.add(q2, Occur.SHOULD)`
			`.add(q3, Occur.SHOULD)`
			`.setMinimumNumberShouldMatch(1)`
			`.build();`

LUCENE-6651: AttributeImpl#reflectWith(AttributeReflector) was made abstract and has no reflection-based default implementation anymore. git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1688855 13f79535-47bb-0310-9956-ffa450edef68 2015-07-02 12:18:51 -04:00			`## AttributeImpl now requires that reflectWith() is implemented (LUCENE-6651)`

			`AttributeImpl removed the default, reflection-based implementation of`
			`reflectWith(AtrributeReflector). The method was made abstract. If you have`
			`implemented your own attribute, make sure to add the required method sigature.`
			`See the Javadocs for an example.`
LUCENE-6590: Replace Query.getBoost, setBoost and clone with a new BoostQuery. git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1701621 13f79535-47bb-0310-9956-ffa450edef68 2015-09-07 09:34:46 -04:00
			`## Query.setBoost() and Query.clone() are removed (LUCENE-6590)`

			`Query.setBoost has been removed. In order to apply a boost to a Query, you now`
			`need to wrap it inside a BoostQuery. For instance,`

			`Query q = ...;`
			`float boost = ...;`
			`q = new BoostQuery(q, boost);`

			`would be equivalent to the following code with the old setBoost API:`

			`Query q = ...;`
			`float boost = ...;`
			`q.setBoost(q.getBoost() * boost);`
LUCENE-6917: rename/deprecate numeric classes in favor of dimensional values git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1719562 13f79535-47bb-0310-9956-ffa450edef68 2015-12-11 16:13:41 -05:00
LUCENE-7076: Improve MIGRATE.txt/Point javadocs 2016-03-08 06:20:25 -05:00			`# PointValues replaces NumericField (LUCENE-6917)`
LUCENE-6917: rename/deprecate numeric classes in favor of dimensional values git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1719562 13f79535-47bb-0310-9956-ffa450edef68 2015-12-11 16:13:41 -05:00
LUCENE-7076: Improve MIGRATE.txt/Point javadocs 2016-03-08 06:20:25 -05:00			`PointValues provides faster indexing and searching, a smaller`
			`index size, and less heap used at search time. See org.apache.lucene.index.PointValues`
			`for an introduction.`

			`Legacy numeric encodings from previous versions of Lucene are`
			`deprecated as LegacyIntField, LegacyFloatField, LegacyLongField, and LegacyDoubleField,`
			`and can be searched with LegacyNumericRangeQuery.`