lucene/contrib/CHANGES.txt

Lucene contrib change Log

======================= Trunk (not yet released) =======================

Changes in runtime behavior

 (None)

API Changes

 (None)

Bug fixes

 1. LUCENE-1423: InstantiatedTermEnum#skipTo(Term) throws ArrayIndexOutOfBounds on empty index.
    (Karl Wettin) 

 2. LUCENE-1462: InstantiatedIndexWriter did not reset pre analyzed TokenStreams the
    same way IndexWriter does. Parts of InstantiatedIndex was not Serializable.
    (Karl Wettin)

 3. LUCENE-1510: InstantiatedIndexReader#norms methods throws NullPointerException on empty index.
    (Karl Wettin, Robert Newson)

 4. LUCENE-1514: ShingleMatrixFilter#next(Token) easily throws a StackOverflowException
    due to recursive invocation. (Karl Wettin)

 5. LUCENE-1548: Fix distance normalization in LevenshteinDistance to
    not produce negative distances (Thomas Morton via Mike McCandless)

 6. LUCENE-1490: Fix latin1 conversion of HALFWIDTH_AND_FULLWIDTH_FORMS
    characters to only apply to the correct subset (Daniel Cheng via
    Mike McCandless)

 7. LUCENE-1576: Fix BrazilianAnalyzer to downcase tokens after
    StandardTokenizer so that stop words with mixed case are filtered
    out.  (Rafael Cunha de Almeida, Douglas Campos via Mike McCandless)

New features

 1. LUCENE-1531: Added support for BoostingTermQuery to XML query parser. (Karl Wettin)

 2. LUCENE-1435: Added contrib/collation, a CollationKeyFilter
    allowing you to convert tokens into CollationKeys encoded usign
    IndexableBinaryStringTools.  This allows for faster RangQuery when
    a field needs to use a custom Collator.  (Steven Rowe via Mike
    McCandless)

 3. LUCENE-1591: EnWikiDocMaker, LineDocMaker, WriteLineDoc can now
    read/write bz2 using Apache commons compress library.  This means
    you can download the .bz2 export from http://wikipedia.org and
    immediately index it.  (Shai Erera via Mike McCandless)

 4. LUCENE-1629: Add SmartChineseAnalyzer to contrib/analyzers.  It
    improves on CJKAnalyzer and ChineseAnalyzer by handling Chinese
    sentences properly.  SmartChineseAnalyzer uses a Hidden Markov
    Model to tokenize Chinese words in a more intelligent way.
    (Xiaoping Gao via Mike McCandless)

 5. LUCENE-1676: Added DelimitedPayloadTokenFilter class for automatically adding payloads "in-stream" (Grant Ingersoll)    
 
 6. LUCENE-1578: Support for loading unoptimized readers to the
    constructor of InstantiatedIndex. (Karl Wettin)
 
Optimizations

  1. LUCENE-1643: Re-use the collation key (RawCollationKey) for
     better performance, in ICUCollationKeyFilter.  (Robert Muir via
     Mike McCandless)

Documentation

 (None)

Build

 (None)

Test Cases

 (None)

======================= Release 2.4.0 2008-10-06 =======================

Changes in runtime behavior

 (None)

API Changes

 1. 

 (None)

Bug fixes

 1. LUCENE-1312: Added full support for InstantiatedIndexReader#getFieldNames()
    and tests that assert that deleted documents behaves as they should (they did).
    (Jason Rutherglen, Karl Wettin)

 2. LUCENE-1318: InstantiatedIndexReader.norms(String, b[], int) didn't treat
    the array offset right. (Jason Rutherglen via Karl Wettin)

New features

 1. LUCENE-1320: ShingleMatrixFilter, multidimensional shingle token filter. (Karl Wettin)

 2. LUCENE-1142: Updated Snowball package, org.tartarus distribution revision 500.
    Introducing Hungarian, Turkish and Romanian support, updated older stemmers
    and optimized (reflectionless) SnowballFilter.
    IMPORTANT NOTICE ON BACKWARDS COMPATIBILITY: an index created using the 2.3.2 (or older)
    might not be compatible with these updated classes as some algorithms have changed.
    (Karl Wettin)

 3. LUCENE-1016: TermVectorAccessor, transparent vector space access via stored vectors
    or by resolving the inverted index. (Karl Wettin) 

Documentation

 (None)

Build

 (None)

Test Cases

 (None)
Contrib level CHANGES.txt. I forgot to add this some time ago. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688370 13f79535-47bb-0310-9956-ffa450edef68 2008-08-23 13:12:57 -04:00			`Lucene contrib change Log`

			`======================= Trunk (not yet released) =======================`

LUCENE-1423 InstantiatedTermEnum#skipTo(Term) throws ArrayIndexOutOfBoundsException on an empty index. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@705893 13f79535-47bb-0310-9956-ffa450edef68 2008-10-18 12:29:53 -04:00			`Changes in runtime behavior`

			`(None)`

			`API Changes`

			`(None)`

			`Bug fixes`

			`1. LUCENE-1423: InstantiatedTermEnum#skipTo(Term) throws ArrayIndexOutOfBounds on empty index.`
			`(Karl Wettin)`

LUCENE-1462 InstantiatedIndexWriter did not reset pre analyzed TokenStreams the same way IndexWriter does. Parts of InstantiatedIndex was not Serializable. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@725837 13f79535-47bb-0310-9956-ffa450edef68 2008-12-11 17:08:45 -05:00			`2. LUCENE-1462: InstantiatedIndexWriter did not reset pre analyzed TokenStreams the`
			`same way IndexWriter does. Parts of InstantiatedIndex was not Serializable.`
			`(Karl Wettin)`

LUCENE-1510 InstantiatedIndexReader#norms methods throws NullPointerException on empty index. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@732661 13f79535-47bb-0310-9956-ffa450edef68 2009-01-08 04:28:42 -05:00			`3. LUCENE-1510: InstantiatedIndexReader#norms methods throws NullPointerException on empty index.`
			`(Karl Wettin, Robert Newson)`

LUCENE-1514 ShingleMatrixFilter#next(Token) easily throws a StackOverflowException due to recursive invocation. (Karl Wettin) git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@733064 13f79535-47bb-0310-9956-ffa450edef68 2009-01-09 10:34:52 -05:00			`4. LUCENE-1514: ShingleMatrixFilter#next(Token) easily throws a StackOverflowException`
			`due to recursive invocation. (Karl Wettin)`

LUCENE-1548: fix distance normalization in LevenshteinDistance to not produce negative distances git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@748534 13f79535-47bb-0310-9956-ffa450edef68 2009-02-27 09:07:12 -05:00			`5. LUCENE-1548: Fix distance normalization in LevenshteinDistance to`
			`not produce negative distances (Thomas Morton via Mike McCandless)`

LUCENE-1490: forgot CHANGES.txt update git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@755746 13f79535-47bb-0310-9956-ffa450edef68 2009-03-18 17:42:17 -04:00			`6. LUCENE-1490: Fix latin1 conversion of HALFWIDTH_AND_FULLWIDTH_FORMS`
			`characters to only apply to the correct subset (Daniel Cheng via`
			`Mike McCandless)`

LUCENE-1576: fix BrazilianAnalyzer to downcase before filtering stop words git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@759307 13f79535-47bb-0310-9956-ffa450edef68 2009-03-27 15:04:25 -04:00			`7. LUCENE-1576: Fix BrazilianAnalyzer to downcase tokens after`
			`StandardTokenizer so that stop words with mixed case are filtered`
			`out. (Rafael Cunha de Almeida, Douglas Campos via Mike McCandless)`

LUCENE-1423 InstantiatedTermEnum#skipTo(Term) throws ArrayIndexOutOfBoundsException on an empty index. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@705893 13f79535-47bb-0310-9956-ffa450edef68 2008-10-18 12:29:53 -04:00			`New features`

LUCENE-1673: Move TrieRange to core (part 2: removing from contrib/queries) git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786474 13f79535-47bb-0310-9956-ffa450edef68 2009-06-19 08:16:52 -04:00			`1. LUCENE-1531: Added support for BoostingTermQuery to XML query parser. (Karl Wettin)`
LUCENE-1423 InstantiatedTermEnum#skipTo(Term) throws ArrayIndexOutOfBoundsException on an empty index. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@705893 13f79535-47bb-0310-9956-ffa450edef68 2008-10-18 12:29:53 -04:00
LUCENE-1673: Move TrieRange to core (part 2: removing from contrib/queries) git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786474 13f79535-47bb-0310-9956-ffa450edef68 2009-06-19 08:16:52 -04:00			`2. LUCENE-1435: Added contrib/collation, a CollationKeyFilter`
LUCENE-1435: add contrib/collation (CollationKeyFilter), to convert tokens into indexable CollationKeys git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@755914 13f79535-47bb-0310-9956-ffa450edef68 2009-03-19 06:51:55 -04:00			`allowing you to convert tokens into CollationKeys encoded usign`
			`IndexableBinaryStringTools. This allows for faster RangQuery when`
			`a field needs to use a custom Collator. (Steven Rowe via Mike`
			`McCandless)`

LUCENE-1673: Move TrieRange to core (part 2: removing from contrib/queries) git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786474 13f79535-47bb-0310-9956-ffa450edef68 2009-06-19 08:16:52 -04:00			`3. LUCENE-1591: EnWikiDocMaker, LineDocMaker, WriteLineDoc can now`
LUCENE-1591: add bzip2 compression/decompress to contrib/benchmark git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@765543 13f79535-47bb-0310-9956-ffa450edef68 2009-04-16 05:46:30 -04:00			`read/write bz2 using Apache commons compress library. This means`
			`you can download the .bz2 export from http://wikipedia.org and`
			`immediately index it. (Shai Erera via Mike McCandless)`

LUCENE-1673: Move TrieRange to core (part 2: removing from contrib/queries) git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786474 13f79535-47bb-0310-9956-ffa450edef68 2009-06-19 08:16:52 -04:00			`4. LUCENE-1629: Add SmartChineseAnalyzer to contrib/analyzers. It`
LUCENE-1629: move CHANGES entry to contrib; add TestArabicAnalyzer git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@774727 13f79535-47bb-0310-9956-ffa450edef68 2009-05-14 06:50:52 -04:00			`improves on CJKAnalyzer and ChineseAnalyzer by handling Chinese`
			`sentences properly. SmartChineseAnalyzer uses a Hidden Markov`
			`Model to tokenize Chinese words in a more intelligent way.`
			`(Xiaoping Gao via Mike McCandless)`
LUCENE-1676: in-stream payload support git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784297 13f79535-47bb-0310-9956-ffa450edef68 2009-06-12 18:26:01 -04:00
LUCENE-1673: Move TrieRange to core (part 2: removing from contrib/queries) git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786474 13f79535-47bb-0310-9956-ffa450edef68 2009-06-19 08:16:52 -04:00			`5. LUCENE-1676: Added DelimitedPayloadTokenFilter class for automatically adding payloads "in-stream" (Grant Ingersoll)`
LUCENE-1578: Support for loading unoptimized readers to the constructor of InstantiatedIndex. (Karl Wettin) git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784481 13f79535-47bb-0310-9956-ffa450edef68 2009-06-13 17:54:07 -04:00
LUCENE-1673: Move TrieRange to core (part 2: removing from contrib/queries) git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786474 13f79535-47bb-0310-9956-ffa450edef68 2009-06-19 08:16:52 -04:00			`6. LUCENE-1578: Support for loading unoptimized readers to the`
LUCENE-1578: Support for loading unoptimized readers to the constructor of InstantiatedIndex. (Karl Wettin) git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784481 13f79535-47bb-0310-9956-ffa450edef68 2009-06-13 17:54:07 -04:00			`constructor of InstantiatedIndex. (Karl Wettin)`

LUCENE-1643: use reusable RawCollationKey for better performance git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@776252 13f79535-47bb-0310-9956-ffa450edef68 2009-05-19 05:50:24 -04:00			`Optimizations`

			`1. LUCENE-1643: Re-use the collation key (RawCollationKey) for`
			`better performance, in ICUCollationKeyFilter. (Robert Muir via`
			`Mike McCandless)`
LUCENE-1531 Added support for BoostingTermQuery to XML query parser. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@742411 13f79535-47bb-0310-9956-ffa450edef68 2009-02-09 06:49:33 -05:00
LUCENE-1423 InstantiatedTermEnum#skipTo(Term) throws ArrayIndexOutOfBoundsException on an empty index. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@705893 13f79535-47bb-0310-9956-ffa450edef68 2008-10-18 12:29:53 -04:00			`Documentation`

			`(None)`

			`Build`

			`(None)`

			`Test Cases`

			`(None)`

break off contrib/CHANGES.txt's 2.4.0 release section git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@700743 13f79535-47bb-0310-9956-ffa450edef68 2008-10-01 07:22:58 -04:00			`======================= Release 2.4.0 2008-10-06 =======================`

Contrib level CHANGES.txt. I forgot to add this some time ago. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688370 13f79535-47bb-0310-9956-ffa450edef68 2008-08-23 13:12:57 -04:00			`Changes in runtime behavior`

			`(None)`

			`API Changes`

LUCENE-1142 : Updated Snowball package, org.tartarus distribution revision 500. Introducing Hungarian, Turkish and Romanian support, updated older stemmers and optimized (reflectionless) SnowballFilter. IMPORTANT NOTICE ON BACKWARDS COMPATIBILITY: an index created using the 2.3.2 (or older) might not be compatible with these updated classes as some algorithms have changed. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688420 13f79535-47bb-0310-9956-ffa450edef68 2008-08-23 18:02:47 -04:00			`1.`

Contrib level CHANGES.txt. I forgot to add this some time ago. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688370 13f79535-47bb-0310-9956-ffa450edef68 2008-08-23 13:12:57 -04:00			`(None)`

			`Bug fixes`

			`1. LUCENE-1312: Added full support for InstantiatedIndexReader#getFieldNames()`
			`and tests that assert that deleted documents behaves as they should (they did).`
			`(Jason Rutherglen, Karl Wettin)`

			`2. LUCENE-1318: InstantiatedIndexReader.norms(String, b[], int) didn't treat`
			`the array offset right. (Jason Rutherglen via Karl Wettin)`

			`New features`

			`1. LUCENE-1320: ShingleMatrixFilter, multidimensional shingle token filter. (Karl Wettin)`

LUCENE-1142 : Updated Snowball package, org.tartarus distribution revision 500. Introducing Hungarian, Turkish and Romanian support, updated older stemmers and optimized (reflectionless) SnowballFilter. IMPORTANT NOTICE ON BACKWARDS COMPATIBILITY: an index created using the 2.3.2 (or older) might not be compatible with these updated classes as some algorithms have changed. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688420 13f79535-47bb-0310-9956-ffa450edef68 2008-08-23 18:02:47 -04:00			`2. LUCENE-1142: Updated Snowball package, org.tartarus distribution revision 500.`
			`Introducing Hungarian, Turkish and Romanian support, updated older stemmers`
			`and optimized (reflectionless) SnowballFilter.`
			`IMPORTANT NOTICE ON BACKWARDS COMPATIBILITY: an index created using the 2.3.2 (or older)`
			`might not be compatible with these updated classes as some algorithms have changed.`
			`(Karl Wettin)`

LUCENE-1016 : TermVectorAccessor, transparent vector space access via stored vectors or by resolving the inverted index. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688745 13f79535-47bb-0310-9956-ffa450edef68 2008-08-25 11:02:20 -04:00			`3. LUCENE-1016: TermVectorAccessor, transparent vector space access via stored vectors`
			`or by resolving the inverted index. (Karl Wettin)`

Contrib level CHANGES.txt. I forgot to add this some time ago. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688370 13f79535-47bb-0310-9956-ffa450edef68 2008-08-23 13:12:57 -04:00			`Documentation`

			`(None)`

			`Build`

			`(None)`

			`Test Cases`

break off contrib/CHANGES.txt's 2.4.0 release section git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@700743 13f79535-47bb-0310-9956-ffa450edef68 2008-10-01 07:22:58 -04:00			`(None)`