mirror of https://github.com/apache/lucene.git
synchronized contrib/CHANGES.txt
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1059936 13f79535-47bb-0310-9956-ffa450edef68
This commit is contained in:
parent
140101dc38
commit
9aefaf7b2b
|
@ -4,53 +4,39 @@ Lucene contrib change Log
|
||||||
|
|
||||||
Build
|
Build
|
||||||
|
|
||||||
* LUCENE-2413: Moved the demo out of lucene core and into contrib/demo.
|
|
||||||
(Robert Muir)
|
|
||||||
|
|
||||||
* LUCENE-2845: Moved contrib/benchmark to modules.
|
* LUCENE-2845: Moved contrib/benchmark to modules.
|
||||||
|
|
||||||
New Features
|
New Features
|
||||||
|
|
||||||
* LUCENE-2604: Added RegexpQuery support to contrib/queryparser.
|
* LUCENE-2604: Added RegexpQuery support to contrib/queryparser.
|
||||||
(Simon Willnauer, Robert Muir)
|
(Simon Willnauer, Robert Muir)
|
||||||
|
|
||||||
* LUCENE-2500: Added DirectIOLinuxDirectory, a Linux-specific
|
* LUCENE-2373: Added a Codec implementation that works with append-only
|
||||||
Directory impl that uses the O_DIRECT flag to bypass the buffer
|
filesystems (such as e.g. Hadoop DFS). SegmentInfos writing/reading
|
||||||
cache. This is useful to prevent segment merging from evicting
|
code is refactored to support append-only FS, and to allow for future
|
||||||
pages from the buffer cache, since fadvise/madvise do not seem.
|
customization of per-segment information. (Andrzej Bialecki)
|
||||||
(Michael McCandless)
|
|
||||||
|
|
||||||
* LUCENE-2373: Added a Codec implementation that works with append-only
|
* LUCENE-2479: Added ability to provide a sort comparator for spelling suggestions along
|
||||||
filesystems (such as e.g. Hadoop DFS). SegmentInfos writing/reading
|
with two implementations. The existing comparator (score, then frequency) is the default (Grant Ingersoll)
|
||||||
code is refactored to support append-only FS, and to allow for future
|
|
||||||
customization of per-segment information. (Andrzej Bialecki)
|
|
||||||
|
|
||||||
* LUCENE-2479: Added ability to provide a sort comparator for spelling suggestions along
|
* LUCENE-2608: Added the ability to specify the accuracy at method time in the SpellChecker. The per class
|
||||||
with two implementations. The existing comparator (score, then frequency) is the default (Grant Ingersoll)
|
method is also still available. (Grant Ingersoll)
|
||||||
|
|
||||||
* LUCENE-2608: Added the ability to specify the accuracy at method time in the SpellChecker. The per class
|
|
||||||
method is also still available. (Grant Ingersoll)
|
|
||||||
|
|
||||||
* LUCENE-2507: Added DirectSpellChecker, which retrieves correction candidates directly
|
* LUCENE-2507: Added DirectSpellChecker, which retrieves correction candidates directly
|
||||||
from the term dictionary using levenshtein automata. (Robert Muir)
|
from the term dictionary using levenshtein automata. (Robert Muir)
|
||||||
|
|
||||||
* LUCENE-2791: Added WindowsDirectory, a Windows-specific Directory impl
|
* LUCENE-2836: Add FieldCacheRewriteMethod, which rewrites MultiTermQueries
|
||||||
that doesn't synchronize on the file handle. This can be useful to
|
using the FieldCache's TermsEnum. (Robert Muir)
|
||||||
avoid the performance problems of SimpleFSDirectory and NIOFSDirectory.
|
|
||||||
(Robert Muir, Simon Willnauer, Uwe Schindler, Michael McCandless)
|
|
||||||
|
|
||||||
* LUCENE-2836: Add FieldCacheRewriteMethod, which rewrites MultiTermQueries
|
|
||||||
using the FieldCache's TermsEnum. (Robert Muir)
|
|
||||||
|
|
||||||
API Changes
|
API Changes
|
||||||
|
|
||||||
* LUCENE-2606: Changed RegexCapabilities interface to fix thread
|
* LUCENE-2606: Changed RegexCapabilities interface to fix thread
|
||||||
safety, serialization, and performance problems. If you have
|
safety, serialization, and performance problems. If you have
|
||||||
written a custom RegexCapabilities it will need to be updated
|
written a custom RegexCapabilities it will need to be updated
|
||||||
to the new API. (Robert Muir, Uwe Schindler)
|
to the new API. (Robert Muir, Uwe Schindler)
|
||||||
|
|
||||||
* LUCENE-2638 MakeHighFreqTerms.TermStats public to make it more useful
|
* LUCENE-2638 MakeHighFreqTerms.TermStats public to make it more useful
|
||||||
for API use. (Andrzej Bialecki)
|
for API use. (Andrzej Bialecki)
|
||||||
|
|
||||||
======================= Lucene 3.x (not yet released) =======================
|
======================= Lucene 3.x (not yet released) =======================
|
||||||
|
|
||||||
|
@ -100,6 +86,11 @@ Changes in runtime behavior
|
||||||
|
|
||||||
Bug fixes
|
Bug fixes
|
||||||
|
|
||||||
|
* LUCENE-2855: contrib queryparser was using CharSequence as key in some internal
|
||||||
|
Map instances, which was leading to incorrect behaviour, since some CharSequence
|
||||||
|
implementors do not override hashcode and equals methods. Now the internal Maps
|
||||||
|
are using String instead. (Adriano Crestani)
|
||||||
|
|
||||||
* LUCENE-2068: Fixed ReverseStringFilter which was not aware of supplementary
|
* LUCENE-2068: Fixed ReverseStringFilter which was not aware of supplementary
|
||||||
characters. During reverse the filter created unpaired surrogates, which
|
characters. During reverse the filter created unpaired surrogates, which
|
||||||
will be replaced by U+FFFD by the indexer, but not at query time. The filter
|
will be replaced by U+FFFD by the indexer, but not at query time. The filter
|
||||||
|
@ -117,41 +108,45 @@ Bug fixes
|
||||||
default. (Robert Muir, Uwe Schindler, Simon Willnauer)
|
default. (Robert Muir, Uwe Schindler, Simon Willnauer)
|
||||||
|
|
||||||
* LUCENE-2184: Fixed bug with handling best fit value when the proper best fit value is
|
* LUCENE-2184: Fixed bug with handling best fit value when the proper best fit value is
|
||||||
not an indexed field. Note, this change affects the APIs. (Grant Ingersoll)
|
not an indexed field. Note, this change affects the APIs. (Grant Ingersoll)
|
||||||
|
|
||||||
* LUCENE-2359: Fix bug in CartesianPolyFilterBuilder related to handling of behavior around
|
* LUCENE-2359: Fix bug in CartesianPolyFilterBuilder related to handling of behavior around
|
||||||
the 180th meridian (Grant Ingersoll)
|
the 180th meridian (Grant Ingersoll)
|
||||||
|
|
||||||
* LUCENE-2404: Fix bugs with position increment and empty tokens in ThaiWordFilter.
|
* LUCENE-2404: Fix bugs with position increment and empty tokens in ThaiWordFilter.
|
||||||
For matchVersion >= 3.1 the filter also no longer lowercases. ThaiAnalyzer
|
For matchVersion >= 3.1 the filter also no longer lowercases. ThaiAnalyzer
|
||||||
will use a separate LowerCaseFilter instead. (Uwe Schindler, Robert Muir)
|
will use a separate LowerCaseFilter instead. (Uwe Schindler, Robert Muir)
|
||||||
|
|
||||||
* LUCENE-2615: Fix DirectIOLinuxDirectory to not assign bogus
|
* LUCENE-2615: Fix DirectIOLinuxDirectory to not assign bogus
|
||||||
permissions to newly created files, and to not silently hardwire
|
permissions to newly created files, and to not silently hardwire
|
||||||
buffer size to 1 MB. (Mark Miller, Robert Muir, Mike McCandless)
|
buffer size to 1 MB. (Mark Miller, Robert Muir, Mike McCandless)
|
||||||
|
|
||||||
* LUCENE-2629: Fix gennorm2 task for generating ICUFoldingFilter's .nrm file. This allows
|
* LUCENE-2629: Fix gennorm2 task for generating ICUFoldingFilter's .nrm file. This allows
|
||||||
you to customize its normalization/folding, by editing the source data files in src/data
|
you to customize its normalization/folding, by editing the source data files in src/data
|
||||||
and regenerating a new .nrm with 'ant gennorm2'. (David Bowen via Robert Muir)
|
and regenerating a new .nrm with 'ant gennorm2'. (David Bowen via Robert Muir)
|
||||||
|
|
||||||
* LUCENE-2653: ThaiWordFilter depends on the JRE having a Thai dictionary, which is not
|
* LUCENE-2653: ThaiWordFilter depends on the JRE having a Thai dictionary, which is not
|
||||||
always the case. If the dictionary is unavailable, the filter will now throw
|
always the case. If the dictionary is unavailable, the filter will now throw
|
||||||
UnsupportedOperationException in the constructor. (Robert Muir)
|
UnsupportedOperationException in the constructor. (Robert Muir)
|
||||||
|
|
||||||
* LUCENE-589: Fix contrib/demo for international documents.
|
* LUCENE-589: Fix contrib/demo for international documents.
|
||||||
(Curtis d'Entremont via Robert Muir)
|
(Curtis d'Entremont via Robert Muir)
|
||||||
|
|
||||||
* LUCENE-2246: Fix contrib/demo for Turkish html documents.
|
* LUCENE-2246: Fix contrib/demo for Turkish html documents.
|
||||||
(Selim Nadi via Robert Muir)
|
(Selim Nadi via Robert Muir)
|
||||||
|
|
||||||
* LUCENE-590: Demo HTML parser gives incorrect summaries when title is repeated as a heading
|
* LUCENE-590: Demo HTML parser gives incorrect summaries when title is repeated as a heading
|
||||||
(Curtis d'Entremont via Robert Muir)
|
(Curtis d'Entremont via Robert Muir)
|
||||||
|
|
||||||
* LUCENE-591: The demo indexer now indexes meta keywords.
|
* LUCENE-591: The demo indexer now indexes meta keywords.
|
||||||
(Curtis d'Entremont via Robert Muir)
|
(Curtis d'Entremont via Robert Muir)
|
||||||
|
|
||||||
API Changes
|
API Changes
|
||||||
|
|
||||||
|
* LUCENE-2867: Some contrib queryparser methods that receives CharSequence as
|
||||||
|
identifier, such as QueryNode#unsetTag(CharSequence), were deprecated and
|
||||||
|
will be removed on version 4. (Adriano Crestani)
|
||||||
|
|
||||||
* LUCENE-2147: Spatial GeoHashUtils now always decode GeoHash strings
|
* LUCENE-2147: Spatial GeoHashUtils now always decode GeoHash strings
|
||||||
with full precision. GeoHash#decode_exactly(String) was merged into
|
with full precision. GeoHash#decode_exactly(String) was merged into
|
||||||
GeoHash#decode(String). (Chris Male, Simon Willnauer)
|
GeoHash#decode(String). (Chris Male, Simon Willnauer)
|
||||||
|
@ -192,6 +187,12 @@ API Changes
|
||||||
|
|
||||||
New features
|
New features
|
||||||
|
|
||||||
|
* LUCENE-2500: Added DirectIOLinuxDirectory, a Linux-specific
|
||||||
|
Directory impl that uses the O_DIRECT flag to bypass the buffer
|
||||||
|
cache. This is useful to prevent segment merging from evicting
|
||||||
|
pages from the buffer cache, since fadvise/madvise do not seem.
|
||||||
|
(Michael McCandless)
|
||||||
|
|
||||||
* LUCENE-2306: Add NumericRangeFilter and NumericRangeQuery support to XMLQueryParser.
|
* LUCENE-2306: Add NumericRangeFilter and NumericRangeQuery support to XMLQueryParser.
|
||||||
(Jingkei Ly, via Mark Harwood)
|
(Jingkei Ly, via Mark Harwood)
|
||||||
|
|
||||||
|
@ -281,6 +282,11 @@ New features
|
||||||
BooleanModifiersQueryNodeProcessor, for example instead of GroupQueryNodeProcessor.
|
BooleanModifiersQueryNodeProcessor, for example instead of GroupQueryNodeProcessor.
|
||||||
(Adriano Crestani via Robert Muir)
|
(Adriano Crestani via Robert Muir)
|
||||||
|
|
||||||
|
* LUCENE-2791: Added WindowsDirectory, a Windows-specific Directory impl
|
||||||
|
that doesn't synchronize on the file handle. This can be useful to
|
||||||
|
avoid the performance problems of SimpleFSDirectory and NIOFSDirectory.
|
||||||
|
(Robert Muir, Simon Willnauer, Uwe Schindler, Michael McCandless)
|
||||||
|
|
||||||
* LUCENE-2842: Add analyzer for Galician. Also adds the RSLP (Orengo) stemmer
|
* LUCENE-2842: Add analyzer for Galician. Also adds the RSLP (Orengo) stemmer
|
||||||
for Portuguese. (Robert Muir)
|
for Portuguese. (Robert Muir)
|
||||||
|
|
||||||
|
@ -305,7 +311,10 @@ Build
|
||||||
(Robert Muir)
|
(Robert Muir)
|
||||||
|
|
||||||
* LUCENE-2833: Upgrade contrib/ant's jtidy jar file to r938 (Robert Muir)
|
* LUCENE-2833: Upgrade contrib/ant's jtidy jar file to r938 (Robert Muir)
|
||||||
|
|
||||||
|
* LUCENE-2413: Moved the demo out of lucene core and into contrib/demo.
|
||||||
|
(Robert Muir)
|
||||||
|
|
||||||
Optimizations
|
Optimizations
|
||||||
|
|
||||||
* LUCENE-2157: DelimitedPayloadTokenFilter no longer copies the buffer
|
* LUCENE-2157: DelimitedPayloadTokenFilter no longer copies the buffer
|
||||||
|
|
Loading…
Reference in New Issue