LUCENE-1576: fix BrazilianAnalyzer to downcase before filtering stop words

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@759307 13f79535-47bb-0310-9956-ffa450edef68
This commit is contained in:
Michael McCandless 2009-03-27 19:04:25 +00:00
parent 2d221f79ad
commit c73712d1bb
2 changed files with 5 additions and 2 deletions

View File

@ -32,6 +32,10 @@ Bug fixes
characters to only apply to the correct subset (Daniel Cheng via characters to only apply to the correct subset (Daniel Cheng via
Mike McCandless) Mike McCandless)
7. LUCENE-1576: Fix BrazilianAnalyzer to downcase tokens after
StandardTokenizer so that stop words with mixed case are filtered
out. (Rafael Cunha de Almeida, Douglas Campos via Mike McCandless)
New features New features
1. LUCENE-1470: Added TrieRangeQuery, a much faster implementation of 1. LUCENE-1470: Added TrieRangeQuery, a much faster implementation of

View File

@ -130,11 +130,10 @@ public final class BrazilianAnalyzer extends Analyzer {
*/ */
public final TokenStream tokenStream(String fieldName, Reader reader) { public final TokenStream tokenStream(String fieldName, Reader reader) {
TokenStream result = new StandardTokenizer( reader ); TokenStream result = new StandardTokenizer( reader );
result = new LowerCaseFilter( result );
result = new StandardFilter( result ); result = new StandardFilter( result );
result = new StopFilter( result, stoptable ); result = new StopFilter( result, stoptable );
result = new BrazilianStemFilter( result, excltable ); result = new BrazilianStemFilter( result, excltable );
// Convert to lowercase after stemming!
result = new LowerCaseFilter( result );
return result; return result;
} }
} }