doc - address tokenstream reusability and fold in previous TokenFilterFactory comment

git-svn-id: https://svn.apache.org/repos/asf/lucene/solr/trunk@823695 13f79535-47bb-0310-9956-ffa450edef68
This commit is contained in:
Yonik Seeley 2009-10-09 21:23:04 +00:00
parent 362ad79804
commit 478f419816
1 changed files with 7 additions and 5 deletions

View File

@ -30,6 +30,13 @@ There is a new default faceting algorithm for multiVaued fields that should be
faster for most cases. One can revert to the previous algorithm (which has faster for most cases. One can revert to the previous algorithm (which has
also been improved somewhat) by adding facet.method=enum to the request. also been improved somewhat) by adding facet.method=enum to the request.
If you use custom Tokenizer or TokenFilter components in a chain specified in
schema.xml, they must support reusability. If your Tokenizer or TokenFilter
maintains state, it should implement reset(). If your TokenFilteFactory does
not return a subclass of TokenFilter, then it should implement reset() and call
reset() on it's input TokenStream. TokenizerFactory implementations must
now return a Tokenizer rather than a TokenStream.
New users of Solr 1.4 will have omitTermFreqAndPositions enabled for non-text New users of Solr 1.4 will have omitTermFreqAndPositions enabled for non-text
indexed fields by default, which avoids indexing term frequency, positions, and indexed fields by default, which avoids indexing term frequency, positions, and
payloads, making the index smaller and faster. If you are upgrading from an payloads, making the index smaller and faster. If you are upgrading from an
@ -47,11 +54,6 @@ requests. (The simplest way to do this is by specifying it as a default param
for your request handlers in solrconfig.xml, see the example solrconfig.xml for for your request handlers in solrconfig.xml, see the example solrconfig.xml for
sample syntax.) sample syntax.)
The TokenizerFactory API has changed to explicitly return a Tokenizer rather then
a TokenStream (that may be or may not be a Tokenizer). This change is required
to take advantage of the Token reuse improvements in lucene 2.9. For more
information, see SOLR-1377.
If spellcheck.extendedResults=true, the response format for suggestions If spellcheck.extendedResults=true, the response format for suggestions
has changed, see SOLR-1071. has changed, see SOLR-1071.