From 5e3cd6711a66ab19d5bc06056037a7ec9a585afb Mon Sep 17 00:00:00 2001 From: Michael McCandless Date: Fri, 5 Dec 2008 10:16:50 +0000 Subject: [PATCH] LUCENE-1470: describe TrieRangeFilter/Query in README git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@723701 13f79535-47bb-0310-9956-ffa450edef68 --- contrib/queries/README.txt | 21 ++++++++++++--------- 1 file changed, 12 insertions(+), 9 deletions(-) diff --git a/contrib/queries/README.txt b/contrib/queries/README.txt index b4355cd55fe..0dcd18cfc73 100644 --- a/contrib/queries/README.txt +++ b/contrib/queries/README.txt @@ -1,10 +1,18 @@ This module contains a number of filter and query objects that add to core lucene. -The "MoreLikeThis" class from the "similarity" module has been copied into here. +==== TrieRangeFilter/TrieRangeQuery - This package provides fast numeric range queries/filters on long, +double or Date fields based on trie structures. Numerical values are stored in a special string-encoded +format with variable precision (all numerical values like doubles, longs, and timestamps are converted +to lexicographic sortable string representations). A range is then divided recursively into multiple +intervals for searching: The center of the range is searched only with the lowest possible precision +in the trie, the boundaries are matched more exactly. This reduces the number of terms and thus improves +dramatically the performance of range queries, on the cost of larger index sizes. + +==== The "MoreLikeThis" class from the "similarity" module has been copied into here. If people are generally happy with this move then the similarity module can be deleted, or at least a "Moved to queries module..." note left in its place. -==== FuzzyLikeThis - mixes the behaviour of FuzzyQuery and MoreLikeThis but with special consideration +==== FuzzyLikeThis - mixes the behaviour of FuzzyQuery and MoreLikeThis but with special consideration of fuzzy scoring factors. This generally produces good results for queries where users may provide details in a number of fields and have no knowledge of boolean query syntax and also want a degree of fuzzy matching. The query is fast because, like MoreLikeThis, it optimizes the query to only the most distinguishing terms. @@ -13,15 +21,10 @@ MoreLikeThis, it optimizes the query to only the most distinguishing terms. Unlike the "NOT" clause, this still selects documents that contain undesirable terms, but reduces the overall score of docs containing these terms. - ==== TermsFilter - Unlike a RangeFilter this can be used for filtering on multiple terms that are not necessarily in a sequence. An example might be a collection of primary keys from a database query result or perhaps a choice of "category" labels picked by the end user. - - -Mark Harwood -25/02/2006 - - +Mark Harwood & Uwe Schindler +05/12/2008