mirror of https://github.com/apache/lucene.git
SOLR-11894: Ref guide: remove 'side' param from Edge N-Gram Tokenizer documentation
This commit is contained in:
parent
a09126d926
commit
6ca04b62a9
|
@ -223,8 +223,6 @@ Reads the field text and generates edge n-gram tokens of sizes in the given rang
|
|||
|
||||
`maxGramSize`: (integer, default is 1) The maximum n-gram size, must be >= `minGramSize`.
|
||||
|
||||
`side`: ("front" or "back", default is "front") Whether to compute the n-grams from the beginning (front) of the text or from the end (back).
|
||||
|
||||
*Example:*
|
||||
|
||||
Default behavior (min and max default to 1):
|
||||
|
@ -255,21 +253,6 @@ Edge n-gram range of 2 to 5
|
|||
|
||||
**Out:**"ba", "bab", "baba", "babal"
|
||||
|
||||
*Example:*
|
||||
|
||||
Edge n-gram range of 2 to 5, from the back side:
|
||||
|
||||
[source,xml]
|
||||
----
|
||||
<analyzer>
|
||||
<tokenizer class="solr.EdgeNGramTokenizerFactory" minGramSize="2" maxGramSize="5" side="back"/>
|
||||
</analyzer>
|
||||
----
|
||||
|
||||
*In:* "babaloo"
|
||||
|
||||
*Out:* "oo", "loo", "aloo", "baloo"
|
||||
|
||||
== ICU Tokenizer
|
||||
|
||||
This tokenizer processes multilingual text and tokenizes it appropriately based on its script attribute.
|
||||
|
|
Loading…
Reference in New Issue