From 3bdbe02922fe9bdf098d50ed07bfc189bd94f700 Mon Sep 17 00:00:00 2001
From: Yonik Seeley
There is a handy analysis + debugging page where you can see how a text value is broken down into words, + and shows the resulting tokens after they pass through each filter in the chain. +
+
+ This
+ shows how "Canon PowerShot SD500
" would be indexed as a value in the name field. Each row of
+ the table shows the resulting tokens after having passed through the next TokenFilter in the Analyzer for the name
field.
+ Notice how both powershot
and power
, shot
are indexed. Tokens generated at the same position
+ are shown in the same column, in this case shot
and powershot
.
+
Selecting verbose output + will show more details, such as the name of each analyzer component in the chain, token positions, and the start and end positions + of the token in the original text. +
+Selecting highlight matches + when both index and query values are provided, will take the resulting terms from the query value and highlight + all matches during the index value analysis. +
+Here + is an example of stemming and stop-words at work. +
+