From 3bdbe02922fe9bdf098d50ed07bfc189bd94f700 Mon Sep 17 00:00:00 2001 From: Yonik Seeley Date: Tue, 28 Feb 2006 17:25:42 +0000 Subject: [PATCH] analysis debugging examples git-svn-id: https://svn.apache.org/repos/asf/incubator/solr/trunk@381717 13f79535-47bb-0310-9956-ffa450edef68 --- .../documentation/content/xdocs/tutorial.xml | 26 +++++++++++++++++++ 1 file changed, 26 insertions(+) diff --git a/src/site/src/documentation/content/xdocs/tutorial.xml b/src/site/src/documentation/content/xdocs/tutorial.xml index bd3360b9fcd..1b042d300e8 100755 --- a/src/site/src/documentation/content/xdocs/tutorial.xml +++ b/src/site/src/documentation/content/xdocs/tutorial.xml @@ -288,6 +288,32 @@ Go ahead and edit the existing XML files to change some of the data, and re-run

+
+ Analysis Debugging +

There is a handy analysis + debugging page where you can see how a text value is broken down into words, + and shows the resulting tokens after they pass through each filter in the chain. +

+

+ This + shows how "Canon PowerShot SD500" would be indexed as a value in the name field. Each row of + the table shows the resulting tokens after having passed through the next TokenFilter in the Analyzer for the name field. + Notice how both powershot and power, shot are indexed. Tokens generated at the same position + are shown in the same column, in this case shot and powershot. +

+

Selecting verbose output + will show more details, such as the name of each analyzer component in the chain, token positions, and the start and end positions + of the token in the original text. +

+

Selecting highlight matches + when both index and query values are provided, will take the resulting terms from the query value and highlight + all matches during the index value analysis. +

+

Here + is an example of stemming and stop-words at work. +

+
+