LUCENE-1882: improved package level docs for smartcn

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@810247 13f79535-47bb-0310-9956-ffa450edef68
2025-02-18 07:55:29 +00:00 · 2009-09-01 21:31:18 +00:00 · 2009-09-01 21:31:18 +00:00 · 29e6be94c3
commit 29e6be94c3
parent e5cb7f668a
2 changed files with 27 additions and 6 deletions
--- a/contrib/analyzers/smartcn/src/java/org/apache/lucene/analysis/cn/smart/hhmm/package.html
+++ b/contrib/analyzers/smartcn/src/java/org/apache/lucene/analysis/cn/smart/hhmm/package.html
@ -15,14 +15,16 @@
 See the License for the specific language governing permissions and
 limitations under the License.
 -->
-<html><head></head>
+<html><head>
 <META http-equiv="Content-Type" content="text/html; charset=UTF-8">
 </head>
 <body>
 <div>
-SmartChineseAnalyzer Hidden Markov Model package
+SmartChineseAnalyzer Hidden Markov Model package.
 </div>
 <div>
 <font color="#FF0000">
-WARNING: The status of the analyzers/smartcn <b>analysis.cn</b> package is experimental. The APIs
+WARNING: The status of the analyzers/smartcn <b>analysis.cn.smart</b> package is experimental. The APIs
 and file formats introduced here might change in the future and will not be supported anymore
 in such a case.
 </font>
--- a/contrib/analyzers/smartcn/src/java/org/apache/lucene/analysis/cn/smart/package.html
+++ b/contrib/analyzers/smartcn/src/java/org/apache/lucene/analysis/cn/smart/package.html
@ -15,17 +15,36 @@
 See the License for the specific language governing permissions and
 limitations under the License.
 -->
-<html><head></head>
+<html>
 <head>
 <META http-equiv="Content-Type" content="text/html; charset=UTF-8">
 </head>
 <body>
 <div>
-SmartChineseAnalyzer Tokenizers and TokenFilters
+Analyzer for Simplified Chinese, which indexes words.
 </div>
 <div>
 <font color="#FF0000">
-WARNING: The status of the analyzers/smartcn <b>analysis.cn</b> package is experimental. The APIs
+WARNING: The status of the analyzers/smartcn <b>analysis.cn.smart</b> package is experimental. The APIs
 and file formats introduced here might change in the future and will not be supported anymore
 in such a case.
 </font>
 </div>
 <div>
 Three analyzers are provided for Chinese, each of which treats Chinese text in a different way.
 <ul>
 	<li>ChineseAnalyzer (in the analyzers/cn package): Index unigrams (individual Chinese characters) as a token.
 	<li>CJKAnalyzer (in the analyzers/cjk package): Index bigrams (overlapping groups of two adjacent Chinese characters) as tokens.
 	<li>SmartChineseAnalyzer (in this package): Index words (attempt to segment Chinese text into words) as tokens.
 </ul>
 Example phrase： "我是中国人"
 <ol>
 	<li>ChineseAnalyzer: 我－是－中－国－人</li>
 	<li>CJKAnalyzer: 我是－是中－中国－国人</li>
 	<li>SmartChineseAnalyzer: 我－是－中国－人</li>
 </ol>
 </div>
 </body>
 </html>