2010-05-05 12:27:58 -04:00
|
|
|
Apache Lucene
|
2011-03-06 13:22:48 -05:00
|
|
|
Copyright 2011 The Apache Software Foundation
|
2010-05-05 12:27:58 -04:00
|
|
|
|
|
|
|
This product includes software developed by
|
|
|
|
The Apache Software Foundation (http://www.apache.org/).
|
|
|
|
|
2010-06-23 07:25:17 -04:00
|
|
|
Includes software from other Apache Software Foundation projects,
|
|
|
|
including, but not limited to:
|
|
|
|
- Apache Commons
|
|
|
|
|
2010-05-05 12:27:58 -04:00
|
|
|
The snowball stemmers in
|
|
|
|
common/src/java/net/sf/snowball
|
|
|
|
were developed by Martin Porter and Richard Boulton.
|
|
|
|
The snowball stopword lists in
|
|
|
|
common/src/resources/org/apache/lucene/analysis/snowball
|
|
|
|
were developed by Martin Porter and Richard Boulton.
|
|
|
|
The full snowball package is available from
|
|
|
|
http://snowball.tartarus.org/
|
|
|
|
|
2011-06-02 08:58:22 -04:00
|
|
|
The KStem stemmer in
|
|
|
|
common/src/org/apache/lucene/analysis/en
|
|
|
|
was developed by Bob Krovetz and Sergio Guzman-Lara (CIIR-UMass Amherst)
|
|
|
|
under the BSD-license.
|
|
|
|
|
2010-07-14 08:10:34 -04:00
|
|
|
The Arabic,Persian,Romanian,Bulgarian, and Hindi analyzers (common) come with a default
|
|
|
|
stopword list that is BSD-licensed created by Jacques Savoy. These files reside in:
|
|
|
|
common/src/resources/org/apache/lucene/analysis/ar/stopwords.txt,
|
|
|
|
common/src/resources/org/apache/lucene/analysis/fa/stopwords.txt,
|
|
|
|
common/src/resources/org/apache/lucene/analysis/ro/stopwords.txt,
|
|
|
|
common/src/resources/org/apache/lucene/analysis/bg/stopwords.txt,
|
|
|
|
common/src/resources/org/apache/lucene/analysis/hi/stopwords.txt
|
2010-05-05 12:27:58 -04:00
|
|
|
See http://members.unine.ch/jacques.savoy/clef/index.html.
|
|
|
|
|
2010-07-14 08:10:34 -04:00
|
|
|
The German,Spanish,Finnish,French,Hungarian,Italian,Portuguese,Russian and Swedish light stemmers
|
|
|
|
(common) are based on BSD-licensed reference implementations created by Jacques Savoy and
|
|
|
|
Ljiljana Dolamic. These files reside in:
|
|
|
|
common/src/java/org/apache/lucene/analysis/de/GermanLightStemmer.java
|
|
|
|
common/src/java/org/apache/lucene/analysis/de/GermanMinimalStemmer.java
|
|
|
|
common/src/java/org/apache/lucene/analysis/es/SpanishLightStemmer.java
|
|
|
|
common/src/java/org/apache/lucene/analysis/fi/FinnishLightStemmer.java
|
|
|
|
common/src/java/org/apache/lucene/analysis/fr/FrenchLightStemmer.java
|
|
|
|
common/src/java/org/apache/lucene/analysis/fr/FrenchMinimalStemmer.java
|
|
|
|
common/src/java/org/apache/lucene/analysis/hu/HungarianLightStemmer.java
|
|
|
|
common/src/java/org/apache/lucene/analysis/it/ItalianLightStemmer.java
|
|
|
|
common/src/java/org/apache/lucene/analysis/pt/PortugueseLightStemmer.java
|
|
|
|
common/src/java/org/apache/lucene/analysis/ru/RussianLightStemmer.java
|
|
|
|
common/src/java/org/apache/lucene/analysis/sv/SwedishLightStemmer.java
|
2010-05-05 12:27:58 -04:00
|
|
|
|
|
|
|
The Stempel analyzer (stempel) includes BSD-licensed software developed
|
|
|
|
by the Egothor project http://egothor.sf.net/, created by Leo Galambos, Martin Kvapil,
|
|
|
|
and Edmond Nolan.
|
|
|
|
|
|
|
|
The Polish analyzer (stempel) comes with a default
|
|
|
|
stopword list that is BSD-licensed created by the Carrot2 project. The file resides
|
|
|
|
in stempel/src/resources/org/apache/lucene/analysis/pl/stopwords.txt.
|
|
|
|
See http://project.carrot2.org/license.html.
|
|
|
|
|
|
|
|
The SmartChineseAnalyzer source code (smartcn) was
|
|
|
|
provided by Xiaoping Gao and copyright 2009 by www.imdict.net.
|
2010-09-28 02:16:16 -04:00
|
|
|
|
|
|
|
WordBreakTestUnicode_*.java (under modules/analysis/common/src/test/)
|
|
|
|
is derived from Unicode data such as the Unicode Character Database.
|
|
|
|
See http://unicode.org/copyright.html for more details.
|
|
|
|
|
2011-06-30 15:12:54 -04:00
|
|
|
The Morfologik analyzer (morfologik) includes BSD-licensed software
|
|
|
|
developed by Dawid Weiss and Marcin Miłkowski (http://morfologik.blogspot.com/).
|
|
|
|
|
|
|
|
Morfologik uses data from Polish ispell/myspell dictionary
|
|
|
|
(http://www.sjp.pl/slownik/en/) licenced on the terms of (inter alia)
|
|
|
|
LGPL and Creative Commons ShareAlike.
|
|
|
|
|
|
|
|
Morfologic includes data from BSD-licensed dictionary of Polish (SGJP)
|
|
|
|
(http://sgjp.pl/morfeusz/)
|