2010-05-05 12:27:58 -04:00
|
|
|
Apache Lucene
|
2011-03-06 13:22:48 -05:00
|
|
|
Copyright 2011 The Apache Software Foundation
|
2010-05-05 12:27:58 -04:00
|
|
|
|
|
|
|
This product includes software developed by
|
|
|
|
The Apache Software Foundation (http://www.apache.org/).
|
|
|
|
|
2010-06-23 07:25:17 -04:00
|
|
|
Includes software from other Apache Software Foundation projects,
|
|
|
|
including, but not limited to:
|
|
|
|
- Apache Commons
|
|
|
|
|
2010-05-05 12:27:58 -04:00
|
|
|
The snowball stemmers in
|
|
|
|
common/src/java/net/sf/snowball
|
|
|
|
were developed by Martin Porter and Richard Boulton.
|
|
|
|
The snowball stopword lists in
|
|
|
|
common/src/resources/org/apache/lucene/analysis/snowball
|
|
|
|
were developed by Martin Porter and Richard Boulton.
|
|
|
|
The full snowball package is available from
|
|
|
|
http://snowball.tartarus.org/
|
|
|
|
|
2011-06-02 08:58:22 -04:00
|
|
|
The KStem stemmer in
|
|
|
|
common/src/org/apache/lucene/analysis/en
|
|
|
|
was developed by Bob Krovetz and Sergio Guzman-Lara (CIIR-UMass Amherst)
|
|
|
|
under the BSD-license.
|
|
|
|
|
2010-07-14 08:10:34 -04:00
|
|
|
The Arabic,Persian,Romanian,Bulgarian, and Hindi analyzers (common) come with a default
|
|
|
|
stopword list that is BSD-licensed created by Jacques Savoy. These files reside in:
|
|
|
|
common/src/resources/org/apache/lucene/analysis/ar/stopwords.txt,
|
|
|
|
common/src/resources/org/apache/lucene/analysis/fa/stopwords.txt,
|
|
|
|
common/src/resources/org/apache/lucene/analysis/ro/stopwords.txt,
|
|
|
|
common/src/resources/org/apache/lucene/analysis/bg/stopwords.txt,
|
|
|
|
common/src/resources/org/apache/lucene/analysis/hi/stopwords.txt
|
2010-05-05 12:27:58 -04:00
|
|
|
See http://members.unine.ch/jacques.savoy/clef/index.html.
|
|
|
|
|
2010-07-14 08:10:34 -04:00
|
|
|
The German,Spanish,Finnish,French,Hungarian,Italian,Portuguese,Russian and Swedish light stemmers
|
|
|
|
(common) are based on BSD-licensed reference implementations created by Jacques Savoy and
|
|
|
|
Ljiljana Dolamic. These files reside in:
|
|
|
|
common/src/java/org/apache/lucene/analysis/de/GermanLightStemmer.java
|
|
|
|
common/src/java/org/apache/lucene/analysis/de/GermanMinimalStemmer.java
|
|
|
|
common/src/java/org/apache/lucene/analysis/es/SpanishLightStemmer.java
|
|
|
|
common/src/java/org/apache/lucene/analysis/fi/FinnishLightStemmer.java
|
|
|
|
common/src/java/org/apache/lucene/analysis/fr/FrenchLightStemmer.java
|
|
|
|
common/src/java/org/apache/lucene/analysis/fr/FrenchMinimalStemmer.java
|
|
|
|
common/src/java/org/apache/lucene/analysis/hu/HungarianLightStemmer.java
|
|
|
|
common/src/java/org/apache/lucene/analysis/it/ItalianLightStemmer.java
|
|
|
|
common/src/java/org/apache/lucene/analysis/pt/PortugueseLightStemmer.java
|
|
|
|
common/src/java/org/apache/lucene/analysis/ru/RussianLightStemmer.java
|
|
|
|
common/src/java/org/apache/lucene/analysis/sv/SwedishLightStemmer.java
|
2010-05-05 12:27:58 -04:00
|
|
|
|
|
|
|
The Stempel analyzer (stempel) includes BSD-licensed software developed
|
|
|
|
by the Egothor project http://egothor.sf.net/, created by Leo Galambos, Martin Kvapil,
|
|
|
|
and Edmond Nolan.
|
|
|
|
|
|
|
|
The Polish analyzer (stempel) comes with a default
|
|
|
|
stopword list that is BSD-licensed created by the Carrot2 project. The file resides
|
|
|
|
in stempel/src/resources/org/apache/lucene/analysis/pl/stopwords.txt.
|
|
|
|
See http://project.carrot2.org/license.html.
|
|
|
|
|
|
|
|
The SmartChineseAnalyzer source code (smartcn) was
|
|
|
|
provided by Xiaoping Gao and copyright 2009 by www.imdict.net.
|
2010-09-28 02:16:16 -04:00
|
|
|
|
|
|
|
WordBreakTestUnicode_*.java (under modules/analysis/common/src/test/)
|
|
|
|
is derived from Unicode data such as the Unicode Character Database.
|
|
|
|
See http://unicode.org/copyright.html for more details.
|
|
|
|
|
2011-06-30 15:12:54 -04:00
|
|
|
The Morfologik analyzer (morfologik) includes BSD-licensed software
|
|
|
|
developed by Dawid Weiss and Marcin Miłkowski (http://morfologik.blogspot.com/).
|
|
|
|
|
|
|
|
Morfologik uses data from Polish ispell/myspell dictionary
|
|
|
|
(http://www.sjp.pl/slownik/en/) licenced on the terms of (inter alia)
|
|
|
|
LGPL and Creative Commons ShareAlike.
|
|
|
|
|
|
|
|
Morfologic includes data from BSD-licensed dictionary of Polish (SGJP)
|
|
|
|
(http://sgjp.pl/morfeusz/)
|
2012-01-12 15:10:48 -05:00
|
|
|
|
|
|
|
===========================================================================
|
|
|
|
Kuromoji Japanese Morphological Analyzer - Apache Lucene Integration
|
|
|
|
===========================================================================
|
|
|
|
|
|
|
|
This software includes a binary and/or source version of data from
|
|
|
|
|
|
|
|
mecab-ipadic-2.7.0-20070801
|
|
|
|
|
|
|
|
which can be obtained from
|
|
|
|
|
|
|
|
http://atilika.com/releases/mecab-ipadic/mecab-ipadic-2.7.0-20070801.tar.gz
|
|
|
|
|
|
|
|
or
|
|
|
|
|
|
|
|
http://jaist.dl.sourceforge.net/project/mecab/mecab-ipadic/2.7.0-20070801/mecab-ipadic-2.7.0-20070801.tar.gz
|
|
|
|
|
|
|
|
===========================================================================
|
|
|
|
mecab-ipadic-2.7.0-20070801 Notice
|
|
|
|
===========================================================================
|
|
|
|
|
|
|
|
Nara Institute of Science and Technology (NAIST),
|
|
|
|
the copyright holders, disclaims all warranties with regard to this
|
|
|
|
software, including all implied warranties of merchantability and
|
|
|
|
fitness, in no event shall NAIST be liable for
|
|
|
|
any special, indirect or consequential damages or any damages
|
|
|
|
whatsoever resulting from loss of use, data or profits, whether in an
|
|
|
|
action of contract, negligence or other tortuous action, arising out
|
|
|
|
of or in connection with the use or performance of this software.
|
|
|
|
|
|
|
|
A large portion of the dictionary entries
|
|
|
|
originate from ICOT Free Software. The following conditions for ICOT
|
|
|
|
Free Software applies to the current dictionary as well.
|
|
|
|
|
|
|
|
Each User may also freely distribute the Program, whether in its
|
|
|
|
original form or modified, to any third party or parties, PROVIDED
|
|
|
|
that the provisions of Section 3 ("NO WARRANTY") will ALWAYS appear
|
|
|
|
on, or be attached to, the Program, which is distributed substantially
|
|
|
|
in the same form as set out herein and that such intended
|
|
|
|
distribution, if actually made, will neither violate or otherwise
|
|
|
|
contravene any of the laws and regulations of the countries having
|
|
|
|
jurisdiction over the User or the intended distribution itself.
|
|
|
|
|
|
|
|
NO WARRANTY
|
|
|
|
|
|
|
|
The program was produced on an experimental basis in the course of the
|
|
|
|
research and development conducted during the project and is provided
|
|
|
|
to users as so produced on an experimental basis. Accordingly, the
|
|
|
|
program is provided without any warranty whatsoever, whether express,
|
|
|
|
implied, statutory or otherwise. The term "warranty" used herein
|
|
|
|
includes, but is not limited to, any warranty of the quality,
|
|
|
|
performance, merchantability and fitness for a particular purpose of
|
|
|
|
the program and the nonexistence of any infringement or violation of
|
|
|
|
any right of any third party.
|
|
|
|
|
|
|
|
Each user of the program will agree and understand, and be deemed to
|
|
|
|
have agreed and understood, that there is no warranty whatsoever for
|
|
|
|
the program and, accordingly, the entire risk arising from or
|
|
|
|
otherwise connected with the program is assumed by the user.
|
|
|
|
|
|
|
|
Therefore, neither ICOT, the copyright holder, or any other
|
|
|
|
organization that participated in or was otherwise related to the
|
|
|
|
development of the program and their respective officials, directors,
|
|
|
|
officers and other employees shall be held liable for any and all
|
|
|
|
damages, including, without limitation, general, special, incidental
|
|
|
|
and consequential damages, arising out of or otherwise in connection
|
|
|
|
with the use or inability to use the program or any product, material
|
|
|
|
or result produced or otherwise obtained by using the program,
|
|
|
|
regardless of whether they have been advised of, or otherwise had
|
|
|
|
knowledge of, the possibility of such damages at any time during the
|
|
|
|
project or thereafter. Each user will be deemed to have agreed to the
|
|
|
|
foregoing by his or her commencement of use of the program. The term
|
|
|
|
"use" as used herein includes, but is not limited to, the use,
|
|
|
|
modification, copying and distribution of the program and the
|
|
|
|
production of secondary products from the program.
|
|
|
|
|
|
|
|
In the case where the program, whether in its original form or
|
|
|
|
modified, was distributed or delivered to or received by a user from
|
|
|
|
any person, organization or entity other than ICOT, unless it makes or
|
|
|
|
grants independently of ICOT any specific warranty to the user in
|
|
|
|
writing, such person, organization or entity, will also be exempted
|
|
|
|
from and not be held liable to the user for any such damages as noted
|
|
|
|
above as far as the program is concerned.
|