blob: 61d1022ff7a71e6681942ccb5464a0587594a61c [file] [log] [blame]
Apache Lucene
Copyright 2011 The Apache Software Foundation
This product includes software developed by
The Apache Software Foundation (http://www.apache.org/).
Includes software from other Apache Software Foundation projects,
including, but not limited to:
- Apache Commons
The snowball stemmers in
common/src/java/net/sf/snowball
were developed by Martin Porter and Richard Boulton.
The snowball stopword lists in
common/src/resources/org/apache/lucene/analysis/snowball
were developed by Martin Porter and Richard Boulton.
The full snowball package is available from
http://snowball.tartarus.org/
The KStem stemmer in
common/src/org/apache/lucene/analysis/en
was developed by Bob Krovetz and Sergio Guzman-Lara (CIIR-UMass Amherst)
under the BSD-license.
The Arabic,Persian,Romanian,Bulgarian, and Hindi analyzers (common) come with a default
stopword list that is BSD-licensed created by Jacques Savoy. These files reside in:
common/src/resources/org/apache/lucene/analysis/ar/stopwords.txt,
common/src/resources/org/apache/lucene/analysis/fa/stopwords.txt,
common/src/resources/org/apache/lucene/analysis/ro/stopwords.txt,
common/src/resources/org/apache/lucene/analysis/bg/stopwords.txt,
common/src/resources/org/apache/lucene/analysis/hi/stopwords.txt
See http://members.unine.ch/jacques.savoy/clef/index.html.
The German,Spanish,Finnish,French,Hungarian,Italian,Portuguese,Russian and Swedish light stemmers
(common) are based on BSD-licensed reference implementations created by Jacques Savoy and
Ljiljana Dolamic. These files reside in:
common/src/java/org/apache/lucene/analysis/de/GermanLightStemmer.java
common/src/java/org/apache/lucene/analysis/de/GermanMinimalStemmer.java
common/src/java/org/apache/lucene/analysis/es/SpanishLightStemmer.java
common/src/java/org/apache/lucene/analysis/fi/FinnishLightStemmer.java
common/src/java/org/apache/lucene/analysis/fr/FrenchLightStemmer.java
common/src/java/org/apache/lucene/analysis/fr/FrenchMinimalStemmer.java
common/src/java/org/apache/lucene/analysis/hu/HungarianLightStemmer.java
common/src/java/org/apache/lucene/analysis/it/ItalianLightStemmer.java
common/src/java/org/apache/lucene/analysis/pt/PortugueseLightStemmer.java
common/src/java/org/apache/lucene/analysis/ru/RussianLightStemmer.java
common/src/java/org/apache/lucene/analysis/sv/SwedishLightStemmer.java
The Stempel analyzer (stempel) includes BSD-licensed software developed
by the Egothor project http://egothor.sf.net/, created by Leo Galambos, Martin Kvapil,
and Edmond Nolan.
The Polish analyzer (stempel) comes with a default
stopword list that is BSD-licensed created by the Carrot2 project. The file resides
in stempel/src/resources/org/apache/lucene/analysis/pl/stopwords.txt.
See http://project.carrot2.org/license.html.
The SmartChineseAnalyzer source code (smartcn) was
provided by Xiaoping Gao and copyright 2009 by www.imdict.net.
WordBreakTestUnicode_*.java (under modules/analysis/common/src/test/)
is derived from Unicode data such as the Unicode Character Database.
See http://unicode.org/copyright.html for more details.
The Morfologik analyzer (morfologik) includes BSD-licensed software
developed by Dawid Weiss and Marcin MiƂkowski (http://morfologik.blogspot.com/).
Morfologik uses data from Polish ispell/myspell dictionary
(http://www.sjp.pl/slownik/en/) licenced on the terms of (inter alia)
LGPL and Creative Commons ShareAlike.
Morfologic includes data from BSD-licensed dictionary of Polish (SGJP)
(http://sgjp.pl/morfeusz/)