| <html><head> |
| <title>Thesaurus Development</title> |
| <meta HTTP-EQUIV="content-type" CONTENT="text/html; charset=UTF-8"> |
| </head> |
| <body> |
| <h2>Lingucomponent Sub-Project: Thesaurus Development </h2> |
| |
| <p>The goal of this project is to improve existing thesauri for OpenOffice.org |
| and to create new thesauri for languages that don't have one yet.</p> |
| |
| <p>This project started by searching for and finding a synonym list for |
| English (US) that was compatible with the OpenOffice.org licensing and |
| then using that list and some simple software to develop a thesaurus for |
| OpenOffice.org 1.x. OpenOffice.org 2.x now uses a thesaurus automatically |
| built from the data in <a href="http://wordnet.princeton.edu">WordNet</a>. |
| The internal file format has also changed to a text-based one.</p> |
| |
| <h4>TODO</h4> |
| |
| <ul> |
| <li><a href="http://lingucomponent.openoffice.org/issues/buglist.cgi?Submit+query=Submit+query&component=lingucomponent&subcomponent=thesaurus&issue_status=UNCONFIRMED&issue_status=NEW&issue_status=STARTED&issue_status=REOPENED&email1=&emailtype1=exact&emailassigned_to1=1&email2=&emailtype2=exact&emailreporter2=1&issueidtype=include&issue_id=&changedin=&votes=&chfieldfrom=&chfieldto=Now&chfieldvalue=&short_desc=&short_desc_type=substring&long_desc=&long_desc_type=substring&issue_file_loc=&issue_file_loc_type=substring&status_whiteboard=&status_whiteboard_type=substring&keywords=&keywords_type=anytokens&field0-0-0=noop&type0-0-0=noop&value0-0-0=&cmdtype=doit&order=Reuse+same+sort+as+last+time">See |
| the list of all open thesaurus issues</a></li> |
| <li>Create new thesauri (see below)</li> |
| </ul> |
| |
| <h4>Downloads</h4> |
| |
| <ul> |
| <li><a href="MyThes-1.zip">MyThes-1.zip (4,5MB)</a> - standalone version of the MyThes thesaurus code. |
| This includes a thesaurus for en_US in its new format for OOo 2.0 (but not yet the WordNet-based |
| thesaurus).</li> |
| <li><a href="http://www.danielnaber.de/wn2ooo">wn2ooo</a>, the script used to create the OOo |
| thesaurus from WordNet data.</li> |
| </ul> |
| |
| <h4>Creating a new thesaurus</h4> |
| |
| <p>If you are willing to maintain a website to collect and coordinate a community |
| developed synonym list for any language we need your help. Please send an e-mail to |
| dev@lingucomponent.openoffice.org listing your skills and interests in being |
| involved in this project. A web-based software for building a new thesaurus is |
| <a href="http://www.openthesaurus.de">OpenThesaurus</a>, which is already successfully |
| used to maintain the German, Polish, and other thesauri. All you need is some knowledge of |
| MySQL and a Java-enabled server space to run your own version of OpenThesaurus.</p> |
| |
| <hr /> |
| |
| <br /> |
| </html> |
| </body> |