blob: b48da0bb59ff1a19c9e897d4b4e77f38c9e4ac86 [file] [log] [blame]
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />
<link rel="stylesheet" type="text/css" media="screen,print" href="joshua.css" />
<title>Joshua Decoder | An open-source statistical machine translation decoder</title>
</head>
<body>
<div id="navbar">
<img src="images/joshua-logo-small.png" width="130px"
alt="Joshua logo (picture of a Joshua tree)" />
<p class="infobox">
<b>Stable version</b><br />
5.0<br/><br/>
<b>Release date</b><br />
7 June 2013
</p>
<div id="download" class="infobox">
<a href="https://github.com/joshua-decoder/joshua/archive/v5.0.tar.gz">Download</a>
</div>
<div class="infobox">
<b>Documentation</b><br />
<ul>
<li> <a href="5.0/index.html">End user</a> </li>
<li> <a href="http://cs.jhu.edu/~post/joshua-docs/index.html">Developer</a> </li>
</ul>
</div>
<div class="infobox">
<b>Links</b><br />
<ul>
<li> <a href="#lists">Mailing lists</a> </li>
<li> <a href="#contributors">Contributors</a> </li>
<li> <a href="#acks">Support</a> </li>
</ul>
</div>
<div class="infobox">
<b>Resources</b><br />
<ul>
<li> <a href="indian-parallel-corpora/">Indian Parallel Corpora</a> </li>
</ul>
</div>
</div>
<div id="main">
<div id="title">
<h1>Joshua Decoder</h1>
</div>
<div id="content">
<p>
Joshua is an open-source statistical machine translation decoder for
hierarchical and syntax-based machine translation, written in Java.
It is developed by <a href="http://www.cs.jhu.edu/~ccb/">Chris
Callison-Burch</a>'s research group at
the <a href="http://clsp.jhu.edu">Center for Language and Speech
Processing</a> and the <a href="http://web.jhu.edu/hltcoe">Human
Language Technology Center of Excellence</a> at Johns Hopkins
University.
</p>
<p>
<a href="https://github.com/joshua-decoder/joshua/blob/5.5.0HANGELOG">Click here</a> for a list of changes from the previous version (4.0). Some of these features are described in more detail in the paper
</p>
<blockquote>
<a href="http://aclweb.org/anthology-new/W/W12/W12-3134.pdf">Joshua 4.1: Packing, PRO, and
Paraphrases</a>
<a class="bibtex" href="http://aclweb.org/anthology-new/W/W12/W12-3134.bib">BibTeX</a>
<br />
Juri Ganitkevitch, Yuan Cao, Jonathan Weese, Matt Post, and Chris Callison-Burch<br />
<a href="http://statmt.org/wmt12/">WMT 2012</a><br/>
</blockquote>
You might also be interested in older papers describing Joshua
<a href="http://aclweb.org/anthology/W/W09/W09-0424.pdf">1.0</a>,
<a href="http://aclweb.org/anthology-new/P/P09/P09-4007.pdf">the Joshua demo paper</a>,
<a href="http://aclweb.org/anthology-new/W/W10/W10-1718.pdf">2.0</a> and
<a href="http://aclweb.org/anthology-new/W/W11/W11-2160.pdf">3.0</a>.
<a name="download"/>
<h3>Download</h3>
<p>
To download, click the big green download button to the left. This will deliver to you a
tarball of our latest release, version 5.0.
</p>
<a name="usage"/>
<h3>Usage</h3>
<p>
Check out our <a href="5.0/index.html">large collection of user documentation</a> for
information on downloading and setup, a quick start guide, detailed information on
different Joshua components, common problems, and more.
</p>
<p>
The easiest way to use Joshua is to use
<a href="https://github.com/joshua-decoder/joshua/wiki/Joshua-Pipeline">the
pipeline script</a> which is included with the source code. This
script supports multiple use cases; see the documentation for more
information.
For more detail, Chris Callison-Burch
describes <a href="http://www.cs.jhu.edu/~ccb/joshua/">how to manually
run the steps of the machine translation pipeline</a>.
(This information is a bit outdated and much of it is
automated by the pipeline, but the broad steps still apply).
</p>
<p>
If you run into difficulty, feel free to email
the <a href="https://groups.google.com/forum/#!forum/joshua_support">Joshua
Technical Support Group</a>, or search its archives.
</p>
<a name="lists" />
<h3>Mailing lists</h3>
<ul>
<li>Technical support (for end users with questions and technical
difficulties): <a href="mailto:joshua_support@googlegroups.com">joshua_support@googlegroups.com</a>
</li>
<li>Developers: <a href="mailto:joshua_developers@googlegroups.com">joshua_developers@googlegroups.com</a>
</li>
</ul>
<a name="contributors" />
<h3>Contributors</h3>
<p>
Joshua was originally ported
from <a href="http://www.isi.edu/~chiang/">David Chiang's</a>
Python implementation of Hiero
by <a href="http://www.cs.jhu.edu/~zfli/">Zhifei Li</a>.
Since then, there have been a number of contributors to the
project (listed here in alphabetical order).
<ul>
<li><a href="http://www.cs.cmu.edu/~aphillips/">Aaron Phillips</a></li>
<li><a href="http://www.eecs.berkeley.edu/~adpauls/">Adam Pauls</a></li>
<li><a href="http://cs.jhu.edu/~ccb/">Chris Callison-Burch</a></li>
<li><a href="http://www.cs.cmu.edu/~cdyer/">Chris Dyer</a></li>
<li><a href="http://comp.ling.utexas.edu/~bannard/">Colin Bannard</a></li>
<li><a href="http://cs.jhu.edu/~jonny/">Jonathan Weese</a></li>
<li>Josh Schroeder</li>
<li><a href="http://cs.jhu.edu/~juri/">Juri Ganitkevitch</a></li>
<li><a href="http://kheafield.com/">Kenneth Heafield</a></li>
<li><a href="http://dowobeha.github.com">Lane Schwartz</a></li>
<li><a href="https://plus.google.com/107547504293849335564">Luke</a> <a href="https://github.com/lukeorland">Orland</a></li>
<li><a href="http://cs.jhu.edu/~post/">Matt Post</a></li>
<li><a href="http://cs.jhu.edu/~ozaidan/">Omar F. Zaidan</a></li>
<li>Wren Ng Thornton</li>
<li>Yuan Cao</li>
</ul>
Please email if you know of anyone who has been left off this list.
</p>
<a name="others" />
<h3>Other decoders</h3>
<p>
There are many other publicly-available open-source decoders available. Some of them are:
</p>
<ul>
<li>
<a href="http://cdec-decoder.org/index.php?title=Main_Page">cdec</a>, an SCFG decoder</li>
<li>
<a href="">Moses</a>, the defacto phrase-based translation system which also supports
hierarchical decoding</li>
<li>
<a href="http://www-i6.informatik.rwth-aachen.de/jane/">Jane</a> a hierarchical phrase-based system</li>
</ul>
<a name="acks" />
<h3>Acknowledgments</h3>
<div>
We gratefully acknowledge support provided by the following institutions.
<div>
<img class="sponsor" alt="DARPA logo" src="images/sponsors/darpa-logo.jpg" />
</div>
<div>
<img class="sponsor" alt="NSF logo" src="images/sponsors/NSF-logo.jpg" />
</div>
<div>
<a href="http://www.euromatrixplus.org/">
<img class="sponsor" alt="EuroMatrix+" src="images/sponsors/euromatrix.png" />
</a>
</div>
<div>
<img class="sponsor" alt="Human Language Technology Center of Excellence logo" src="images/sponsors/hltcoe-logo3.png" />
</div>
</div>
</div>
</div>
<div id="twitterbar">
<b>Twitter feed</b>:
<script charset="utf-8" src="http://widgets.twimg.com/j/2/widget.js"></script>
<script>
new TWTR.Widget({
version: 2,
type: 'profile',
rpp: 4,
interval: 30000,
width: 250,
height: 300,
theme: {
shell: {
background: '#ffffff',
color: '#000000'
},
tweets: {
background: '#ffffff',
color: '#000000',
links: '#072deb'
}
},
features: {
scrollbar: true,
loop: false,
live: true,
behavior: 'all'
}
}).render().setUser('joshuadecoder').start();
</script>
</div>
<!-- Start of StatCounter Code for Default Guide -->
<script type="text/javascript">
var sc_project=8264132;
var sc_invisible=1;
var sc_security="4b97fe2d";
</script>
<script type="text/javascript"
src="http://www.statcounter.com/counter/counter.js"></script>
<noscript>
<div class="statcounter">
<a title="hit counter joomla"
href="http://statcounter.com/joomla/"
target="_blank">
<img class="statcounter"
src="http://c.statcounter.com/8264132/0/4b97fe2d/1/"
alt="hit counter joomla" />
</a>
</div>
</noscript>
<!-- End of StatCounter Code for Default Guide -->
</body>
</html>