tree: ace4de64e57b948e339594f0e822412dcfc741f8 [path history] [tgz]

example/rnn/bucketing/README.md

RNN Example

This folder contains RNN examples using high level mxnet.rnn interface.

Gluon Implementation

You can check this improved Gluon implementation in gluon-nlp, the largest LSTM model reaches a perplexity of 65.62.

Data

Review the license for the Sherlock Holmes dataset and ensure that you agree to it. Then uncomment the lines in the ‘get_sherlockholmes_data.sh’ script that download the dataset.
Run get_sherlockholmes_data.sh to download Sherlock Holmes data.

Python

Generate the Sherlock Holmes language model by using LSTM:
For Python2 (CPU support): can take 2+ hours on AWS-EC2-p2.16xlarge
```
$ python  [lstm_bucketing.py](lstm_bucketing.py) 
```
For Python3 (CPU support): can take 2+ hours on AWS-EC2-p2.16xlarge
```
$ python3 [lstm_bucketing.py](lstm_bucketing.py) 
```
Assuming your machine has 4 GPUs and you want to use all the 4 GPUs:
For Python2 (GPU support only): can take 50+ minutes on AWS-EC2-p2.16xlarge
```
$ python [cudnn_rnn_bucketing.py](cudnn_rnn_bucketing.py) --gpus 0,1,2,3
```
For Python3 (GPU support only): can take 50+ minutes on AWS-EC2-p2.16xlarge
```
$ python3 [cudnn_rnn_bucketing.py](cudnn_rnn_bucketing.py) --gpus 0,1,2,3
```

Performance Note:

More MXNET_GPU_WORKER_NTHREADS may lead to better performance. For setting MXNET_GPU_WORKER_NTHREADS, please refer to Environment Variables.