example/bayesian-methods/README.md - mxnet - Git at Google

 <!--- Licensed to the Apache Software Foundation (ASF) under one -->
 <!--- or more contributor license agreements.  See the NOTICE file -->
 <!--- distributed with this work for additional information -->
 <!--- regarding copyright ownership.  The ASF licenses this file -->
 <!--- to you under the Apache License, Version 2.0 (the -->
 <!--- "License"); you may not use this file except in compliance -->
 <!--- with the License.  You may obtain a copy of the License at -->

 <!---   http://www.apache.org/licenses/LICENSE-2.0 -->

 <!--- Unless required by applicable law or agreed to in writing, -->
 <!--- software distributed under the License is distributed on an -->
 <!--- "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY -->
 <!--- KIND, either express or implied.  See the License for the -->
 <!--- specific language governing permissions and limitations -->
 <!--- under the License. -->

 Bayesian Methods
 ================

 This folder contains examples related to Bayesian Methods.

 We currently have *Stochastic Gradient Langevin Dynamics (SGLD)* [<cite>(Welling and Teh, 2011)</cite>](http://www.icml-2011.org/papers/398_icmlpaper.pdf)
 and *Bayesian Dark Knowledge (BDK)* [<cite>(Balan, Rathod, Murphy and Welling, 2015)</cite>](http://papers.nips.cc/paper/5965-bayesian-dark-knowledge).

 **sgld.ipynb** shows how to use MXNet to repeat the toy experiment in the original SGLD paper.

 **bdk.ipynb** shows how to use MXNet to implement the DistilledSGLD algorithm in Bayesian Dark Knowledge.

 **bdk_demo.py** contains scripts (more than the notebook) related to Bayesian Dark Knowledge. Use `python bdk_demo.py -d 1 -l 2 -t 50000` to run classification on MNIST.

 View parameters we can use with the following command.

 ```shell
 python bdk_demo.py -h


 usage: bdk_demo.py [-h] [-d DATASET] [-l ALGORITHM] [-t TRAINING] [--gpu GPU]

 Examples in the paper [NIPS2015]Bayesian Dark Knowledge and [ICML2011]Bayesian
 Learning via Stochastic Gradient Langevin Dynamics

 optional arguments:
   -h, --help            show this help message and exit
   -d DATASET, --dataset DATASET
                         Dataset to use. 0 --> TOY, 1 --> MNIST, 2 -->
                         Synthetic Data in the SGLD paper
   -l ALGORITHM, --algorithm ALGORITHM
                         Type of algorithm to use. 0 --> SGD, 1 --> SGLD,
                         other-->DistilledSGLD
   -t TRAINING, --training TRAINING
                         Number of training samples
   --gpu GPU             0 to use GPU, not set to use CPU
 ```
	<!--- Licensed to the Apache Software Foundation (ASF) under one -->
	<!--- or more contributor license agreements. See the NOTICE file -->
	<!--- distributed with this work for additional information -->
	<!--- regarding copyright ownership. The ASF licenses this file -->
	<!--- to you under the Apache License, Version 2.0 (the -->
	<!--- "License"); you may not use this file except in compliance -->
	<!--- with the License. You may obtain a copy of the License at -->

	<!--- http://www.apache.org/licenses/LICENSE-2.0 -->

	<!--- Unless required by applicable law or agreed to in writing, -->
	<!--- software distributed under the License is distributed on an -->
	<!--- "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY -->
	<!--- KIND, either express or implied. See the License for the -->
	<!--- specific language governing permissions and limitations -->
	<!--- under the License. -->

	Bayesian Methods
	================

	This folder contains examples related to Bayesian Methods.

	We currently have Stochastic Gradient Langevin Dynamics (SGLD) [<cite>(Welling and Teh, 2011)</cite>](http://www.icml-2011.org/papers/398_icmlpaper.pdf)
	and Bayesian Dark Knowledge (BDK) [<cite>(Balan, Rathod, Murphy and Welling, 2015)</cite>](http://papers.nips.cc/paper/5965-bayesian-dark-knowledge).

	sgld.ipynb shows how to use MXNet to repeat the toy experiment in the original SGLD paper.

	bdk.ipynb shows how to use MXNet to implement the DistilledSGLD algorithm in Bayesian Dark Knowledge.

	bdk_demo.py contains scripts (more than the notebook) related to Bayesian Dark Knowledge. Use `python bdk_demo.py -d 1 -l 2 -t 50000` to run classification on MNIST.

	View parameters we can use with the following command.

	```shell
	python bdk_demo.py -h


	usage: bdk_demo.py [-h] [-d DATASET] [-l ALGORITHM] [-t TRAINING] [--gpu GPU]

	Examples in the paper [NIPS2015]Bayesian Dark Knowledge and [ICML2011]Bayesian
	Learning via Stochastic Gradient Langevin Dynamics

	optional arguments:
	-h, --help show this help message and exit
	-d DATASET, --dataset DATASET
	Dataset to use. 0 --> TOY, 1 --> MNIST, 2 -->
	Synthetic Data in the SGLD paper
	-l ALGORITHM, --algorithm ALGORITHM
	Type of algorithm to use. 0 --> SGD, 1 --> SGLD,
	other-->DistilledSGLD
	-t TRAINING, --training TRAINING
	Number of training samples
	--gpu GPU 0 to use GPU, not set to use CPU
	```