docs/api/python/model.md - mxnet - Git at Google

 <!--- Licensed to the Apache Software Foundation (ASF) under one -->
 <!--- or more contributor license agreements.  See the NOTICE file -->
 <!--- distributed with this work for additional information -->
 <!--- regarding copyright ownership.  The ASF licenses this file -->
 <!--- to you under the Apache License, Version 2.0 (the -->
 <!--- "License"); you may not use this file except in compliance -->
 <!--- with the License.  You may obtain a copy of the License at -->

 <!---   http://www.apache.org/licenses/LICENSE-2.0 -->

 <!--- Unless required by applicable law or agreed to in writing, -->
 <!--- software distributed under the License is distributed on an -->
 <!--- "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY -->
 <!--- KIND, either express or implied.  See the License for the -->
 <!--- specific language governing permissions and limitations -->
 <!--- under the License. -->

 # Model API

 The model API provides a simplified way to train neural networks using common best practices.
 It's a thin wrapper built on top of the [ndarray](../python/ndarray/ndarray.md) and [symbolic](../python/symbol/symbol.md)
 modules that make neural network training easy.

 Topics:

 * [Train a Model](#train-a-model)
 * [Save the Model](#save-the-model)
 * [Periodic Checkpoint](#periodic-checkpointing)
 * [Initializer API Reference](#initializer-api-reference)
 * [Evaluation Metric API Reference](#evaluation-metric-api-reference)
 * [Optimizer API Reference](#optimizer-api-reference)
 * [Model API Reference](#model-api-reference)

 ## Train the Model

 To train a model, perform two steps: configure the model using the symbol parameter,
 then call ```model.Feedforward.create``` to create the model.
 The following example creates a two-layer neural network.

 ```python
     # configure a two layer neuralnetwork
     data = mx.symbol.Variable('data')
     fc1 = mx.symbol.FullyConnected(data, name='fc1', num_hidden=128)
     act1 = mx.symbol.Activation(fc1, name='relu1', act_type='relu')
     fc2 = mx.symbol.FullyConnected(act1, name='fc2', num_hidden=64)
     softmax = mx.symbol.SoftmaxOutput(fc2, name='sm')
     # create a model
     model = mx.model.FeedForward.create(
          softmax,
          X=data_set,
          num_epoch=num_epoch,
          learning_rate=0.01)
 ```
 You can also use the `scikit-learn-style` construct and `fit` function to create a model.

 ```python
     # create a model using sklearn-style two-step way
     model = mx.model.FeedForward(
          softmax,
          num_epoch=num_epoch,
          learning_rate=0.01)

     model.fit(X=data_set)
 ```
 For more information, see [Model API Reference](#model-api-reference).

 ## Save the Model

 After the job is done, save your work.
 To save the model, you can directly pickle it with Python.
 We also provide `save` and `load` functions.

 ```python
     # save a model to mymodel-symbol.json and mymodel-0100.params
     prefix = 'mymodel'
     iteration = 100
     model.save(prefix, iteration)

     # load model back
     model_loaded = mx.model.FeedForward.load(prefix, iteration)
 ```
 The advantage of these two `save` and `load` functions are that they are language agnostic.
 You should be able to save and load directly into cloud storage, such as Amazon S3 and HDFS.

 ##  Periodic Checkpointing

 We recommend checkpointing your model after each iteration.
 To do this, add a checkpoint callback ```do_checkpoint(path)``` to the function.
 The training process automatically checkpoints the specified location after
 each iteration.

 ```python
     prefix='models/chkpt'
     model = mx.model.FeedForward.create(
          softmax,
          X=data_set,
          iter_end_callback=mx.callback.do_checkpoint(prefix),
          ...)
 ```
 You can load the model checkpoint later using ```Feedforward.load```.

 ## Use Multiple Devices

 Set ```ctx``` to the list of devices that you want to train on.

 ```python
     devices = [mx.gpu(i) for i in range(num_device)]
     model = mx.model.FeedForward.create(
          softmax,
          X=dataset,
          ctx=devices,
          ...)
 ```
 Training occurs in parallel on the GPUs that you specify.

 ```eval_rst
     .. raw:: html

         <script type="text/javascript" src='../../../_static/js/auto_module_index.js'></script>
 ```


 ## Initializer API Reference


 ```eval_rst
     .. automodule:: mxnet.initializer
         :members:

     .. raw:: html

         <script>auto_index("initializer-api-reference");</script>
 ```

 ## Evaluation Metric API Reference


 ```eval_rst
     .. automodule:: mxnet.metric
         :members:

     .. raw:: html

         <script>auto_index("evaluation-metric-api-reference");</script>
 ```

 ## Optimizer API Reference


 ```eval_rst
     .. automodule:: mxnet.optimizer
         :members:

     .. raw:: html

         <script>auto_index("optimizer-api-reference");</script>
 ```

 ## Model API Reference


 ```eval_rst
     .. automodule:: mxnet.model
         :members:

     .. raw:: html

         <script>auto_index("model-api-reference");</script>
 ```

 ## Next Steps
 * See [Symbolic API](../python/symbol/symbol.md) for operations on NDArrays that assemble neural networks from layers.
 * See [IO Data Loading API](../python/io/io.md) for parsing and loading data.
 * See [NDArray API](../python/ndarray/ndarray.md) for vector/matrix/tensor operations.
 * See [KVStore API](../python/kvstore/kvstore.md) for multi-GPU and multi-host distributed training.
	<!--- Licensed to the Apache Software Foundation (ASF) under one -->
	<!--- or more contributor license agreements. See the NOTICE file -->
	<!--- distributed with this work for additional information -->
	<!--- regarding copyright ownership. The ASF licenses this file -->
	<!--- to you under the Apache License, Version 2.0 (the -->
	<!--- "License"); you may not use this file except in compliance -->
	<!--- with the License. You may obtain a copy of the License at -->

	<!--- http://www.apache.org/licenses/LICENSE-2.0 -->

	<!--- Unless required by applicable law or agreed to in writing, -->
	<!--- software distributed under the License is distributed on an -->
	<!--- "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY -->
	<!--- KIND, either express or implied. See the License for the -->
	<!--- specific language governing permissions and limitations -->
	<!--- under the License. -->

	# Model API

	The model API provides a simplified way to train neural networks using common best practices.
	It's a thin wrapper built on top of the [ndarray](../python/ndarray/ndarray.md) and [symbolic](../python/symbol/symbol.md)
	modules that make neural network training easy.

	Topics:

	* [Train a Model](#train-a-model)
	* [Save the Model](#save-the-model)
	* [Periodic Checkpoint](#periodic-checkpointing)
	* [Initializer API Reference](#initializer-api-reference)
	* [Evaluation Metric API Reference](#evaluation-metric-api-reference)
	* [Optimizer API Reference](#optimizer-api-reference)
	* [Model API Reference](#model-api-reference)

	## Train the Model

	To train a model, perform two steps: configure the model using the symbol parameter,
	then call ```model.Feedforward.create``` to create the model.
	The following example creates a two-layer neural network.

	```python
	# configure a two layer neuralnetwork
	data = mx.symbol.Variable('data')
	fc1 = mx.symbol.FullyConnected(data, name='fc1', num_hidden=128)
	act1 = mx.symbol.Activation(fc1, name='relu1', act_type='relu')
	fc2 = mx.symbol.FullyConnected(act1, name='fc2', num_hidden=64)
	softmax = mx.symbol.SoftmaxOutput(fc2, name='sm')
	# create a model
	model = mx.model.FeedForward.create(
	softmax,
	X=data_set,
	num_epoch=num_epoch,
	learning_rate=0.01)
	```
	You can also use the `scikit-learn-style` construct and `fit` function to create a model.

	```python
	# create a model using sklearn-style two-step way
	model = mx.model.FeedForward(
	softmax,
	num_epoch=num_epoch,
	learning_rate=0.01)

	model.fit(X=data_set)
	```
	For more information, see [Model API Reference](#model-api-reference).

	## Save the Model

	After the job is done, save your work.
	To save the model, you can directly pickle it with Python.
	We also provide `save` and `load` functions.

	```python
	# save a model to mymodel-symbol.json and mymodel-0100.params
	prefix = 'mymodel'
	iteration = 100
	model.save(prefix, iteration)

	# load model back
	model_loaded = mx.model.FeedForward.load(prefix, iteration)
	```
	The advantage of these two `save` and `load` functions are that they are language agnostic.
	You should be able to save and load directly into cloud storage, such as Amazon S3 and HDFS.

	## Periodic Checkpointing

	We recommend checkpointing your model after each iteration.
	To do this, add a checkpoint callback ```do_checkpoint(path)``` to the function.
	The training process automatically checkpoints the specified location after
	each iteration.

	```python
	prefix='models/chkpt'
	model = mx.model.FeedForward.create(
	softmax,
	X=data_set,
	iter_end_callback=mx.callback.do_checkpoint(prefix),
	...)
	```
	You can load the model checkpoint later using ```Feedforward.load```.

	## Use Multiple Devices

	Set ```ctx``` to the list of devices that you want to train on.

	```python
	devices = [mx.gpu(i) for i in range(num_device)]
	model = mx.model.FeedForward.create(
	softmax,
	X=dataset,
	ctx=devices,
	...)
	```
	Training occurs in parallel on the GPUs that you specify.

	```eval_rst
	.. raw:: html

	<script type="text/javascript" src='../../../_static/js/auto_module_index.js'></script>
	```


	## Initializer API Reference


	```eval_rst
	.. automodule:: mxnet.initializer
	:members:

	.. raw:: html

	<script>auto_index("initializer-api-reference");</script>
	```

	## Evaluation Metric API Reference


	```eval_rst
	.. automodule:: mxnet.metric
	:members:

	.. raw:: html

	<script>auto_index("evaluation-metric-api-reference");</script>
	```

	## Optimizer API Reference


	```eval_rst
	.. automodule:: mxnet.optimizer
	:members:

	.. raw:: html

	<script>auto_index("optimizer-api-reference");</script>
	```

	## Model API Reference


	```eval_rst
	.. automodule:: mxnet.model
	:members:

	.. raw:: html

	<script>auto_index("model-api-reference");</script>
	```

	## Next Steps
	* See [Symbolic API](../python/symbol/symbol.md) for operations on NDArrays that assemble neural networks from layers.
	* See [IO Data Loading API](../python/io/io.md) for parsing and loading data.
	* See [NDArray API](../python/ndarray/ndarray.md) for vector/matrix/tensor operations.
	* See [KVStore API](../python/kvstore/kvstore.md) for multi-GPU and multi-host distributed training.