tree: 63014e58613596963b295647f2e6d1271e1327f3 [path history] [tgz]
  1. build_model.py
  2. bundle.cc
  3. demo.cc
  4. Makefile
  5. README.md
  6. runtime.cc
apps/bundle_deploy/README.md

How to Bundle TVM Modules

This folder contains an example on how to bundle a TVM module (with the required interpreter runtime modules such as runtime::GraphRuntime, the graph JSON, and the params) into a single, self-contained shared object (bundle.so) which exposes a C API wrapping the appropriate runtime::GraphRuntime instance.

This is useful for cases where we'd like to avoid deploying the TVM runtime components to the target host in advance - instead, we simply deploy the bundled shared-object to the host, which embeds both the model and the runtime components. The bundle should only depend on libc/libc++.

It also contains an example code (demo.cc) to load this shared object and invoke the packaged TVM model instance. This is a dependency-free binary that uses the functionality packaged in bundle.so (which means that bundle.so can be deployed lazily at runtime, instead of at compile time) to invoke TVM functionality.

Type the following command to run the sample code under the current folder, after building TVM first.

make demo

This will:

  • Download the mobilenet0.25 model from the MXNet Gluon Model Zoo
  • Compile the model with NNVM
  • Build a bundle.so shared object containing the model specification and parameters
  • Build a demo executable that dlopen's bundle.so, instantiates the contained graph runtime, and invokes the GraphRuntime::Run function on a random input, then prints the output tensor to stderr.