commit | 6c41083fd1f7b256a34dc1e591bb98a97e44479e | [log] [tgz] |
---|---|---|
author | Thierry Moreau <moreau@uw.edu> | Mon Jul 22 08:31:37 2019 -0700 |
committer | Tianqi Chen <tqchen@users.noreply.github.com> | Mon Jul 22 08:31:37 2019 -0700 |
tree | 5842d8bd7b8db7bcb9a69567b2a1d8a26645688a | |
parent | 8a27e115197f5961edde7054761e3503fa40e3fb [diff] |
[VTA] Runtime refactor to allow for non-shared memory FPGAs (e.g. F1) (#3554) * updated runtime to support non-shared memory FPGAs for instruction and micro-op kernels * adding driver-defined memcpy function to handle F1 cases * refactor to include flush/invalidate in memcpy driver function * update tsim driver * bug fixes * cleanup * pre-allocate fpga readable buffers to improve perf * fix * remove instruction stream address rewrite pass for micro op kernels * fix: * white spaces * fix lint * avoid signed/unsigned compilation warning * avoid signed/unsigned compilation warning * fix * fix * addressing comments * whitespace * moving flush/invalidate out of memmove * clearnup * fix * cosmetic * rename API * comment fix
VTA (versatile tensor accelerator) is an open-source deep learning accelerator complemented with an end-to-end TVM-based compiler stack.
The key features of VTA include:
Learn more about VTA here.