TVM Stack

  • Once custom codegen is registered to TVM Backend, any subgraph(operations) supported will be offloaded to the accelerator

Bring Your Own Codegen

  • The Relay graph is partitioned into different operation(s) according to predefined patterns.
  • The compiler outputs the instruction according to the accelerators ISA and tags the operations to run in the accelerator

Copy of deck

By Vinod Ganesan