- Once custom codegen is registered to TVM Backend, any subgraph(operations) supported will be offloaded to the accelerator
- The Relay graph is partitioned into different operation(s) according to predefined patterns.
- The compiler outputs the instruction according to the accelerators ISA and tags the operations to run in the accelerator
Copy of deck
By Vinod Ganesan