Modelling Irregular Architecture
FP 16, 32, 64 and vector registers
Improving Runtime Performance
Data Structure
Array
initialize Cost Vector
How does Constraints affects Cost vector?
What is reduce graph ?
reduce1, reduce2, reduceN
Data Structure
Singly Linked List for each Degree
Why to go for Separate Lists ?
Populate virtual registers in appropriate Degree List
Add Vector Constraints
While No DegreeList in Empty
Perform reduce1()
Perform reduce2()
Perform reduceN()
Propagate Solution
For each Node X in degree0 List
For each Node X in degree0 List
BHUSHAN SONAWANE
SIDDHARTH KUMAR
Internal guide: Prof. M. V. Kulkarni
External guide: Shekhar Divekar
Understanding Algorithm And NVIDIA Compiler Infrastructure
Modelling Vector Instructions
Modelling FP16 and 64 bit registers
Implementation of Basic Register Allocator for sm50
Perf Tuning
What Next