Harshavardhan Kamarthi
Presented at Data Seminar 10 Dec 2021
CNN for cell classification [Oei+ Plos One 2019]
Cancer detection [Zhang+ Nature 2019]
Predicting DNA Specificities [Alipanahi+ Nature 2019]
Training
1. Levergaes evolutionarily related sequences from Protein Data bank and perform multiple sequence alignment
2. Pass these set of sequences through transformer based architecture (Evoformer) and directly predict the 3D sequence
1. Transformer Architecture learns interpretable intermediate folding stages, especially for complex proteins.
2. Intermediate Loss functions: Local protein structure from data bank, full sequence using torsion angles.
2. Achieves SOTA performance