Estimating effects of mutations to SARS-CoV-2 proteins from natural sequences

Jesse Bloom & Richard Neher

Fred Hutch Cancer Center / HHMI

Slides: https://slides.com/jbloom/sars2-mut-fitness

Estimating effects of mutations to SARS-CoV-2 proteins from natural sequences Jesse Bloom & Richard Neher Fred Hutch Cancer Center / HHMI @jbloom_lab Slides: https://slides.com/jbloom/sars2-mut-fitness Richard Neher

sars2-mut-fitness

By Jesse Bloom

sars2-mut-fitness

Estimating effects of mutations to all SARS-CoV-2 proteins from actual versus expected mutation counts in natural sequences

2 years ago
2,008

Jesse Bloom PRO

Scientist studying evolution of proteins and viruses.

jbloomlab.org
jbloom_lab

Estimating effects of mutations to SARS-CoV-2 proteins from natural sequences

Jesse Bloom & Richard Neher

Determining effects of viral mutations is important

Traditional way to determine effect of mutations is experiments

Traditional way to determine effect of mutations is experiments

Traditional way to determine effect of mutations is experiments

Traditional way to determine effect of mutations is experiments

Traditional way to determine effect of mutations is experiments

My group tries to do such experiments at large scale via deep mutational scanning

Limitations of using experiments to understand mutation effects

Nature is "testing" effects of viral mutations in humans all the time

Average neutral single-nucleotide mutation has occurred ~15,000 independent times in human transmitted SARS-CoV-2

We can use publicly available human SARS-CoV-2 sequences to "read out" effects of viral mutations on human transmission

First calculate how often each mutation expected to be observed without selection by analyzing 4-fold degenerate sites

We count unique occurrences of mutation, not number of sequences with mutation

Mutations expected to be observed ~8 to ~500 times in absence of selection

There are enough sequences to calculate effects on a per-mutation basis

Distribution of effects of all mutations

We can see which genes are under strong purifying selection

Among accessory genes, ORF3a is under strongest selection against stop codons

We can also look in detail at mutation level

These maps can identify constrained sites

Estimated mutation effects are robust to sequence sampling location

Estimated mutation effects are robust to viral clade identity

Estimated mutation effects correlate well with deep mutational scanning

Maps of mutation effects to all viral proteins

sars2-mut-fitness

sars2-mut-fitness

Jesse Bloom PRO

Estimating effects of mutations to SARS-CoV-2 proteins from natural sequences

Jesse Bloom & Richard Neher

sars2-mut-fitness

More from Jesse Bloom