Biggest question: How does the cell recognize the Scan-RNA independant IES ?
2 independant hypothesis:
- Inter-Pulse Duration (IPD) ⇈ if n6mA
- Compared with Control or in-sillico control --> IpdRatio
- Each molecule --> analyzed independantly
- Each nucleotide --> Analyzed independantly
- Each strand --> analyzed independantly (then paired)
> 99% accuracy
~15% error
1:200
Random sampling
PacBio SMRT
Only a few remaining: ~ 10 to ~100 sequences
If 100% carry a methylation pattern, this is enough
Sorting
Deduced origin
MIC DNA
Alignment of consensus
MAC, MIC,
Mito...
Analysis
Report n6mA
5mC, 4mC
Re-alignment
Wild type:
Silencing of methylase candidates:
Found:
$$p= [ Se \cdot \pi + (1 - Sp) \ (1 - \pi) ] \cdot N$$
True positives
False positives
p : Number of D+
N: Number tested
$$\pi = \frac{\frac{p}{N} - 1 + Sp}{Se-1+Sp}$$
--> From imperfect detections, allows to estimate the fraction of nucleotides truly methylated
$$\pi$$ is the true proportion of n6mA
Mostly in AT sites (~90%)
"Lots" are symmetrical (~80%)
HTVEG
MT2
etc...
--> Some molecules carry all the detections, in sym-A*T
Very likely to be sequences comming from the MAC
In the MIC:
In the MAC: Hemimethylation
Thanks !
Output example
Enfin !
~95% of the methylation locates in AT dinucleotides in the MAC(*)
slightly lower in the MIC (5 to 5 points less)
True in any condition
75% of the methylation in an AT dinucleotide is actually symetrically modified, independantly from being in the MAC or the MIC(*)
Kept in MIC and MAC (All conditions)
(*) Linear equation / idQv20
Kept in the MAC for all experimental conditions
Impossible to tell in the MIC (not enough sequences)
In the MAC
42% GC
38% GC
~ 100% TA bounded
Small-RNA (~30% of IESs)
Output example