19BIO201
Intelligence in Biological Systems - 3
Linear Neighbourhood Propagation method for predicting Long Non-Coding RNA-Protein Interactions
19BIO201
Intelligence in Biological Systems - 3
Aadharsh Aadhithya - CB.EN.U4AIE20001
Anirudh Edpuganti - CB.EN.U4AIE20005
Madhav Kishor - CB.EN.U4AIE20033
Onteddu Chaitanya Reddy - CB.EN.U4AIE20045
Pillalamarri Akshaya - CB.EN.U4AIE20049
Team-1
Non Coding DNA
Non Coding DNA
- Non coding regions in dna are regions which do not code for proteins
- only about 1% of DNA is estimated to be coding
- However, It is of importance as it contains sequences that can act as regulatory elements
Non Coding DNA
- Promoters
- Enhancers
- Silencers
- Insulators
Non Coding DNA
Other Regions Code for RNA's like tRNA's and rRNA's
Non Coding DNA
- Recent studies have proved that only one-fifth of transcription across the human genome is associated with protein-coding genes
- Expressed in Tissue Specific contexts
- long non-coding RNAs, which consist of more than 200 nucleotides, have gained wide attention because of their large number and their essential functions
Non Coding DNA
- lncRNAs play an important role in regulating many biological processes, such as transcription, splicing and gene expression
- Because of the high cost of experimental identification for lncRNA– protein interactions, a great number of computational methods have been developed.
Features
Features
Features
Interaction
Profile of l1
Interaction
Profile of p1
Expression Profiles
24 Types of Human Tissues
Expression Profile vector
Long Noncoding RNA Sequence Composition
Features for Proteins
- Interaction Profile
- Composition,Transition,Destruction Vectors
- Interaction Profile
- Expression Profile
- Sequence Composition
- Interaction Profile
- CTD
lncRNA
Proteins
Linear Neighbourhood Similarity
Linear Neighbourhood Similarity
- Local Neighbourhood of a manifold in feature space can have a linear approximation
- Hence, A datapoint can be a linear combination of its neighbouring points
Linear Neighbourhood Similarity
Reconstructive
Contribution of j
Linear Neighbourhood Similarity
Linear Neighbourhood Similarity
Linear Neighbourhood Similarity
Linear Neighbourhood Similarity
Linear Neighbourhood Similarity
Linear Neighbourhood Similarity
The number of nearest neighbors is a hyperparameter
In Matrix Representation,
In Matrix Representation,
To avoid over-fitting, penalty parameter can be used
Linear Neighbourhood Similarity
Label Propagation
Label Propagation
Label Propagation
If a protein Pi Is known to be Interacting
with lncRNA l1
Label Propagation
If a protein Pi Is known to be Interacting
with lncRNA l1
if l2 is Similar to l1
It is more likely that Pi interacts with l2 as well
Label Propagation
That depends on the Similarity Score
Label Propagation
That depends on the Similarity Score
How to use what we know, to infer about what we don't?
Label Propagation
How to use what we know, to infer about what we don't?
What do we know?
Protein pi interacts with
lncRNA l1
Label Propagation
How to use what we know, to infer about what we don't?
What can we Infer?
Label Propagation
How to use what we know, to infer about what we don't?
Protein pi interacts with
lncRNA l2
Select Information
About lncRNAs similar to
l1
Label Propagation
How to use what we know, to infer about what we don't?
Protein pi interacts with
lncRNA l2
Select Information
About lncRNA similar to
l1
If l2 is similar l1, more likely
it should interact with pi
Label Propagation
How to use what we know, to infer about what we don't?
Protein pi interacts with
lncRNA l2
Information About
Closeness of l1 to l2 is
incorporated
If l2 is similar l1, more likely
it should interact with pi
Label Propagation
How to use what we know, to infer about what we don't?
Protein pi interacts with
lncRNA l2
Information About
Closeness of l1 to l2 is
incorporated
If l2 is similar l1, more likely
it should interact with pi
However, Our Initial Knowledge that Pi intreacts with l1 is lost
Label Propagation
How to use what we know, to infer about what we don't?
Protein pi interacts with
lncRNA l2
If l2 is similar l1, more likely
it should interact with pi
However, Our Initial Knowledge that Pi intreacts with l1 is lost
We can Impart this knowledge by
Adding the Ground Truth
Label Propagation
How to use what we know, to infer about what we don't?
Protein pi interacts with
lncRNA l2
If l2 is similar l1, more likely
it should interact with pi
However, Our Initial Knowledge that Pi intreacts with l1 is lost
We can Impart this knowledge by
Adding the Ground Truth
Label Propagation
How to use what we know, to infer about what we don't?
What we Infer?
Protein pi interacts with
lncRNA l2
If l2 is similar l1, more likely
it should interact with pi
Can be used to select
between inference and ground
truth
Label Propagation
How to use what we know, to infer about what we don't?
What we Infer?
Protein pi interacts with
lncRNA l2
If l2 is similar l1, more likely
it should interact with pi
Label Propagation
How to use what we know, to infer about what we don't?
What we Infer?
Protein pi interacts with
lncRNA l2
If l2 is similar l1, more likely
it should interact with pi
Label Propagation
How to use what we know, to infer about what we don't?
What we Infer?
Protein pi interacts with
lncRNA l2
If l2 is similar l1, more likely
it should interact with pi
In Matrix Form
Label Propagation
How to use what we know, to infer about what we don't?
What do we know?
Protein pi interacts with
lncRNA l2
Select Information
About lncRNA similar to
l2
Label Propagation
How to use what we know, to infer about what we don't?
What do we know?
Protein pi interacts with
lncRNA l2
Label Propagation
How to use what we know, to infer about what we don't?
What do we know?
Protein pi interacts with
lncRNA l2
Label Propagation
If a protein Pi Is known to be Interacting
with lncRNA l1
if l2 is Similar to l1
It is more likely that Pi interacts with l2 as well
Thank you sir!
References
Text
[1] Zhang, Wen; Qu, Qianlong; Zhang, Yunqiu; Wang, Wei (2017).
The linear neighborhood propagation method for
predicting long non-coding RNA–protein interactions. Neurocomputing
, (), S0925231217313899–. doi:10.1016/j.neucom.2017.07.065
IBS3_Proj
By Incredeble us
IBS3_Proj
- 46