We use sequence conservation patterns to determine the significance of variants in genetic diseases and cancer. In another project, we compare protein sequences to look for unique sequences which render species resistant to cancer.
Phylogenetic profiling (PP) follows the conservation pattern for each gene across thousands of genomes and identifies genes with similar evolutionary patterns to infer functional interactions. We aim to use PP as a discovery tool to characterize gene function, understand biological pathways, and identify disease causing genes.
Through the years we’ve published many papers optimizing PP, and adding clade-wise analysis to improve predictability. Recently we employed PP to look for the genetic network of the gene MECP2 which is responsible for RETT syndrome, and ACE2 which is the target of the COVID19 virus.
Currently we are searching for disease causing genes in patients with hereditary diseases, and building gene networks to identify novel biological pathways in humans and plants.
- Decoding genetics & treatment of rare diseases
- Mapping novel DNA repair genes
- Decoding genetics & treatment of Rett syndrome
- Controlling aging
- Enhancing the co-evolution signal in thousands of species (Phylogenetic profiling)
- Annotating the plant genome using thousands of genomes
- Hidden gene pathways
Omics (Data Integration)
There is an exponentially growing amount of data freely available today. These data include genomic sequences, clinical data, mutations data, protein-protein interactions, gene and protein expression, phenotypes of different model species, diseases etc.
We take advantage of these enormous data to perform functional genomics, and use machine learning to integrate all these sources to uncover new functions for uncharacterized genes.