1.Identifying Novel Drug Targets by iDTPnd: A Case Study of Kinase Inhibitors
Naveed HAMMAD ; Reglin CORINNA ; Schubert THOMAS ; Gao XIN ; T.Arold STEFAN ; L.Maitland MICHAEL
Genomics, Proteomics & Bioinformatics 2021;19(6):986-997
Current FDA-approved kinase inhibitors cause diverse adverse effects, some of which are due to the me-chanism-independent effects of these drugs. Identifying these mechanism-independent interactions could improve drug safety and support drug repurposing. Here, we develop iDTPnd (integrated Drug Target Predictor with negative dataset), a computational approach for large-scale discovery of novel targets for known drugs. For a given drug, we construct a positive structural signature as well as a negative structural signature that captures the weakly conserved structural features of drug-binding sites. To facilitate assessment of unintended targets, iDTPnd also provides a docking-based interaction score and its statistical significance. We confirm the interactions of sorafenib, imatinib, dasatinib, sunitinib, and pazopanib with their known targets at a sensitivity of 52%and a specificity of 55%. We also validate 10 predicted novel targets by using in vitro experiments. Our results suggest that proteins other than kinases, such as nuclear receptors, cytochrome P450, and MHC class I molecules, can also be physiologically relevant targets of kinase inhibitors. Our method is general and broadly applicable for the identification of protein–small molecule interactions, when sufficient drug–target 3D data are available. The code for constructing the structural signatures is available at https://sfb.kaust.edu.sa/Documents/iDTP.zip.
2.QAUST: Protein Function Prediction Using Structure Similarity, Protein Interaction, and Functional Motifs
Smaili Zohra FATIMA ; Tian SHUYE ; Roy AMBRISH ; Alazmi MESHARI ; T.Arold STEFAN ; Mukherjee SRAYANTA ; Hefty P.SCOTT ; Chen WEI ; Gao XIN
Genomics, Proteomics & Bioinformatics 2021;19(6):998-1011
The number of available protein sequences in public databases is increasing exponentially. However, a sig-nificant percentage of these sequences lack functional annotation, which is essential for the understanding of how bio-logical systems operate. Here, we propose a novel method, Quantitative Annotation of Unknown STructure (QAUST), to infer protein functions, specifically Gene Ontology (GO) terms and Enzyme Commission (EC) numbers. QAUST uses three sources of information: structure information encoded by global and local structure similarity search, biological network information inferred by protein–protein interaction data, and sequence information extracted from functionally discriminative sequence motifs. These three pieces of information are combined by consensus averaging to make the final prediction. Our approach has been tested on 500 protein targets from the Critical Assessment of Functional Annotation (CAFA) benchmark set. The results show that our method provides accurate functional annotation and outperforms other prediction methods based on sequence similarity search or threading. We further demonstrate that a previously unknown function of human tripartite motif-containing 22 (TRIM22) protein predicted by QAUST can be experimentally validated.