Adaptive multi-view learning method for enhanced drug repurposing using chemical-induced transcriptional profiles,knowledge graphs,and large language models

Yudong YAN; Yinqi YANG; Zhuohao TONG; Yu WANG; Fan YANG; Zupeng PAN; Chuan LIU; Mingze BAI; Yongfang XIE; Yuefei LI; Kunxian SHU; Yinghong LI

Return

Adaptive multi-view learning method for enhanced drug repurposing using chemical-induced transcriptional profiles,knowledge graphs,and large language models

Author: Yudong YAN ¹ ; Yinqi YANG ; Zhuohao TONG ; Yu WANG ; Fan YANG ; Zupeng PAN ; Chuan LIU ; Mingze BAI ; Yongfang XIE ; Yuefei LI ; Kunxian SHU ; Yinghong LI
Author Information

1. Chongqing Key Laboratory of Big Data for Bio Intelligence,Chongqing University of Posts and Telecommunications,Chongqing,400065,China
Publication Type:Journal Article
Keywords: Drug repurposing; Multi-view learning; Chemical-induced transcriptional profile; Knowledge graph; Large language model; Heterogeneous network
From: Journal of Pharmaceutical Analysis 2025;15(6):1354-1369
CountryChina
Language:English
Abstract: Drug repurposing offers a promising alternative to traditional drug development and significantly re-duces costs and timelines by identifying new therapeutic uses for existing drugs.However,the current approaches often rely on limited data sources and simplistic hypotheses,which restrict their ability to capture the multi-faceted nature of biological systems.This study introduces adaptive multi-view learning(AMVL),a novel methodology that integrates chemical-induced transcriptional profiles(CTPs),knowledge graph(KG)embeddings,and large language model(LLM)representations,to enhance drug repurposing predictions.AMVL incorporates an innovative similarity matrix expansion strategy and leverages multi-view learning(MVL),matrix factorization,and ensemble optimization techniques to integrate heterogeneous multi-source data.Comprehensive evaluations on benchmark datasets(Fdata-set,Cdataset,and Ydataset)and the large-scale iDrug dataset demonstrate that AMVL outperforms state-of-the-art(SOTA)methods,achieving superior accuracy in predicting drug-disease associations across multiple metrics.Literature-based validation further confirmed the model's predictive capabilities,with seven out of the top ten predictions corroborated by post-2011 evidence.To promote transparency and reproducibility,all data and codes used in this study were open-sourced,providing resources for pro-cessing CTPs,KG,and LLM-based similarity calculations,along with the complete AMVL algorithm and benchmarking procedures.By unifying diverse data modalities,AMVL offers a robust and scalable so-lution for accelerating drug discovery,fostering advancements in translational medicine and integrating multi-omics data.We aim to inspire further innovations in multi-source data integration and support the development of more precise and efficient strategies for advancing drug discovery and translational medicine.