Establishment of interpretable cytotoxicity prediction models using machine learning analysis of transcriptome features.
10.1016/j.apsb.2025.02.009
- Author:
You WU
1
;
Ke TANG
1
;
Chunzheng WANG
1
;
Hao SONG
1
;
Fanfan ZHOU
1
;
Ying GUO
1
Author Information
1. Beijing Key Laboratory of New Drug Mechanisms and Pharmacological Evaluation Study, Institute of Materia Medica, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100050, China.
- Publication Type:Journal Article
- Keywords:
Cell viability;
Cytotoxicity Signature genes;
Drug safety;
Interpretable model;
Machine learning;
Narrow therapeutic index drugs;
Transcriptome;
Weak cytotoxicity
- From:
Acta Pharmaceutica Sinica B
2025;15(3):1344-1358
- CountryChina
- Language:English
-
Abstract:
Cytotoxicity, usually represented by cell viability, is a crucial parameter for evaluating drug safety in vitro. Accurate prediction of cell viability/cytotoxicity could accelerate drug development in the early stage. In this study, by integrating cellular transcriptome and cell viability data using four machine learning algorithms (support vector machine (SVM), random forest (RF), extreme gradient boosting (XGBoost), and light gradient boosting machine (LightGBM)) and two ensemble algorithms (voting and stacking), highly accurate prediction models of 50% and 80% cell viability were developed with area under the receiver operating characteristic curve (AUROC) of 0.90 and 0.84, respectively; these models also showed good performance when utilized for diverse cell lines. Concerning the characterization of the employed Feature Genes, the models were interpreted, and the mechanisms of bioactive compounds with a narrow therapeutic index (NTI) can also be analyzed. In summary, the models established in this research exhibit superior capacity to those of previous studies; these models enable accurate high-safety substance screening via cytotoxicity prediction across cell lines. Moreover, for the first time, Cytotoxicity Signature (CTS) genes were identified, which could provide additional clues for further study of mechanisms of action (MOA), especially for NTI compounds.