Research on machine learning-based activity prediction models for KRAS inhibitors
10.11665/j.issn.1000-5048.2024031102
- VernacularTitle:基于机器学习的KRAS抑制剂活性预测模型研究
- Author:
Ke DU
1
;
Danqi RONG
;
Rui LU
;
Xiaoya ZHANG
;
Hongping ZHAO
Author Information
1. 中国药科大学理学院
- Publication Type:Journal Article
- Keywords:
KRAS inhibitors / mutual information / principal component analysis / random forest / support vector machine / extreme gradient boosting
- From:
Journal of China Pharmaceutical University
2024;55(3):306-315
- CountryChina
- Language:Chinese
-
Abstract:
Abstract: Kirsten rat sarcoma viral oncogene homolog (KRAS) gene is one of the most commonly mutated oncogenes. It has been found that KRAS inhibitors have the potential therapeutic effect on cancer patients with this gene mutation. In this study, machine learning was applied to develop a QSAR(quantitative structure-activity relationship) model for KRAS small molecule inhibitors. A total of 1857data points of IC50 and SMILES(simplified molecular input line entry system) for KRAS inhibitors were collected from three databases: ChEMBL, BindingDB, and PubChem. And nine different classifiers were constructed using three different feature screening methods combined with three machine learning models, namely, random forest, support vector machine, and extreme gradient boosting machine. The results showed that the SVM model combined with mutual information feature selection exhibited the best performance: AUCtest=0.912, ACCtest=0.859, F1test=0.890. Moreover, it also demonstrated good predictive performance on the external validation set(AUCExt=0.944, RecallExt=0.856, FPRExt=0.111). This study provides a new technical route for KRAS inhibitor screening in natural product databases using artificial intelligence methods.