Radiogenomics-based prediction of KRAS and EGFR gene mutation in non-small cell lung cancer patients.
10.11817/j.issn.1672-7347.2025.250026
- Author:
Jianing LIN
1
,
2
;
Zhihang YAN
3
;
Longyu HE
4
;
Hao ZHANG
4
;
Mingxuan XIE
1
,
5
,
6
Author Information
1. Department of Geriatric Pulmonary and Critical Care Medicine, Xiangya Hospital, Central South University; National Clinical Research Center for Geriatric Disorders (Xiangya Hospital), Changsha
2. voicelin0777@gmail.com.
3. School of Electronic Information, Central South University, Changsha 410075, China. 3393771542@qq.com.
4. School of Electronic Information, Central South University, Changsha 410075, China.
5. xiemx2015@
6. com.
- Publication Type:Journal Article
- Keywords:
imaging;
machine learning;
non-small cell lung cancer;
radiogenomics;
targeted therapy
- MeSH:
Humans;
Carcinoma, Non-Small-Cell Lung/diagnostic imaging*;
Lung Neoplasms/diagnostic imaging*;
Mutation;
Proto-Oncogene Proteins p21(ras)/genetics*;
ErbB Receptors/genetics*;
Machine Learning;
Positron Emission Tomography Computed Tomography;
Female;
Male;
Neural Networks, Computer;
Middle Aged;
Aged
- From:
Journal of Central South University(Medical Sciences)
2025;50(5):805-814
- CountryChina
- Language:English
-
Abstract:
OBJECTIVES:Non-small cell lung cancer (NSCLC) is associated with poor prognosis, with 30% of patients diagnosed at an advanced stage. Mutations in the EGFR and KRAS genes are important prognostic factors for NSCLC, and targeted therapies can significantly improve survival in these patients. Although tissue biopsy remains the gold standard for detecting gene mutations, it has limitations, including invasiveness, sampling errors due to tumor heterogeneity, and poor reproducibility. This study aims to develop machine learning models based on radiomic features to predict EGFR and KRAS gene mutation status in NSCLC patients, thereby providing a reference for precision oncology.
METHODS:Imaging and mutation data from eligible NSCLC patients were obtained from the publicly available Lung-PET-CT-Dx dataset in The Cancer Imaging Archive (TCIA). A three-dimensional-convolutional neural network (3D-CNN) was used to extract imaging features from the regions of interest (ROI). The LightGBM algorithm was employed to build classification models for predicting EGFR and KRAS gene mutation status. Model performance was evaluated using 5-fold cross-validation, with receiver operator characteristic (ROC) curves, area under the curve (AUC), accuracy, sensitivity, and specificity used for validation.
RESULTS:The models effectively predicted EGFR and KRAS mutations in NSCLC patients, achieving an AUC of 0.95 for EGFR mutations and 0.90 for KRAS. The models also demonstrated high accuracy (EGFR 89.66%; KRAS 87.10%), sensitivity (EGFR 93.33%; KRAS 87.50%), and specificity (EGFR 85.71%; KRAS 86.67%).
CONCLUSIONS:A radiogenomics-machine learning predictive model can serve as a non-invasive tool for anticipating EGFR and KRAS gene mutation status in NSCLC patients.