Prediction of pulmonary function test parameters by parameter response mapping parameters based on random forest regression model
10.3760/cma.j.cn112149-20210907-00829
- VernacularTitle:基于参数响应图参数的随机森林模型预测胸部疾病患者的肺功能
- Author:
Xiuxiu ZHOU
1
;
Yu PU
;
Di ZHANG
;
Yu GUAN
;
Yi XIA
;
Wenting TU
;
Shiyuan LIU
;
Li FAN
Author Information
1. 海军军医大学第二附属医院(上海长征医院)放射诊断科,上海 200003
- Keywords:
Pulmonary disease, chronic obstructive;
Tomography, X-ray computed;
Pulmonary function test;
Random forest
- From:
Chinese Journal of Radiology
2022;56(9):1001-1008
- CountryChina
- Language:Chinese
-
Abstract:
Objective:To explore the predictive value of random forest regression model for pulmonary function test.Methods:From August 2018 to December 2019, 615 subjects who underwent screening for three major chest diseases in Shanghai Changzheng Hospital were analyzed retrospectively. According to the ratio of forced expiratory volume in the first second to forced vital capacity (FEV 1/FVC) and the percentage of forced expiratory volume in the first second to the predicted value (FEV 1%), the subjects were divided into normal group, high risk group and chronic obstructive pulmonary disease (COPD) group. The CT quantitative parameter of small airway was parameter response mapping (PRM) parameters, including lung volume, the volume of functional small airways disease (PRMV fSAD), the volume of emphysema (PRMV Emph), the volume of normal lung tissue (PRMV Normal), the volume of uncategorized lung tissue (PRMV Uncategorized) and the percentage of the latter four volumes to the whole lung (%). ANOVA or Kruskal Wallis H was used to test the differences of basic clinical characteristics (age, sex, height, body mass), pulmonary function parameters and small airway CT quantitative parameters among the three groups; Spearman test was used to evaluate the correlation between PRM parameters and pulmonary function parameters. Finally, a random forest regression model based on PRM combined with four basic clinical characteristics was constructed to predict lung function. Results:There were significant differences in the parameters of whole lung PRM among the three groups ( P<0.001). Quantitative CT parameters PRMV Emph, PRMV Emph%, and PRMV Normal% showed a moderate correlation with FEV 1/FVC ( P<0.001). Whole lung volume, PRMV Normal,PRMV Uncategorized and PRMV Uncategorized% were strongly or moderately positively correlated with FVC ( P<0.001), other PRM parameters were weakly or very weakly correlated with pulmonary function parameters. Based on the above parameters, a random forest model for predicting FEV 1/FVC and a random forest model for predicting FEV 1% were established. The random forest model for predicting FEV 1/FVC predicted FEV 1/FVC and actual value was R 2=0.864 in the training set and R 2=0.749 in the validation set. The random forest model for predicting FEV 1% predicted FEV 1% and the actual value in the training set was R 2=0.888, and the validation set was R 2=0.792. The sensitivity, specificity and accuracy of predicting FEV 1% random forest model for the classification of normal group from high-risk group were 0.85(34/40), 0.90(65/72) and 0.88(99/112), respectively; and the sensitivity, specificity and accuracy of predicting FEV 1/FVC random forest model for differentiating non COPD group from COPD group were 0.89(8/9), 1.00 (112/112) and 0.99(120/121), respectively. While the accuracy of two models combination for subclassification of COPD [global initiative for chronic obstructive lung disease (GOLD) Ⅰ, GOLDⅡ and GOLD Ⅲ+Ⅳ] was only 0.44. Conclusions:Small airway CT quantitative parameter PRM can distinguish the normal population, high-risk and COPD population. The comprehensive regression prediction model combined with clinical characteristics based on PRM parameter show good performance differentiating normal group from high risk group, and differentiating non-COPD group from COPD group. Therefore, one-stop CT scan can evaluate the functional small airway and PFT simultaneously.