1.Comparison of Logistic Regression and Machine Learning Approaches in Predicting Depressive Symptoms: A National-Based Study
Xing-Xuan DONG ; Jian-Hua LIU ; Tian-Yang ZHANG ; Chen-Wei PAN ; Chun-Hua ZHAO ; Yi-Bo WU ; Dan-Dan CHEN
Psychiatry Investigation 2025;22(3):267-278
Objective:
Machine learning (ML) has been reported to have better predictive capability than traditional statistical techniques. The aim of this study was to assess the efficacy of ML algorithms and logistic regression (LR) for predicting depressive symptoms during the COVID-19 pandemic.
Methods:
Analyses were carried out in a national cross-sectional study involving 21,916 participants. The ML algorithms in this study included random forest (RF), support vector machine (SVM), neural network (NN), and gradient boosting machine (GBM) methods. The performance indices were sensitivity, specificity, accuracy, precision, F1-score, and area under the receiver operating characteristic curve (AUC).
Results:
LR and NN had the best performance in terms of AUCs. The risk of overfitting was found to be negligible for most ML models except for RF, and GBM obtained the highest sensitivity, specificity, accuracy, precision, and F1-score. Therefore, LR, NN, and GBM models ranked among the best models.
Conclusion
Compared with ML models, LR model performed comparably to ML models in predicting depressive symptoms and identifying potential risk factors while also exhibiting a lower risk of overfitting.
2.Comparison of Logistic Regression and Machine Learning Approaches in Predicting Depressive Symptoms: A National-Based Study
Xing-Xuan DONG ; Jian-Hua LIU ; Tian-Yang ZHANG ; Chen-Wei PAN ; Chun-Hua ZHAO ; Yi-Bo WU ; Dan-Dan CHEN
Psychiatry Investigation 2025;22(3):267-278
Objective:
Machine learning (ML) has been reported to have better predictive capability than traditional statistical techniques. The aim of this study was to assess the efficacy of ML algorithms and logistic regression (LR) for predicting depressive symptoms during the COVID-19 pandemic.
Methods:
Analyses were carried out in a national cross-sectional study involving 21,916 participants. The ML algorithms in this study included random forest (RF), support vector machine (SVM), neural network (NN), and gradient boosting machine (GBM) methods. The performance indices were sensitivity, specificity, accuracy, precision, F1-score, and area under the receiver operating characteristic curve (AUC).
Results:
LR and NN had the best performance in terms of AUCs. The risk of overfitting was found to be negligible for most ML models except for RF, and GBM obtained the highest sensitivity, specificity, accuracy, precision, and F1-score. Therefore, LR, NN, and GBM models ranked among the best models.
Conclusion
Compared with ML models, LR model performed comparably to ML models in predicting depressive symptoms and identifying potential risk factors while also exhibiting a lower risk of overfitting.
3.Innovation and application of traditional Chinese medicine dispensing promoted through integration of whole-process data elements.
Huan-Fei YANG ; Si-Yu LI ; Chen-Qian YU ; Jian-Kun WU ; Fang LIU ; Li-Bin JIANG ; Chun-Jin LI ; Xiang-Fei SU ; Wei-Guo BAI ; Hua-Qiang ZHAI ; Shi-Yuan JIN ; Yong-Yan WANG
China Journal of Chinese Materia Medica 2025;50(11):3189-3196
As a new type of production factor that can empower the development of new quality productivity, the data element is an important engine to promote the high quality development of the industry. Traditional Chinese medicine(TCM) dispensing is the most basic work of TCM clinical pharmacy, and its quality directly affects the clinical efficacy of TCM. The integration of data elements and TCM dispensing can stimulate the innovation and vitality of the TCM dispensing industry and promote the high-quality and sustainable development of the industry. A large-scale, detailed, and systematic study on TCM dispensing was conducted. The innovative practice path of data fusion construction in the whole process of TCM dispensing was investigated by integrating the digital resources "nine full activities" of TCM dispensing, creating the digital dictionary of "TCM clinical information data elements", and exploring innovative applications of TCM dispensing driven by data and technology, so as to promote the standardized, digital, and intelligent development of TCM dispensing in medical health services. The research content of this project was successfully selected as the second batch of "Data element×" typical cases of National Data Administration in 2024, which is the only selected case in the field of TCM.
Medicine, Chinese Traditional/methods*
;
Drugs, Chinese Herbal
;
Humans
4.Quality evaluation of Xinjiang Rehmannia glutinosa and Rehmannia glutinosa based on fingerprint and multi-component quantification combined with chemical pattern recognition.
Pan-Ying REN ; Wei ZHANG ; Xue LIU ; Juan ZHANG ; Cheng-Fu SU ; Hai-Yan GONG ; Chun-Jing YANG ; Jing-Wei LEI ; Su-Qing ZHI ; Cai-Xia XIE
China Journal of Chinese Materia Medica 2025;50(16):4630-4640
The differences in chemical quality characteristics between Xinjiang Rehmannia glutinosa and R. glutinosa were analyzed to provide a theoretical basis for the introduction and quality control of R. glutinosa. In this study, the high performance liquid chromatography(HPLC) fingerprints of 6 batches of Xinjiang R. glutinosa and 10 batches of R. glutinosa samples were established. The content of iridoid glycosides, phenylethanoid glycosides, monosaccharides, oligosaccharides, and polysaccharides in Xinjiang R. glutinosa and R. glutinosa was determined by high performance liquid chromatography-diode array detection(HPLC-DAD), high performance liquid chromatography-evaporative light scattering detection(HPLC-ELSD), and ultraviolet-visible spectroscopy(UV-Vis). The determination results were analyzed with by chemical pattern recognition and entropy weight TOPSIS method. The results showed that there were 19 common peaks in the HPLC fingerprints of the 16 batches of R. glutinosa, and catalpol, aucubin, rehmannioside D, rehmannioside A, hydroxytyrosol, leonuride, salidroside, cistanoside A, and verbascoside were identified. Hierarchical cluster analysis(HCA) and principal component analysis(PCA) showed that Qinyang R. glutinosa, Mengzhou R. glutinosa, and Xinjiang R. glutinosa were grouped into three different categories, and eight common components causing the chemical quality difference between Xinjiang R. glutinosa and R. glutinosa in Mengzhou and Qinyang of Henan province were screened out by orthogonal partial least squares discriminant analysis(OPLS-DA). The results of content determination showed that there were glucose, sucrose, raffinose, stachyose, polysaccharides, and nine glycosides in Xinjiang R. glutinosa and R. glutinosa samples, and the content of catalpol, rehmannioside A, leonuride, cistanoside A, verbascoside, sucrose, and glucose was significantly different between Xinjiang R. glutinosa and R. glutinosa. The analysis with entropy weight TOPSIS method showed that the comprehensive quality of R. glutinosa in Mengzhou and Qinyang of Henan province was better than that of Xinjiang R. glutinosa. In conclusion, the types of main chemical components of R. glutinosa and Xinjiang R. glutinosa were the same, but their content was different. The chemical quality of R. glutinosa was better than Xinjiang R. glutinosa, and other components in R. glutinosa from two producing areas and their effects need further study.
Rehmannia/classification*
;
Drugs, Chinese Herbal/chemistry*
;
Chromatography, High Pressure Liquid/methods*
;
Quality Control
5.Analysis of fungal composition in different layers of Bantou agarwood-forming trunk of Aquilaria sinensis revealing presence of Aspergillus-inhibiting substances in agarwood sites.
Xuyu CHEN ; Yun YANG ; Yangyang LIU ; Chun SUI ; Jianhe WEI
Chinese Herbal Medicines 2025;17(2):315-321
OBJECTIVE:
The objective of this study was to analyse fungal composition and exploit application potential in the Bantou (BT) agarwood-forming trunk of Aquilaria sinensis.
METHODS:
BT agarwood is a naturally formed agarwood that was collected after cutting. Total genomic DNA of the fungi in BT agarwood was extracted by the hexadecyltrimethy ammonium bromide (CTAB) method, followed by PCR amplification and library construction. The effective tags were obtained by the HiSeq2500 platform, and the data were subjected to bioinformatics and statistical analyses.
RESULTS:
A total of 7 850 040 effective tags were obtained, Ascomycota was the most abundant fungus at the phylum level, with a relative abundance of 56.36%-61.44%, followed by Basidiomycota, with a relative abundance of 10.49%-20.39%. Dothideomycetes, Agaricomycetes and Sordariomycetes were dominant at the class level, accounting for 26.21%-33.88%, 8.40%-17.66%, and 18.41%-24.11%, respectively. Lignosphaeria, Phaeoacremonium and Hermatomyces were dominant at the genus level, with relative abundances of 6.25%-7.64%, 1.95%-9.05% and 1.5%-5.4%, respectively. Diversity and richness analysis showed that the fungal composition in the agarwood formation sites (agarwood layer, upper agarwood layer and lower agarwood layer) were significantly lower than those in the decomposing layer and the healthy layer. That is, the fungal diversity and richness were significantly reduced during agarwood formation by the action of open wounds. The fungal community structure in the decomposing layer and agarwood formation sites obviously differed from that in the healthy layer. The number of Aspergillus taxa in agarwood formation sites decreased significantly (healthy layer is 0.5%, decomposing layer is 0.022%, upper agarwood layer is 0.012%, agarwood layer is 0.01%, and lower agarwood layer is 0.013%), indicating that agarwood may contain potential substances to inhibit the growth of Aspergillus.
CONCLUSION
Agarwood from agarwood formation sites contains potential substances that inhibit Aspergillus, which provides valuable information for the control of the genus of Aspergillus.
6.Predictive Modeling of Symptomatic Intracranial Hemorrhage Following Endovascular Thrombectomy: Insights From the Nationwide TREAT-AIS Registry
Jia-Hung CHEN ; I-Chang SU ; Yueh-Hsun LU ; Yi-Chen HSIEH ; Chih-Hao CHEN ; Chun-Jen LIN ; Yu-Wei CHEN ; Kuan-Hung LIN ; Pi-Shan SUNG ; Chih-Wei TANG ; Hai-Jui CHU ; Chuan-Hsiu FU ; Chao-Liang CHOU ; Cheng-Yu WEI ; Shang-Yih YAN ; Po-Lin CHEN ; Hsu-Ling YEH ; Sheng-Feng SUNG ; Hon-Man LIU ; Ching-Huang LIN ; Meng LEE ; Sung-Chun TANG ; I-Hui LEE ; Lung CHAN ; Li-Ming LIEN ; Hung-Yi CHIOU ; Jiunn-Tay LEE ; Jiann-Shing JENG ;
Journal of Stroke 2025;27(1):85-94
Background:
and Purpose Symptomatic intracranial hemorrhage (sICH) following endovascular thrombectomy (EVT) is a severe complication associated with adverse functional outcomes and increased mortality rates. Currently, a reliable predictive model for sICH risk after EVT is lacking.
Methods:
This study used data from patients aged ≥20 years who underwent EVT for anterior circulation stroke from the nationwide Taiwan Registry of Endovascular Thrombectomy for Acute Ischemic Stroke (TREAT-AIS). A predictive model including factors associated with an increased risk of sICH after EVT was developed to differentiate between patients with and without sICH. This model was compared existing predictive models using nationwide registry data to evaluate its relative performance.
Results:
Of the 2,507 identified patients, 158 developed sICH after EVT. Factors such as diastolic blood pressure, Alberta Stroke Program Early CT Score, platelet count, glucose level, collateral score, and successful reperfusion were associated with the risk of sICH after EVT. The TREAT-AIS score demonstrated acceptable predictive accuracy (area under the curve [AUC]=0.694), with higher scores being associated with an increased risk of sICH (odds ratio=2.01 per score increase, 95% confidence interval=1.64–2.45, P<0.001). The discriminatory capacity of the score was similar in patients with symptom onset beyond 6 hours (AUC=0.705). Compared to existing models, the TREAT-AIS score consistently exhibited superior predictive accuracy, although this difference was marginal.
Conclusions
The TREAT-AIS score outperformed existing models, and demonstrated an acceptable discriminatory capacity for distinguishing patients according to sICH risk levels. However, the differences between models were only marginal. Further research incorporating periprocedural and postprocedural factors is required to improve the predictive accuracy.
7.Comparison of Logistic Regression and Machine Learning Approaches in Predicting Depressive Symptoms: A National-Based Study
Xing-Xuan DONG ; Jian-Hua LIU ; Tian-Yang ZHANG ; Chen-Wei PAN ; Chun-Hua ZHAO ; Yi-Bo WU ; Dan-Dan CHEN
Psychiatry Investigation 2025;22(3):267-278
Objective:
Machine learning (ML) has been reported to have better predictive capability than traditional statistical techniques. The aim of this study was to assess the efficacy of ML algorithms and logistic regression (LR) for predicting depressive symptoms during the COVID-19 pandemic.
Methods:
Analyses were carried out in a national cross-sectional study involving 21,916 participants. The ML algorithms in this study included random forest (RF), support vector machine (SVM), neural network (NN), and gradient boosting machine (GBM) methods. The performance indices were sensitivity, specificity, accuracy, precision, F1-score, and area under the receiver operating characteristic curve (AUC).
Results:
LR and NN had the best performance in terms of AUCs. The risk of overfitting was found to be negligible for most ML models except for RF, and GBM obtained the highest sensitivity, specificity, accuracy, precision, and F1-score. Therefore, LR, NN, and GBM models ranked among the best models.
Conclusion
Compared with ML models, LR model performed comparably to ML models in predicting depressive symptoms and identifying potential risk factors while also exhibiting a lower risk of overfitting.
8.Comparison of Logistic Regression and Machine Learning Approaches in Predicting Depressive Symptoms: A National-Based Study
Xing-Xuan DONG ; Jian-Hua LIU ; Tian-Yang ZHANG ; Chen-Wei PAN ; Chun-Hua ZHAO ; Yi-Bo WU ; Dan-Dan CHEN
Psychiatry Investigation 2025;22(3):267-278
Objective:
Machine learning (ML) has been reported to have better predictive capability than traditional statistical techniques. The aim of this study was to assess the efficacy of ML algorithms and logistic regression (LR) for predicting depressive symptoms during the COVID-19 pandemic.
Methods:
Analyses were carried out in a national cross-sectional study involving 21,916 participants. The ML algorithms in this study included random forest (RF), support vector machine (SVM), neural network (NN), and gradient boosting machine (GBM) methods. The performance indices were sensitivity, specificity, accuracy, precision, F1-score, and area under the receiver operating characteristic curve (AUC).
Results:
LR and NN had the best performance in terms of AUCs. The risk of overfitting was found to be negligible for most ML models except for RF, and GBM obtained the highest sensitivity, specificity, accuracy, precision, and F1-score. Therefore, LR, NN, and GBM models ranked among the best models.
Conclusion
Compared with ML models, LR model performed comparably to ML models in predicting depressive symptoms and identifying potential risk factors while also exhibiting a lower risk of overfitting.
9.Comparison of Logistic Regression and Machine Learning Approaches in Predicting Depressive Symptoms: A National-Based Study
Xing-Xuan DONG ; Jian-Hua LIU ; Tian-Yang ZHANG ; Chen-Wei PAN ; Chun-Hua ZHAO ; Yi-Bo WU ; Dan-Dan CHEN
Psychiatry Investigation 2025;22(3):267-278
Objective:
Machine learning (ML) has been reported to have better predictive capability than traditional statistical techniques. The aim of this study was to assess the efficacy of ML algorithms and logistic regression (LR) for predicting depressive symptoms during the COVID-19 pandemic.
Methods:
Analyses were carried out in a national cross-sectional study involving 21,916 participants. The ML algorithms in this study included random forest (RF), support vector machine (SVM), neural network (NN), and gradient boosting machine (GBM) methods. The performance indices were sensitivity, specificity, accuracy, precision, F1-score, and area under the receiver operating characteristic curve (AUC).
Results:
LR and NN had the best performance in terms of AUCs. The risk of overfitting was found to be negligible for most ML models except for RF, and GBM obtained the highest sensitivity, specificity, accuracy, precision, and F1-score. Therefore, LR, NN, and GBM models ranked among the best models.
Conclusion
Compared with ML models, LR model performed comparably to ML models in predicting depressive symptoms and identifying potential risk factors while also exhibiting a lower risk of overfitting.
10.Predictive Modeling of Symptomatic Intracranial Hemorrhage Following Endovascular Thrombectomy: Insights From the Nationwide TREAT-AIS Registry
Jia-Hung CHEN ; I-Chang SU ; Yueh-Hsun LU ; Yi-Chen HSIEH ; Chih-Hao CHEN ; Chun-Jen LIN ; Yu-Wei CHEN ; Kuan-Hung LIN ; Pi-Shan SUNG ; Chih-Wei TANG ; Hai-Jui CHU ; Chuan-Hsiu FU ; Chao-Liang CHOU ; Cheng-Yu WEI ; Shang-Yih YAN ; Po-Lin CHEN ; Hsu-Ling YEH ; Sheng-Feng SUNG ; Hon-Man LIU ; Ching-Huang LIN ; Meng LEE ; Sung-Chun TANG ; I-Hui LEE ; Lung CHAN ; Li-Ming LIEN ; Hung-Yi CHIOU ; Jiunn-Tay LEE ; Jiann-Shing JENG ;
Journal of Stroke 2025;27(1):85-94
Background:
and Purpose Symptomatic intracranial hemorrhage (sICH) following endovascular thrombectomy (EVT) is a severe complication associated with adverse functional outcomes and increased mortality rates. Currently, a reliable predictive model for sICH risk after EVT is lacking.
Methods:
This study used data from patients aged ≥20 years who underwent EVT for anterior circulation stroke from the nationwide Taiwan Registry of Endovascular Thrombectomy for Acute Ischemic Stroke (TREAT-AIS). A predictive model including factors associated with an increased risk of sICH after EVT was developed to differentiate between patients with and without sICH. This model was compared existing predictive models using nationwide registry data to evaluate its relative performance.
Results:
Of the 2,507 identified patients, 158 developed sICH after EVT. Factors such as diastolic blood pressure, Alberta Stroke Program Early CT Score, platelet count, glucose level, collateral score, and successful reperfusion were associated with the risk of sICH after EVT. The TREAT-AIS score demonstrated acceptable predictive accuracy (area under the curve [AUC]=0.694), with higher scores being associated with an increased risk of sICH (odds ratio=2.01 per score increase, 95% confidence interval=1.64–2.45, P<0.001). The discriminatory capacity of the score was similar in patients with symptom onset beyond 6 hours (AUC=0.705). Compared to existing models, the TREAT-AIS score consistently exhibited superior predictive accuracy, although this difference was marginal.
Conclusions
The TREAT-AIS score outperformed existing models, and demonstrated an acceptable discriminatory capacity for distinguishing patients according to sICH risk levels. However, the differences between models were only marginal. Further research incorporating periprocedural and postprocedural factors is required to improve the predictive accuracy.

Result Analysis
Print
Save
E-mail