1.YOLOX-SwinT algorithm improves the accuracy of AO/OTA classification of intertrochanteric fractures by orthopedic trauma surgeons.
Xue-Si LIU ; Rui NIE ; Ao-Wen DUAN ; Li YANG ; Xiang LI ; Le-Tian ZHANG ; Guang-Kuo GUO ; Qing-Shan GUO ; Dong-Chu ZHAO ; Yang LI ; He-Hua ZHANG
Chinese Journal of Traumatology 2025;28(1):69-75
PURPOSE:
Intertrochanteric fracture (ITF) classification is crucial for surgical decision-making. However, orthopedic trauma surgeons have shown lower accuracy in ITF classification than expected. The objective of this study was to utilize an artificial intelligence (AI) method to improve the accuracy of ITF classification.
METHODS:
We trained a network called YOLOX-SwinT, which is based on the You Only Look Once X (YOLOX) object detection network with Swin Transformer (SwinT) as the backbone architecture, using 762 radiographic ITF examinations as the training set. Subsequently, we recruited 5 senior orthopedic trauma surgeons (SOTS) and 5 junior orthopedic trauma surgeons (JOTS) to classify the 85 original images in the test set, as well as the images with the prediction results of the network model in sequence. Statistical analysis was performed using the SPSS 20.0 (IBM Corp., Armonk, NY, USA) to compare the differences among the SOTS, JOTS, SOTS + AI, JOTS + AI, SOTS + JOTS, and SOTS + JOTS + AI groups. All images were classified according to the AO/OTA 2018 classification system by 2 experienced trauma surgeons and verified by another expert in this field. Based on the actual clinical needs, after discussion, we integrated 8 subgroups into 5 new subgroups, and the dataset was divided into training, validation, and test sets by the ratio of 8:1:1.
RESULTS:
The mean average precision at the intersection over union (IoU) of 0.5 (mAP50) for subgroup detection reached 90.29%. The classification accuracy values of SOTS, JOTS, SOTS + AI, and JOTS + AI groups were 56.24% ± 4.02%, 35.29% ± 18.07%, 79.53% ± 7.14%, and 71.53% ± 5.22%, respectively. The paired t-test results showed that the difference between the SOTS and SOTS + AI groups was statistically significant, as well as the difference between the JOTS and JOTS + AI groups, and the SOTS + JOTS and SOTS + JOTS + AI groups. Moreover, the difference between the SOTS + JOTS and SOTS + JOTS + AI groups in each subgroup was statistically significant, with all p < 0.05. The independent samples t-test results showed that the difference between the SOTS and JOTS groups was statistically significant, while the difference between the SOTS + AI and JOTS + AI groups was not statistically significant. With the assistance of AI, the subgroup classification accuracy of both SOTS and JOTS was significantly improved, and JOTS achieved the same level as SOTS.
CONCLUSION
In conclusion, the YOLOX-SwinT network algorithm enhances the accuracy of AO/OTA subgroups classification of ITF by orthopedic trauma surgeons.
Humans
;
Hip Fractures/diagnostic imaging*
;
Orthopedic Surgeons
;
Algorithms
;
Artificial Intelligence
2.Explainable machine learning model for predicting septic shock in critically sepsis patients based on coagulation indexes: A multicenter cohort study.
Qing-Bo ZENG ; En-Lan PENG ; Ye ZHOU ; Qing-Wei LIN ; Lin-Cui ZHONG ; Long-Ping HE ; Nian-Qing ZHANG ; Jing-Chun SONG
Chinese Journal of Traumatology 2025;28(6):404-411
PURPOSE:
Septic shock is associated with high mortality and poor outcomes among sepsis patients with coagulopathy. Although traditional statistical methods or machine learning (ML) algorithms have been proposed to predict septic shock, these potential approaches have never been systematically compared. The present work aimed to develop and compare models to predict septic shock among patients with sepsis.
METHODS:
It is a retrospective cohort study based on 484 patients with sepsis who were admitted to our intensive care units between May 2018 and November 2022. Patients from the 908th Hospital of Chinese PLA Logistical Support Force and Nanchang Hongdu Hospital of Traditional Chinese Medicine were respectively allocated to training (n=311) and validation (n=173) sets. All clinical and laboratory data of sepsis patients characterized by comprehensive coagulation indexes were collected. We developed 5 models based on ML algorithms and 1 model based on a traditional statistical method to predict septic shock in the training cohort. The performance of all models was assessed using the area under the receiver operating characteristic curve and calibration plots. Decision curve analysis was used to evaluate the net benefit of the models. The validation set was applied to verify the predictive accuracy of the models. This study also used Shapley additive explanations method to assess variable importance and explain the prediction made by a ML algorithm.
RESULTS:
Among all patients, 37.2% experienced septic shock. The characteristic curves of the 6 models ranged from 0.833 to 0.962 and 0.630 to 0.744 in the training and validation sets, respectively. The model with the best prediction performance was based on the support vector machine (SVM) algorithm, which was constructed by age, tissue plasminogen activator-inhibitor complex, prothrombin time, international normalized ratio, white blood cells, and platelet counts. The SVM model showed good calibration and discrimination and a greater net benefit in decision curve analysis.
CONCLUSION
The SVM algorithm may be superior to other ML and traditional statistical algorithms for predicting septic shock. Physicians can better understand the reliability of the predictive model by Shapley additive explanations value analysis.
Humans
;
Shock, Septic/blood*
;
Machine Learning
;
Male
;
Female
;
Retrospective Studies
;
Middle Aged
;
Aged
;
Sepsis/complications*
;
ROC Curve
;
Cohort Studies
;
Adult
;
Intensive Care Units
;
Algorithms
;
Blood Coagulation
;
Critical Illness
3.Early prediction and warning of MODS following major trauma via identification of cytokine storm: A prospective cohort study.
Panpan CHANG ; Rui LI ; Jiahe WEN ; Guanjun LIU ; Feifei JIN ; Yongpei YU ; Yongzheng LI ; Guang ZHANG ; Tianbing WANG
Chinese Journal of Traumatology 2025;28(6):391-398
PURPOSE:
Early mortality in major trauma has decreased, but MODS remains a leading cause of poor outcomes, driven by trauma-induced cytokine storms that exacerbate injuries and organ damage.
METHODS:
This prospective cohort study included 79 major trauma patients (ISS >15) treated in the National Center for Trauma Medicine, Peking University People's Hospital, from September 1, 2021, to July 31, 2023. Patients (1) with ISS >15 (according to AIS 2015), (2) aged 15-80 years, (3) admitted within 6 h of injury, (4) having no prior treatment before admission, were included. Exclusion criteria were (1) GCS score <9 or AIS score ≥3 for TBI, (2) confirmed infection, infectious disease, or high infection risk, (3) pregnancy, (4) severe primary diseases affecting survival, (5) recent use of immunosuppressive or cytotoxic drugs within the past 6 months, (6) psychiatric patients, (7) participation in other clinical trials within the past 30 days, (8) patients with incomplete data or missing blood samples. Admission serum inflammatory cytokines and pathophysiological data were analyzed to develop machine learning models predicting MODS within 7 days. LR, DR, RF, SVM, NB, and XGBoost were evaluated based on the area under the AUROC. The SHAP method was used to interpret results.
RESULTS:
This study enrolled 79 patients with major trauma, and the median (Q1, Q3) age was 51 (35, 59) years (52 males, 65.8%). The inflammatory cytokine data were collected for all participants. Among these patients, 35 (44.3%) developed MODS, and 44 (55.7%) did not. Additionally, 2 patients (2.5%) from the MODS group succumbed. The logistic regression model showed strong performance in predicting MODS. Ten key cytokines, IL-18, Eotaxin, MCP-4, IP-10, CXCL12, MIP-3α, MCP-1, IL-1RA, Cystatin C, and MRP8/14 were identified as critical to the trauma-induced cytokine storm and MODS development. Early elevation of these cytokines achieved high predictive accuracy, with an AUROC of 0.887 (95% CI 0.813-0.976).
CONCLUSION
Trauma-induced cytokine storms are strongly associated with MODS. Early identification of inflammatory cytokine changes enables better prediction and timely interventions to improve outcomes.
Humans
;
Prospective Studies
;
Middle Aged
;
Male
;
Female
;
Adult
;
Aged
;
Cytokine Release Syndrome/etiology*
;
Adolescent
;
Young Adult
;
Aged, 80 and over
;
Wounds and Injuries/complications*
;
Cytokines/blood*
;
Multiple Organ Failure/diagnosis*
;
Machine Learning
4.Predictability of varicocele repair success: preliminary results of a machine learning-based approach.
Andrea CRAFA ; Marco RUSSO ; Rossella CANNARELLA ; Murat GÜL ; Michele COMPAGNONE ; Laura M MONGIOÌ ; Vittorio CANNARELLA ; Rosita A CONDORELLI ; Sandro La VIGNERA ; Aldo E CALOGERO
Asian Journal of Andrology 2025;27(1):52-58
Varicocele is a prevalent condition in the infertile male population. However, to date, which patients may benefit most from varicocele repair is still a matter of debate. The purpose of this study was to evaluate whether certain preintervention sperm parameters are predictive of successful varicocele repair, defined as an improvement in total motile sperm count (TMSC). We performed a retrospective study on 111 patients with varicocele who had undergone varicocele repair, collected from the Department of Endocrinology, Metabolic Diseases and Nutrition, University of Catania (Catania, Italy), and the Unit of Urology at the Selcuk University School of Medicine (Konya, Türkiye). The predictive analysis was conducted through the use of the Brain Project, an innovative tool that allows a complete and totally unbiased search of mathematical expressions that relate the object of study to the various parameters available. Varicocele repair was considered successful when TMSC increased by at least 50% of the preintervention value. For patients with preintervention TMSC below 5 × 10 6 , improvement was considered clinically relevant when the increase exceeded 50% and the absolute TMSC value was >5 × 10 6 . From the preintervention TMSC alone, we found a model that predicts patients who appear to benefit little from varicocele repair with a sensitivity of 50.0% and a specificity of 81.8%. Varicocele grade and serum follicle-stimulating hormone (FSH) levels did not play a predictive role, but it should be noted that all patients enrolled in this study were selected with intermediate- or high-grade varicocele and normal FSH levels. In conclusion, preintervention TMSC is predictive of the success of varicocele repair in terms of TMSC improvement in patients with intermediate- or high-grade varicoceles and normal FSH levels.
Humans
;
Varicocele/complications*
;
Male
;
Retrospective Studies
;
Machine Learning
;
Adult
;
Treatment Outcome
;
Sperm Count
;
Infertility, Male/etiology*
;
Sperm Motility
;
Follicle Stimulating Hormone/blood*
;
Young Adult
5.Recent advances in the management of male infertility.
Rashed ROWAIEE ; Omar ALMIDANI ; Omer A RAHEEM
Asian Journal of Andrology 2025;27(6):669-672
Male factor infertility has been rising, which accounts for up to 30% of infertility cases and contributes to 50% of overall cases. The aim of this review is to explore the recent advances that have emerged in the field through a narrative review. A comprehensive literature search was conducted using multiple databases, including the Cochrane Library, PubMed, Scopus, and Web of Science. Gray literature was also reviewed through ClinicalTrials.gov and the World Health Organization International Clinical Trials Registry Platform. The findings were presented narratively to encompass the extensive range of published data on male infertility. Significant strides have been made in the field of male infertility, particularly with biomarkers, shear wave elastography, 3-dimensional (3D) bioprinting, artificial intelligence (AI), and robotic and microsurgical treatment, offering promising avenues for diagnosis and treatment. Continued research and technological innovation are essential to further improve outcomes for patients facing male factor infertility.
Humans
;
Male
;
Infertility, Male/diagnosis*
;
Artificial Intelligence
;
Elasticity Imaging Techniques
;
Bioprinting
;
Microsurgery
;
Biomarkers
;
Robotic Surgical Procedures
6.DeepSeek perspective on managing Kawasaki disease in Chinese children.
Chinese Journal of Contemporary Pediatrics 2025;27(5):524-528
Clinical management of Kawasaki disease faces several challenges, including difficulties in early diagnosis, insufficient personalized treatment, delayed access to information, and inefficient multidisciplinary collaboration. This paper explores the application of the DeepSeek AI model in the management of Kawasaki disease: (1) Enhancing early diagnosis accuracy through the integration and analysis of multimodal data (imaging, laboratory, and clinical data); (2) Dynamically adjusting treatment plans to achieve personalized medicine; (3) Integrating the latest global guidelines and research findings in real-time to optimize clinical processes; (4) Providing personalized health education content to enhance parental involvement; (5) Establishing a platform for sharing clinical data to support intelligent decision-making and multidisciplinary collaboration.
Humans
;
Mucocutaneous Lymph Node Syndrome/diagnosis*
;
Child
;
Artificial Intelligence
;
Precision Medicine
;
East Asian People
8.A machine learning approach for the diagnosis of obstructive sleep apnoea using oximetry, demographic and anthropometric data.
Zhou Hao LEONG ; Shaun Ray Han LOH ; Leong Chai LEOW ; Thun How ONG ; Song Tar TOH
Singapore medical journal 2025;66(4):195-201
INTRODUCTION:
Obstructive sleep apnoea (OSA) is a serious but underdiagnosed condition. Demand for the gold standard diagnostic polysomnogram (PSG) far exceeds its availability. More efficient diagnostic methods are needed, even in tertiary settings. Machine learning (ML) models have strengths in disease prediction and early diagnosis. We explored the use of ML with oximetry, demographic and anthropometric data to diagnose OSA.
METHODS:
A total of 2,996 patients were included for modelling and divided into test and training sets. Seven commonly used supervised learning algorithms were trained with the data. Sensitivity (recall), specificity, positive predictive value (PPV) (precision), negative predictive value, area under the receiver operating characteristic curve (AUC) and F1 measure were reported for each model.
RESULTS:
In the best performing four-class model (neural network model predicting no, mild, moderate or severe OSA), a prediction of moderate and/or severe disease had a combined PPV of 94%; one out of 335 patients had no OSA and 19 had mild OSA. In the best performing two-class model (logistic regression model predicting no-mild vs. moderate-severe OSA), the PPV for moderate-severe OSA was 92%; two out of 350 patients had no OSA and 26 had mild OSA.
CONCLUSION
Our study showed that the prediction of moderate-severe OSA in a tertiary setting with an ML approach is a viable option to facilitate early identification of OSA. Prospective studies with home-based oximeters and analysis of other oximetry variables are the next steps towards formal implementation.
Humans
;
Oximetry/methods*
;
Sleep Apnea, Obstructive/diagnosis*
;
Male
;
Female
;
Middle Aged
;
Machine Learning
;
Polysomnography
;
Adult
;
Anthropometry
;
ROC Curve
;
Aged
;
Algorithms
;
Predictive Value of Tests
;
Sensitivity and Specificity
;
Neural Networks, Computer
;
Demography
9.Deploying artificial intelligence in the detection of adult appendicular and pelvic fractures in the Singapore emergency department after hours: efficacy, cost savings and non-monetary benefits.
John Jian Xian QUEK ; Oliver James NICKALLS ; Bak Siew Steven WONG ; Min On TAN
Singapore medical journal 2025;66(4):202-207
INTRODUCTION:
Radiology plays an integral role in fracture detection in the emergency department (ED). After hours, when there are fewer reporting radiologists, most radiographs are interpreted by ED physicians. A minority of these interpretations may miss diagnoses, which later require the callback of patients for further management. Artificial intelligence (AI) has been viewed as a potential solution to augment the shortage of radiologists after hours. We explored the efficacy of an AI solution in the detection of appendicular and pelvic fractures for adult radiographs performed after hours at a general hospital ED in Singapore, and estimated the potential monetary and non-monetary benefits.
METHODS:
One hundred and fifty anonymised abnormal radiographs were retrospectively collected and fed through an AI fracture detection solution. The radiographs were re-read by two radiologist reviewers and their consensus was established as the reference standard. Cases were stratified based on the concordance between the AI solution and the reviewers' findings. Discordant cases were further analysed based on the nature of the discrepancy into overcall and undercall subgroups. Statistical analysis was performed to evaluate the accuracy, sensitivity and inter-rater reliability of the AI solution.
RESULTS:
Ninety-two examinations were included in the final study radiograph set. The AI solution had a sensitivity of 98.9%, an accuracy of 85.9% and an almost perfect agreement with the reference standard.
CONCLUSION
An AI fracture detection solution has similar sensitivity to human radiologists in the detection of fractures on ED appendicular and pelvic radiographs. Its implementation offers significant potential measurable cost, manpower and time savings.
Humans
;
Singapore
;
Emergency Service, Hospital
;
Fractures, Bone/diagnostic imaging*
;
Artificial Intelligence
;
Retrospective Studies
;
Adult
;
Male
;
Female
;
Cost Savings
;
Middle Aged
;
Pelvic Bones/diagnostic imaging*
;
Reproducibility of Results
;
Aged
;
Sensitivity and Specificity
;
Radiography
10.Use of deep learning model for paediatric elbow radiograph binomial classification: initial experience, performance and lessons learnt.
Mark Bangwei TAN ; Yuezhi Russ CHUA ; Qiao FAN ; Marielle Valerie FORTIER ; Peiqi Pearlly CHANG
Singapore medical journal 2025;66(4):208-214
INTRODUCTION:
In this study, we aimed to compare the performance of a convolutional neural network (CNN)-based deep learning model that was trained on a dataset of normal and abnormal paediatric elbow radiographs with that of paediatric emergency department (ED) physicians on a binomial classification task.
METHODS:
A total of 1,314 paediatric elbow lateral radiographs (patient mean age 8.2 years) were retrospectively retrieved and classified based on annotation as normal or abnormal (with pathology). They were then randomly partitioned to a development set (993 images); first and second tuning (validation) sets (109 and 100 images, respectively); and a test set (112 images). An artificial intelligence (AI) model was trained on the development set using the EfficientNet B1 network architecture. Its performance on the test set was compared to that of five physicians (inter-rater agreement: fair). Performance of the AI model and the physician group was tested using McNemar test.
RESULTS:
The accuracy of the AI model on the test set was 80.4% (95% confidence interval [CI] 71.8%-87.3%), and the area under the receiver operating characteristic curve (AUROC) was 0.872 (95% CI 0.831-0.947). The performance of the AI model vs. the physician group on the test set was: sensitivity 79.0% (95% CI: 68.4%-89.5%) vs. 64.9% (95% CI: 52.5%-77.3%; P = 0.088); and specificity 81.8% (95% CI: 71.6%-92.0%) vs. 87.3% (95% CI: 78.5%-96.1%; P = 0.439).
CONCLUSION
The AI model showed good AUROC values and higher sensitivity, with the P-value at nominal significance when compared to the clinician group.
Humans
;
Deep Learning
;
Child
;
Retrospective Studies
;
Male
;
Female
;
Radiography/methods*
;
ROC Curve
;
Elbow/diagnostic imaging*
;
Neural Networks, Computer
;
Child, Preschool
;
Elbow Joint/diagnostic imaging*
;
Emergency Service, Hospital
;
Adolescent
;
Infant
;
Artificial Intelligence


Result Analysis
Print
Save
E-mail