1.Performance of Digital Mammography-Based Artificial Intelligence Computer-Aided Diagnosis on Synthetic Mammography From Digital Breast Tomosynthesis
Kyung Eun LEE ; Sung Eun SONG ; Kyu Ran CHO ; Min Sun BAE ; Bo Kyoung SEO ; Soo-Yeon KIM ; Ok Hee WOO
Korean Journal of Radiology 2025;26(3):217-229
Objective:
To test the performance of an artificial intelligence-based computer-aided diagnosis (AI-CAD) designed for fullfield digital mammography (FFDM) when applied to synthetic mammography (SM).
Materials and Methods:
We analyzed 501 women (mean age, 57 ± 11 years) who underwent preoperative mammography and breast cancer surgery. This cohort consisted of 1002 breasts, comprising 517 with cancer and 485 without. All patients underwent digital breast tomosynthesis (DBT) and FFDM during the preoperative workup. The SM is routinely reconstructed using DBT. Commercial AI-CAD (Lunit Insight MMG, version 1.1.7.2) was retrospectively applied to SM and FFDM to calculate the abnormality scores for each breast. The median abnormality scores were compared for the 517 breasts with cancer using the Wilcoxon signed-rank test. Calibration curves of abnormality scores were evaluated. The discrimination performance was analyzed using the area under the receiver operating characteristic curve (AUC), sensitivity, and specificity using a 10% preset threshold. Sensitivity and specificity were further analyzed according to the mammographic and pathological characteristics.The results of SM and FFDM were compared.
Results:
AI-CAD demonstrated a significantly lower median abnormality score (71% vs. 96%, P < 0.001) and poorer calibration performance for SM than for FFDM. SM exhibited lower sensitivity (76.2% vs. 82.8%, P < 0.001), higher specificity (95.5% vs.91.8%, P < 0.001), and comparable AUC (0.86 vs. 0.87, P = 0.127) than FFDM. SM showed lower sensitivity than FFDM in asymptomatic breasts, dense breasts, ductal carcinoma in situ, T1, N0, and hormone receptor-positive/human epidermal growth factor receptor 2-negative cancers but showed higher specificity in non-cancerous dense breasts.
Conclusion
AI-CAD showed lower abnormality scores and reduced calibration performance for SM than for FFDM.Furthermore, the 10% preset threshold resulted in different discrimination performances for the SM. Given these limitations, off-label application of the current AI-CAD to SM should be avoided.
2.Characteristics and outcomes of portal vein thrombosis in patients with inflammatory bowel disease in Korea
Ki Jin KIM ; Su-Bin SONG ; Jung-Bin PARK ; June Hwa BAE ; Ji Eun BAEK ; Ga Hee KIM ; Min-Jun KIM ; Seung Wook HONG ; Sung Wook HWANG ; Dong-Hoon YANG ; Byong Duk YE ; Jeong-Sik BYEON ; Seung-Jae MYUNG ; Suk-Kyun YANG ; Chang Sik YU ; Yong-Sik YOON ; Jong-Lyul LEE ; Min Hyun KIM ; Ho-Su LEE ; Sang Hyoung PARK
The Korean Journal of Internal Medicine 2025;40(2):243-250
Background/Aims:
Portal vein thrombosis (PVT) frequently occurs in patients with inflammatory bowel disease (IBD), particularly when influenced by factors such as abdominal infections, IBD flare-ups, or surgical procedures. The implications of PVT range from immediate issues such as intestinal ischemia to long-term concerns including portal hypertension and its complications. However, there is a notable gap in comprehensive studies on PVT in IBD, especially with the increasing incidence of IBD in Asia. This research aimed to evaluate the clinical features and outcomes of PVT in patients with IBD at a leading hospital in South Korea.
Methods:
This retrospective analysis reviewed adult patients diagnosed with both IBD and PVT from 1989 to 2021 at a renowned South Korean medical center. The study focused on patient characteristics, specifics of PVT, administered treatments, and outcomes, all confirmed through enhanced CT scans.
Results:
A total of 78 patients met the study’s criteria. Notably, only 20.5% (16/78) were treated with oral anticoagulants; however, a vast majority (96.2%; 75/78) achieved complete radiographic resolution (CRR). When comparing patients receiving anticoagulants to those who did not, a significant preference for anticoagulant use was observed in cases where the main portal vein was affected, as opposed to just the left or right veins (p = 0.006). However, multivariable analysis indicated that neither anticoagulant use nor previous surgeries significantly impacted CRR.
Conclusions
Patients with IBD and PVT generally had favorable outcomes, regardless of anticoagulant use.
3.Performance of Digital Mammography-Based Artificial Intelligence Computer-Aided Diagnosis on Synthetic Mammography From Digital Breast Tomosynthesis
Kyung Eun LEE ; Sung Eun SONG ; Kyu Ran CHO ; Min Sun BAE ; Bo Kyoung SEO ; Soo-Yeon KIM ; Ok Hee WOO
Korean Journal of Radiology 2025;26(3):217-229
Objective:
To test the performance of an artificial intelligence-based computer-aided diagnosis (AI-CAD) designed for fullfield digital mammography (FFDM) when applied to synthetic mammography (SM).
Materials and Methods:
We analyzed 501 women (mean age, 57 ± 11 years) who underwent preoperative mammography and breast cancer surgery. This cohort consisted of 1002 breasts, comprising 517 with cancer and 485 without. All patients underwent digital breast tomosynthesis (DBT) and FFDM during the preoperative workup. The SM is routinely reconstructed using DBT. Commercial AI-CAD (Lunit Insight MMG, version 1.1.7.2) was retrospectively applied to SM and FFDM to calculate the abnormality scores for each breast. The median abnormality scores were compared for the 517 breasts with cancer using the Wilcoxon signed-rank test. Calibration curves of abnormality scores were evaluated. The discrimination performance was analyzed using the area under the receiver operating characteristic curve (AUC), sensitivity, and specificity using a 10% preset threshold. Sensitivity and specificity were further analyzed according to the mammographic and pathological characteristics.The results of SM and FFDM were compared.
Results:
AI-CAD demonstrated a significantly lower median abnormality score (71% vs. 96%, P < 0.001) and poorer calibration performance for SM than for FFDM. SM exhibited lower sensitivity (76.2% vs. 82.8%, P < 0.001), higher specificity (95.5% vs.91.8%, P < 0.001), and comparable AUC (0.86 vs. 0.87, P = 0.127) than FFDM. SM showed lower sensitivity than FFDM in asymptomatic breasts, dense breasts, ductal carcinoma in situ, T1, N0, and hormone receptor-positive/human epidermal growth factor receptor 2-negative cancers but showed higher specificity in non-cancerous dense breasts.
Conclusion
AI-CAD showed lower abnormality scores and reduced calibration performance for SM than for FFDM.Furthermore, the 10% preset threshold resulted in different discrimination performances for the SM. Given these limitations, off-label application of the current AI-CAD to SM should be avoided.
4.Characteristics and outcomes of portal vein thrombosis in patients with inflammatory bowel disease in Korea
Ki Jin KIM ; Su-Bin SONG ; Jung-Bin PARK ; June Hwa BAE ; Ji Eun BAEK ; Ga Hee KIM ; Min-Jun KIM ; Seung Wook HONG ; Sung Wook HWANG ; Dong-Hoon YANG ; Byong Duk YE ; Jeong-Sik BYEON ; Seung-Jae MYUNG ; Suk-Kyun YANG ; Chang Sik YU ; Yong-Sik YOON ; Jong-Lyul LEE ; Min Hyun KIM ; Ho-Su LEE ; Sang Hyoung PARK
The Korean Journal of Internal Medicine 2025;40(2):243-250
Background/Aims:
Portal vein thrombosis (PVT) frequently occurs in patients with inflammatory bowel disease (IBD), particularly when influenced by factors such as abdominal infections, IBD flare-ups, or surgical procedures. The implications of PVT range from immediate issues such as intestinal ischemia to long-term concerns including portal hypertension and its complications. However, there is a notable gap in comprehensive studies on PVT in IBD, especially with the increasing incidence of IBD in Asia. This research aimed to evaluate the clinical features and outcomes of PVT in patients with IBD at a leading hospital in South Korea.
Methods:
This retrospective analysis reviewed adult patients diagnosed with both IBD and PVT from 1989 to 2021 at a renowned South Korean medical center. The study focused on patient characteristics, specifics of PVT, administered treatments, and outcomes, all confirmed through enhanced CT scans.
Results:
A total of 78 patients met the study’s criteria. Notably, only 20.5% (16/78) were treated with oral anticoagulants; however, a vast majority (96.2%; 75/78) achieved complete radiographic resolution (CRR). When comparing patients receiving anticoagulants to those who did not, a significant preference for anticoagulant use was observed in cases where the main portal vein was affected, as opposed to just the left or right veins (p = 0.006). However, multivariable analysis indicated that neither anticoagulant use nor previous surgeries significantly impacted CRR.
Conclusions
Patients with IBD and PVT generally had favorable outcomes, regardless of anticoagulant use.
5.Consensus Statements on Tinnitus Assessment and Treatment Outcome Evaluation: A Delphi Study by the Korean Tinnitus Study Group
Oak-Sung CHOO ; Jung Mee PARK ; Euyhyun PARK ; Jiwon CHANG ; Min Young LEE ; Ho Yun LEE ; In Seok MOON ; Jae-Jun SONG ; Kyu-Yup LEE ; Jae-Jin SONG ; Eui-Cheol NAM ; Shi Nae PARK ; Hyun Joon SHIM ; Yoon Chan RAH ; Jae-Hyun SEO
Journal of Korean Medical Science 2025;40(7):e93-
Background:
Tinnitus is a multifactorial condition with no universally accepted assessment guidelines. The Korean Tinnitus Study Group previously established consensus statements on the definition, classification, and diagnostic tests for tinnitus. As a continuation of this effort, this study aims to establish expert consensus on tinnitus assessment and treatment outcome evaluation, specifically tailored to the Korean clinical context.
Methods:
A modified Delphi method involving 26 otology experts from across Korea was used. A two-round Delphi survey was conducted to evaluate statements related to tinnitus assessment before and after treatment. Statements were rated on a scale of 1 to 9 for the level of agreement. Consensus was defined as ≥ 70% agreement (score of 7–9) and ≤ 15% disagreement (score of 1–3). Statistical measures such as content validity ratio and Kendall’s coefficient of concordance (W) were calculated to assess agreement levels.
Results:
Of the 46 assessment-related statements, 17 (37%) reached consensus, though overall pre-treatment assessments showed weak agreement (Kendall’s W = 0.319). Key areas of agreement included the use of the visual analogue scale, numeric rating scale, and validated questionnaires for pre-treatment evaluation. Five statements, such as the use of computed tomography, magnetic resonance imaging, and angiography for diagnosing pulsatile tinnitus, achieved over 90% agreement. For treatment outcome measurements, 8 of 12 statements (67%) reached a consensus, with moderate agreement (Kendall’s W = 0.513). Validated questionnaires and psychoacoustic tests were recommended for evaluating treatment effects within 12 weeks. While standardized imaging for pulsatile tinnitus and additional clinical tests were strongly recommended, full consensus was not achieved across all imaging modalities.
Conclusion
This study provides actionable recommendations for tinnitus assessment and treatment evaluation, emphasizing the use of standardized tools and individualized approaches based on patient needs. These findings offer a practical framework to enhance consistency and effectiveness in tinnitus management within Korean clinical settings.
6.Explainability Enhanced Machine Learning Model for Classifying Intellectual Disability and AttentionDeficit/Hyperactivity Disorder With Psychological Test Reports
Tong Min KIM ; Young-Hoon KIM ; Sung-Hee SONG ; In-Young CHOI ; Dai-Jin KIM ; Taehoon KO
Journal of Korean Medical Science 2025;40(11):e26-
Background:
Psychological test reports are essential in assessing intellectual functioning, aiding in diagnosing and treating intellectual disability (ID) and attention-deficit/ hyperactivity disorder (ADHD). However, these reports can have several problems because they are diverse, unstructured, subjective, and involve human errors. Additionally, physicians often do not read the entire report, and the number of reports is lower than that of diagnoses.
Methods:
We developed explainable predictive models for classifying IDs and ADHDs based on written reports to address these issues. The reports of 1,475 patients with IDs and ADHDs who underwent intelligence tests were used for the models. These models were developed by analyzing reports using natural language processing (NLP) and incorporating the physician’s diagnosis for each report. We selected n-gram features from the models’ results by extracting important features using SHapley Additive exPlanations and permutation importance to make the models explainable. Developing the n-gram feature-based original text search system compensated for the lack of human readability caused by NLP and enabled the reconstruction of human-readable texts from the selected n-gram features.
Results:
The maximum model accuracy was 0.92, and the 80 human-readable texts were restored from four models.
Conclusion
The results showed that the models could accurately classify IDs and ADHDs, even with a few reports. The models were also able to explain their predictions. The explainability-enhanced model can help physicians understand the classification process of IDs and ADHDs and provide evidence-based insights.
7.Consensus Statements on Tinnitus Assessment and Treatment Outcome Evaluation: A Delphi Study by the Korean Tinnitus Study Group
Oak-Sung CHOO ; Jung Mee PARK ; Euyhyun PARK ; Jiwon CHANG ; Min Young LEE ; Ho Yun LEE ; In Seok MOON ; Jae-Jun SONG ; Kyu-Yup LEE ; Jae-Jin SONG ; Eui-Cheol NAM ; Shi Nae PARK ; Hyun Joon SHIM ; Yoon Chan RAH ; Jae-Hyun SEO
Journal of Korean Medical Science 2025;40(7):e93-
Background:
Tinnitus is a multifactorial condition with no universally accepted assessment guidelines. The Korean Tinnitus Study Group previously established consensus statements on the definition, classification, and diagnostic tests for tinnitus. As a continuation of this effort, this study aims to establish expert consensus on tinnitus assessment and treatment outcome evaluation, specifically tailored to the Korean clinical context.
Methods:
A modified Delphi method involving 26 otology experts from across Korea was used. A two-round Delphi survey was conducted to evaluate statements related to tinnitus assessment before and after treatment. Statements were rated on a scale of 1 to 9 for the level of agreement. Consensus was defined as ≥ 70% agreement (score of 7–9) and ≤ 15% disagreement (score of 1–3). Statistical measures such as content validity ratio and Kendall’s coefficient of concordance (W) were calculated to assess agreement levels.
Results:
Of the 46 assessment-related statements, 17 (37%) reached consensus, though overall pre-treatment assessments showed weak agreement (Kendall’s W = 0.319). Key areas of agreement included the use of the visual analogue scale, numeric rating scale, and validated questionnaires for pre-treatment evaluation. Five statements, such as the use of computed tomography, magnetic resonance imaging, and angiography for diagnosing pulsatile tinnitus, achieved over 90% agreement. For treatment outcome measurements, 8 of 12 statements (67%) reached a consensus, with moderate agreement (Kendall’s W = 0.513). Validated questionnaires and psychoacoustic tests were recommended for evaluating treatment effects within 12 weeks. While standardized imaging for pulsatile tinnitus and additional clinical tests were strongly recommended, full consensus was not achieved across all imaging modalities.
Conclusion
This study provides actionable recommendations for tinnitus assessment and treatment evaluation, emphasizing the use of standardized tools and individualized approaches based on patient needs. These findings offer a practical framework to enhance consistency and effectiveness in tinnitus management within Korean clinical settings.
8.Explainability Enhanced Machine Learning Model for Classifying Intellectual Disability and AttentionDeficit/Hyperactivity Disorder With Psychological Test Reports
Tong Min KIM ; Young-Hoon KIM ; Sung-Hee SONG ; In-Young CHOI ; Dai-Jin KIM ; Taehoon KO
Journal of Korean Medical Science 2025;40(11):e26-
Background:
Psychological test reports are essential in assessing intellectual functioning, aiding in diagnosing and treating intellectual disability (ID) and attention-deficit/ hyperactivity disorder (ADHD). However, these reports can have several problems because they are diverse, unstructured, subjective, and involve human errors. Additionally, physicians often do not read the entire report, and the number of reports is lower than that of diagnoses.
Methods:
We developed explainable predictive models for classifying IDs and ADHDs based on written reports to address these issues. The reports of 1,475 patients with IDs and ADHDs who underwent intelligence tests were used for the models. These models were developed by analyzing reports using natural language processing (NLP) and incorporating the physician’s diagnosis for each report. We selected n-gram features from the models’ results by extracting important features using SHapley Additive exPlanations and permutation importance to make the models explainable. Developing the n-gram feature-based original text search system compensated for the lack of human readability caused by NLP and enabled the reconstruction of human-readable texts from the selected n-gram features.
Results:
The maximum model accuracy was 0.92, and the 80 human-readable texts were restored from four models.
Conclusion
The results showed that the models could accurately classify IDs and ADHDs, even with a few reports. The models were also able to explain their predictions. The explainability-enhanced model can help physicians understand the classification process of IDs and ADHDs and provide evidence-based insights.
9.Erratum: Korean Gastric Cancer Association-Led Nationwide Survey on Surgically Treated Gastric Cancers in 2023
Dong Jin KIM ; Jeong Ho SONG ; Ji-Hyeon PARK ; Sojung KIM ; Sin Hye PARK ; Cheol Min SHIN ; Yoonjin KWAK ; Kyunghye BANG ; Chung-sik GONG ; Sung Eun OH ; Yoo Min KIM ; Young Suk PARK ; Jeesun KIM ; Ji Eun JUNG ; Mi Ran JUNG ; Bang Wool EOM ; Ki Bum PARK ; Jae Hun CHUNG ; Sang-Il LEE ; Young-Gil SON ; Dae Hoon KIM ; Sang Hyuk SEO ; Sejin LEE ; Won Jun SEO ; Dong Jin PARK ; Yoonhong KIM ; Jin-Jo KIM ; Ki Bum PARK ; In CHO ; Hye Seong AHN ; Sung Jin OH ; Ju-Hee LEE ; Hayemin LEE ; Seong Chan GONG ; Changin CHOI ; Ji-Ho PARK ; Eun Young KIM ; Chang Min LEE ; Jong Hyuk YUN ; Seung Jong OH ; Eunju LEE ; Seong-A JEONG ; Jung-Min BAE ; Jae-Seok MIN ; Hyun-dong CHAE ; Sung Gon KIM ; Daegeun PARK ; Dong Baek KANG ; Hogoon KIM ; Seung Soo LEE ; Sung Il CHOI ; Seong Ho HWANG ; Su-Mi KIM ; Moon Soo LEE ; Sang Hyun KIM ; Sang-Ho JEONG ; Yusung YANG ; Yonghae BAIK ; Sang Soo EOM ; Inho JEONG ; Yoon Ju JUNG ; Jong-Min PARK ; Jin Won LEE ; Jungjai PARK ; Ki Han KIM ; Kyung-Goo LEE ; Jeongyeon LEE ; Seongil OH ; Ji Hun PARK ; Jong Won KIM ;
Journal of Gastric Cancer 2025;25(2):400-402
10.Prospective Multicenter Observational Study on Postoperative Quality of Life According to Type of Gastrectomy for Gastric Cancer
Sung Eun OH ; Yun-Suhk SUH ; Ji Yeong AN ; Keun Won RYU ; In CHO ; Sung Geun KIM ; Ji-Ho PARK ; Hoon HUR ; Hyung-Ho KIM ; Sang-Hoon AHN ; Sun-Hwi HWANG ; Hong Man YOON ; Ki Bum PARK ; Hyoung-Il KIM ; In Gyu KWON ; Han-Kwang YANG ; Byoung-Jo SUH ; Sang-Ho JEONG ; Tae-Han KIM ; Oh Kyoung KWON ; Hye Seong AHN ; Ji Yeon PARK ; Ki Young YOON ; Myoung Won SON ; Seong-Ho KONG ; Young-Gil SON ; Geum Jong SONG ; Jong Hyuk YUN ; Jung-Min BAE ; Do Joong PARK ; Sol LEE ; Jun-Young YANG ; Kyung Won SEO ; You-Jin JANG ; So Hyun KANG ; Bang Wool EOM ; Joongyub LEE ; Hyuk-Joon LEE ;
Journal of Gastric Cancer 2025;25(2):382-399
Purpose:
This study evaluated the postoperative quality of life (QoL) after various types of gastrectomy for gastric cancer.
Materials and Methods:
A multicenter prospective observational study was conducted in Korea using the Korean Quality of Life in Stomach Cancer Patients Study (KOQUSS)-40, a new QoL assessment tool focusing on postgastrectomy syndrome. Overall, 496 patients with gastric cancer were enrolled, and QoL was assessed at 5 time points: preoperatively and at 1, 3, 6, and 12 months after surgery.
Results:
Distal gastrectomy (DG) and pylorus-preserving gastrectomy (PPG) showed significantly better outcomes than total gastrectomy (TG) and proximal gastrectomy (PG) with regard to total score, indigestion, and dysphagia. DG, PPG, and TG also showed significantly better outcomes than PG in terms of dumping syndrome and worry about cancer. Postoperative QoL did not differ significantly according to anastomosis type in DG, except for Billroth I anastomosis, which achieved better bowel habit change scores than the others. No domains differed significantly when comparing double tract reconstruction and esophagogastrostomy after PG. The total QoL score correlated significantly with postoperative body weight loss (more than 10%) and extent of resection (P<0.05 for both).Reflux as assessed by KOQUSS-40 did not correlate significantly with reflux observed on gastroscopy 1 year postoperatively (P=0.064).
Conclusions
Our prospective observation using KOQUSS-40 revealed that DG and PPG lead to better QoL than TG and PG. Further study is needed to compare postoperative QoL according to anastomosis type in DG and PG.

Result Analysis
Print
Save
E-mail