1.Study Protocol of Expanded Multicenter Prospective Cohort Study of Active Surveillance on Papillary Thyroid Microcarcinoma (MAeSTro-EXP)
Jae Hoon MOON ; Eun Kyung LEE ; Wonjae CHA ; Young Jun CHAI ; Sun Wook CHO ; June Young CHOI ; Sung Yong CHOI ; A Jung CHU ; Eun-Jae CHUNG ; Yul HWANGBO ; Woo-Jin JEONG ; Yuh-Seog JUNG ; Kyungsik KIM ; Min Joo KIM ; Su-jin KIM ; Woochul KIM ; Yoo Hyung KIM ; Chang Yoon LEE ; Ji Ye LEE ; Kyu Eun LEE ; Young Ki LEE ; Hunjong LIM ; Do Joon PARK ; Sue K. PARK ; Chang Hwan RYU ; Junsun RYU ; Jungirl SEOK ; Young Shin SONG ; Ka Hee YI ; Hyeong Won YU ; Eleanor WHITE ; Katerina MASTROCOSTAS ; Roderick J. CLIFTON-BLIGH ; Anthony GLOVER ; Matti L. GILD ; Ji-hoon KIM ; Young Joo PARK
Endocrinology and Metabolism 2025;40(2):236-246
Background:
Active surveillance (AS) has emerged as a viable management strategy for low-risk papillary thyroid microcarcinoma (PTMC), following pioneering trials at Kuma Hospital and the Cancer Institute Hospital in Japan. Numerous prospective cohort studies have since validated AS as a management option for low-risk PTMC, leading to its inclusion in thyroid cancer guidelines across various countries. From 2016 to 2020, the Multicenter Prospective Cohort Study of Active Surveillance on Papillary Thyroid Microcarcinoma (MAeSTro) enrolled 1,177 patients, providing comprehensive data on PTMC progression, sonographic predictors of progression, quality of life, surgical outcomes, and cost-effectiveness when comparing AS to immediate surgery. The second phase of MAeSTro (MAeSTro-EXP) expands AS to low-risk papillary thyroid carcinoma (PTC) tumors larger than 1 cm, driven by the hypothesis that overall risk assessment outweighs absolute tumor size in surgical decision-making.
Methods:
This protocol aims to address whether limiting AS to tumors smaller than 1 cm may result in unnecessary surgeries for low-risk PTCs detected during their rapid initial growth phase. By expanding the AS criteria to include tumors up to 1.5 cm, while simultaneously refining and standardizing the criteria for risk assessment and disease progression, we aim to minimize overtreatment and maintain rigorous monitoring to improve patient outcomes.
Conclusion
This study will contribute to optimizing AS guidelines and enhance our understanding of the natural course and appropriate management of low-risk PTCs. Additionally, MAeSTro-EXP involves a multinational collaboration between South Korea and Australia. This cross-country study aims to identify cultural and racial differences in the management of low-risk PTC, thereby enriching the global understanding of AS practices and their applicability across diverse populations.
2.Deep Learning Technology for Classification of Thyroid Nodules Using Multi-View Ultrasound Images: Potential Benefits and Challenges in Clinical Application
Jinyoung KIM ; Min-Hee KIM ; Dong-Jun LIM ; Hankyeol LEE ; Jae Jun LEE ; Hyuk-Sang KWON ; Mee Kyoung KIM ; Ki-Ho SONG ; Tae-Jung KIM ; So Lyung JUNG ; Yong Oh LEE ; Ki-Hyun BAEK
Endocrinology and Metabolism 2025;40(2):216-224
Background:
This study aimed to evaluate the applicability of deep learning technology to thyroid ultrasound images for classification of thyroid nodules.
Methods:
This retrospective analysis included ultrasound images of patients with thyroid nodules investigated by fine-needle aspiration at the thyroid clinic of a single center from April 2010 to September 2012. Thyroid nodules with cytopathologic results of Bethesda category V (suspicious for malignancy) or VI (malignant) were defined as thyroid cancer. Multiple deep learning algorithms based on convolutional neural networks (CNNs) —ResNet, DenseNet, and EfficientNet—were utilized, and Siamese neural networks facilitated multi-view analysis of paired transverse and longitudinal ultrasound images.
Results:
Among 1,048 analyzed thyroid nodules from 943 patients, 306 (29%) were identified as thyroid cancer. In a subgroup analysis of transverse and longitudinal images, longitudinal images showed superior prediction ability. Multi-view modeling, based on paired transverse and longitudinal images, significantly improved the model performance; with an accuracy of 0.82 (95% confidence intervals [CI], 0.80 to 0.86) with ResNet50, 0.83 (95% CI, 0.83 to 0.88) with DenseNet201, and 0.81 (95% CI, 0.79 to 0.84) with EfficientNetv2_ s. Training with high-resolution images obtained using the latest equipment tended to improve model performance in association with increased sensitivity.
Conclusion
CNN algorithms applied to ultrasound images demonstrated substantial accuracy in thyroid nodule classification, indicating their potential as valuable tools for diagnosing thyroid cancer. However, in real-world clinical settings, it is important to aware that model performance may vary depending on the quality of images acquired by different physicians and imaging devices.
3.Unveiling Risk Factors for Treatment Failure in Patients with Graves’ Disease: A Nationwide Cohort Study in Korea
Jung A KIM ; Kyeong Jin KIM ; Jimi CHOI ; Kyoung Jin KIM ; Eyun SONG ; Ji Hee YU ; Nam Hoon KIM ; Hye Jin YOO ; Ji A SEO ; Nan Hee KIM ; Kyung Mook CHOI ; Sei Hyun BAIK ; Sin Gon KIM
Endocrinology and Metabolism 2025;40(1):125-134
Background:
Antithyroid drug (ATD) treatment is the preferred initial treatment for Graves’ disease (GD) in South Korea, despite higher treatment failure rates than radioactive iodine (RAI) therapy or thyroidectomy. This study aimed to evaluate the incidence of treatment failure associated with the primary modalities for GD treatment in real-world practice.
Methods:
We included 452,001 patients diagnosed with GD between 2004 and 2020 from the Korean National Health Insurance Service-National Health Information Database. Treatment failure was defined as switching from ATD, RAI, or thyroidectomy treatments, and for ATD specifically, inability to discontinue medication for over 2 years.
Results:
Mean age was 46.2 years, with females constituting 70.8%. Initial treatments for GD included ATDs (98.0%), thyroidectomy (1.3%), and RAI (0.7%), with a noted increment in ATD application from 96.2% in 2004 to 98.8% in 2020. During a median follow- up of 8.5 years, the treatment failure rates were 58.5% for ATDs, 21.3% for RAI, and 2.1% for thyroidectomy. Multivariate analysis indicated that the hazard ratio for treatment failure with ATD was 2.81 times higher than RAI. RAI treatments ≥10 mCi had 37% lower failure rates than doses <10 mCi.
Conclusion
ATDs are the most commonly used for GD in South Korea, followed by thyroidectomy and RAI. Although the risk of treatment failure for ATD is higher than that of RAI therapy, initial RAI treatment in South Korea is relatively limited compared to that in Western countries. Further studies are required to evaluate the cause of low initial RAI treatment rates in South Korea.
4.Development of a Long-Acting Follicle-Stimulating Hormone Using Serum Albumin Fab-Associated Technology for Female Infertility
Daham KIM ; Yoon Hee CHO ; Min Jeong KANG ; So Jeong LEE ; Soohyun LEE ; Bo Hyon YUN ; Hyunjin CHI ; Jeongsuk AN ; Kyungsun LEE ; Jaekyu HAN ; Susan CHI ; Moo Young SONG ; Sang-Hoon CHA ; Eun Jig LEE
Endocrinology and Metabolism 2025;40(1):146-155
Background:
Recombinant human follicle-stimulating hormone (rhFSH) is commonly used to treat female infertility, but its short half-life necessitates multiple doses. Even corifollitropin alfa, with an extended half-life, requires supplementary injections of rhFSH after 7 days. This study aimed to develop and evaluate a long-acting follicle-stimulating hormone (FSH) formulation using anti-serum albumin Fab-associated (SAFA) technology to avoid additional injections and enhance ovarian function.
Methods:
SAFA-FSH was synthesized using a Chinese hamster ovary expression system. Its biological efficacy was confirmed through assays measuring its ability to stimulate cyclic adenosine monophosphate (cAMP) production, estradiol synthesis, and the expression of human cytochrome P450 family 19 subfamily A member 1 (hCYP19α1) and human steroidogenic acute regulatory protein (hSTAR) in human ovarian granulosa (KGN) cells. To evaluate the effects of SAFA-FSH, we compared its impact on serum estradiol levels and ovarian weight increase with that of rhFSH in Sprague-Dawley (SD) rats using the modified Steelman-Pohley test.
Results:
The results indicated that SAFA-FSH induces cAMP synthesis in KGN cells and upregulates the expression of hCYP19α1 and hSTAR in a dose-dependent manner. Female SD rats, aged 21 days, receiving daily subcutaneous human chorionic gonadotropin injections for 5 days exhibited a significant increase in serum estradiol levels and ovarian weight when administered SAFA-FSH on the first day or when given nine injections of rhFSH over 5 days. Notably, the group receiving SAFA-FSH on the first and third days demonstrated an even greater rise in serum estradiol levels and ovarian weight.
Conclusion
These findings suggest that SAFA-FSH presents a promising alternative to current rhFSH treatments for female infertility. However, further research is essential to thoroughly assess its safety and efficacy in clinical contexts.
5.Artificial Intelligence Models May Aid in Predicting Lymph Node Metastasis in Patients with T1 Colorectal Cancer
Ji Eun BAEK ; Hahn YI ; Seung Wook HONG ; Subin SONG ; Ji Young LEE ; Sung Wook HWANG ; Sang Hyoung PARK ; Dong-Hoon YANG ; Byong Duk YE ; Seung-Jae MYUNG ; Suk-Kyun YANG ; Namkug KIM ; Jeong-Sik BYEON
Gut and Liver 2025;19(1):69-76
Background/Aims:
Inaccurate prediction of lymph node metastasis (LNM) may lead to unnecessary surgery following endoscopic resection of T1 colorectal cancer (CRC). We aimed to validate the usefulness of artificial intelligence (AI) models for predicting LNM in patients with T1 CRC.
Methods:
We analyzed the clinical data, laboratory results, pathological reports, and endoscopic findings of patients who underwent radical surgery for T1 CRC. We developed AI models to predict LNM using four algorithms: regularized logistic regression classifier (RLRC), random forest classifier (RFC), CatBoost classifier (CBC), and the voting classifier (VC). Four histological factors and four endoscopic findings were included to develop AI models. Areas under the receiver operating characteristics curves (AUROCs) were measured to distinguish AI model performance in accordance with the Japanese Society for Cancer of the Colon and Rectum guidelines.
Results:
Among 1,386 patients with T1 CRC, 173 patients (12.5%) had LNM. The AUROC values of the RLRC, RFC, CBC, and VC models for LNM prediction were significantly higher (0.673, 0.640, 0.679, and 0.677, respectively) than the 0.525 suggested in accordance with the Japanese Society for Cancer of the Colon and Rectum guidelines (vs RLRC, p<0.001; vs RFC, p=0.001; vs CBC, p<0.001; vs VC, p<0.001). The AUROC value was similar between T1 colon versus T1 rectal cancers (0.718 vs 0.615, p=0.700). The AUROC value was also similar between the initial endoscopic resection and initial surgery groups (0.581 vs 0.746, p=0.845).
Conclusions
AI models trained on the basis of endoscopic findings and pathological features performed well in predicting LNM in patients with T1 CRC regardless of tumor location and initial treatment method.
6.Assessing the Validity of the AASLD Surgical Treatment Algorithm in Patients with Early-Stage Hepatocellular Carcinoma
Aryoung KIM ; Byeong Geun SONG ; Wonseok KANG ; Geum-Youn GWAK ; Yong-Han PAIK ; Moon Seok CHOI ; Joon Hyeok LEE ; Myung Ji GOH ; Dong Hyun SINN
Gut and Liver 2025;19(2):265-274
Background/Aims:
The aim of this study was to investigate the effect of a surgical treatment algorithm recently proposed by the American Association for the Study of Liver Diseases (AASLD) on survival outcomes in patients with early-stage hepatocellular carcinoma (HCC) and identify effective alternative treatment modalities when liver transplantation (LT) is not available.
Methods:
We studied the clinical data of 1,442 patients who were diagnosed with early-stage HCC (a single lesion measuring 2–5 cm in size or 2 to 3 lesions measuring ≤3 cm in size) be-tween 2013 and 2018 and classified as Child-Turcotte-Pugh (CTP) A or B. Analyses were separately performed for individuals recommended for resection (single lesion, CTP A and no clinically significant portal hypertension) and those recommended for LT (single lesion with impaired liver function such as CTP B or clinically significant portal hypertension or multiple lesions).
Results:
Of 791 patients recommended for surgical resection, 85.8% underwent resection. The 5-year survival rate was higher for patients who underwent surgical resection than for those who received other treatments (89.4% vs 72.3%). Among 651 patients recommended for LT, only 3.4% underwent the procedure. The most common alternative treatment modalities were transarterial therapy (39.3%) followed by resection (28.9%) and ablation (27.8%). The overall survival rate associated with transarterial therapy was lower than that for resection and ablation, whereas that of the latter two treatments were comparable.
Conclusions
The survival outcomes of treatment strategies that most closely aligned with the algorithm proposed by the AASLD were superior to those of alternative treatment approaches.However, LT in patients with early-stage HCC can be challenging. When LT is not feasible, resection and ablation can be considered first-line alternative options.
7.Anxiety and Depression Are Associated with Poor Long-term Quality of Life in Moderate-to-Severe Ulcerative Colitis: Results of a 3-Year Longitudinal Study of the MOSAIK Cohort
Shin Ju OH ; Chang Hwan CHOI ; Sung-Ae JUNG ; Geun Am SONG ; Yoon Jae KIM ; Ja Seol KOO ; Sung Jae SHIN ; Geom Seog SEO ; Kang-Moon LEE ; Byung Ik JANG ; Eun Suk JUNG ; Youngdoe KIM ; Chang Kyun LEE
Gut and Liver 2025;19(2):253-264
Background/Aims:
We previously reported that patients with moderate-to-severe ulcerative colitis (UC) often experience common mental disorders (CMDs) such as anxiety and depression, necessitating immediate psychological interventions within the first 4 weeks of diagnosis. In this 3-year follow-up study of the MOSAIK cohort in Korea, we examined the effects of CMDs at initial diagnosis on clinical outcomes and health-related quality of life (HRQoL).
Methods:
We examined differences in clinical outcomes (evaluated based on clinical response, relapse, hospitalization, and medication use) and HRQoL (assessed using the Inflammatory Bowel Disease Questionnaire [IBDQ] and Short Form 12 [SF-12]) according to Hospital Anxiety and Depression Scale (HADS) scores at diagnosis.
Results:
In a study involving 199 UC patients, 47.7% exhibited significant psychological distress (anxiety and/or depression) at diagnosis. Clinical follow-up showed no major differences in outcomes, including remission rates, response rates, or hospitalization rates, between patients with anxiety or depression at diagnosis and patients without anxiety or depression at diagnosis. The HRQoL at the end of follow-up was notably lower in those with baseline CMDs, particularly anxiety, across all domains of the IBDQ and SF-12. Linear mixed-effect models revealed that higher HADS scores, as well as higher Mayo scores, were independently associated with lower IBDQ scores and both summary domains of the SF-12. Additionally, regular attendance at follow-up visits during the study period was also related to improvements in HRQoL (all p<0.05).
Conclusions
While CMDs present at the time of UC diagnosis did not influence long-term clinical outcomes, they persistently impaired HRQoL. Our findings support the routine incorporation of psychological interventions into the long-term management of moderate-to-severe UC.
8.Predicting Mortality and Cirrhosis-Related Complications with MELD3.0: A Multicenter Cohort Analysis
Jihye LIM ; Ji Hoon KIM ; Ahlim LEE ; Ji Won HAN ; Soon Kyu LEE ; Hyun YANG ; Heechul NAM ; Hae Lim LEE ; Do Seon SONG ; Sung Won LEE ; Hee Yeon KIM ; Jung Hyun KWON ; Chang Wook KIM ; U Im CHANG ; Soon Woo NAM ; Seok-Hwan KIM ; Pil Soo SUNG ; Jeong Won JANG ; Si Hyun BAE ; Jong Young CHOI ; Seung Kew YOON ; Myeong Jun SONG
Gut and Liver 2025;19(3):427-437
Background/Aims:
This study aimed to evaluate the performance of the Model for End-Stage Liver Disease (MELD) 3.0 for predicting mortality and liver-related complications compared with the Child-Pugh classification, albumin-bilirubin (ALBI) grade, the MELD, and the MELD sodium (MELDNa) score.
Methods:
We evaluated a multicenter retrospective cohort of incorporated patients with cirrhosis between 2013 and 2019. We conducted comparisons of the area under the receiver operating characteristic curve (AUROC) of the MELD3.0 and other models for predicting 3-month mortality. Additionally, we assessed the risk of cirrhosis-related complications according to the MELD3.0 score.
Results:
A total of 3,314 patients were included. The mean age was 55.9±11.3 years, and 70.2% of the patients were male. Within the initial 3 months, 220 patients (6.6%) died, and the MELD3.0had the best predictive performance among the tested models, with an AUROC of 0.851, outperforming the Child-Pugh classification, ALBI grade, MELD, and MELDNa. A high MELD3.0score was associated with an increased risk of mortality. Compared with that of the group with a MELD3.0 score <10 points, the adjusted hazard ratio of the group with a score of 10–20 pointswas 2.176, and that for the group with a score of ≥20 points was 4.892. Each 1-point increase inthe MELD3.0 score increased the risk of cirrhosis-related complications by 1.033-fold. The risk of hepatorenal syndrome showed the highest increase, with an adjusted hazard ratio of 1.149, followed by hepatic encephalopathy and ascites.
Conclusions
The MELD3.0 demonstrated robust prognostic performance in predicting mortality in patients with cirrhosis. Moreover, the MELD3.0 score was linked to cirrhosis-related complications, particularly those involving kidney function, such as hepatorenal syndrome and ascites.
9.Korean Registry on the Current Management of Helicobacter pylori (K-Hp-Reg): Interim Analysis of Adherence to the Revised Evidence-Based Guidelines for First-Line Treatment
Hyo-Joon YANG ; Joon Sung KIM ; Ji Yong AHN ; Ok-Jae LEE ; Gwang Ha KIM ; Chang Seok BANG ; Moo In PARK ; Jae Yong PARK ; Sun Moon KIM ; Su Jin HONG ; Joon Hyun CHO ; Shin Hee KIM ; Hyun Joo SONG ; Jin Woong CHO ; Sam Ryong JEE ; Hyun LIM ; Yong Hwan KWON ; Ju Yup LEE ; Seong Woo JEON ; Seon-Young PARK ; Younghee CHOE ; Moon Kyung JOO ; Dae-Hyun KIM ; Jae Myung PARK ; Beom Jin KIM ; Jong Yeul LEE ; Tae Hoon OH ; Jae Gyu KIM ;
Gut and Liver 2025;19(3):364-375
Background/Aims:
The Korean guidelines for Helicobacter pylori treatment were revised in 2020, however, the extent of adherence to these guidelines in clinical practice remains unclear. Herein, we initiated a prospective, nationwide, multicenter registry study in 2021 to evaluate the current management of H.pylori infection in Korea.
Methods:
This interim report describes the adherence to the revised guidelines and their impact on firstline eradication rates. Data on patient demographics, diagnoses, treatments, and eradication outcomes were collected using a web-based electronic case report form.
Results:
A total of 7,261 patients from 66 hospitals who received first-line treatment were analyzed.The modified intention-to-treat eradication rate for first-line treatment was 81.0%, with 80.4% of the prescriptions adhering to the revised guidelines. The most commonly prescribed regimen was the 14-day clarithromycin-based triple therapy (CTT; 42.0%), followed by tailored therapy (TT; 21.2%), 7-day CTT (14.1%), and 10-day concomitant therapy (CT; 10.1%). Time-trend analysis demonstrated significant increases in guideline adherence and the use of 10-day CT and TT, along with a decrease in the use of 7-day CTT (all p<0.001). Multivariate logistic regression analysis revealed that guideline adherence was significantly associated with first-line eradication success (odds ratio, 2.03; 95% confidence interval, 1.61 to 2.56; p<0.001).
Conclusions
The revised guidelines for the treatment of H. pylori infection have been increasingly adopted in routine clinical practice in Korea, which may have contributed to improved first-line eradication rates. Notably, the 14-day CTT, 10-day CT, and TT regimens are emerging as the preferred first-line treatment options among Korean physicians.
10.Effect of Helicobacter pylori Eradication on Metabolic Parameters and Body Composition including Skeletal Muscle Mass: A Matched Case-Control Study
Suh Eun BAE ; Kee Don CHOI ; Jaewon CHOE ; Min Jung LEE ; Seonok KIM ; Ji Young CHOI ; Hana PARK ; Jaeil KIM ; Hye Won PARK ; Hye-Sook CHANG ; Hee Kyong NA ; Ji Yong AHN ; Kee Wook JUNG ; Jeong Hoon LEE ; Do Hoon KIM ; Ho June SONG ; Gin Hyug LEE ; Hwoon-Yong JUNG
Gut and Liver 2025;19(3):346-354
Background/Aims:
Findings on the impact of Helicobacter pylori eradication on metabolic parameters are inconsistent. This study aimed to evaluate the effects of H. pylori eradication on metabolic parameters and body composition, including body fat mass and skeletal muscle mass.
Methods:
We retrospectively reviewed the data of asymptomatic patients who underwent health screenings, including bioelectrical impedance analysis, before and after H. pylori eradication between 2005 and 2021. After matching individuals based on key factors, we compared lipid profiles, metabolic parameters, and body composition between 823 patients from the eradicated group and 823 patients from the non-eradicated groups.
Results:
Blood pressure, erythrocyte sedimentation rate, and glycated hemoglobin values were significantly lower in the eradicated group than in the non-eradicated group. However, changes in body mass index (BMI), body fat mass, appendicular skeletal muscle mass (ASM), waist circumference, and lipid profiles were not significantly different between the two groups. In a subgroup analysis of individuals aged >45 years, blood pressure, erythrocyte sedimentation rate, and glycated hemoglobin changes were significantly lower in the eradicated group than in the noneradicated group. BMI values were significantly higher in the eradicated group than in the noneradicated group; however, no significant differences were observed between the two groups regarding changes in body weight, body fat mass, ASM, or waist circumference. Total cholesterol and low-density lipoprotein cholesterol levels were significantly lower in the eradicated group than in non-eradicated group.
Conclusions
H. pylori eradication significantly reduced blood pressure, glucose levels, and systemic inflammation and improved lipid profiles in patients aged >45 years. BMI, body fat mass, ASM, and waist circumference did not significantly differ between patients in the eradicated group and those in the non-eradicated group.

Result Analysis
Print
Save
E-mail