1.Adherence of Studies on Large Language Models for Medical Applications Published in Leading Medical Journals According to the MI-CLEAR-LLM Checklist
Ji Su KO ; Hwon HEO ; Chong Hyun SUH ; Jeho YI ; Woo Hyun SHIM
Korean Journal of Radiology 2025;26(4):304-312
Objective:
To evaluate the adherence of large language model (LLM)-based healthcare research to the Minimum Reporting Items for Clear Evaluation of Accuracy Reports of Large Language Models in Healthcare (MI-CLEAR-LLM) checklist, a framework designed to enhance the transparency and reproducibility of studies on the accuracy of LLMs for medical applications.
Materials and Methods:
A systematic PubMed search was conducted to identify articles on LLM performance published in high-ranking clinical medicine journals (the top 10% in each of the 59 specialties according to the 2023 Journal Impact Factor) from November 30, 2022, through June 25, 2024. Data on the six MI-CLEAR-LLM checklist items: 1) identification and specification of the LLM used, 2) stochasticity handling, 3) prompt wording and syntax, 4) prompt structuring, 5) prompt testing and optimization, and 6) independence of the test data—were independently extracted by two reviewers, and adherence was calculated for each item.
Results:
Of 159 studies, 100% (159/159) reported the name of the LLM, 96.9% (154/159) reported the version, and 91.8% (146/159) reported the manufacturer. However, only 54.1% (86/159) reported the training data cutoff date, 6.3% (10/159) documented access to web-based information, and 50.9% (81/159) provided the date of the query attempts. Clear documentation regarding stochasticity management was provided in 15.1% (24/159) of the studies. Regarding prompt details, 49.1% (78/159) provided exact prompt wording and syntax but only 34.0% (54/159) documented prompt-structuring practices. While 46.5% (74/159) of the studies detailed prompt testing, only 15.7% (25/159) explained the rationale for specific word choices. Test data independence was reported for only 13.2% (21/159) of the studies, and 56.6% (43/76) provided URLs for internet-sourced test data.
Conclusion
Although basic LLM identification details were relatively well reported, other key aspects, including stochasticity, prompts, and test data, were frequently underreported. Enhancing adherence to the MI-CLEAR-LLM checklist will allow LLM research to achieve greater transparency and will foster more credible and reliable future studies.
2.Adherence of Studies on Large Language Models for Medical Applications Published in Leading Medical Journals According to the MI-CLEAR-LLM Checklist
Ji Su KO ; Hwon HEO ; Chong Hyun SUH ; Jeho YI ; Woo Hyun SHIM
Korean Journal of Radiology 2025;26(4):304-312
Objective:
To evaluate the adherence of large language model (LLM)-based healthcare research to the Minimum Reporting Items for Clear Evaluation of Accuracy Reports of Large Language Models in Healthcare (MI-CLEAR-LLM) checklist, a framework designed to enhance the transparency and reproducibility of studies on the accuracy of LLMs for medical applications.
Materials and Methods:
A systematic PubMed search was conducted to identify articles on LLM performance published in high-ranking clinical medicine journals (the top 10% in each of the 59 specialties according to the 2023 Journal Impact Factor) from November 30, 2022, through June 25, 2024. Data on the six MI-CLEAR-LLM checklist items: 1) identification and specification of the LLM used, 2) stochasticity handling, 3) prompt wording and syntax, 4) prompt structuring, 5) prompt testing and optimization, and 6) independence of the test data—were independently extracted by two reviewers, and adherence was calculated for each item.
Results:
Of 159 studies, 100% (159/159) reported the name of the LLM, 96.9% (154/159) reported the version, and 91.8% (146/159) reported the manufacturer. However, only 54.1% (86/159) reported the training data cutoff date, 6.3% (10/159) documented access to web-based information, and 50.9% (81/159) provided the date of the query attempts. Clear documentation regarding stochasticity management was provided in 15.1% (24/159) of the studies. Regarding prompt details, 49.1% (78/159) provided exact prompt wording and syntax but only 34.0% (54/159) documented prompt-structuring practices. While 46.5% (74/159) of the studies detailed prompt testing, only 15.7% (25/159) explained the rationale for specific word choices. Test data independence was reported for only 13.2% (21/159) of the studies, and 56.6% (43/76) provided URLs for internet-sourced test data.
Conclusion
Although basic LLM identification details were relatively well reported, other key aspects, including stochasticity, prompts, and test data, were frequently underreported. Enhancing adherence to the MI-CLEAR-LLM checklist will allow LLM research to achieve greater transparency and will foster more credible and reliable future studies.
3.The Effect of Hematopoietic Stem Cell Transplantation on Treatment Outcome in Children with Acute Lymphoblastic Leukemia
Hee Young JU ; Na Hee LEE ; Eun Sang YI ; Young Bae CHOI ; So Jin KIM ; Ju Kyung HYUN ; Hee Won CHO ; Jae Kyung LEE ; Ji Won LEE ; Ki Woong SUNG ; Hong Hoe KOO ; Keon Hee YOO
Cancer Research and Treatment 2025;57(1):240-249
Purpose:
Hematopoietic stem cell transplantation (HSCT) has been an important method of treatment in the advance of pediatric acute lymphoblastic leukemia (ALL). The indications for HSCT are evolving and require updated establishment. In this study, we aimed to investigate the efficacy of HSCT on the treatment outcome of pediatric ALL, considering the indications for HSCT and subgroups.
Materials and Methods:
A retrospective analysis was conducted on ALL patients diagnosed and treated at a single center. Risk groups were categorized based on age at diagnosis, initial white blood cell count, disease lineage (B/T), and cytogenetic study results. Data on the patients’ disease status at HSCT and indications of HSCT were collected. Indications for HSCT were categorized as upfront HSCT at 1st complete remission, relapse, and refractory disease.
Results:
Among the 549 screened patients, a total of 418 patients were included in the study; B-cell ALL (n=379) and T-cell ALL (T-ALL) (n=39). HSCT was conducted on a total of 106 patients (25.4%), with a higher frequency as upfront HSCT in higher-risk groups and specific cytogenetics. The overall survival (OS) was significantly better when done upfront than in relapsed or refractory state in T-ALL patients (p=0.002). The KMT2A-rearranged ALL patients showed superior event-free survival (p=0.002) and OS (p=0.022) when HSCT was done as upfront treatment.
Conclusion
HSCT had a substantial positive effect in a specific subset of pediatric ALL. In particular, frontline HSCT for T-ALL and KMT2A-rearranged ALL offered a better prognosis than when HSCT was conducted in a relapsed or refractory setting.
4.Exploring methylation signatures for high de novo recurrence risk in hepatocellular carcinoma
Da-Won KIM ; Jin Hyun PARK ; Suk Kyun HONG ; Min-Hyeok JUNG ; Ji-One PYEON ; Jin-Young LEE ; Kyung-Suk SUH ; Nam-Joon YI ; YoungRok CHOI ; Kwang-Woong LEE ; Young-Joon KIM
Clinical and Molecular Hepatology 2025;31(2):563-576
Background/Aims:
Hepatocellular carcinoma (HCC) exhibits high de novo recurrence rates post-resection. Current post-surgery recurrence prediction methods are limited, emphasizing the need for reliable biomarkers to assess recurrence risk. We aimed to develop methylation-based markers for classifying HCC patients and predicting their risk of de novo recurrence post-surgery.
Methods:
In this retrospective cohort study, we analyzed data from HCC patients who underwent surgical resection in Korea, excluding those with recurrence within one year post-surgery. Using the Infinium Methylation EPIC array on 140 samples in the discovery cohort, we classified patients into low- and high-risk groups based on methylation profiles. Distinctive markers were identified through random forest analysis. These markers were validated in the cancer genome atlas (n=217), Validation cohort 1 (n=63) and experimental Validation using a methylation-sensitive high-resolution melting (MS-HRM) assay in Validation cohort 1 and Validation cohort 2 (n=63).
Results:
The low-risk recurrence group (methylation group 1; MG1) showed a methylation average of 0.73 (95% confidence interval [CI] 0.69–0.77) with a 23.5% recurrence rate, while the high-risk group (MG2) had an average of 0.17 (95% CI 0.14–0.20) with a 44.1% recurrence rate (P<0.03). Validation confirmed the applicability of methylation markers across diverse populations, showing high accuracy in predicting the probability of HCC recurrence risk (area under the curve 96.8%). The MS-HRM assay confirmed its effectiveness in predicting de novo recurrence with 95.5% sensitivity, 89.7% specificity, and 92.2% accuracy.
Conclusions
Methylation markers effectively classified HCC patients by de novo recurrence risk, enhancing prediction accuracy and potentially offering personalized management strategies.
5.Adherence of Studies on Large Language Models for Medical Applications Published in Leading Medical Journals According to the MI-CLEAR-LLM Checklist
Ji Su KO ; Hwon HEO ; Chong Hyun SUH ; Jeho YI ; Woo Hyun SHIM
Korean Journal of Radiology 2025;26(4):304-312
Objective:
To evaluate the adherence of large language model (LLM)-based healthcare research to the Minimum Reporting Items for Clear Evaluation of Accuracy Reports of Large Language Models in Healthcare (MI-CLEAR-LLM) checklist, a framework designed to enhance the transparency and reproducibility of studies on the accuracy of LLMs for medical applications.
Materials and Methods:
A systematic PubMed search was conducted to identify articles on LLM performance published in high-ranking clinical medicine journals (the top 10% in each of the 59 specialties according to the 2023 Journal Impact Factor) from November 30, 2022, through June 25, 2024. Data on the six MI-CLEAR-LLM checklist items: 1) identification and specification of the LLM used, 2) stochasticity handling, 3) prompt wording and syntax, 4) prompt structuring, 5) prompt testing and optimization, and 6) independence of the test data—were independently extracted by two reviewers, and adherence was calculated for each item.
Results:
Of 159 studies, 100% (159/159) reported the name of the LLM, 96.9% (154/159) reported the version, and 91.8% (146/159) reported the manufacturer. However, only 54.1% (86/159) reported the training data cutoff date, 6.3% (10/159) documented access to web-based information, and 50.9% (81/159) provided the date of the query attempts. Clear documentation regarding stochasticity management was provided in 15.1% (24/159) of the studies. Regarding prompt details, 49.1% (78/159) provided exact prompt wording and syntax but only 34.0% (54/159) documented prompt-structuring practices. While 46.5% (74/159) of the studies detailed prompt testing, only 15.7% (25/159) explained the rationale for specific word choices. Test data independence was reported for only 13.2% (21/159) of the studies, and 56.6% (43/76) provided URLs for internet-sourced test data.
Conclusion
Although basic LLM identification details were relatively well reported, other key aspects, including stochasticity, prompts, and test data, were frequently underreported. Enhancing adherence to the MI-CLEAR-LLM checklist will allow LLM research to achieve greater transparency and will foster more credible and reliable future studies.
6.The Effect of Hematopoietic Stem Cell Transplantation on Treatment Outcome in Children with Acute Lymphoblastic Leukemia
Hee Young JU ; Na Hee LEE ; Eun Sang YI ; Young Bae CHOI ; So Jin KIM ; Ju Kyung HYUN ; Hee Won CHO ; Jae Kyung LEE ; Ji Won LEE ; Ki Woong SUNG ; Hong Hoe KOO ; Keon Hee YOO
Cancer Research and Treatment 2025;57(1):240-249
Purpose:
Hematopoietic stem cell transplantation (HSCT) has been an important method of treatment in the advance of pediatric acute lymphoblastic leukemia (ALL). The indications for HSCT are evolving and require updated establishment. In this study, we aimed to investigate the efficacy of HSCT on the treatment outcome of pediatric ALL, considering the indications for HSCT and subgroups.
Materials and Methods:
A retrospective analysis was conducted on ALL patients diagnosed and treated at a single center. Risk groups were categorized based on age at diagnosis, initial white blood cell count, disease lineage (B/T), and cytogenetic study results. Data on the patients’ disease status at HSCT and indications of HSCT were collected. Indications for HSCT were categorized as upfront HSCT at 1st complete remission, relapse, and refractory disease.
Results:
Among the 549 screened patients, a total of 418 patients were included in the study; B-cell ALL (n=379) and T-cell ALL (T-ALL) (n=39). HSCT was conducted on a total of 106 patients (25.4%), with a higher frequency as upfront HSCT in higher-risk groups and specific cytogenetics. The overall survival (OS) was significantly better when done upfront than in relapsed or refractory state in T-ALL patients (p=0.002). The KMT2A-rearranged ALL patients showed superior event-free survival (p=0.002) and OS (p=0.022) when HSCT was done as upfront treatment.
Conclusion
HSCT had a substantial positive effect in a specific subset of pediatric ALL. In particular, frontline HSCT for T-ALL and KMT2A-rearranged ALL offered a better prognosis than when HSCT was conducted in a relapsed or refractory setting.
7.Exploring methylation signatures for high de novo recurrence risk in hepatocellular carcinoma
Da-Won KIM ; Jin Hyun PARK ; Suk Kyun HONG ; Min-Hyeok JUNG ; Ji-One PYEON ; Jin-Young LEE ; Kyung-Suk SUH ; Nam-Joon YI ; YoungRok CHOI ; Kwang-Woong LEE ; Young-Joon KIM
Clinical and Molecular Hepatology 2025;31(2):563-576
Background/Aims:
Hepatocellular carcinoma (HCC) exhibits high de novo recurrence rates post-resection. Current post-surgery recurrence prediction methods are limited, emphasizing the need for reliable biomarkers to assess recurrence risk. We aimed to develop methylation-based markers for classifying HCC patients and predicting their risk of de novo recurrence post-surgery.
Methods:
In this retrospective cohort study, we analyzed data from HCC patients who underwent surgical resection in Korea, excluding those with recurrence within one year post-surgery. Using the Infinium Methylation EPIC array on 140 samples in the discovery cohort, we classified patients into low- and high-risk groups based on methylation profiles. Distinctive markers were identified through random forest analysis. These markers were validated in the cancer genome atlas (n=217), Validation cohort 1 (n=63) and experimental Validation using a methylation-sensitive high-resolution melting (MS-HRM) assay in Validation cohort 1 and Validation cohort 2 (n=63).
Results:
The low-risk recurrence group (methylation group 1; MG1) showed a methylation average of 0.73 (95% confidence interval [CI] 0.69–0.77) with a 23.5% recurrence rate, while the high-risk group (MG2) had an average of 0.17 (95% CI 0.14–0.20) with a 44.1% recurrence rate (P<0.03). Validation confirmed the applicability of methylation markers across diverse populations, showing high accuracy in predicting the probability of HCC recurrence risk (area under the curve 96.8%). The MS-HRM assay confirmed its effectiveness in predicting de novo recurrence with 95.5% sensitivity, 89.7% specificity, and 92.2% accuracy.
Conclusions
Methylation markers effectively classified HCC patients by de novo recurrence risk, enhancing prediction accuracy and potentially offering personalized management strategies.
8.Adherence of Studies on Large Language Models for Medical Applications Published in Leading Medical Journals According to the MI-CLEAR-LLM Checklist
Ji Su KO ; Hwon HEO ; Chong Hyun SUH ; Jeho YI ; Woo Hyun SHIM
Korean Journal of Radiology 2025;26(4):304-312
Objective:
To evaluate the adherence of large language model (LLM)-based healthcare research to the Minimum Reporting Items for Clear Evaluation of Accuracy Reports of Large Language Models in Healthcare (MI-CLEAR-LLM) checklist, a framework designed to enhance the transparency and reproducibility of studies on the accuracy of LLMs for medical applications.
Materials and Methods:
A systematic PubMed search was conducted to identify articles on LLM performance published in high-ranking clinical medicine journals (the top 10% in each of the 59 specialties according to the 2023 Journal Impact Factor) from November 30, 2022, through June 25, 2024. Data on the six MI-CLEAR-LLM checklist items: 1) identification and specification of the LLM used, 2) stochasticity handling, 3) prompt wording and syntax, 4) prompt structuring, 5) prompt testing and optimization, and 6) independence of the test data—were independently extracted by two reviewers, and adherence was calculated for each item.
Results:
Of 159 studies, 100% (159/159) reported the name of the LLM, 96.9% (154/159) reported the version, and 91.8% (146/159) reported the manufacturer. However, only 54.1% (86/159) reported the training data cutoff date, 6.3% (10/159) documented access to web-based information, and 50.9% (81/159) provided the date of the query attempts. Clear documentation regarding stochasticity management was provided in 15.1% (24/159) of the studies. Regarding prompt details, 49.1% (78/159) provided exact prompt wording and syntax but only 34.0% (54/159) documented prompt-structuring practices. While 46.5% (74/159) of the studies detailed prompt testing, only 15.7% (25/159) explained the rationale for specific word choices. Test data independence was reported for only 13.2% (21/159) of the studies, and 56.6% (43/76) provided URLs for internet-sourced test data.
Conclusion
Although basic LLM identification details were relatively well reported, other key aspects, including stochasticity, prompts, and test data, were frequently underreported. Enhancing adherence to the MI-CLEAR-LLM checklist will allow LLM research to achieve greater transparency and will foster more credible and reliable future studies.
9.The Effect of Hematopoietic Stem Cell Transplantation on Treatment Outcome in Children with Acute Lymphoblastic Leukemia
Hee Young JU ; Na Hee LEE ; Eun Sang YI ; Young Bae CHOI ; So Jin KIM ; Ju Kyung HYUN ; Hee Won CHO ; Jae Kyung LEE ; Ji Won LEE ; Ki Woong SUNG ; Hong Hoe KOO ; Keon Hee YOO
Cancer Research and Treatment 2025;57(1):240-249
Purpose:
Hematopoietic stem cell transplantation (HSCT) has been an important method of treatment in the advance of pediatric acute lymphoblastic leukemia (ALL). The indications for HSCT are evolving and require updated establishment. In this study, we aimed to investigate the efficacy of HSCT on the treatment outcome of pediatric ALL, considering the indications for HSCT and subgroups.
Materials and Methods:
A retrospective analysis was conducted on ALL patients diagnosed and treated at a single center. Risk groups were categorized based on age at diagnosis, initial white blood cell count, disease lineage (B/T), and cytogenetic study results. Data on the patients’ disease status at HSCT and indications of HSCT were collected. Indications for HSCT were categorized as upfront HSCT at 1st complete remission, relapse, and refractory disease.
Results:
Among the 549 screened patients, a total of 418 patients were included in the study; B-cell ALL (n=379) and T-cell ALL (T-ALL) (n=39). HSCT was conducted on a total of 106 patients (25.4%), with a higher frequency as upfront HSCT in higher-risk groups and specific cytogenetics. The overall survival (OS) was significantly better when done upfront than in relapsed or refractory state in T-ALL patients (p=0.002). The KMT2A-rearranged ALL patients showed superior event-free survival (p=0.002) and OS (p=0.022) when HSCT was done as upfront treatment.
Conclusion
HSCT had a substantial positive effect in a specific subset of pediatric ALL. In particular, frontline HSCT for T-ALL and KMT2A-rearranged ALL offered a better prognosis than when HSCT was conducted in a relapsed or refractory setting.
10.Exploring methylation signatures for high de novo recurrence risk in hepatocellular carcinoma
Da-Won KIM ; Jin Hyun PARK ; Suk Kyun HONG ; Min-Hyeok JUNG ; Ji-One PYEON ; Jin-Young LEE ; Kyung-Suk SUH ; Nam-Joon YI ; YoungRok CHOI ; Kwang-Woong LEE ; Young-Joon KIM
Clinical and Molecular Hepatology 2025;31(2):563-576
Background/Aims:
Hepatocellular carcinoma (HCC) exhibits high de novo recurrence rates post-resection. Current post-surgery recurrence prediction methods are limited, emphasizing the need for reliable biomarkers to assess recurrence risk. We aimed to develop methylation-based markers for classifying HCC patients and predicting their risk of de novo recurrence post-surgery.
Methods:
In this retrospective cohort study, we analyzed data from HCC patients who underwent surgical resection in Korea, excluding those with recurrence within one year post-surgery. Using the Infinium Methylation EPIC array on 140 samples in the discovery cohort, we classified patients into low- and high-risk groups based on methylation profiles. Distinctive markers were identified through random forest analysis. These markers were validated in the cancer genome atlas (n=217), Validation cohort 1 (n=63) and experimental Validation using a methylation-sensitive high-resolution melting (MS-HRM) assay in Validation cohort 1 and Validation cohort 2 (n=63).
Results:
The low-risk recurrence group (methylation group 1; MG1) showed a methylation average of 0.73 (95% confidence interval [CI] 0.69–0.77) with a 23.5% recurrence rate, while the high-risk group (MG2) had an average of 0.17 (95% CI 0.14–0.20) with a 44.1% recurrence rate (P<0.03). Validation confirmed the applicability of methylation markers across diverse populations, showing high accuracy in predicting the probability of HCC recurrence risk (area under the curve 96.8%). The MS-HRM assay confirmed its effectiveness in predicting de novo recurrence with 95.5% sensitivity, 89.7% specificity, and 92.2% accuracy.
Conclusions
Methylation markers effectively classified HCC patients by de novo recurrence risk, enhancing prediction accuracy and potentially offering personalized management strategies.

Result Analysis
Print
Save
E-mail