Search Results

1.Adherence of Studies on Large Language Models for Medical Applications Published in Leading Medical Journals According to the MI-CLEAR-LLM Checklist

Ji Su KO ; Hwon HEO ; Chong Hyun SUH ; Jeho YI ; Woo Hyun SHIM

Korean Journal of Radiology 2025;26(4):304-312

Objective: To evaluate the adherence of large language model (LLM)-based healthcare research to the Minimum Reporting Items for Clear Evaluation of Accuracy Reports of Large Language Models in Healthcare (MI-CLEAR-LLM) checklist, a framework designed to enhance the transparency and reproducibility of studies on the accuracy of LLMs for medical applications. Materials and Methods: A systematic PubMed search was conducted to identify articles on LLM performance published in high-ranking clinical medicine journals (the top 10% in each of the 59 specialties according to the 2023 Journal Impact Factor) from November 30, 2022, through June 25, 2024. Data on the six MI-CLEAR-LLM checklist items: 1) identification and specification of the LLM used, 2) stochasticity handling, 3) prompt wording and syntax, 4) prompt structuring, 5) prompt testing and optimization, and 6) independence of the test data—were independently extracted by two reviewers, and adherence was calculated for each item. Results: Of 159 studies, 100% (159/159) reported the name of the LLM, 96.9% (154/159) reported the version, and 91.8% (146/159) reported the manufacturer. However, only 54.1% (86/159) reported the training data cutoff date, 6.3% (10/159) documented access to web-based information, and 50.9% (81/159) provided the date of the query attempts. Clear documentation regarding stochasticity management was provided in 15.1% (24/159) of the studies. Regarding prompt details, 49.1% (78/159) provided exact prompt wording and syntax but only 34.0% (54/159) documented prompt-structuring practices. While 46.5% (74/159) of the studies detailed prompt testing, only 15.7% (25/159) explained the rationale for specific word choices. Test data independence was reported for only 13.2% (21/159) of the studies, and 56.6% (43/76) provided URLs for internet-sourced test data. Conclusion Although basic LLM identification details were relatively well reported, other key aspects, including stochasticity, prompts, and test data, were frequently underreported. Enhancing adherence to the MI-CLEAR-LLM checklist will allow LLM research to achieve greater transparency and will foster more credible and reliable future studies.

2.Adherence of Studies on Large Language Models for Medical Applications Published in Leading Medical Journals According to the MI-CLEAR-LLM Checklist

Ji Su KO ; Hwon HEO ; Chong Hyun SUH ; Jeho YI ; Woo Hyun SHIM

Korean Journal of Radiology 2025;26(4):304-312

Objective: To evaluate the adherence of large language model (LLM)-based healthcare research to the Minimum Reporting Items for Clear Evaluation of Accuracy Reports of Large Language Models in Healthcare (MI-CLEAR-LLM) checklist, a framework designed to enhance the transparency and reproducibility of studies on the accuracy of LLMs for medical applications. Materials and Methods: A systematic PubMed search was conducted to identify articles on LLM performance published in high-ranking clinical medicine journals (the top 10% in each of the 59 specialties according to the 2023 Journal Impact Factor) from November 30, 2022, through June 25, 2024. Data on the six MI-CLEAR-LLM checklist items: 1) identification and specification of the LLM used, 2) stochasticity handling, 3) prompt wording and syntax, 4) prompt structuring, 5) prompt testing and optimization, and 6) independence of the test data—were independently extracted by two reviewers, and adherence was calculated for each item. Results: Of 159 studies, 100% (159/159) reported the name of the LLM, 96.9% (154/159) reported the version, and 91.8% (146/159) reported the manufacturer. However, only 54.1% (86/159) reported the training data cutoff date, 6.3% (10/159) documented access to web-based information, and 50.9% (81/159) provided the date of the query attempts. Clear documentation regarding stochasticity management was provided in 15.1% (24/159) of the studies. Regarding prompt details, 49.1% (78/159) provided exact prompt wording and syntax but only 34.0% (54/159) documented prompt-structuring practices. While 46.5% (74/159) of the studies detailed prompt testing, only 15.7% (25/159) explained the rationale for specific word choices. Test data independence was reported for only 13.2% (21/159) of the studies, and 56.6% (43/76) provided URLs for internet-sourced test data. Conclusion Although basic LLM identification details were relatively well reported, other key aspects, including stochasticity, prompts, and test data, were frequently underreported. Enhancing adherence to the MI-CLEAR-LLM checklist will allow LLM research to achieve greater transparency and will foster more credible and reliable future studies.

3.Adherence of Studies on Large Language Models for Medical Applications Published in Leading Medical Journals According to the MI-CLEAR-LLM Checklist

Ji Su KO ; Hwon HEO ; Chong Hyun SUH ; Jeho YI ; Woo Hyun SHIM

Korean Journal of Radiology 2025;26(4):304-312

Objective: To evaluate the adherence of large language model (LLM)-based healthcare research to the Minimum Reporting Items for Clear Evaluation of Accuracy Reports of Large Language Models in Healthcare (MI-CLEAR-LLM) checklist, a framework designed to enhance the transparency and reproducibility of studies on the accuracy of LLMs for medical applications. Materials and Methods: A systematic PubMed search was conducted to identify articles on LLM performance published in high-ranking clinical medicine journals (the top 10% in each of the 59 specialties according to the 2023 Journal Impact Factor) from November 30, 2022, through June 25, 2024. Data on the six MI-CLEAR-LLM checklist items: 1) identification and specification of the LLM used, 2) stochasticity handling, 3) prompt wording and syntax, 4) prompt structuring, 5) prompt testing and optimization, and 6) independence of the test data—were independently extracted by two reviewers, and adherence was calculated for each item. Results: Of 159 studies, 100% (159/159) reported the name of the LLM, 96.9% (154/159) reported the version, and 91.8% (146/159) reported the manufacturer. However, only 54.1% (86/159) reported the training data cutoff date, 6.3% (10/159) documented access to web-based information, and 50.9% (81/159) provided the date of the query attempts. Clear documentation regarding stochasticity management was provided in 15.1% (24/159) of the studies. Regarding prompt details, 49.1% (78/159) provided exact prompt wording and syntax but only 34.0% (54/159) documented prompt-structuring practices. While 46.5% (74/159) of the studies detailed prompt testing, only 15.7% (25/159) explained the rationale for specific word choices. Test data independence was reported for only 13.2% (21/159) of the studies, and 56.6% (43/76) provided URLs for internet-sourced test data. Conclusion Although basic LLM identification details were relatively well reported, other key aspects, including stochasticity, prompts, and test data, were frequently underreported. Enhancing adherence to the MI-CLEAR-LLM checklist will allow LLM research to achieve greater transparency and will foster more credible and reliable future studies.

4.Adherence of Studies on Large Language Models for Medical Applications Published in Leading Medical Journals According to the MI-CLEAR-LLM Checklist

Ji Su KO ; Hwon HEO ; Chong Hyun SUH ; Jeho YI ; Woo Hyun SHIM

Korean Journal of Radiology 2025;26(4):304-312

Objective: To evaluate the adherence of large language model (LLM)-based healthcare research to the Minimum Reporting Items for Clear Evaluation of Accuracy Reports of Large Language Models in Healthcare (MI-CLEAR-LLM) checklist, a framework designed to enhance the transparency and reproducibility of studies on the accuracy of LLMs for medical applications. Materials and Methods: A systematic PubMed search was conducted to identify articles on LLM performance published in high-ranking clinical medicine journals (the top 10% in each of the 59 specialties according to the 2023 Journal Impact Factor) from November 30, 2022, through June 25, 2024. Data on the six MI-CLEAR-LLM checklist items: 1) identification and specification of the LLM used, 2) stochasticity handling, 3) prompt wording and syntax, 4) prompt structuring, 5) prompt testing and optimization, and 6) independence of the test data—were independently extracted by two reviewers, and adherence was calculated for each item. Results: Of 159 studies, 100% (159/159) reported the name of the LLM, 96.9% (154/159) reported the version, and 91.8% (146/159) reported the manufacturer. However, only 54.1% (86/159) reported the training data cutoff date, 6.3% (10/159) documented access to web-based information, and 50.9% (81/159) provided the date of the query attempts. Clear documentation regarding stochasticity management was provided in 15.1% (24/159) of the studies. Regarding prompt details, 49.1% (78/159) provided exact prompt wording and syntax but only 34.0% (54/159) documented prompt-structuring practices. While 46.5% (74/159) of the studies detailed prompt testing, only 15.7% (25/159) explained the rationale for specific word choices. Test data independence was reported for only 13.2% (21/159) of the studies, and 56.6% (43/76) provided URLs for internet-sourced test data. Conclusion Although basic LLM identification details were relatively well reported, other key aspects, including stochasticity, prompts, and test data, were frequently underreported. Enhancing adherence to the MI-CLEAR-LLM checklist will allow LLM research to achieve greater transparency and will foster more credible and reliable future studies.

5.Adherence of Studies on Large Language Models for Medical Applications Published in Leading Medical Journals According to the MI-CLEAR-LLM Checklist

Ji Su KO ; Hwon HEO ; Chong Hyun SUH ; Jeho YI ; Woo Hyun SHIM

Korean Journal of Radiology 2025;26(4):304-312

Objective: To evaluate the adherence of large language model (LLM)-based healthcare research to the Minimum Reporting Items for Clear Evaluation of Accuracy Reports of Large Language Models in Healthcare (MI-CLEAR-LLM) checklist, a framework designed to enhance the transparency and reproducibility of studies on the accuracy of LLMs for medical applications. Materials and Methods: A systematic PubMed search was conducted to identify articles on LLM performance published in high-ranking clinical medicine journals (the top 10% in each of the 59 specialties according to the 2023 Journal Impact Factor) from November 30, 2022, through June 25, 2024. Data on the six MI-CLEAR-LLM checklist items: 1) identification and specification of the LLM used, 2) stochasticity handling, 3) prompt wording and syntax, 4) prompt structuring, 5) prompt testing and optimization, and 6) independence of the test data—were independently extracted by two reviewers, and adherence was calculated for each item. Results: Of 159 studies, 100% (159/159) reported the name of the LLM, 96.9% (154/159) reported the version, and 91.8% (146/159) reported the manufacturer. However, only 54.1% (86/159) reported the training data cutoff date, 6.3% (10/159) documented access to web-based information, and 50.9% (81/159) provided the date of the query attempts. Clear documentation regarding stochasticity management was provided in 15.1% (24/159) of the studies. Regarding prompt details, 49.1% (78/159) provided exact prompt wording and syntax but only 34.0% (54/159) documented prompt-structuring practices. While 46.5% (74/159) of the studies detailed prompt testing, only 15.7% (25/159) explained the rationale for specific word choices. Test data independence was reported for only 13.2% (21/159) of the studies, and 56.6% (43/76) provided URLs for internet-sourced test data. Conclusion Although basic LLM identification details were relatively well reported, other key aspects, including stochasticity, prompts, and test data, were frequently underreported. Enhancing adherence to the MI-CLEAR-LLM checklist will allow LLM research to achieve greater transparency and will foster more credible and reliable future studies.

6.Korean Thyroid Association Guidelines on the Management of Differentiated Thyroid Cancers; Overview and Summary 2024

Young Joo PARK ; Eun Kyung LEE ; Young Shin SONG ; Bon Seok KOO ; Hyungju KWON ; Keunyoung KIM ; Mijin KIM ; Bo Hyun KIM ; Won Gu KIM ; Won Bae KIM ; Won Woong KIM ; Jung-Han KIM ; Hee Kyung KIM ; Hee Young NA ; Shin Je MOON ; Jung-Eun MOON ; Sohyun PARK ; Jun-Ook PARK ; Ji-In BANG ; Kyorim BACK ; Youngduk SEO ; Dong Yeob SHIN ; Su-Jin SHIN ; Hwa Young AHN ; So Won OH ; Seung Hoon WOO ; Ho-Ryun WON ; Chang Hwan RYU ; Jee Hee YOON ; Ka Hee YI ; Min Kyoung LEE ; Sang-Woo LEE ; Seung Eun LEE ; Sihoon LEE ; Young Ah LEE ; Joon-Hyop LEE ; Ji Ye LEE ; Jieun LEE ; Cho Rok LEE ; Dong-Jun LIM ; Jae-Yol LIM ; Yun Kyung JEON ; Kyong Yeun JUNG ; Ari CHONG ; Yun Jae CHUNG ; Chan Kwon JUNG ; Kwanhoon JO ; Yoon Young CHO ; A Ram HONG ; Chae Moon HONG ; Ho-Cheol KANG ; Sun Wook KIM ; Woong Youn CHUNG ; Do Joon PARK ; Dong Gyu NA ;

International Journal of Thyroidology 2024;17(1):1-20

7.Korean guidelines for the management of gout

Jennifer Jooha LEE ; Ji Soo LEE ; Min Kyung CHUNG ; Joong Kyong AHN ; Hyo-Jin CHOI ; Seung-Jae HONG ; Chong-Hyeon YOON ; Su-Hyun KIM ; Kyung-Hwan JEONG ; Jong-Woo KIM ; Bo-Yeon KIM ; Jin-Ho SHIN ; Woo Gyu KIM ; Soo-Young KIM ; Hyun-Jung KIM ; Jeong-Soo SONG ; Jae-Bum JUN ; Hyun-Ah PARK ; Shung Chull CHAE ; Bum Soon CHOI ; Tae Nyun KIM ; Hyun Ah KIM

Journal of Rheumatic Diseases 2023;30(3):141-150

8.Korean guidelines for the management of gout

Jennifer Jooha LEE ; Ji Soo LEE ; Min Kyung CHUNG ; Joong Kyong AHN ; Hyo-Jin CHOI ; Seung-Jae HONG ; Chong-Hyeon YOON ; Su-Hyun KIM ; Kyung-Hwan JEONG ; Jong-Woo KIM ; Bo-Yeon KIM ; Jin-Ho SHIN ; Woo Gyu KIM ; Soo-Young KIM ; Hyun-Jung KIM ; Jeong-Soo SONG ; Jae-Bum JUN ; Hyun-Ah PARK ; Shung Chull CHAE ; Bum Soon CHOI ; Tae Nyun KIM ; Hyun Ah KIM

The Korean Journal of Internal Medicine 2023;38(5):641-650

9.The 2022 Annual Report on Toxicology Surveillance and Severe Poisoning Cases at Emergency Departments in Korea

Eun Sun LEE ; Su Jin KIM ; Gyu Chong CHO ; Mi Jin LEE ; Byung Hak SO ; Kyung Su KIM ; Juhyun SONG ; Sung Woo LEE

Journal of The Korean Society of Clinical Toxicology 2023;21(1):1-16

Purpose: This study investigated the actual incidence of acute poisoning in Korea on a nationwide scale, with the aim of laying the groundwork for future initiatives in prevention, strategic antidote distribution, and the development of effective emergency treatment for acute poisoning. Methods: The study analyzed data from 3,038 patients who presented to emergency departments with poisoning-related conditions from June 1, 2022 to December 31, 2022 at 10 sites in nine cities across the country. We extracted data on general characteristics of the poisoning cases, including demographic characteristics (age and gender), place of exposure, reason for poisoning, route of exposure, and the substance involved in the poisoning incident. Age-related patterns in reasons for poisoning, medical outcomes, frequent and primary poisoning substances, and deaths were also analyzed. Results: The population analyzed in our study was predominantly female, with women constituting 54.74% of all cases. Among infants and children, non-intentional poisoning due to general accidents was the most common cause, accounting for 71.43% of cases. Conversely, suicidal poisoning was more prevalent among teenagers and adults over 20. Fifty-two patients died during the study period, with males comprising approximately two-thirds (67.31%) of these fatalities. Pesticides were the most common poisoning substance among those who died, accounting for 55.77% of such cases. Notably, a significant majority of the victims were elderly individuals aged 60 and above. Conclusion This study holds substantial significance, since it represents the first comprehensive investigation and analysis of the symptoms, treatment, and causes of death due to poisoning in Korea on a national scale. By substantially expanding the range and types of poisonous substances examined, we were able to more precisely identify the characteristics and clinical patterns of poisoning cases nationwide.

10.Diagnostic Yield of Diffusion-Weighted Brain Magnetic Resonance Imaging in Patients with Transient Global Amnesia: A Systematic Review and Meta-Analysis

Su Jin LIM ; Minjae KIM ; Chong Hyun SUH ; Sang Yeong KIM ; Woo Hyun SHIM ; Sang Joon KIM

Korean Journal of Radiology 2021;22(10):1680-1689

Objective: To investigate the diagnostic yield of diffusion-weighted imaging (DWI) in patients with transient global amnesia (TGA) and identify significant parameters affecting diagnostic yield. Materials and Methods: A systematic literature search of the MEDLINE and EMBASE databases was conducted to identify studies that assessed the diagnostic yield of DWI in patients with TGA. The pooled diagnostic yield of DWI in patients with TGA was calculated using the DerSimonian-Laird random-effects model. Subgroup analyses were also performed of slice thickness, magnetic field strength, and interval between symptom onset and DWI. Results: Twenty-two original articles (1732 patients) were included. The pooled incidence of right, left, and bilateral hippocampal lesions was 37% (95% confidence interval [CI], 30–44%), 42% (95% CI, 39–46%), and 25% (95% CI, 20–30%) of all lesions, respectively. The pooled diagnostic yield of DWI in patients with TGA was 39% (95% CI, 27–52%). The Higgins I2 statistic showed significant heterogeneity (I2 = 95%). DWI with a slice thickness ≤ 3 mm showed a higher diagnostic yield than DWI with a slice thickness > 3 mm (pooled diagnostic yield: 63% [95% CI, 53–72%] vs. 26% [95% CI, 16–40%], p < 0.01). DWI performed at an interval between 24 and 96 hours after symptom onset showed a higher diagnostic yield (68% [95% CI, 57–78%], p < 0.01) than DWI performed within 24 hours (16% [95% CI, 7–34%]) or later than 96 hours (15% [95% CI, 8–26%]). There was no difference in the diagnostic yield between DWI performed using 3T vs. 1.5T (pooled diagnostic yield, 31% [95% CI, 25–38%] vs. 24% [95% CI, 14–37%], p = 0.31). Conclusion The pooled diagnostic yield of DWI in TGA patients was 39%. DWI obtained with a slice thickness ≤ 3 mm or an interval between symptom onset and DWI of > 24 to 96 hours could increase the diagnostic yield.

1.Adherence of Studies on Large Language Models for Medical Applications Published in Leading Medical Journals According to the MI-CLEAR-LLM Checklist

2.Adherence of Studies on Large Language Models for Medical Applications Published in Leading Medical Journals According to the MI-CLEAR-LLM Checklist

3.Adherence of Studies on Large Language Models for Medical Applications Published in Leading Medical Journals According to the MI-CLEAR-LLM Checklist

4.Adherence of Studies on Large Language Models for Medical Applications Published in Leading Medical Journals According to the MI-CLEAR-LLM Checklist

5.Adherence of Studies on Large Language Models for Medical Applications Published in Leading Medical Journals According to the MI-CLEAR-LLM Checklist

6.Korean Thyroid Association Guidelines on the Management of Differentiated Thyroid Cancers; Overview and Summary 2024

7.Korean guidelines for the management of gout

8.Korean guidelines for the management of gout

9.The 2022 Annual Report on Toxicology Surveillance and Severe Poisoning Cases at Emergency Departments in Korea

10.Diagnostic Yield of Diffusion-Weighted Brain Magnetic Resonance Imaging in Patients with Transient Global Amnesia: A Systematic Review and Meta-Analysis

Display Mode

Output Records

File Type