1.Advancing Korean Medical Large Language Models: Automated Pipeline for Korean Medical Preference Dataset Construction
Jean SEO ; Sumin PARK ; Sungjoo BYUN ; Jinwook CHOI ; Jinho CHOI ; Hyopil SHIN
Healthcare Informatics Research 2025;31(2):166-174
Objectives:
Developing large language models (LLMs) in biomedicine requires access to high-quality training and alignment tuning datasets. However, publicly available Korean medical preference datasets are scarce, hindering the advancement of Korean medical LLMs. This study constructs and evaluates the efficacy of the Korean Medical Preference Dataset (KoMeP), an alignment tuning dataset constructed with an automated pipeline, minimizing the high costs of human annotation.
Methods:
KoMeP was generated using the DAHL score, an automated hallucination evaluation metric. Five LLMs (Dolly-v2-3B, MPT-7B, GPT-4o, Qwen-2-7B, Llama-3-8B) produced responses to 8,573 biomedical examination questions, from which 5,551 preference pairs were extracted. Each pair consisted of a “chosen” response and a “rejected” response, as determined by their DAHL scores. The dataset was evaluated when trained through two different alignment tuning methods, direct preference optimization (DPO) and odds ratio preference optimization (ORPO) respectively across five different models. The KorMedMCQA benchmark was employed to assess the effectiveness of alignment tuning.
Results:
Models trained with DPO consistently improved KorMedMCQA performance; notably, Llama-3.1-8B showed a 43.96% increase. In contrast, ORPO training produced inconsistent results. Additionally, English-to-Korean transfer learning proved effective, particularly for English-centric models like Gemma-2, whereas Korean-to-English transfer learning achieved limited success. Instruction tuning with KoMeP yielded mixed outcomes, which suggests challenges in dataset formatting.
Conclusions
KoMeP is the first publicly available Korean medical preference dataset and significantly improves alignment tuning performance in LLMs. The DPO method outperforms ORPO in alignment tuning. Future work should focus on expanding KoMeP, developing a Korean-native dataset, and refining alignment tuning methods to produce safer and more reliable Korean medical LLMs.
2.Era of Digital Healthcare: Emergence of the Smart Patient
Dooyoung HUHH ; Kwangsoo SHIN ; Miyeong KIM ; Jisan LEE ; Hana KIM ; Jinho CHOI ; Suyeon BAN
Healthcare Informatics Research 2025;31(1):107-110
3.Advancing Korean Medical Large Language Models: Automated Pipeline for Korean Medical Preference Dataset Construction
Jean SEO ; Sumin PARK ; Sungjoo BYUN ; Jinwook CHOI ; Jinho CHOI ; Hyopil SHIN
Healthcare Informatics Research 2025;31(2):166-174
Objectives:
Developing large language models (LLMs) in biomedicine requires access to high-quality training and alignment tuning datasets. However, publicly available Korean medical preference datasets are scarce, hindering the advancement of Korean medical LLMs. This study constructs and evaluates the efficacy of the Korean Medical Preference Dataset (KoMeP), an alignment tuning dataset constructed with an automated pipeline, minimizing the high costs of human annotation.
Methods:
KoMeP was generated using the DAHL score, an automated hallucination evaluation metric. Five LLMs (Dolly-v2-3B, MPT-7B, GPT-4o, Qwen-2-7B, Llama-3-8B) produced responses to 8,573 biomedical examination questions, from which 5,551 preference pairs were extracted. Each pair consisted of a “chosen” response and a “rejected” response, as determined by their DAHL scores. The dataset was evaluated when trained through two different alignment tuning methods, direct preference optimization (DPO) and odds ratio preference optimization (ORPO) respectively across five different models. The KorMedMCQA benchmark was employed to assess the effectiveness of alignment tuning.
Results:
Models trained with DPO consistently improved KorMedMCQA performance; notably, Llama-3.1-8B showed a 43.96% increase. In contrast, ORPO training produced inconsistent results. Additionally, English-to-Korean transfer learning proved effective, particularly for English-centric models like Gemma-2, whereas Korean-to-English transfer learning achieved limited success. Instruction tuning with KoMeP yielded mixed outcomes, which suggests challenges in dataset formatting.
Conclusions
KoMeP is the first publicly available Korean medical preference dataset and significantly improves alignment tuning performance in LLMs. The DPO method outperforms ORPO in alignment tuning. Future work should focus on expanding KoMeP, developing a Korean-native dataset, and refining alignment tuning methods to produce safer and more reliable Korean medical LLMs.
4.Era of Digital Healthcare: Emergence of the Smart Patient
Dooyoung HUHH ; Kwangsoo SHIN ; Miyeong KIM ; Jisan LEE ; Hana KIM ; Jinho CHOI ; Suyeon BAN
Healthcare Informatics Research 2025;31(1):107-110
5.Sociobehavioural factors associated with SARS-CoV-2 infection and COVID-19 vaccine effectiveness against medically attended, symptomatic SARS-CoV-2 infection in the Philippines: a prospective case-control study (FASCINATE-P study)
Takeshi Arashiro ; Regina Pascua Berba ; Joy Potenciano Calayo ; Marie Kris ; Reby Marie Garcia ; Shuichi Suzuki ; Cecile Dungog ; Jonathan Rivera ; Greco Mark Malijan ; Kristal An Agrupis ; Mary Jane Salazar ; Mary Ann Salazar ; Jinho Shin ; Martin Hibberd ; Koya Ariyoshi ; Chris Smith
Western Pacific Surveillance and Response 2025;16(1):49-60
Objective: We examined sociobehavioural factors associated with SARS-CoV-2 infection and estimated COVID-19 vaccine effectiveness against symptomatic SARS-CoV-2 infection in the Philippines. Such studies are limited in low- and middle-income countries, especially in Asia and the Pacific.
Methods: A case-control study was conducted in two hospitals in Manila, Philippines, from March 2022 to June 2023. Sociobehavioural factors and vaccination history were collected. PCR-positive individuals were cases, while PCR-negative individuals were controls. Adjusted odds ratios (aORs) were calculated to examine associations between sociobehavioural factors/vaccination and medically attended SARS-CoV-2 infection.
Results: The analysis included 2489 individuals (574 positive cases, 23.1%; 1915 controls, 76.9%; median age [interquartile range]: 35 [27–51] years). Although education and household income were not associated with infection, being a health-care worker was (aOR: 1.45; 95% confidence interval [CI]: 1.03–2.06). The odds of infection were higher among individuals who attended gatherings of five or more people compared to those who attended smaller gatherings (aOR: 2.58; 95% CI: 1.14–5.83). Absolute vaccine effectiveness for vaccination status was not estimated due to a high risk of bias, for example, unascertained prior infection. Moderate relative vaccine effectiveness for the first booster (32%; 95% CI: -120–79) and the second booster (48%; 95% CI: -23–78) were observed (both with wide CI), albeit with a waning trend after half a year.
Discussion: The higher odds of infection among health-care workers emphasize the importance of infection prevention and control measures. Moderate relative vaccine effectiveness with a waning trend reiterates the need for more efficacious vaccines against symptomatic infection caused by circulating variants and with longer duration of protection.
6.Advancing Korean Medical Large Language Models: Automated Pipeline for Korean Medical Preference Dataset Construction
Jean SEO ; Sumin PARK ; Sungjoo BYUN ; Jinwook CHOI ; Jinho CHOI ; Hyopil SHIN
Healthcare Informatics Research 2025;31(2):166-174
Objectives:
Developing large language models (LLMs) in biomedicine requires access to high-quality training and alignment tuning datasets. However, publicly available Korean medical preference datasets are scarce, hindering the advancement of Korean medical LLMs. This study constructs and evaluates the efficacy of the Korean Medical Preference Dataset (KoMeP), an alignment tuning dataset constructed with an automated pipeline, minimizing the high costs of human annotation.
Methods:
KoMeP was generated using the DAHL score, an automated hallucination evaluation metric. Five LLMs (Dolly-v2-3B, MPT-7B, GPT-4o, Qwen-2-7B, Llama-3-8B) produced responses to 8,573 biomedical examination questions, from which 5,551 preference pairs were extracted. Each pair consisted of a “chosen” response and a “rejected” response, as determined by their DAHL scores. The dataset was evaluated when trained through two different alignment tuning methods, direct preference optimization (DPO) and odds ratio preference optimization (ORPO) respectively across five different models. The KorMedMCQA benchmark was employed to assess the effectiveness of alignment tuning.
Results:
Models trained with DPO consistently improved KorMedMCQA performance; notably, Llama-3.1-8B showed a 43.96% increase. In contrast, ORPO training produced inconsistent results. Additionally, English-to-Korean transfer learning proved effective, particularly for English-centric models like Gemma-2, whereas Korean-to-English transfer learning achieved limited success. Instruction tuning with KoMeP yielded mixed outcomes, which suggests challenges in dataset formatting.
Conclusions
KoMeP is the first publicly available Korean medical preference dataset and significantly improves alignment tuning performance in LLMs. The DPO method outperforms ORPO in alignment tuning. Future work should focus on expanding KoMeP, developing a Korean-native dataset, and refining alignment tuning methods to produce safer and more reliable Korean medical LLMs.
7.Era of Digital Healthcare: Emergence of the Smart Patient
Dooyoung HUHH ; Kwangsoo SHIN ; Miyeong KIM ; Jisan LEE ; Hana KIM ; Jinho CHOI ; Suyeon BAN
Healthcare Informatics Research 2025;31(1):107-110
8.Experience conducting COVID-19 vaccine effectiveness studies in response to the COVID-19 pandemic in Japan and the Philippines: lessons for future epidemics and potential pandemics
Takeshi Arashiro ; Regina Pascua Berba ; Joy Potenciano Calayo ; Rontgene Solante ; Shuichi Suzuki ; Jinho Shin ; Motoi Suzuki ; Martin Hibberd ; Koya Ariyoshi ; Chris Smith
Western Pacific Surveillance and Response 2025;16(2):03-10
roblem: Once COVID-19 vaccines were rolled out, there was a need to monitor real-world vaccine effectiveness to accumulate evidence to inform policy and risk communication. This was especially true in Japan and the Philippines, given historical issues that affected vaccine confidence.
Context: Neither country had public health surveillance that could be enhanced to evaluate vaccine effectiveness or readily available national vaccination databases.
Action: Study groups were established in multiple health-care facilities in each country to assess vaccine effectiveness against both symptomatic infection and severe disease.
Outcome: In Japan, multiple study reports were published in Japanese on the website of the National Institute of Infectious Diseases and presented at the national government’s advisory board. Nationwide media coverage facilitated transparency and increased the confidence of the government and the public in the vaccination programme. In the Philippines, the launch of the study was delayed so as to align the research plan with the interests of various stakeholders and to obtain institutional review board approval. Ultimately, the studies were successfully initiated and completed.
Discussion: There were four main challenges in conducting our studies: finding health-care facilities for data collection; obtaining exposure (vaccination) data; identifying epidemiological biases and confounders; and informing policy and risk communication in a timely manner. Preparedness during inter-emergency/epidemic/pandemic periods to rapidly evaluate relevant interventions such as vaccination is critical and should include the following considerations: (1) the establishment and maintenance of prospective data collection platforms, ideally under public health surveillance (if not, clinical research networks or linked databases); (2) uniform and practical protocols considering biases and confounders; and (3) communication with stakeholders including institutional review boards.
9.Prognostic Value of Residual Circulating Tumor DNA in Metastatic Pancreatic Ductal Adenocarcinoma
Hongkyung KIM ; Jinho LEE ; Mi Ri PARK ; Zisun CHOI ; Seung Jung HAN ; Dongha KIM ; Saeam SHIN ; Seung-Tae LEE ; Jong Rak CHOI ; Seung Woo PARK
Annals of Laboratory Medicine 2025;45(2):199-208
Background:
Circulating tumor DNA (ctDNA) is a potential biomarker in pancreatic ductal adenocarcinoma (PDAC). However, studies on residual ctDNA in patients post-chemotherapy are limited. We assessed the prognostic value of residual ctDNA in metastatic PDAC relative to that of carbohydrate antigen 19-9 (CA19-9).
Methods:
ctDNA analysis using a targeted next-generation sequencing panel was performed at baseline and during chemotherapy response evaluation in 53 patients. Progression-free survival (PFS) and overall survival (OS) were first evaluated based on ctDNA positivity at baseline. For further comparison, patients testing ctDNA-positive at baseline were subdivided based on residual ctDNA into ctDNA responders (no residual ctDNA post-chemotherapy) and ctDNA non-responders (residual ctDNA post-chemotherapy). Additional survival analysis was performed based on CA19-9 levels.
Results:
The baseline ctDNA detection rate was 56.6%. Although clinical outcomes tended to be poorer in patients with baseline ctDNA positivity than in those without, the differences were not significant. Residual ctDNA post-chemotherapy was associated with reduced PFS and OS. The prognosis of ctDNA responders was better than that of non-responders but did not significantly differ from that of ctDNA-negative individuals (no ctDNA both at baseline and during post-chemotherapy). Compared with ctDNA responses to che-motherapy, a ≥ 50% decrease in the CA19-9 level had less effect on both PFS and OSbased on hazard ratios and significance levels. ctDNA could be monitored in half of the patients whose baseline CA19-9 levels were within the reference range.
Conclusions
Residual ctDNA analysis post-chemotherapy is a promising approach for predicting the clinical outcomes of patients with metastatic PDAC.
10.Unusual or Uncommon Histology of Gastric Cancer
Journal of Gastric Cancer 2024;24(1):69-88
This review comprehensively examines the diverse spectrum of gastric cancers, focusing on unusual or uncommon histology that presents significant diagnostic and therapeutic challenges. While the predominant form, tubular adenocarcinoma, is well-characterized, this review focuses on lesser-known variants, including papillary adenocarcinoma, micropapillary carcinoma, adenosquamous carcinoma, squamous cell carcinoma (SCC), hepatoid adenocarcinoma, gastric choriocarcinoma, gastric carcinoma with lymphoid stroma, carcinosarcoma, gastroblastoma, parietal cell carcinoma, oncocytic adenocarcinoma, Paneth cell carcinoma, gastric adenocarcinoma of the fundic gland type, undifferentiated carcinoma, and extremely well-differentiated adenocarcinoma. Although these diseases have different nomenclatures characterized by distinct histopathological features, these phenotypes often overlap, making it difficult to draw clear boundaries. Furthermore, the number of cases was limited, and the unique histopathological nature and potential pathogenic mechanisms were not well defined. This review highlights the importance of understanding these rare variants for accurate diagnosis, effective treatment planning, and improving patient outcomes. This review emphasizes the need for ongoing research and case studies to enhance our knowledge of these uncommon forms of gastric cancer, which will ultimately contribute to more effective treatments and better prognostic assessments. This review aimed to broaden the pathological narrative by acknowledging and addressing the intricacies of all cancer types, regardless of their rarity, to advance patient care and improve prognosis.


Result Analysis
Print
Save
E-mail