1.Exploring the mechanism and treatment principles of testicular radiation injury from the perspective of "the struggle between vital qi and pathogen" theory
Xiaoying CHEN ; An WANG ; Yifan YE ; Yan WANG ; Yuankai GAO ; Qing XU ; Shuran WANG ; Zhangdi ZHAO ; Sumin HU
Journal of Beijing University of Traditional Chinese Medicine 2025;48(3):379-385
Testicular radiation injury is a structural and functional abnormality of the testes caused directly or indirectly by radiation, which disrupts spermatogenesis and compromises male fertility. The development of effective preventive and therapeutic interventions is essential because of the high prevalence of this condition in clinical settings and its profound effect on patients′ reproductive health and overall well-being. The concept of "the struggle between vital qi and pathogen" is first seen in the Treatise on Cold Pathogenic Diseases. It denotes the dynamic struggle between vital and pathogenic qi. The occurrence, development, and sequelae of all diseases reflect this ongoing conflict. In this context, this study defines the "vital qi" of the testis as its capacity to generate and preserve the essence of reproduction and to resist damage. The pathogenic qi associated with testicular radiation injury is categorized into two types: ionizing poison and retaining evil. The pathogenesis of testicular radiation damage is delineated into three stages by integrating the characteristics of vital and pathogenic qi: the injury, adhesion, and recovery phases. Based on the theoretical framework advanced by this study, the therapeutic approach for testicular radiation injury should adhere to the fundamental principle of strengthening vital qi and eliminating pathogenic factors. Although the primary focus of treatment should be on strengthening vital qi, it should also be complemented by strategies to eliminate pathogenic influences. This paper aims to provide a novel perspective and strategic approach to the traditional Chinese medicine diagnosis, prevention, and treatment of testicular radiation injury. By elucidating the process of testicular radiation injury and its corresponding treatment principles, it seeks to offer valuable insights for clinical practice.
2.Whole genome sequencing analysis of enteropathogenic Escherichia coli from human and companion animals in Korea
Jae Young OH ; Kyung-Hyo DO ; Jae Hong JEONG ; SuMin KWAK ; Sujin CHOE ; Dongheui AN ; Jong-Chan CHAE ; Kwangjun LEE ; Kwang-Won SEO
Journal of Veterinary Science 2025;26(1):e1-
Objective:
To improve our understanding of EPEC, this study focused on analyzing and comparing the genomic characteristics of EPEC isolates from humans and companion animals in Korea.
Methods:
The whole genome of 26 EPEC isolates from patients with diarrhea and 20 EPEC isolates from companion animals in Korea were sequenced using the Illumina HiSeq X (Illumina, USA) and Oxford Nanopore MinION (Oxford Nanopore Technologies, UK) platforms.
Results:
Most isolates were atypical EPEC, and did not harbor the bfpA gene. The most prevalent virulence genes were found to be ompT (humans: 61.5%; companion animals:60.0%) followed by lpfA (humans: 46.2%; companion animals: 60.0%). Although pangenome analyses showed no apparent correlation among the origin of the strains, virulence profiles, and antimicrobial resistance profiles, isolates included in clade A obtained from both humans and companion animals exhibited high similarity. Additionally, all the isolates included in clade A encoded the ompT gene and did not encode the hlyE gene. The two isolates from companion animals harbored an incomplete bundle-forming pilus region encoding bfpA and bfpB. Moreover, the type IV secretion system-associated genes tra and trb were found in the bfpA-encoding isolates from humans.
Conclusions
and Relevance: Whole-genome sequencing enabled a more accurate analysis of the phylogenetic structure of EPEC and provided better insights into the understanding of EPEC epidemiology and pathogenicity.
3.Advancing Korean Medical Large Language Models: Automated Pipeline for Korean Medical Preference Dataset Construction
Jean SEO ; Sumin PARK ; Sungjoo BYUN ; Jinwook CHOI ; Jinho CHOI ; Hyopil SHIN
Healthcare Informatics Research 2025;31(2):166-174
Objectives:
Developing large language models (LLMs) in biomedicine requires access to high-quality training and alignment tuning datasets. However, publicly available Korean medical preference datasets are scarce, hindering the advancement of Korean medical LLMs. This study constructs and evaluates the efficacy of the Korean Medical Preference Dataset (KoMeP), an alignment tuning dataset constructed with an automated pipeline, minimizing the high costs of human annotation.
Methods:
KoMeP was generated using the DAHL score, an automated hallucination evaluation metric. Five LLMs (Dolly-v2-3B, MPT-7B, GPT-4o, Qwen-2-7B, Llama-3-8B) produced responses to 8,573 biomedical examination questions, from which 5,551 preference pairs were extracted. Each pair consisted of a “chosen” response and a “rejected” response, as determined by their DAHL scores. The dataset was evaluated when trained through two different alignment tuning methods, direct preference optimization (DPO) and odds ratio preference optimization (ORPO) respectively across five different models. The KorMedMCQA benchmark was employed to assess the effectiveness of alignment tuning.
Results:
Models trained with DPO consistently improved KorMedMCQA performance; notably, Llama-3.1-8B showed a 43.96% increase. In contrast, ORPO training produced inconsistent results. Additionally, English-to-Korean transfer learning proved effective, particularly for English-centric models like Gemma-2, whereas Korean-to-English transfer learning achieved limited success. Instruction tuning with KoMeP yielded mixed outcomes, which suggests challenges in dataset formatting.
Conclusions
KoMeP is the first publicly available Korean medical preference dataset and significantly improves alignment tuning performance in LLMs. The DPO method outperforms ORPO in alignment tuning. Future work should focus on expanding KoMeP, developing a Korean-native dataset, and refining alignment tuning methods to produce safer and more reliable Korean medical LLMs.
4.Enhancing Identification of High-Risk cN0 Lung Adenocarcinoma Patients Using MRI-Based Radiomic Features
Harim KIM ; Jonghoon KIM ; Soohyun HWANG ; You Jin OH ; Joong Hyun AHN ; Min-Ji KIM ; Tae Hee HONG ; Sung Goo PARK ; Joon Young CHOI ; Hong Kwan KIM ; Jhingook KIM ; Sumin SHIN ; Ho Yun LEE
Cancer Research and Treatment 2025;57(1):57-69
Purpose:
This study aimed to develop a magnetic resonance imaging (MRI)–based radiomics model to predict high-risk pathologic features for lung adenocarcinoma: micropapillary and solid pattern (MPsol), spread through air space, and poorly differentiated patterns.
Materials and Methods:
As a prospective study, we screened clinical N0 lung cancer patients who were surgical candidates and had undergone both 18F-fluorodeoxyglucose (FDG) positron emission tomography–computed tomography (PET/CT) and chest CT from August 2018 to January 2020. We recruited patients meeting our proposed imaging criteria indicating high-risk, that is, poorer prognosis of lung adenocarcinoma, using CT and FDG PET/CT. If possible, these patients underwent an MRI examination from which we extracted 77 radiomics features from T1-contrast-enhanced and T2-weighted images. Additionally, patient demographics, maximum standardized uptake value on FDG PET/CT, and the mean apparent diffusion coefficient value on diffusion-weighted image, were considered together to build prediction models for high-risk pathologic features.
Results:
Among 616 patients, 72 patients met the imaging criteria for high-risk lung cancer and underwent lung MRI. The magnetic resonance (MR)–eligible group showed a higher prevalence of nodal upstaging (29.2% vs. 4.2%, p < 0.001), vascular invasion (6.5% vs. 2.1%, p=0.011), high-grade pathologic features (p < 0.001), worse 4-year disease-free survival (p < 0.001) compared with non-MR-eligible group. The prediction power for MR-based radiomics model predicting high-risk pathologic features was good, with mean area under the receiver operating curve (AUC) value measuring 0.751-0.886 in test sets. Adding clinical variables increased the predictive performance for MPsol and the poorly differentiated pattern using the 2021 grading system (AUC, 0.860 and 0.907, respectively).
Conclusion
Our imaging criteria can effectively screen high-risk lung cancer patients and predict high-risk pathologic features by our MR-based prediction model using radiomics.
5.Whole genome sequencing analysis of enteropathogenic Escherichia coli from human and companion animals in Korea
Jae Young OH ; Kyung-Hyo DO ; Jae Hong JEONG ; SuMin KWAK ; Sujin CHOE ; Dongheui AN ; Jong-Chan CHAE ; Kwangjun LEE ; Kwang-Won SEO
Journal of Veterinary Science 2025;26(1):e1-
Objective:
To improve our understanding of EPEC, this study focused on analyzing and comparing the genomic characteristics of EPEC isolates from humans and companion animals in Korea.
Methods:
The whole genome of 26 EPEC isolates from patients with diarrhea and 20 EPEC isolates from companion animals in Korea were sequenced using the Illumina HiSeq X (Illumina, USA) and Oxford Nanopore MinION (Oxford Nanopore Technologies, UK) platforms.
Results:
Most isolates were atypical EPEC, and did not harbor the bfpA gene. The most prevalent virulence genes were found to be ompT (humans: 61.5%; companion animals:60.0%) followed by lpfA (humans: 46.2%; companion animals: 60.0%). Although pangenome analyses showed no apparent correlation among the origin of the strains, virulence profiles, and antimicrobial resistance profiles, isolates included in clade A obtained from both humans and companion animals exhibited high similarity. Additionally, all the isolates included in clade A encoded the ompT gene and did not encode the hlyE gene. The two isolates from companion animals harbored an incomplete bundle-forming pilus region encoding bfpA and bfpB. Moreover, the type IV secretion system-associated genes tra and trb were found in the bfpA-encoding isolates from humans.
Conclusions
and Relevance: Whole-genome sequencing enabled a more accurate analysis of the phylogenetic structure of EPEC and provided better insights into the understanding of EPEC epidemiology and pathogenicity.
7.Enhancing Identification of High-Risk cN0 Lung Adenocarcinoma Patients Using MRI-Based Radiomic Features
Harim KIM ; Jonghoon KIM ; Soohyun HWANG ; You Jin OH ; Joong Hyun AHN ; Min-Ji KIM ; Tae Hee HONG ; Sung Goo PARK ; Joon Young CHOI ; Hong Kwan KIM ; Jhingook KIM ; Sumin SHIN ; Ho Yun LEE
Cancer Research and Treatment 2025;57(1):57-69
Purpose:
This study aimed to develop a magnetic resonance imaging (MRI)–based radiomics model to predict high-risk pathologic features for lung adenocarcinoma: micropapillary and solid pattern (MPsol), spread through air space, and poorly differentiated patterns.
Materials and Methods:
As a prospective study, we screened clinical N0 lung cancer patients who were surgical candidates and had undergone both 18F-fluorodeoxyglucose (FDG) positron emission tomography–computed tomography (PET/CT) and chest CT from August 2018 to January 2020. We recruited patients meeting our proposed imaging criteria indicating high-risk, that is, poorer prognosis of lung adenocarcinoma, using CT and FDG PET/CT. If possible, these patients underwent an MRI examination from which we extracted 77 radiomics features from T1-contrast-enhanced and T2-weighted images. Additionally, patient demographics, maximum standardized uptake value on FDG PET/CT, and the mean apparent diffusion coefficient value on diffusion-weighted image, were considered together to build prediction models for high-risk pathologic features.
Results:
Among 616 patients, 72 patients met the imaging criteria for high-risk lung cancer and underwent lung MRI. The magnetic resonance (MR)–eligible group showed a higher prevalence of nodal upstaging (29.2% vs. 4.2%, p < 0.001), vascular invasion (6.5% vs. 2.1%, p=0.011), high-grade pathologic features (p < 0.001), worse 4-year disease-free survival (p < 0.001) compared with non-MR-eligible group. The prediction power for MR-based radiomics model predicting high-risk pathologic features was good, with mean area under the receiver operating curve (AUC) value measuring 0.751-0.886 in test sets. Adding clinical variables increased the predictive performance for MPsol and the poorly differentiated pattern using the 2021 grading system (AUC, 0.860 and 0.907, respectively).
Conclusion
Our imaging criteria can effectively screen high-risk lung cancer patients and predict high-risk pathologic features by our MR-based prediction model using radiomics.
8.Advancing Korean Medical Large Language Models: Automated Pipeline for Korean Medical Preference Dataset Construction
Jean SEO ; Sumin PARK ; Sungjoo BYUN ; Jinwook CHOI ; Jinho CHOI ; Hyopil SHIN
Healthcare Informatics Research 2025;31(2):166-174
Objectives:
Developing large language models (LLMs) in biomedicine requires access to high-quality training and alignment tuning datasets. However, publicly available Korean medical preference datasets are scarce, hindering the advancement of Korean medical LLMs. This study constructs and evaluates the efficacy of the Korean Medical Preference Dataset (KoMeP), an alignment tuning dataset constructed with an automated pipeline, minimizing the high costs of human annotation.
Methods:
KoMeP was generated using the DAHL score, an automated hallucination evaluation metric. Five LLMs (Dolly-v2-3B, MPT-7B, GPT-4o, Qwen-2-7B, Llama-3-8B) produced responses to 8,573 biomedical examination questions, from which 5,551 preference pairs were extracted. Each pair consisted of a “chosen” response and a “rejected” response, as determined by their DAHL scores. The dataset was evaluated when trained through two different alignment tuning methods, direct preference optimization (DPO) and odds ratio preference optimization (ORPO) respectively across five different models. The KorMedMCQA benchmark was employed to assess the effectiveness of alignment tuning.
Results:
Models trained with DPO consistently improved KorMedMCQA performance; notably, Llama-3.1-8B showed a 43.96% increase. In contrast, ORPO training produced inconsistent results. Additionally, English-to-Korean transfer learning proved effective, particularly for English-centric models like Gemma-2, whereas Korean-to-English transfer learning achieved limited success. Instruction tuning with KoMeP yielded mixed outcomes, which suggests challenges in dataset formatting.
Conclusions
KoMeP is the first publicly available Korean medical preference dataset and significantly improves alignment tuning performance in LLMs. The DPO method outperforms ORPO in alignment tuning. Future work should focus on expanding KoMeP, developing a Korean-native dataset, and refining alignment tuning methods to produce safer and more reliable Korean medical LLMs.
10.Whole genome sequencing analysis of enteropathogenic Escherichia coli from human and companion animals in Korea
Jae Young OH ; Kyung-Hyo DO ; Jae Hong JEONG ; SuMin KWAK ; Sujin CHOE ; Dongheui AN ; Jong-Chan CHAE ; Kwangjun LEE ; Kwang-Won SEO
Journal of Veterinary Science 2025;26(1):e1-
Objective:
To improve our understanding of EPEC, this study focused on analyzing and comparing the genomic characteristics of EPEC isolates from humans and companion animals in Korea.
Methods:
The whole genome of 26 EPEC isolates from patients with diarrhea and 20 EPEC isolates from companion animals in Korea were sequenced using the Illumina HiSeq X (Illumina, USA) and Oxford Nanopore MinION (Oxford Nanopore Technologies, UK) platforms.
Results:
Most isolates were atypical EPEC, and did not harbor the bfpA gene. The most prevalent virulence genes were found to be ompT (humans: 61.5%; companion animals:60.0%) followed by lpfA (humans: 46.2%; companion animals: 60.0%). Although pangenome analyses showed no apparent correlation among the origin of the strains, virulence profiles, and antimicrobial resistance profiles, isolates included in clade A obtained from both humans and companion animals exhibited high similarity. Additionally, all the isolates included in clade A encoded the ompT gene and did not encode the hlyE gene. The two isolates from companion animals harbored an incomplete bundle-forming pilus region encoding bfpA and bfpB. Moreover, the type IV secretion system-associated genes tra and trb were found in the bfpA-encoding isolates from humans.
Conclusions
and Relevance: Whole-genome sequencing enabled a more accurate analysis of the phylogenetic structure of EPEC and provided better insights into the understanding of EPEC epidemiology and pathogenicity.


Result Analysis
Print
Save
E-mail