1.Synthetic data production for biomedical research
Yun Gyeong LEE ; Mi-Sook KWAK ; Jeong Eun KIM ; Min Sun KIM ; Dong Un NO ; Hee Youl CHAI
Osong Public Health and Research Perspectives 2025;16(2):94-99
Synthetic data, generated using advanced artificial intelligence (AI) techniques, replicates the statistical properties of real-world datasets while excluding identifiable information.Although synthetic data does not consist of actual data points, it is derived from original datasets, thereby enabling analyses that yield results comparable to those obtained with real data. Synthetic datasets are evaluated based on their utility—a measure of how effectively they mirror real data for analytical purposes. This paper presents the generation of synthetic datasets through the Healthcare Big Data Showcase Project (2019–2023). The original dataset comprises comprehensive multi-omics data from 400 individuals, including cancer survivors, chronic disease patients, and healthy participants. Synthetic data facilitates efficient access and robust analyses, serving as a practical tool for research and education. It addresses privacy concerns, supports AI research, and provides a foundation for innovative applications across diverse fields, such as public health and precision medicine.
2.Non-Linear Association Between Physical Activities and Type 2Diabetes in 2.4 Million Korean Population, 2009–2022: A Nationwide Representative Study
Wonwoo JANG ; Seokjun KIM ; Yejun SON ; Soeun KIM ; Hayeon LEE ; Jaeyu PARK ; Kyeongmin LEE ; Jiseung KANG ; Damiano PIZZOL ; Jiyoung HWANG ; Sang Youl RHEE ; Dong Keon YON
Journal of Korean Medical Science 2025;40(12):e42-
Background:
Although excessive physical activity (PA) does not always confer additional health benefits, there is a paucity of studies that have quantitatively examined the doseresponse relationship between PA and type 2 diabetes. Therefore, this study investigated the relationship between the type 2 diabetes prevalence and intensity, frequency, and metabolic equivalent of task (MET) score of PA in a large population sample.
Methods:
We conducted a nationwide cross-sectional analysis examining sociodemographic variables, PA habits, and type 2 diabetes prevalence in 2,428,448 participants included in the Korea Community Health Survey. The non-linear association between MET score and odds ratios (ORs) for type 2 diabetes prevalence was plotted using a weighted generalized additive model. Categorical analysis was used to examine the joint association of moderate-intensity PA (MPA) and vigorous-intensity PA (VPA), and the influence of PA frequency.
Results:
MET score and diabetes prevalence revealed a non-linear association with the nadir at 1,028 MET-min/week, beyond which ORs increased with additional PA. Joint analysis of MPA and VPA showed the lowest OR of 0.79 (95% confidence interval, 0.75–0.84) for those engaging in 300–600 MET-min/week of MPA and > 600 MET-min/week of VPA concurrently, corresponding with World Health Organization recommendations. Additionally, both “weekend warriors” and “regularly active” individuals showed lower ORs compared to the inactive, although no significant difference was noted between the active groups.
Conclusion
In a large South Korean sample, higher PA is not always associated with a lower prevalence of type 2 diabetes, as the association follows a non-linear pattern; differences existed across sociodemographic variables. Considering the joint association, an adequate combination of MPA and VPA is recommended. The frequency of PA does not significantly influence the type 2 diabetes prevalence.
3.Non-Linear Association Between Physical Activities and Type 2Diabetes in 2.4 Million Korean Population, 2009–2022: A Nationwide Representative Study
Wonwoo JANG ; Seokjun KIM ; Yejun SON ; Soeun KIM ; Hayeon LEE ; Jaeyu PARK ; Kyeongmin LEE ; Jiseung KANG ; Damiano PIZZOL ; Jiyoung HWANG ; Sang Youl RHEE ; Dong Keon YON
Journal of Korean Medical Science 2025;40(12):e42-
Background:
Although excessive physical activity (PA) does not always confer additional health benefits, there is a paucity of studies that have quantitatively examined the doseresponse relationship between PA and type 2 diabetes. Therefore, this study investigated the relationship between the type 2 diabetes prevalence and intensity, frequency, and metabolic equivalent of task (MET) score of PA in a large population sample.
Methods:
We conducted a nationwide cross-sectional analysis examining sociodemographic variables, PA habits, and type 2 diabetes prevalence in 2,428,448 participants included in the Korea Community Health Survey. The non-linear association between MET score and odds ratios (ORs) for type 2 diabetes prevalence was plotted using a weighted generalized additive model. Categorical analysis was used to examine the joint association of moderate-intensity PA (MPA) and vigorous-intensity PA (VPA), and the influence of PA frequency.
Results:
MET score and diabetes prevalence revealed a non-linear association with the nadir at 1,028 MET-min/week, beyond which ORs increased with additional PA. Joint analysis of MPA and VPA showed the lowest OR of 0.79 (95% confidence interval, 0.75–0.84) for those engaging in 300–600 MET-min/week of MPA and > 600 MET-min/week of VPA concurrently, corresponding with World Health Organization recommendations. Additionally, both “weekend warriors” and “regularly active” individuals showed lower ORs compared to the inactive, although no significant difference was noted between the active groups.
Conclusion
In a large South Korean sample, higher PA is not always associated with a lower prevalence of type 2 diabetes, as the association follows a non-linear pattern; differences existed across sociodemographic variables. Considering the joint association, an adequate combination of MPA and VPA is recommended. The frequency of PA does not significantly influence the type 2 diabetes prevalence.
4.Synthetic data production for biomedical research
Yun Gyeong LEE ; Mi-Sook KWAK ; Jeong Eun KIM ; Min Sun KIM ; Dong Un NO ; Hee Youl CHAI
Osong Public Health and Research Perspectives 2025;16(2):94-99
Synthetic data, generated using advanced artificial intelligence (AI) techniques, replicates the statistical properties of real-world datasets while excluding identifiable information.Although synthetic data does not consist of actual data points, it is derived from original datasets, thereby enabling analyses that yield results comparable to those obtained with real data. Synthetic datasets are evaluated based on their utility—a measure of how effectively they mirror real data for analytical purposes. This paper presents the generation of synthetic datasets through the Healthcare Big Data Showcase Project (2019–2023). The original dataset comprises comprehensive multi-omics data from 400 individuals, including cancer survivors, chronic disease patients, and healthy participants. Synthetic data facilitates efficient access and robust analyses, serving as a practical tool for research and education. It addresses privacy concerns, supports AI research, and provides a foundation for innovative applications across diverse fields, such as public health and precision medicine.
5.Synthetic data production for biomedical research
Yun Gyeong LEE ; Mi-Sook KWAK ; Jeong Eun KIM ; Min Sun KIM ; Dong Un NO ; Hee Youl CHAI
Osong Public Health and Research Perspectives 2025;16(2):94-99
Synthetic data, generated using advanced artificial intelligence (AI) techniques, replicates the statistical properties of real-world datasets while excluding identifiable information.Although synthetic data does not consist of actual data points, it is derived from original datasets, thereby enabling analyses that yield results comparable to those obtained with real data. Synthetic datasets are evaluated based on their utility—a measure of how effectively they mirror real data for analytical purposes. This paper presents the generation of synthetic datasets through the Healthcare Big Data Showcase Project (2019–2023). The original dataset comprises comprehensive multi-omics data from 400 individuals, including cancer survivors, chronic disease patients, and healthy participants. Synthetic data facilitates efficient access and robust analyses, serving as a practical tool for research and education. It addresses privacy concerns, supports AI research, and provides a foundation for innovative applications across diverse fields, such as public health and precision medicine.
6.Non-Linear Association Between Physical Activities and Type 2Diabetes in 2.4 Million Korean Population, 2009–2022: A Nationwide Representative Study
Wonwoo JANG ; Seokjun KIM ; Yejun SON ; Soeun KIM ; Hayeon LEE ; Jaeyu PARK ; Kyeongmin LEE ; Jiseung KANG ; Damiano PIZZOL ; Jiyoung HWANG ; Sang Youl RHEE ; Dong Keon YON
Journal of Korean Medical Science 2025;40(12):e42-
Background:
Although excessive physical activity (PA) does not always confer additional health benefits, there is a paucity of studies that have quantitatively examined the doseresponse relationship between PA and type 2 diabetes. Therefore, this study investigated the relationship between the type 2 diabetes prevalence and intensity, frequency, and metabolic equivalent of task (MET) score of PA in a large population sample.
Methods:
We conducted a nationwide cross-sectional analysis examining sociodemographic variables, PA habits, and type 2 diabetes prevalence in 2,428,448 participants included in the Korea Community Health Survey. The non-linear association between MET score and odds ratios (ORs) for type 2 diabetes prevalence was plotted using a weighted generalized additive model. Categorical analysis was used to examine the joint association of moderate-intensity PA (MPA) and vigorous-intensity PA (VPA), and the influence of PA frequency.
Results:
MET score and diabetes prevalence revealed a non-linear association with the nadir at 1,028 MET-min/week, beyond which ORs increased with additional PA. Joint analysis of MPA and VPA showed the lowest OR of 0.79 (95% confidence interval, 0.75–0.84) for those engaging in 300–600 MET-min/week of MPA and > 600 MET-min/week of VPA concurrently, corresponding with World Health Organization recommendations. Additionally, both “weekend warriors” and “regularly active” individuals showed lower ORs compared to the inactive, although no significant difference was noted between the active groups.
Conclusion
In a large South Korean sample, higher PA is not always associated with a lower prevalence of type 2 diabetes, as the association follows a non-linear pattern; differences existed across sociodemographic variables. Considering the joint association, an adequate combination of MPA and VPA is recommended. The frequency of PA does not significantly influence the type 2 diabetes prevalence.
7.Synthetic data production for biomedical research
Yun Gyeong LEE ; Mi-Sook KWAK ; Jeong Eun KIM ; Min Sun KIM ; Dong Un NO ; Hee Youl CHAI
Osong Public Health and Research Perspectives 2025;16(2):94-99
Synthetic data, generated using advanced artificial intelligence (AI) techniques, replicates the statistical properties of real-world datasets while excluding identifiable information.Although synthetic data does not consist of actual data points, it is derived from original datasets, thereby enabling analyses that yield results comparable to those obtained with real data. Synthetic datasets are evaluated based on their utility—a measure of how effectively they mirror real data for analytical purposes. This paper presents the generation of synthetic datasets through the Healthcare Big Data Showcase Project (2019–2023). The original dataset comprises comprehensive multi-omics data from 400 individuals, including cancer survivors, chronic disease patients, and healthy participants. Synthetic data facilitates efficient access and robust analyses, serving as a practical tool for research and education. It addresses privacy concerns, supports AI research, and provides a foundation for innovative applications across diverse fields, such as public health and precision medicine.
8.Non-Linear Association Between Physical Activities and Type 2Diabetes in 2.4 Million Korean Population, 2009–2022: A Nationwide Representative Study
Wonwoo JANG ; Seokjun KIM ; Yejun SON ; Soeun KIM ; Hayeon LEE ; Jaeyu PARK ; Kyeongmin LEE ; Jiseung KANG ; Damiano PIZZOL ; Jiyoung HWANG ; Sang Youl RHEE ; Dong Keon YON
Journal of Korean Medical Science 2025;40(12):e42-
Background:
Although excessive physical activity (PA) does not always confer additional health benefits, there is a paucity of studies that have quantitatively examined the doseresponse relationship between PA and type 2 diabetes. Therefore, this study investigated the relationship between the type 2 diabetes prevalence and intensity, frequency, and metabolic equivalent of task (MET) score of PA in a large population sample.
Methods:
We conducted a nationwide cross-sectional analysis examining sociodemographic variables, PA habits, and type 2 diabetes prevalence in 2,428,448 participants included in the Korea Community Health Survey. The non-linear association between MET score and odds ratios (ORs) for type 2 diabetes prevalence was plotted using a weighted generalized additive model. Categorical analysis was used to examine the joint association of moderate-intensity PA (MPA) and vigorous-intensity PA (VPA), and the influence of PA frequency.
Results:
MET score and diabetes prevalence revealed a non-linear association with the nadir at 1,028 MET-min/week, beyond which ORs increased with additional PA. Joint analysis of MPA and VPA showed the lowest OR of 0.79 (95% confidence interval, 0.75–0.84) for those engaging in 300–600 MET-min/week of MPA and > 600 MET-min/week of VPA concurrently, corresponding with World Health Organization recommendations. Additionally, both “weekend warriors” and “regularly active” individuals showed lower ORs compared to the inactive, although no significant difference was noted between the active groups.
Conclusion
In a large South Korean sample, higher PA is not always associated with a lower prevalence of type 2 diabetes, as the association follows a non-linear pattern; differences existed across sociodemographic variables. Considering the joint association, an adequate combination of MPA and VPA is recommended. The frequency of PA does not significantly influence the type 2 diabetes prevalence.
9.Synthetic data production for biomedical research
Yun Gyeong LEE ; Mi-Sook KWAK ; Jeong Eun KIM ; Min Sun KIM ; Dong Un NO ; Hee Youl CHAI
Osong Public Health and Research Perspectives 2025;16(2):94-99
Synthetic data, generated using advanced artificial intelligence (AI) techniques, replicates the statistical properties of real-world datasets while excluding identifiable information.Although synthetic data does not consist of actual data points, it is derived from original datasets, thereby enabling analyses that yield results comparable to those obtained with real data. Synthetic datasets are evaluated based on their utility—a measure of how effectively they mirror real data for analytical purposes. This paper presents the generation of synthetic datasets through the Healthcare Big Data Showcase Project (2019–2023). The original dataset comprises comprehensive multi-omics data from 400 individuals, including cancer survivors, chronic disease patients, and healthy participants. Synthetic data facilitates efficient access and robust analyses, serving as a practical tool for research and education. It addresses privacy concerns, supports AI research, and provides a foundation for innovative applications across diverse fields, such as public health and precision medicine.
10.The Moderating Effect of Resilience on the Relationship Between the Relevance to Victims With Post-Trauma Psychiatric Symptoms of Community Residents After Seoul Halloween Crowd Crush
Se Youl KIM ; Sra JUNG ; Mi Yeon LEE ; Kang-Seob OH ; Young-Chul SHIN ; Dong-Won SHIN ; Junhyung KIM ; Eun Soo KIM ; Sun Wook JUNG ; Kwang-yeol LEE ; Nahyun OH ; Sung Joon CHO ; Sang-Won JEON
Psychiatry Investigation 2024;21(11):1183-1192
Objective:
This study aimed to examine the psychiatric impact of the Seoul Halloween crowd crush on individuals related to the victims compared to the general population. It also explores the moderating effect of resilience on the relationship between trauma exposure and psychiatric symptoms.
Methods:
In total, 2,220 participants completed various post-incident questionnaires (Patient Health Questionnaire-9, Generalized Anxiety Disorder-7, Hwa-byung symptom scale, post-traumatic stress disorder checklist for DSM-5, and Brief Resilience Scale) 30 days after the incident. Moderation analyses were conducted using the PROCESS macro in the statistical package for the social sciences.
Results:
Individuals related to the victims exhibited higher symptom severity and a greater risk for clinically significant levels of depression, anxiety, anger, and post-traumatic stress disorder (PTSD) (odds ratio=3.28, 3.33, 1.51, and 4.39 respectively). The impact of relevance to victims on anxiety and PTSD symptoms was moderated by resilience, with a stronger effect observed for individuals with low resilience (β=3.51, 95% confidence interval [CI] 2.78–4.24 for anxiety and β=14.53, 95% CI 12.43–16.63 for PTSD) than for those with high resilience (β=1.69, 95% CI 0.72–2.65 for anxiety and β=8.33, 95% CI 5.56–11.09 for PTSD).
Conclusion
When related to the victims, it was found that not only PTSD, but also depression, anxiety, and anger could intensify. Resilience emerged as a potential buffer against these adverse effects, emphasizing its significance in mitigating the psychiatric impact of community trauma.

Result Analysis
Print
Save
E-mail