1.Synthetic data production for biomedical research
Yun Gyeong LEE ; Mi-Sook KWAK ; Jeong Eun KIM ; Min Sun KIM ; Dong Un NO ; Hee Youl CHAI
Osong Public Health and Research Perspectives 2025;16(2):94-99
Synthetic data, generated using advanced artificial intelligence (AI) techniques, replicates the statistical properties of real-world datasets while excluding identifiable information.Although synthetic data does not consist of actual data points, it is derived from original datasets, thereby enabling analyses that yield results comparable to those obtained with real data. Synthetic datasets are evaluated based on their utility—a measure of how effectively they mirror real data for analytical purposes. This paper presents the generation of synthetic datasets through the Healthcare Big Data Showcase Project (2019–2023). The original dataset comprises comprehensive multi-omics data from 400 individuals, including cancer survivors, chronic disease patients, and healthy participants. Synthetic data facilitates efficient access and robust analyses, serving as a practical tool for research and education. It addresses privacy concerns, supports AI research, and provides a foundation for innovative applications across diverse fields, such as public health and precision medicine.
2.The combination of CDX2 expression status and tumor-infiltrating lymphocyte density as a prognostic factor in adjuvant FOLFOX-treated patients with stage III colorectal cancers
Ji-Ae LEE ; Hye Eun PARK ; Hye-Yeong JIN ; Lingyan JIN ; Seung Yeon YOO ; Nam-Yun CHO ; Jeong Mo BAE ; Jung Ho KIM ; Gyeong Hoon KANG
Journal of Pathology and Translational Medicine 2025;59(1):50-59
Background:
Colorectal carcinomas (CRCs) with caudal-type homeobox 2 (CDX2) loss are recognized to pursue an aggressive behavior but tend to be accompanied by a high density of tumor-infiltrating lymphocytes (TILs). However, little is known about whether there is an interplay between CDX2 loss and TIL density in the survival of patients with CRC.
Methods:
Stage III CRC tissues were assessed for CDX2 loss using immunohistochemistry and analyzed for their densities of CD8 TILs in both intraepithelial (iTILs) and stromal areas using a machine learning-based analytic method.
Results:
CDX2 loss was significantly associated with a higher density of CD8 TILs in both intraepithelial and stromal areas. Both CDX2 loss and a high CD8 iTIL density were found to be prognostic parameters and showed hazard ratios of 2.314 (1.050–5.100) and 0.378 (0.175–0.817), respectively, for cancer-specific survival. A subset of CRCs with retained CDX2 expression and a high density of CD8 iTILs showed the best clinical outcome (hazard ratio of 0.138 [0.023–0.826]), whereas a subset with CDX2 loss and a high density of CD8 iTILs exhibited the worst clinical outcome (15.781 [3.939–63.230]).
Conclusions
Altogether, a high density of CD8 iTILs did not make a difference in the survival of patients with CRC with CDX2 loss. The combination of CDX2 expression and intraepithelial CD8 TIL density was an independent prognostic marker in adjuvant chemotherapy-treated patients with stage III CRC.
3.Synthetic data production for biomedical research
Yun Gyeong LEE ; Mi-Sook KWAK ; Jeong Eun KIM ; Min Sun KIM ; Dong Un NO ; Hee Youl CHAI
Osong Public Health and Research Perspectives 2025;16(2):94-99
Synthetic data, generated using advanced artificial intelligence (AI) techniques, replicates the statistical properties of real-world datasets while excluding identifiable information.Although synthetic data does not consist of actual data points, it is derived from original datasets, thereby enabling analyses that yield results comparable to those obtained with real data. Synthetic datasets are evaluated based on their utility—a measure of how effectively they mirror real data for analytical purposes. This paper presents the generation of synthetic datasets through the Healthcare Big Data Showcase Project (2019–2023). The original dataset comprises comprehensive multi-omics data from 400 individuals, including cancer survivors, chronic disease patients, and healthy participants. Synthetic data facilitates efficient access and robust analyses, serving as a practical tool for research and education. It addresses privacy concerns, supports AI research, and provides a foundation for innovative applications across diverse fields, such as public health and precision medicine.
4.Synthetic data production for biomedical research
Yun Gyeong LEE ; Mi-Sook KWAK ; Jeong Eun KIM ; Min Sun KIM ; Dong Un NO ; Hee Youl CHAI
Osong Public Health and Research Perspectives 2025;16(2):94-99
Synthetic data, generated using advanced artificial intelligence (AI) techniques, replicates the statistical properties of real-world datasets while excluding identifiable information.Although synthetic data does not consist of actual data points, it is derived from original datasets, thereby enabling analyses that yield results comparable to those obtained with real data. Synthetic datasets are evaluated based on their utility—a measure of how effectively they mirror real data for analytical purposes. This paper presents the generation of synthetic datasets through the Healthcare Big Data Showcase Project (2019–2023). The original dataset comprises comprehensive multi-omics data from 400 individuals, including cancer survivors, chronic disease patients, and healthy participants. Synthetic data facilitates efficient access and robust analyses, serving as a practical tool for research and education. It addresses privacy concerns, supports AI research, and provides a foundation for innovative applications across diverse fields, such as public health and precision medicine.
5.The combination of CDX2 expression status and tumor-infiltrating lymphocyte density as a prognostic factor in adjuvant FOLFOX-treated patients with stage III colorectal cancers
Ji-Ae LEE ; Hye Eun PARK ; Hye-Yeong JIN ; Lingyan JIN ; Seung Yeon YOO ; Nam-Yun CHO ; Jeong Mo BAE ; Jung Ho KIM ; Gyeong Hoon KANG
Journal of Pathology and Translational Medicine 2025;59(1):50-59
Background:
Colorectal carcinomas (CRCs) with caudal-type homeobox 2 (CDX2) loss are recognized to pursue an aggressive behavior but tend to be accompanied by a high density of tumor-infiltrating lymphocytes (TILs). However, little is known about whether there is an interplay between CDX2 loss and TIL density in the survival of patients with CRC.
Methods:
Stage III CRC tissues were assessed for CDX2 loss using immunohistochemistry and analyzed for their densities of CD8 TILs in both intraepithelial (iTILs) and stromal areas using a machine learning-based analytic method.
Results:
CDX2 loss was significantly associated with a higher density of CD8 TILs in both intraepithelial and stromal areas. Both CDX2 loss and a high CD8 iTIL density were found to be prognostic parameters and showed hazard ratios of 2.314 (1.050–5.100) and 0.378 (0.175–0.817), respectively, for cancer-specific survival. A subset of CRCs with retained CDX2 expression and a high density of CD8 iTILs showed the best clinical outcome (hazard ratio of 0.138 [0.023–0.826]), whereas a subset with CDX2 loss and a high density of CD8 iTILs exhibited the worst clinical outcome (15.781 [3.939–63.230]).
Conclusions
Altogether, a high density of CD8 iTILs did not make a difference in the survival of patients with CRC with CDX2 loss. The combination of CDX2 expression and intraepithelial CD8 TIL density was an independent prognostic marker in adjuvant chemotherapy-treated patients with stage III CRC.
6.Synthetic data production for biomedical research
Yun Gyeong LEE ; Mi-Sook KWAK ; Jeong Eun KIM ; Min Sun KIM ; Dong Un NO ; Hee Youl CHAI
Osong Public Health and Research Perspectives 2025;16(2):94-99
Synthetic data, generated using advanced artificial intelligence (AI) techniques, replicates the statistical properties of real-world datasets while excluding identifiable information.Although synthetic data does not consist of actual data points, it is derived from original datasets, thereby enabling analyses that yield results comparable to those obtained with real data. Synthetic datasets are evaluated based on their utility—a measure of how effectively they mirror real data for analytical purposes. This paper presents the generation of synthetic datasets through the Healthcare Big Data Showcase Project (2019–2023). The original dataset comprises comprehensive multi-omics data from 400 individuals, including cancer survivors, chronic disease patients, and healthy participants. Synthetic data facilitates efficient access and robust analyses, serving as a practical tool for research and education. It addresses privacy concerns, supports AI research, and provides a foundation for innovative applications across diverse fields, such as public health and precision medicine.
7.The combination of CDX2 expression status and tumor-infiltrating lymphocyte density as a prognostic factor in adjuvant FOLFOX-treated patients with stage III colorectal cancers
Ji-Ae LEE ; Hye Eun PARK ; Hye-Yeong JIN ; Lingyan JIN ; Seung Yeon YOO ; Nam-Yun CHO ; Jeong Mo BAE ; Jung Ho KIM ; Gyeong Hoon KANG
Journal of Pathology and Translational Medicine 2025;59(1):50-59
Background:
Colorectal carcinomas (CRCs) with caudal-type homeobox 2 (CDX2) loss are recognized to pursue an aggressive behavior but tend to be accompanied by a high density of tumor-infiltrating lymphocytes (TILs). However, little is known about whether there is an interplay between CDX2 loss and TIL density in the survival of patients with CRC.
Methods:
Stage III CRC tissues were assessed for CDX2 loss using immunohistochemistry and analyzed for their densities of CD8 TILs in both intraepithelial (iTILs) and stromal areas using a machine learning-based analytic method.
Results:
CDX2 loss was significantly associated with a higher density of CD8 TILs in both intraepithelial and stromal areas. Both CDX2 loss and a high CD8 iTIL density were found to be prognostic parameters and showed hazard ratios of 2.314 (1.050–5.100) and 0.378 (0.175–0.817), respectively, for cancer-specific survival. A subset of CRCs with retained CDX2 expression and a high density of CD8 iTILs showed the best clinical outcome (hazard ratio of 0.138 [0.023–0.826]), whereas a subset with CDX2 loss and a high density of CD8 iTILs exhibited the worst clinical outcome (15.781 [3.939–63.230]).
Conclusions
Altogether, a high density of CD8 iTILs did not make a difference in the survival of patients with CRC with CDX2 loss. The combination of CDX2 expression and intraepithelial CD8 TIL density was an independent prognostic marker in adjuvant chemotherapy-treated patients with stage III CRC.
8.Synthetic data production for biomedical research
Yun Gyeong LEE ; Mi-Sook KWAK ; Jeong Eun KIM ; Min Sun KIM ; Dong Un NO ; Hee Youl CHAI
Osong Public Health and Research Perspectives 2025;16(2):94-99
Synthetic data, generated using advanced artificial intelligence (AI) techniques, replicates the statistical properties of real-world datasets while excluding identifiable information.Although synthetic data does not consist of actual data points, it is derived from original datasets, thereby enabling analyses that yield results comparable to those obtained with real data. Synthetic datasets are evaluated based on their utility—a measure of how effectively they mirror real data for analytical purposes. This paper presents the generation of synthetic datasets through the Healthcare Big Data Showcase Project (2019–2023). The original dataset comprises comprehensive multi-omics data from 400 individuals, including cancer survivors, chronic disease patients, and healthy participants. Synthetic data facilitates efficient access and robust analyses, serving as a practical tool for research and education. It addresses privacy concerns, supports AI research, and provides a foundation for innovative applications across diverse fields, such as public health and precision medicine.
9.Outcomes in Refractory Diffuse Large B-Cell Lymphoma: Results from Two Prospective Korean Cohorts
Jun Ho YI ; Seong Hyun JEONG ; Seok Jin KIM ; Dok Hyun YOON ; Hye Jin KANG ; Youngil KOH ; Jin Seok KIM ; Won-Sik LEE ; Deok-Hwan YANG ; Young Rok DO ; Min Kyoung KIM ; Kwai Han YOO ; Yoon Seok CHOI ; Whan Jung YUN ; Yong PARK ; Jae-Cheol JO ; Hyeon-Seok EOM ; Jae-Yong KWAK ; Ho-Jin SHIN ; Byeong Bae PARK ; Seong Yoon YI ; Ji-Hyun KWON ; Sung Yong OH ; Hyo Jung KIM ; Byeong Seok SOHN ; Jong Ho WON ; Dae-Sik HONG ; Ho-Sup LEE ; Gyeong-Won LEE ; Cheolwon SUH ; Won Seog KIM
Cancer Research and Treatment 2023;55(1):325-333
Purpose:
Diffuse large B-cell lymphoma (DLBCL) is the most common hematologic malignancy worldwide. Although substantial improvement has been achieved by the frontline rituximab-based chemoimmunotherapy, up to 40%-50% of patients will eventually have relapsed or refractory disease, whose prognosis is extremely dismal.
Materials and Methods:
We have carried out two prospective cohort studies that include over 1,500 DLBCL patients treated with rituximab plus CHOP (#NCT01202448 and #NCT02474550). In the current report, we describe the outcomes of refractory DLBCL patients. Patients were defined to have refractory DLBCL if they met one of the followings, not achieving at least partial response after 4 or more cycles of R-CHOP; not achieving at least partial response after 2 or more cycles of salvage therapy; progressive disease within 12 months after autologous stem cell transplantation.
Results:
Among 1,581 patients, a total of 260 patients met the criteria for the refractory disease after a median time to progression of 9.1 months. The objective response rate of salvage treatment was 26.4%, and the complete response rate was 9.6%. The median overall survival (OS) was 7.5 months (95% confidence interval, 6.4 to 8.6), and the 2-year survival rate was 22.1%±2.8%. The median OS for each refractory category was not significantly different (p=0.529).
Conclusion
In line with the previous studies, the outcomes of refractory DLBCL patients were extremely poor, which necessitates novel approaches for this population.
10.Erythema Nodosum after Vaccination against Coronavirus Disease 2019
Na Gyeong YANG ; Jeong Yeon HONG ; Jae Yun KIM ; Sung Yul LEE ; Jung Eun KIM ; Shi Nae YU ; Euy Hyun CHUNG
Korean Journal of Dermatology 2022;60(6):383-386
Erythema nodosum (EN) is the most common form of panniculitis and may be triggered by a variety of stimuli, including infections, drugs, pregnancy, sarcoidosis, inflammatory bowel disease, and malignancies. Rare cases of vaccination-related EN have been reported, but none due to the coronavirus disease 2019 (COVID-19) vaccine of Pfizer have been documented. We report a case of EN associated with the Pfizer vaccine. A 43-year-old woman presented with acute-onset painful nodular lesions that appeared bilaterally on the extensor surface of the lower legs. These lesions appeared 5 days after the first dose of Pfizer vaccination. The patient reported no recent infectious history other than fever for 3 days after vaccination. Skin biopsy revealed inflammation extending into the subcutaneous fat with a septal distribution. It is important for physicians to be aware of the side effects of the COVID-19 vaccine because more people are bound to be vaccinated.

Result Analysis
Print
Save
E-mail