1.Mock communities to assess biases in nextgeneration sequencing of bacterial species representation
Younjee HWANG ; Ju Yeong KIM ; Se Il KIM ; Ji Yeon SUNG ; Hye Su MOON ; Tai-Soon YONG ; Ki Ho HONG ; Hyukmin LEE ; Dongeun YONG
Annals of Clinical Microbiology 2025;28(1):3-
Background:
The 16S rRNA-targeted next-generation sequencing (NGS) has been widely used as the primary tool for microbiome analysis. However, whether the sequenced microbial diversity absolutely represents the original sample composition remains unclear. This study aimed to evaluate whether 16S rRNA gene-targeted NGS accurately captures bacterial community composition.
Methods:
Mock communities were constructed using equal amounts of DNA from 18 bacterial strains in three formats: genomic DNA, recombinant plasmids, and polymerase chain reaction (PCR) templates. The V3V4 region of the 16S rRNA gene was amplified and sequenced using the Illumina MiSeq.
Results:
Data regression analysis revealed that the recombinant plasmid produced more accurate and precise correlation curve than that by the gDNA and PCR products, with a slope closest to 1 (1.0082) and the highest R² value (0.9975). Despite the same input amount of bacterial DNA, the NGS read distribution varied across all three mock communities. Using multiple regression analysis, we found that the guanine-cytosine (GC) content of the V3V4 region, 16S rRNA gene, size of gDNA, and copy number of 16S rRNA were significantly associated with the NGS output of each bacterial species.
Conclusion
This study demonstrated that recombinant plasmids are the preferred option for quality control and that NGS output is biased owing to certain bacterial characteristics, such as %GC content, gDNA size, and 16S rRNA gene copy number. Further research is required to develop a system that compensates for NGS process biases using mock communities.
2.Mock communities to assess biases in nextgeneration sequencing of bacterial species representation
Younjee HWANG ; Ju Yeong KIM ; Se Il KIM ; Ji Yeon SUNG ; Hye Su MOON ; Tai-Soon YONG ; Ki Ho HONG ; Hyukmin LEE ; Dongeun YONG
Annals of Clinical Microbiology 2025;28(1):3-
Background:
The 16S rRNA-targeted next-generation sequencing (NGS) has been widely used as the primary tool for microbiome analysis. However, whether the sequenced microbial diversity absolutely represents the original sample composition remains unclear. This study aimed to evaluate whether 16S rRNA gene-targeted NGS accurately captures bacterial community composition.
Methods:
Mock communities were constructed using equal amounts of DNA from 18 bacterial strains in three formats: genomic DNA, recombinant plasmids, and polymerase chain reaction (PCR) templates. The V3V4 region of the 16S rRNA gene was amplified and sequenced using the Illumina MiSeq.
Results:
Data regression analysis revealed that the recombinant plasmid produced more accurate and precise correlation curve than that by the gDNA and PCR products, with a slope closest to 1 (1.0082) and the highest R² value (0.9975). Despite the same input amount of bacterial DNA, the NGS read distribution varied across all three mock communities. Using multiple regression analysis, we found that the guanine-cytosine (GC) content of the V3V4 region, 16S rRNA gene, size of gDNA, and copy number of 16S rRNA were significantly associated with the NGS output of each bacterial species.
Conclusion
This study demonstrated that recombinant plasmids are the preferred option for quality control and that NGS output is biased owing to certain bacterial characteristics, such as %GC content, gDNA size, and 16S rRNA gene copy number. Further research is required to develop a system that compensates for NGS process biases using mock communities.
3.Mock communities to assess biases in nextgeneration sequencing of bacterial species representation
Younjee HWANG ; Ju Yeong KIM ; Se Il KIM ; Ji Yeon SUNG ; Hye Su MOON ; Tai-Soon YONG ; Ki Ho HONG ; Hyukmin LEE ; Dongeun YONG
Annals of Clinical Microbiology 2025;28(1):3-
Background:
The 16S rRNA-targeted next-generation sequencing (NGS) has been widely used as the primary tool for microbiome analysis. However, whether the sequenced microbial diversity absolutely represents the original sample composition remains unclear. This study aimed to evaluate whether 16S rRNA gene-targeted NGS accurately captures bacterial community composition.
Methods:
Mock communities were constructed using equal amounts of DNA from 18 bacterial strains in three formats: genomic DNA, recombinant plasmids, and polymerase chain reaction (PCR) templates. The V3V4 region of the 16S rRNA gene was amplified and sequenced using the Illumina MiSeq.
Results:
Data regression analysis revealed that the recombinant plasmid produced more accurate and precise correlation curve than that by the gDNA and PCR products, with a slope closest to 1 (1.0082) and the highest R² value (0.9975). Despite the same input amount of bacterial DNA, the NGS read distribution varied across all three mock communities. Using multiple regression analysis, we found that the guanine-cytosine (GC) content of the V3V4 region, 16S rRNA gene, size of gDNA, and copy number of 16S rRNA were significantly associated with the NGS output of each bacterial species.
Conclusion
This study demonstrated that recombinant plasmids are the preferred option for quality control and that NGS output is biased owing to certain bacterial characteristics, such as %GC content, gDNA size, and 16S rRNA gene copy number. Further research is required to develop a system that compensates for NGS process biases using mock communities.
4.Mock communities to assess biases in nextgeneration sequencing of bacterial species representation
Younjee HWANG ; Ju Yeong KIM ; Se Il KIM ; Ji Yeon SUNG ; Hye Su MOON ; Tai-Soon YONG ; Ki Ho HONG ; Hyukmin LEE ; Dongeun YONG
Annals of Clinical Microbiology 2025;28(1):3-
Background:
The 16S rRNA-targeted next-generation sequencing (NGS) has been widely used as the primary tool for microbiome analysis. However, whether the sequenced microbial diversity absolutely represents the original sample composition remains unclear. This study aimed to evaluate whether 16S rRNA gene-targeted NGS accurately captures bacterial community composition.
Methods:
Mock communities were constructed using equal amounts of DNA from 18 bacterial strains in three formats: genomic DNA, recombinant plasmids, and polymerase chain reaction (PCR) templates. The V3V4 region of the 16S rRNA gene was amplified and sequenced using the Illumina MiSeq.
Results:
Data regression analysis revealed that the recombinant plasmid produced more accurate and precise correlation curve than that by the gDNA and PCR products, with a slope closest to 1 (1.0082) and the highest R² value (0.9975). Despite the same input amount of bacterial DNA, the NGS read distribution varied across all three mock communities. Using multiple regression analysis, we found that the guanine-cytosine (GC) content of the V3V4 region, 16S rRNA gene, size of gDNA, and copy number of 16S rRNA were significantly associated with the NGS output of each bacterial species.
Conclusion
This study demonstrated that recombinant plasmids are the preferred option for quality control and that NGS output is biased owing to certain bacterial characteristics, such as %GC content, gDNA size, and 16S rRNA gene copy number. Further research is required to develop a system that compensates for NGS process biases using mock communities.
5.Mock communities to assess biases in nextgeneration sequencing of bacterial species representation
Younjee HWANG ; Ju Yeong KIM ; Se Il KIM ; Ji Yeon SUNG ; Hye Su MOON ; Tai-Soon YONG ; Ki Ho HONG ; Hyukmin LEE ; Dongeun YONG
Annals of Clinical Microbiology 2025;28(1):3-
Background:
The 16S rRNA-targeted next-generation sequencing (NGS) has been widely used as the primary tool for microbiome analysis. However, whether the sequenced microbial diversity absolutely represents the original sample composition remains unclear. This study aimed to evaluate whether 16S rRNA gene-targeted NGS accurately captures bacterial community composition.
Methods:
Mock communities were constructed using equal amounts of DNA from 18 bacterial strains in three formats: genomic DNA, recombinant plasmids, and polymerase chain reaction (PCR) templates. The V3V4 region of the 16S rRNA gene was amplified and sequenced using the Illumina MiSeq.
Results:
Data regression analysis revealed that the recombinant plasmid produced more accurate and precise correlation curve than that by the gDNA and PCR products, with a slope closest to 1 (1.0082) and the highest R² value (0.9975). Despite the same input amount of bacterial DNA, the NGS read distribution varied across all three mock communities. Using multiple regression analysis, we found that the guanine-cytosine (GC) content of the V3V4 region, 16S rRNA gene, size of gDNA, and copy number of 16S rRNA were significantly associated with the NGS output of each bacterial species.
Conclusion
This study demonstrated that recombinant plasmids are the preferred option for quality control and that NGS output is biased owing to certain bacterial characteristics, such as %GC content, gDNA size, and 16S rRNA gene copy number. Further research is required to develop a system that compensates for NGS process biases using mock communities.
6.Real-World Eligibility and Cost-Effectiveness Analysis of Empagliflozin for Heart Failure in Korea
Eui-Soon KIM ; Sun-Kyeong PARK ; Jong-Chan YOUN ; Hye Sun LEE ; Hae-Young LEE ; Hyun-Jai CHO ; Jin-Oh CHOI ; Eun-Seok JEON ; Sang Eun LEE ; Min-Seok KIM ; Jae-Joong KIM ; Kyung-Kuk HWANG ; Myeong-Chan CHO ; Shung Chull CHAE ; Seok-Min KANG ; Jin Joo PARK ; Dong-Ju CHOI ; Byung-Su YOO ; Jae Yeong CHO ; Kye Hun KIM ; Byung-Hee OH ; Barry GREENBERG ; Sang Hong BAEK
Journal of Korean Medical Science 2024;39(1):e8-
Background:
The US Food and Drug Administration (FDA) and European Medicines Agency (EMA) approved empagliflozin for reducing cardiovascular mortality and heart failure (HF) hospitalization in patients with both HF with reduced ejection fraction (HFrEF) and HF with preserved ejection fraction (HFpEF). However, limited data are available on the generalizability of empagliflozin to clinical practice. Therefore, we evaluated real-world eligibility and potential cost-effectiveness based on a nationwide prospective HF registry.
Methods:
A total of 3,108 HFrEF and 2,070 HFpEF patients from the Korean Acute Heart Failure (KorAHF) registry were analyzed. Eligibility was estimated by inclusion and exclusion criteria of EMPagliflozin outcomE tRial in Patients With chrOnic heaRt Failure With Reduced Ejection Fraction (EMPEROR-Reduced) and EMPagliflozin outcomE tRial in Patients With chrOnic heaRt Failure With Preserved Ejection Fraction (EMPEROR-Preserved) trials and by FDA & EMA label criteria. The cost-utility analysis was done using a Markov model to project the lifetime medical cost and quality-adjusted life year (QALY).
Results:
Among the KorAHF patients, 91.4% met FDA & EMA label criteria, while 44.7% met the clinical trial criteria. The incremental cost-effectiveness ratio of empagliflozin was calculated at US$6,764 per QALY in the overall population, which is far below a threshold of US$18,182 per QALY. The cost-effectiveness benefit was more evident in patients with HFrEF (US$5,012 per QALY) than HFpEF (US$8,971 per QALY).
Conclusion
There is a large discrepancy in real-world eligibility for empagliflozin between FDA & EMA labels and clinical trial criteria. Empagliflozin is cost-effective in HF patients regardless of ejection fraction in South Korea health care setting. The efficacy and safety of empagliflozin in real-world HF patients should be further investigated for a broader range of clinical applications.
7.Korea Seroprevalence Study of Monitoring of SARS-COV-2 Antibody Retention and Transmission (K-SEROSMART): findings from national representative sample
Jina HAN ; Hye Jin BAEK ; Eunbi NOH ; Kyuhyun YOON ; Jung Ae KIM ; Sukhyun RYU ; Kay O LEE ; No Yai PARK ; Eunok JUNG ; Sangil KIM ; Hyukmin LEE ; Yoo-Sung HWANG ; Jaehun JUNG ; Hun Jae LEE ; Sung-il CHO ; Sangcheol OH ; Migyeong KIM ; Chang-Mo OH ; Byengchul YU ; Young-Seoub HONG ; Keonyeop KIM ; Sunjae JUNG ; Mi Ah HAN ; Moo-Sik LEE ; Jung-Jeung LEE ; Young HWANGBO ; Hyeon Woo YIM ; Yu-Mi KIM ; Joongyub LEE ; Weon-Young LEE ; Jae-Hyun PARK ; Sungsoo OH ; Heui Sug JO ; Hyeongsu KIM ; Gilwon KANG ; Hae-Sung NAM ; Ju-Hyung LEE ; Gyung-Jae OH ; Min-Ho SHIN ; Soyeon RYU ; Tae-Yoon HWANG ; Soon-Woo PARK ; Sang Kyu KIM ; Roma SEOL ; Ki-Soo PARK ; Su Young KIM ; Jun-wook KWON ; Sung Soon KIM ; Byoungguk KIM ; June-Woo LEE ; Eun Young JANG ; Ah-Ra KIM ; Jeonghyun NAM ; ; Soon Young LEE ; Dong-Hyun KIM
Epidemiology and Health 2023;45(1):e2023075-
OBJECTIVES:
We estimated the population prevalence of antibodies to severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), including unreported infections, through a Korea Seroprevalence Study of Monitoring of SARS-CoV-2 Antibody Retention and Transmission (K-SEROSMART) in 258 communities throughout Korea.
METHODS:
In August 2022, a survey was conducted among 10,000 household members aged 5 years and older, in households selected through two stage probability random sampling. During face-to-face household interviews, participants self-reported their health status, COVID-19 diagnosis and vaccination history, and general characteristics. Subsequently, participants visited a community health center or medical clinic for blood sampling. Blood samples were analyzed for the presence of antibodies to spike proteins (anti-S) and antibodies to nucleocapsid proteins (anti-N) SARS-CoV-2 proteins using an electrochemiluminescence immunoassay. To estimate the population prevalence, the PROC SURVEYMEANS statistical procedure was employed, with weighting to reflect demographic data from July 2022.
RESULTS:
In total, 9,945 individuals from 5,041 households were surveyed across 258 communities, representing all basic local governments in Korea. The overall population-adjusted prevalence rates of anti-S and anti-N were 97.6% and 57.1%, respectively. Since the Korea Disease Control and Prevention Agency has reported a cumulative incidence of confirmed cases of 37.8% through July 31, 2022, the proportion of unreported infections among all COVID-19 infection was suggested to be 33.9%.
CONCLUSIONS
The K-SEROSMART represents the first nationwide, community-based seroepidemiologic survey of COVID-19, confirming that most individuals possess antibodies to SARS-CoV-2 and that a significant number of unreported cases existed. Furthermore, this study lays the foundation for a surveillance system to continuously monitor transmission at the community level and the response to COVID-19.
8.Generative Adversarial Network-Based Image Conversion Among Different Computed Tomography Protocols and Vendors: Effects on Accuracy and Variability in Quantifying Regional Disease Patterns of Interstitial Lung Disease
Hye Jeon HWANG ; Hyunjong KIM ; Joon Beom SEO ; Jong Chul YE ; Gyutaek OH ; Sang Min LEE ; Ryoungwoo JANG ; Jihye YUN ; Namkug KIM ; Hee Jun PARK ; Ho Yun LEE ; Soon Ho YOON ; Kyung Eun SHIN ; Jae Wook LEE ; Woocheol KWON ; Joo Sung SUN ; Seulgi YOU ; Myung Hee CHUNG ; Bo Mi GIL ; Jae-Kwang LIM ; Youkyung LEE ; Su Jin HONG ; Yo Won CHOI
Korean Journal of Radiology 2023;24(8):807-820
Objective:
To assess whether computed tomography (CT) conversion across different scan parameters and manufacturers using a routable generative adversarial network (RouteGAN) can improve the accuracy and variability in quantifying interstitial lung disease (ILD) using a deep learning-based automated software.
Materials and Methods:
This study included patients with ILD who underwent thin-section CT. Unmatched CT images obtained using scanners from four manufacturers (vendors A-D), standard- or low-radiation doses, and sharp or medium kernels were classified into groups 1–7 according to acquisition conditions. CT images in groups 2–7 were converted into the target CT sty le (Group 1: vendor A, standard dose, and sharp kernel) using a RouteGAN. ILD was quantified on original and converted CT images using a deep learning-based software (Aview, Coreline Soft). The accuracy of quantification was analyzed using the dice similarity coefficient (DSC) and pixel-wise overlap accuracy metrics against manual quantification by a radiologist. Five radiologists evaluated quantification accuracy using a 10-point visual scoring system.
Results:
Three hundred and fifty CT slices from 150 patients (mean age: 67.6 ± 10.7 years; 56 females) were included. The overlap accuracies for quantifying total abnormalities in groups 2–7 improved after CT conversion (original vs. converted: 0.63vs. 0.68 for DSC, 0.66 vs. 0.70 for pixel-wise recall, and 0.68 vs. 0.73 for pixel-wise precision; P < 0.002 for all). The DSCs of fibrosis score, honeycombing, and reticulation significantly increased after CT conversion (0.32 vs. 0.64, 0.19 vs. 0.47, and 0.23 vs. 0.54, P < 0.002 for all), whereas those of ground-glass opacity, consolidation, and emphysema did not change significantly or decreased slightly. The radiologists’ scores were significantly higher (P < 0.001) and less variable on converted CT.
Conclusion
CT conversion using a RouteGAN can improve the accuracy and variability of CT images obtained using different scan parameters and manufacturers in deep learning-based quantification of ILD.
9.A Report on a Nationwide Surveillance System for Pediatric Acute Hepatitis of Unknown Etiology in Korea
Kyung Jae LEE ; Jae Sung KO ; Kie Young PARK ; Ki Soo KANG ; Kunsong LEE ; Jeana HONG ; Soon Chul KIM ; Yoon LEE ; Ben KANG ; Yu Bin KIM ; Hyun Jin KIM ; Byung Wook EUN ; Hye-Kyung CHO ; Yae-Jean KIM ; Mi Jin KIM ; Jin LEE ; Taek-Jin LEE ; Seak Hee OH ; Sowon PARK ; Eun Ha HWANG ; Sangjun SOHN ; Jin Gyu LIM ; YooJin KIM ; Yeoun Joo LEE
Journal of Korean Medical Science 2023;38(47):e401-
Background:
Several cases of pediatric acute hepatitis of unknown etiology related to adenoviral infections have been reported in Europe since January 2022. The aim of this study was to compare the incidence, severity, possible etiology, and prognosis of the disease with those in the past in Korea.
Methods:
The surveillance group collected data between May and November 2022 using a surveillance system. Acute hepatitis of unknown etiology was defined in patients aged < 16 years with a serum transaminase level > 500 IU/L, not due to hepatitis A-E or other underlying causes. For comparison, data from 18 university hospitals were retrospectively collected as a control group between January 2021 and April 2022.
Results:
We enrolled 270 patients (mean age, 5 years). The most common symptom was fever. However, the incidence was similar between 2021 and 2022. Liver function test results, number of patients with acute liver failure (ALF), liver transplantation (LT), death, and adenovirus detection rates did not differ between the two groups. None of the adenoviruspositive patients in either group experienced ALF, LT, or death. In the surveillance group, adenovirus-associated virus-2 was detected in four patients, one of whom underwent LT. Patients with an unknown etiology showed significantly higher bilirubin levels, a lower platelet count, and a higher LT rate than patients with a possible etiology.
Conclusion
The incidence of pediatric acute hepatitis of unknown etiology and adenovirus detection rate have not increased in Korea.
10.Myocardial Infarction, Stroke, and All-Cause Mortality according to Low-Density Lipoprotein Cholesterol Level in the Elderly, a Nationwide Study
You-Bin LEE ; Minji KOO ; Eunjin NOH ; Soon Young HWANG ; Jung A KIM ; Eun ROH ; So-hyeon HONG ; Kyung Mook CHOI ; Sei Hyun BAIK ; Geum Joon CHO ; Hye Jin YOO
Diabetes & Metabolism Journal 2022;46(5):722-732
Background:
We assessed the myocardial infarction (MI), stroke, and all-cause death risks during follow-up according to the low-density lipoprotein cholesterol (LDL-C) levels among older adults.
Methods:
The Korean National Health Insurance Service datasets (2002 to 2020) were used for this population-based cohort study. The hazards of MI, stroke, and all-cause mortality during follow-up were analyzed according to LDL-C level in individuals aged ≥65 years without baseline cardiovascular diseases (n=1,391,616).
Results:
During a mean 7.55 years, 52,753 MIs developed; 84,224 strokes occurred over a mean 7.47 years. After a mean 8.50 years, 233,963 died. A decrease in LDL-C was associated with lower hazards of MI and stroke. The decreased hazard of stroke in lower LDL-C was more pronounced in statin users, and individuals with diabetes or obesity. The hazard of all-cause death during follow-up showed an inverted J-shaped pattern according to the LDL-C levels. However, the paradoxically increased hazard of mortality during follow-up in lower LDL-C was attenuated in statin users and individuals with diabetes, hypertension, or obesity. In statin users, lower LDL-C was associated with a decreased hazard of mortality during follow-up.
Conclusion
Among the elderly, lower LDL-C was associated with decreased risks of MI and stroke. Lower LDL-C achieved by statins in the elderly was associated with a decreased risk of all-cause death during follow-up, suggesting that LDL-C paradox for the premature death risk in the elderly should not be applied to statin users. Intensive statin therapy should not be hesitated for older adults with cardiovascular risk factors including diabetes.

Result Analysis
Print
Save
E-mail