- Author:
Younjee HWANG
1
;
Ju Yeong KIM
;
Se Il KIM
;
Ji Yeon SUNG
;
Hye Su MOON
;
Tai-Soon YONG
;
Ki Ho HONG
;
Hyukmin LEE
;
Dongeun YONG
Author Information
- Publication Type:Original article
- From:Annals of Clinical Microbiology 2025;28(1):3-
- CountryRepublic of Korea
- Language:English
-
Abstract:
Background:The 16S rRNA-targeted next-generation sequencing (NGS) has been widely used as the primary tool for microbiome analysis. However, whether the sequenced microbial diversity absolutely represents the original sample composition remains unclear. This study aimed to evaluate whether 16S rRNA gene-targeted NGS accurately captures bacterial community composition.
Methods:Mock communities were constructed using equal amounts of DNA from 18 bacterial strains in three formats: genomic DNA, recombinant plasmids, and polymerase chain reaction (PCR) templates. The V3V4 region of the 16S rRNA gene was amplified and sequenced using the Illumina MiSeq.
Results:Data regression analysis revealed that the recombinant plasmid produced more accurate and precise correlation curve than that by the gDNA and PCR products, with a slope closest to 1 (1.0082) and the highest R² value (0.9975). Despite the same input amount of bacterial DNA, the NGS read distribution varied across all three mock communities. Using multiple regression analysis, we found that the guanine-cytosine (GC) content of the V3V4 region, 16S rRNA gene, size of gDNA, and copy number of 16S rRNA were significantly associated with the NGS output of each bacterial species.
Conclusion:This study demonstrated that recombinant plasmids are the preferred option for quality control and that NGS output is biased owing to certain bacterial characteristics, such as %GC content, gDNA size, and 16S rRNA gene copy number. Further research is required to develop a system that compensates for NGS process biases using mock communities.