A practical guide to amplicon and metagenomic analysis of microbiome data.
10.1007/s13238-020-00724-8
- Author:
Yong-Xin LIU
1
;
Yuan QIN
2
;
Tong CHEN
3
;
Meiping LU
4
;
Xubo QIAN
4
;
Xiaoxuan GUO
2
;
Yang BAI
5
Author Information
1. State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China. yxliu@genetics.ac.cn.
2. State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China.
3. National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, 100700, China.
4. Department of Rheumatology Immunology & Allergy, Children's Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, 310053, China.
5. State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China. ybai@genetics.ac.cn.
- Publication Type:Review
- Keywords:
high-throughput sequencing;
marker genes;
metagenome;
pipeline;
reproducible analysis;
visualization
- From:
Protein & Cell
2021;12(5):315-330
- CountryChina
- Language:English
-
Abstract:
Advances in high-throughput sequencing (HTS) have fostered rapid developments in the field of microbiome research, and massive microbiome datasets are now being generated. However, the diversity of software tools and the complexity of analysis pipelines make it difficult to access this field. Here, we systematically summarize the advantages and limitations of microbiome methods. Then, we recommend specific pipelines for amplicon and metagenomic analyses, and describe commonly-used software and databases, to help researchers select the appropriate tools. Furthermore, we introduce statistical and visualization methods suitable for microbiome analysis, including alpha- and beta-diversity, taxonomic composition, difference comparisons, correlation, networks, machine learning, evolution, source tracing, and common visualization styles to help researchers make informed choices. Finally, a step-by-step reproducible analysis guide is introduced. We hope this review will allow researchers to carry out data analysis more effectively and to quickly select the appropriate tools in order to efficiently mine the biological significance behind the data.