Bacterial signatures for diagnosis of colorectal cancer by fecal metagenomics analysis
10.3969/j.issn.1674-8115.2018.09.004
- Author:
Xin-Yu ZHANG
1
Author Information
1. Department of Gastroenterology and Hepatology, Renji Hospital, Shanghai Jiao Tong University School of Medicine
- Publication Type:Journal Article
- Keywords:
Colorectal cancer;
Diagnosis;
Intestinal bacteria;
Machine learning;
Random forest
- From:
Journal of Shanghai Jiaotong University(Medical Science)
2018;38(9):1019-1026
- CountryChina
- Language:Chinese
-
Abstract:
Objective • To construct bacterial signatures by analyzing fecal metagenomics for the screening and diagnosis of colorectal cancer (CRC). Methods • A total of 285 samples were included in the study. Diagnostic models for CRC according to six different machine learning algorithms were developed using the featured bacteria selected by random forest algorithm, and validated in validation sets. Results • Nine bacteria that differentiated CRC and the control were identified, with which 6 models were established. The best model was random forest model, with an accuracy of 0.847 7 in the training set. Its accuracy in two test sets was 0.815 8 and 0.734 4, respectively. The area under curve (AUC) of receiver operating characteristic of the random forest model in the set including all samples was 0.894. Conclusion • Bacterial signatures based on random forest algorithm for the diagnosis of CRC can differentiate patients with CRC and the control effectively, which suggests the potential clinical value of the bacterial signatures.