Comprehensive functional annotation of susceptibility variants identifies genetic heterogeneity between lung adenocarcinoma and squamous cell carcinoma.
- Author:
Na QIN
1
;
Yuancheng LI
1
;
Cheng WANG
1
;
Meng ZHU
1
;
Juncheng DAI
1
;
Tongtong HONG
1
;
Demetrius ALBANES
2
;
Stephen LAM
3
;
Adonina TARDON
4
;
Chu CHEN
5
;
Gary GOODMAN
6
;
Stig E BOJESEN
7
;
Maria Teresa LANDI
8
;
Mattias JOHANSSON
9
;
Angela RISCH
10
;
H-Erich WICHMANN
11
;
Heike BICKEBOLLER
12
;
Gadi RENNERT
13
;
Susanne ARNOLD
14
;
Paul BRENNAN
9
;
John K FIELD
15
;
Sanjay SHETE
16
;
Loic LE MARCHAND
17
;
Olle MELANDER
18
;
Hans BRUNNSTROM
18
;
Geoffrey LIU
19
;
Rayjean J HUNG
20
;
Angeline ANDREW
21
;
Lambertus A KIEMENEY
22
;
Shan ZIENOLDDINY
23
;
Kjell GRANKVIST
24
;
Mikael JOHANSSON
25
;
Neil CAPORASO
26
;
Penella WOLL
27
;
Philip LAZARUS
28
;
Matthew B SCHABATH
29
;
Melinda C ALDRICH
30
;
Victoria L STEVENS
31
;
Guangfu JIN
1
;
David C CHRISTIANI
32
;
Zhibin HU
1
;
Christopher I AMOS
33
;
Hongxia MA
34
;
Hongbing SHEN
35
Author Information
- Publication Type:Meta-Analysis
- Keywords: function annotation; genetic heterogeneity; genome-wide association study; homologous recombination repair deficiency; immune; lung cancer
- MeSH: Adenocarcinoma of Lung/genetics*; Carcinoma, Non-Small-Cell Lung/genetics*; Carcinoma, Squamous Cell/genetics*; Genetic Heterogeneity; Genetic Predisposition to Disease; Genome-Wide Association Study; Humans; Lung Neoplasms/genetics*; Polymorphism, Single Nucleotide
- From: Frontiers of Medicine 2021;15(2):275-291
- CountryChina
- Language:English
- Abstract: Although genome-wide association studies have identified more than eighty genetic variants associated with non-small cell lung cancer (NSCLC) risk, biological mechanisms of these variants remain largely unknown. By integrating a large-scale genotype data of 15 581 lung adenocarcinoma (AD) cases, 8350 squamous cell carcinoma (SqCC) cases, and 27 355 controls, as well as multiple transcriptome and epigenomic databases, we conducted histology-specific meta-analyses and functional annotations of both reported and novel susceptibility variants. We identified 3064 credible risk variants for NSCLC, which were overrepresented in enhancer-like and promoter-like histone modification peaks as well as DNase I hypersensitive sites. Transcription factor enrichment analysis revealed that USF1 was AD-specific while CREB1 was SqCC-specific. Functional annotation and gene-based analysis implicated 894 target genes, including 274 specifics for AD and 123 for SqCC, which were overrepresented in somatic driver genes (ER = 1.95, P = 0.005). Pathway enrichment analysis and Gene-Set Enrichment Analysis revealed that AD genes were primarily involved in immune-related pathways, while SqCC genes were homologous recombination deficiency related. Our results illustrate the molecular basis of both well-studied and new susceptibility loci of NSCLC, providing not only novel insights into the genetic heterogeneity between AD and SqCC but also a set of plausible gene targets for post-GWAS functional experiments.