GRCh38 · COSMIC v87


This section shows a general overview of information for the selected study (COSU identifier) or publication (COSP identifier). Studies may have been performed by the Sanger Institute Cancer Genome Project, or imported from the ICGC/TCGA. You can see more information on the help pages.

Somatic Genomics and Clinical Features of Lung Adenocarcinoma: A Retrospective Study.
Paper ID
Shi J, Hua X, Zhu B, Ravichandran S, Wang M, Nguyen C, Brodie SA, Palleschi A, Alloisio M, Pariscenti G, Jones K, Zhou W, Bouk AJ, Boland J, Hicks B, Risch A, Bennett H, Luke BT, Song L, Duan J, Liu P, Kohno T, Chen Q, Meerzaman D, Marconett C, Laird-Offringa I, Mills I, Caporaso NE, Gail MH, Pesatori AC, Consonni D, Bertazzi PA, Chanock SJ and Landi MT
Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, Maryland, United States of America.
PLoS medicine 2016;13(12):e1002162
Background: Lung adenocarcinoma (LUAD) is the most common histologic subtype of lung cancer and has a high risk of distant metastasis at every disease stage. We aimed to characterize the genomic landscape of LUAD and identify mutation signatures associated with tumor progression.We performed an integrative genomic analysis, incorporating whole exome sequencing (WES), determination of DNA copy number and DNA methylation, and transcriptome sequencing for 101 LUAD samples from the Environment And Genetics in Lung cancer Etiology (EAGLE) study. We detected driver genes by testing whether the nonsynonymous mutation rate was significantly higher than the background mutation rate and replicated our findings in public datasets with 724 samples. We performed subclonality analysis for mutations based on mutant allele data and copy number alteration data. We also tested the association between mutation signatures and clinical outcomes, including distant metastasis, survival, and tumor grade. We identified and replicated two novel candidate driver genes, POU class 4 homeobox 2 (POU4F2) (mutated in 9 [8.9%] samples) and ZKSCAN1 (mutated in 6 [5.9%] samples), and characterized their major deleterious mutations. ZKSCAN1 was part of a mutually exclusive gene set that included the RTK/RAS/RAF pathway genes BRAF, EGFR, KRAS, MET, and NF1, indicating an important driver role for this gene. Moreover, we observed strong associations between methylation in specific genomic regions and somatic mutation patterns. In the tumor evolution analysis, four driver genes had a significantly lower fraction of subclonal mutations (FSM), including TP53 (p = 0.007), KEAP1 (p = 0.012), STK11 (p = 0.0076), and EGFR (p = 0.0078), suggesting a tumor initiation role for these genes. Subclonal mutations were significantly enriched in APOBEC-related signatures (p < 2.5×10-50). The total number of somatic mutations (p = 0.0039) and the fraction of transitions (p = 5.5×10-4) were associated with increased risk of distant metastasis. Our study's limitations include a small number of LUAD patients for subgroup analyses and a single-sample design for investigation of subclonality.Conclusions: These data provide a genomic characterization of LUAD pathogenesis and progression. The distinct clonal and subclonal mutation signatures suggest possible diverse carcinogenesis pathways for endogenous and exogenous exposures, and may serve as a foundation for more effective treatments for this lethal disease. LUAD's high heterogeneity emphasizes the need to further study this tumor type and to associate genomic findings with clinical outcomes.
Paper Status
Genes Analysed
Mutated Samples
Total No. of Samples

Mutation Matrix

This section shows the correlation plot between the top 20 genes and samples. There is more information in our help pages.


This table shows genes with mutations in the selected study/paper [more details]
Genes Mutated Samples
This table shows genes without mutations in the selected study/paper [more details]

Table Information


This is a whole exome/systematic screen paper and the negatives for this paper should be inferred.


This tab shows genes with mutations in the selected study/paper [more details]

Genes Samples CDS Mutation AA Mutation

This tab shows non coding variant in the selected study/paper [more details]

Sample ID Sample Name ID NCV Annotation Zygosity Chromosome Genome start Genome stop Genome version Strand WT seq Mut seq FATHMM-MKL

This tab shows the gene expression and copy number variation data for this study [more details]

Table Information


The table currently shows only high value (numeric) copy number data. Copy number segments are excluded if the total copy number and minor allele values are unknown.

Click here to include all copy number data. For more detailed information about copy number data and gain/loss definitions click here.

Sample Gene Expression Expr Level (Z-Score)

Over Expressed; Z-Score > 2.0

Under Expressed; Z-Score < -2.0

Normal; Z-Score within the range -2.0 to 2.0

CN Type Minor Allele Copy Number CN Segment Posn. Average Ploidy

1. N/A represents cases where the average ploidy value is not available( mostly ICGC samples). For some TCGA samples where the minor allele information is not available the average ploidy value could not be calculated.

2. For TCGA samples, the ASCAT algorithm was used to calculate the average ploidy.

3. For CGP samples, the PICNIC algorithm was used to calculate the average ploidy.


This table lists the samples in the selected study which have low/high methylation for each gene. [more details]

No data

This tab shows the fusion mutations observed in this sample [more details]

Gene Sample Name Id Sample(COSS) CDS Mutation Somatic status Zygosity Validated Type


This table shows mutated samples in the selected study/paper.

Sample Name Mutation Count

This table shows samples without mutations in the selected study/paper.

Non-Mutant Samples Sample Id (COSS)