GRCh38 · COSMIC v82


This section shows a general overview of information for the selected study (COSU identifier) or publication (COSP identifier). Studies may have been performed by the WTSI Cancer Genome Project, or imported from the ICGC/TCGA. You can see more information on the help pages.

The mutation spectrum revealed by paired genome sequences from a lung cancer patient.
Paper ID
Lee W, Jiang Z, Liu J, Haverty PM, Guan Y, Stinson J, Yue P, Zhang Y, Pant KP, Bhatt D, Ha C, Johnson S, Kennemer MI, Mohan S, Nazarenko I, Watanabe C, Sparks AB, Shames DS, Gentleman R, de Sauvage FJ, Stern H, Pandita A, Ballinger DG, Drmanac R, Modrusan Z, Seshagiri S and Zhang Z
Department of Bioinformatics and Computational Biology, Genentech Inc., South San Francisco, California 94080, USA.
Nature 2010;465(7297):473-7
Lung cancer is the leading cause of cancer-related mortality worldwide, with non-small-cell lung carcinomas in smokers being the predominant form of the disease. Although previous studies have identified important common somatic mutations in lung cancers, they have primarily focused on a limited set of genes and have thus provided a constrained view of the mutational spectrum. Recent cancer sequencing efforts have used next-generation sequencing technologies to provide a genome-wide view of mutations in leukaemia, breast cancer and cancer cell lines. Here we present the complete sequences of a primary lung tumour (60x coverage) and adjacent normal tissue (46x). Comparing the two genomes, we identify a wide variety of somatic variations, including >50,000 high-confidence single nucleotide variants. We validated 530 somatic single nucleotide variants in this tumour, including one in the KRAS proto-oncogene and 391 others in coding regions, as well as 43 large-scale structural variations. These constitute a large set of new somatic mutations and yield an estimated 17.7 per megabase genome-wide somatic mutation rate. Notably, we observe a distinct pattern of selection against mutations within expressed genes compared to non-expressed genes and in promoter regions up to 5 kilobases upstream of all protein-coding genes. Furthermore, we observe a higher rate of amino acid-changing mutations in kinase genes. We present a comprehensive view of somatic alterations in a single lung tumour, and provide the first evidence, to our knowledge, of distinct selective pressures present within the tumour environment.
Paper Status
Genes Analysed
Mutated Samples
Total No. of Samples

Mutation Matrix

This section shows the correlation plot between the top 20 genes and samples. There is more information in our help pages.


This table shows genes with mutations in the selected study/paper [more details]
Genes Mutated Samples
This table shows genes without mutations in the selected study/paper [more details]

Table Information


This is a whole exome/systematic screen paper and the negatives for this paper should be inferred.


This tab shows genes with mutations in the selected study/paper [more details]

Genes Samples CDS Mutation AA Mutation

This tab shows non coding variant in the selected study/paper [more details]

Sample ID Sample Name ID NCV Annotation Zygosity Chromosome Genome start Genome stop Genome version Strand WT seq Mut seq FATHMM-MKL

This tab shows the gene expression and copy number variation data for this study [more details]

Table Information


The table currently shows only high value (numeric) copy number data. Copy number segments are excluded if the total copy number and minor allele values are unknown.

Click here to include all copy number data. For more detailed information about copy number data and gain/loss definitions click here.

Sample Gene Expression Expr Level (Z-Score)

Over Expressed; Z-Score > 2.0

Under Expressed; Z-Score < -2.0

Normal; Z-Score within the range -2.0 to 2.0

CN Type Minor Allele Copy Number CN Segment Posn. Average Ploidy

1. N/A represents cases where the average ploidy value is not available( mostly ICGC samples). For some TCGA samples where the minor allele information is not available the average ploidy value could not be calculated.

2. For TCGA samples, the ASCAT algorithm was used to calculate the average ploidy.

3. For CGP samples, the PICNIC algorithm was used to calculate the average ploidy.


This table lists the samples in the selected study which have low/high methylation for each gene. [more details]

No data

This tab shows the fusion mutations observed in this sample [more details]

Gene Sample Name Id Sample(COSS) CDS Mutation Somatic status Zygosity Validated Type


This table shows mutated samples in the selected study/paper.

Sample Name Mutation Count

This table shows samples without mutations in the selected study/paper.

Non-Mutant Samples Sample Id (COSS)