Lung cancer is the leading cause of cancer-related mortality worldwide, with non-small-cell lung carcinomas in smokers being the predominant form of the disease. Although previous studies have identified important common somatic mutations in lung cancers, they have primarily focused on a limited set of genes and have thus provided a constrained view of the mutational spectrum. Recent cancer sequencing efforts have used next-generation sequencing technologies to provide a genome-wide view of mutations in leukaemia, breast cancer and cancer cell lines. Here we present the complete sequences of a primary lung tumour (60x coverage) and adjacent normal tissue (46x). Comparing the two genomes, we identify a wide variety of somatic variations, including >50,000 high-confidence single nucleotide variants. We validated 530 somatic single nucleotide variants in this tumour, including one in the KRAS proto-oncogene and 391 others in coding regions, as well as 43 large-scale structural variations. These constitute a large set of new somatic mutations and yield an estimated 17.7 per megabase genome-wide somatic mutation rate. Notably, we observe a distinct pattern of selection against mutations within expressed genes compared to non-expressed genes and in promoter regions up to 5 kilobases upstream of all protein-coding genes. Furthermore, we observe a higher rate of amino acid-changing mutations in kinase genes. We present a comprehensive view of somatic alterations in a single lung tumour, and provide the first evidence, to our knowledge, of distinct selective pressures present within the tumour environment.

Paper Status

Curated

Genes Analysed

773

Mutated Samples

1

Total No. of Samples

1

This tab shows the correlation plot between top 20 genes and samples
[more details]

This tab shows genes with mutations in the selected study/paper
[more details]

Genes

Samples

CDS Mutation

AA Mutation

This tab shows genes without mutations in the selected study/paper
[more details]

Table Information

Hide

This is a whole exome/systematic screen paper and the negatives for this paper should be inferred.

This tab shows samples without mutations in the selected study/paper
[more details]

This tab shows the gene expression and copy number variation data for this study.
[more details]

Table Information

Hide

The table currently shows only high value (numeric) copy number data. Copy number segments are excluded if the total copy number and minor allele values are unknown.

Click here to include all copy number data. For more detailed information about copy number data and gain/loss definitions click here.

Sample

Gene

Expression

Expr Level (Z-Score)

Over Expressed; Z-Score > 2.0

Under Expressed; Z-Score < -2.0

Normal; Z-Score within the range -2.0 to 2.0

CN Type

Minor Allele

Copy Number

CN Segment Posn.

Average Ploidy

1. N/A represents cases where the average ploidy value is not available( mostly ICGC samples). For some TCGA samples where the minor allele information is not available the average ploidy value could not be calculated.

2. For TCGA samples, the ASCAT algorithm was used to calculate the average ploidy.

3. For CGP samples, the PICNIC algorithm was used to calculate the average ploidy.

CNV

This table lists the samples in the selected study which have low/high methylation for each gene.
[more details]

No data

This tab shows the fusion mutations observed in this sample
[more details]