Genomic medicine, in its broadest sense of being medical developments informed by 'omic' advances, has continued to move towards the clinic in 2011. To mark the end of the year and the beginning of 2012, the editors of the six sections within Genome Medicine were invited to provide their highlights of the past year and to hint at the developments that we are likely to see in the near future.

Six different areas of progress are covered here, but the core of genomic medicine continues to be intrinsically linked to improvements in the underlying technology, and two obvious examples are sequencing and mass spectrometry. Technological advances have enabled larger studies and more complex analyses, allowing researchers and clinicians to track changes within a single cell and yet spot patterns across a whole population and within an entire physiological system. The foundations laid in 2011 should help the field to tackle the challenges of translating genomic medicine to the clinic in 2012.

Complex genomic rearrangements and disease

The past year has been marked by advances in the speed, accuracy and scale of genome sequencing. These improvements have led to the first population-scale genome sequencing study to provide information on structural variants [1]. Over 15,000 novel structural variants were identified from 185 individuals. Analysis of breakpoint junctions revealed that 70% of deletions and almost 90% of insertions showed microhomology ranging from 2 bp to 376 bp at the junctions. This suggests that nonhomologous recombination mechanisms are predominant in copy number variation, and that microhomology-mediated DNA replication mechanisms, such as microhomology-mediated break-induced replication, might have a major role in human genome structural variation.

Genome sequencing also revealed the extent of complex genomic rearrangements (CGRs) in disease. Over 700 genomes from different cancers were studied, and 'chromosome catastrophes' were identified in 2 to 3% of all cancers and in up to 25% of bone cancers [2]. This phenomenon, also termed 'chromothripsis' (shattering and regluing of chromosomes), is primarily localized to single chromosomes, but includes multiple structural genomic changes, such as gains, losses and inversions. As a result, chromothripsis can lead to the simultaneous occurrence of mutations in a number of different cancer-causing genes. Cancer is known to be driven by somatically acquired point mutations and chromosomal rearrangements, conventionally thought to accumulate gradually over time. However, chromothripsis is a one-off event resulting in multigenic changes [2]. It remains to be shown whether chromothripsis is a major driver of cancer.

Intriguingly, a similar chromosome catastrophe event that resulted in CGRs was found to be associated with a small fraction of genomic disorders [3]. This involved a germline or constitutional rearrangement event early in embryogenesis rather than somatically acquired mutations and seemed to occur via a DNA-replication-based mechanism. Triplicated genomic segments were also identified within the rearrangements, and microhomology was present at many of the breakpoint junctions. The triplication cannot readily be explained by chromothripsis, as it requires a gain of genetic information, probably resulting from DNA replication. Other CGRs identified in association with genomic disorders included triplications; triplicated dosage-sensitive genes conveyed a more severe clinical phenotype than duplication. Triplications seem to occur by a double crossover event within a flanking low-copy repeat and are tandem in nature, similar to earlier predictions from an unequal crossing-over or non-allelic homologous recombination (NAHR) model [4]. However, many of the observed triplications have a complex 'duplication-inverted triplication-duplication' structure, which requires only two breakpoint junctions. One of these is generated by an initial NAHR event between inverted repeats, and the other occurs by a nonhomologous mechanism, potentially a DNA replication-based mechanism that results in microhomology at the breakpoint junction [5]. The inversion allows distant genomic regions to be brought into spatial proximity.

A summary of 2011 would not be complete without highlighting exome sequencing and its role in the elucidation of the gene(s) responsible for a number of different Mendelian disorders [6]. Genomic studies have so far revealed the tremendous extent of rare variants in individual genomes. This has led to the concept of 'clan genomics' [7], wherein the most relevant medically actionable variation may be rare variants that occurred as novel combinations or as new mutations in your personal genome or that of your nearest relatives.

Bringing quantification to the proteome

The ability to conduct global analysis of proteomes has existed for about a decade now. During this period most of the progress has been on increasing the number of proteins identified within the proteome. Because of the lack of inherent quantification within the data supplied by high-throughput mass spectrometry (MS), measuring the absolute concentration of proteins across the proteome has lagged behind. This situation is analogous to having a recipe in which the ingredients are listed but the amount of each required is not supplied. Without knowing how much of each ingredient to add, it is unlikely that the dish would taste very good.

Recent studies have moved to develop techniques to supply this vital piece of information. Selbach and colleagues [8] used a combination of stable isotope incorporation and MS to quantify pulse-labeled proteins and 4-thiouridine incorporation to quantify newly synthesized RNA in mouse cells. These results showed that genes that respond quickly to stimuli (such as those encoding transcription factors and signaling proteins) have short protein and mRNA half lives. Constitutive processes (such as translation and central metabolism) were populated by genes giving rise to stable proteins and mRNA. A more recent study by Aebersold and colleagues [9] provided an absolute quantitative estimate of approximately 7,000 proteins in a human cell line during exponential and M-phase growth. This study also found that proteins related to translation were found in high abundance, whereas those involved in processes such as transcription and signaling are present in very low abundance. These results are in agreement with those found by Selbach and colleagues [8], with both studies suggesting that the correlation between mRNA and protein levels is greater than previously reported [10, 11].

Although it is possible to measure the absolute number of specific proteins within complex mixtures [12], these studies represent a new, hopefully broader, effort in determining the absolute abundances of proteins across an entire proteome. Frankly, without this quantitative information it is impossible to accurately understand cellular processes at a systems-level view.

Timothy Veenstra, Section Editor, Post-genomic advances in medicine

The human microbiome, our personal health and diseases

Microbial cells in the human body outnumber human cells by two orders of magnitude. During the past decade, there has been growing interest in the analysis of these microbial communities, which are an integral part of the human body and provide us with nutrients essential to our health. With the rapid increase in sequencing power, the characterization of the so-called 'microbiome' has given rise to large 'metagenomic' datasets, revealing a wealth of microbial species that were previously unknown and cannot be cultured.

The gut microbiome is the most extensively studied part of the human microbiome so far, and this past year the international MetaHIT Consortium has consolidated the data to support the existence of a limited number of 'enterotypes', stable clusters of bacterial communities that are common to groups of individuals belonging to different human populations rather than specific for each population [13, 14]. Enterotypes are 'well-balanced host-microbial symbiotic states that might respond differently to diet and drug intake' and warrant functional analysis to complement species identification obtained through next-generation sequencing.

The same sequencing approach is being successfully extended to characterize the microbiome in other human tissues, such as nasal passages, oral cavities, skin and urogenital tract within the Human Microbiome Project [15, 16]. Significant challenges remain in establishing robust and efficient bioinformatics pipelines to deal with the very large sequence datasets generated [17], but the integration of metagenomics with functional genomics has started to provide new insights into the role of the microbiome in health and in disease susceptibility, diagnosis, progression and treatment, as reviewed extensively by Nicholson and colleagues in this journal [18]. Although emphasis so far has been on bacterial communities, metagenomics provide simultaneous access to fungi, protozoa and viruses, revealing that the vast majority had not been identified through classical culture-based techniques. Functional metagenomics is thus opening a window into an as-yet largely unexplored biological world, and is starting to fill gaps in our knowledge of health and diseases. It is likely that microbiome studies will flourish in the coming years and will contribute to a more personal appraisal of our medical condition and potential response to treatment.

Charles Auffray, Section Editor, Systems medicine and informatics

Progress in pharmacogenomics and individualized medicine

In the past year some very important contributions have been made indicating that genomic information predicts drug response and adverse drug reactions, thereby promoting the concept of personalized medicine and pharmacogenomics. Although the integration of individualized medicine into clinical decision making is still limited [19], there is increasing evidence for genetic markers for major drug-induced hypersensitivity reactions in the major histocompatibility complex (MHC) region. Examples of such adverse reactions include the Steven-Johnson syndrome (SJS) and the related toxic epidermal necrolysis (TEN), with mortalities of 10% and 30%, respectively.

Carbamazepine, an antiepileptic drug also used for treatment of other conditions, including bipolar disorders, and for pain management in trigeminal neuralgia, induces different manifestations of these hypersensitivity reactions. In 2004 the HLA-B*1502 allele was strongly associated with SJS/TEN in Han Chinese people [20]. In a recent study, 4,855 Taiwanese were prospectively screened for the HLA-B*1502 allele before onset of carbamazepine treatment, and all HLA-B*1502-positive people (7.7%) were offered alternative medications to carbamazepine and did not develop SJS/TEN [21]. Although the estimated historical incidence for SJS/TEN in Southeast Asians is 0.23%, no cases of SJS/TEN were identified among the 4,120 study participants treated with carbamazepine. This prospective study clearly indicates that genetic testing for HLA-B*1502 seems to be warranted in clinical practice for the prevention of carbamazepine-associated SJS/TEN in Southeast Asians.

Recent data have suggested that different HLA alleles might contribute to carbamazepine-related hypersensitivity reactions in patients with other ethnic backgrounds [22], and a genome-wide association study (GWAS) found a strong association for the HLA-A*3101 allele in 22 people with Northern European ancestry and carbamazepine-induced hypersensitivity reactions, as compared with 2,691 healthy controls [23]. Whereas the HLA-B*1502 allele seems to be a phenotype-specific predictor for the development of SJS/TEN in Asians, in Europeans the HLA-A*3101 allele seems to be associated with multiple phenotypes of carbamazepine hypersensitivity reactions, including SJS/TEN. In a separate GWAS, multiple HLA class I and II alleles were identified as susceptibility factors for idiosyncratic amoxicillin-clavulanate-induced liver injury, further supporting the importance of genetic variation in the MHC region in idiosyncratic adverse drug reactions [24].

In addition to the major efforts to individualize drug therapy using genomic information alone, metabolomics promises significant achievements towards a better understanding of the potential relevance of genetic variants in disease susceptibility as well as drug therapy, by the study of metabolism at the global level. A highlight in this field was the first analysis that comprehensively combined genomic information (from GWAS) with non-targeted metabolic profiling of serum from subjects of two independent cohorts (n = 2,820) for hypothesis generation [25]. Meta-analysis revealed 37 independent loci with genome-wide significance, providing not only new functional insights for disease susceptibility but also novel biochemical data for genotype-dependent reactions to drug therapy. This approach highlights the enormous potential of integrated analyses for pharmacology and clinical pharmacology in the near future.

Matthias Schwab, Section Editor, Personalized medicine & therapeutics

Beyond base pairs to bedside

In February 2011, more than 10 years after a draft sequence of the human genome was published, the US National Human Genome Research Institute announced its new strategic plan for genomic medicine from base pairs to bedside [26]. The plan calls for evaluating the structure and biology of genomes; understanding the biology of disease; advancing the science of medicine; and improving the effectiveness of healthcare. Nevertheless, fulfilling the promise of genomics in improving health requires a multidisciplinary research agenda beyond bench to bedside, an agenda that will demonstrate added value of genome-based information for improving health in populations [27].

Currently, this translational research agenda is much less robust than discovery research, accounting for less than 2% of funded genomics research and research publications [28], but this is likely to change in the next decade as more and more applications make it to the bedside. Ongoing horizon scanning for genomic tests has revealed more than 330 genomic tests have reached the bedside since 2009 [29]. In addition, the US Food and Drug Administration has provided drug labeling information about adverse reactions for more than 80 gene-drug pairs [30].

With rapid improvements in technologies, we are seeing the leading edge of the applications of whole genome sequencing (WGS) in practice primarily in the identification of rare heritable conditions [31]. Deploying WGS in practice will require a systematic evidence-based approach to binning genes and gene variants into tiers based on levels of evidence for improving health; recently, Berg et al. [32] proposed a three-tier binning schema for WGS, based on the availability of information on their clinical validity and utility. The ultimate success of genomics for improving health will require adoption of evidence-based approaches for their use in clinical and public health practice.

Direct-to-consumer testing

As genomic technologies advance at an accelerating pace, direct-to-consumer (DTC) genetic testing has emerged as a dominant area of ELSI (ethical, legal and social issues) inquiry. Indeed, over the past few years, DTC testing has received a tremendous amount of attention from policymakers, ELSI scholars and the popular press [33]. Although it remains uncertain whether this industry, which is fueled by the availability of cheap and efficient testing technologies, will attract a broad consumer base [34], its growth has triggered a diverse array of regulatory responses, from outright bans [35] to recommendations that call for the provision of accurate information and valid results [36].

The DTC industry has also allowed ELSI researchers to explore popular culture representations of genetics [37], how individuals react to genetic risk information and the motivations behind the use of DTC services [38]. Much of this research has provided surprising results - highlighting both problems (for example, the degree to which the benefits and limitations of testing are often misrepresented [39] and the reality that much provided risk information may be inaccurate [40]) and the fact that some concerns may not be as problematic as previously thought (for example, the degree to which people become anxious after receiving predisposition information [41]). Given the appearance of the DTC industry, ELSI researchers are now contemplating future policy challenges. For example, the use of DTC services for the testing of children and the availability of cheap WGS, which is inevitable in the near future, seem likely to add new and challenging dimensions to the DTC story.

Declarations

Competing interests

JRL is a consultant for Athena Diagnostics, has stock ownership in 23andMe and Ion Torrent Systems, and is a co-inventor on multiple United States and European patents for DNA diagnostics.

The Department of Molecular and Human Genetics at Baylor College of Medicine derives revenue from clinical testing by high-resolution human genome analyses. All other authors declare that they have no competing interests.