George Church

We present an approach for engineering evolving DNA barcodes in living cells. A homing guide RNA (hgRNA) scaffold directs the Cas9-hgRNA complex to the DNA locus of the hgRNA itself. We show that this homing CRISPR-Cas9 system acts as an expressed genetic barcode that diversifies its sequence and that the rate of diversification can be controlled in cultured cells. We further evaluate these barcodes in cell populations and show that they can be used to record lineage history and that the barcode RNA can be amplified in situ, a prerequisite for in situ sequencing...

INTRODUCTION: African American (AA) high school-age girls are more likely to have had sex before age 13 years and have higher rates of all sexually transmitted infections. Cognition and religion/spirituality are associated with adolescent sexuality, therefore, the purpose of this study was to identify cognitive and religious substrates of AA girls' risky sexual behaviors. METHOD: A descriptive study was conducted with 65 AA girls aged 15 to 20 years using computerized questionnaires and cognitive function tasks...

BACKGROUND: In pedobarography, clinically meaningful comparison of measurements within or between subjects is limited by data variability and measurement error. This study aims to determine the components of the minimal detectable change (MDC) in impulse across all foot regions and the reliability of these measures. METHODS: A convenience sample of foot pressures from 108 visits by normal, healthy subjects aged 2-17 years was studied. Each subject had three pedobarograph measurements taken per foot, with six subjects returning for a second visit for assessment of day-to-day variability...

Human genomes are routinely compared against a universal reference. However, this strategy could miss population-specific and personal genomic variations, which may be detected more efficiently using an ethnically relevant or personal reference. Here we report a hybrid assembly of a Korean reference genome (KOREF) for constructing personal and ethnic references by combining sequencing and mapping methods. We also build its consensus variome reference, providing information on millions of variants from 40 additional ethnically homogeneous genomes from the Korean Personal Genome Project...

Precise editing is essential for biomedical research and gene therapy. Yet, homology-directed genome modification is limited by the requirements for genomic lesions, homology donors and the endogenous DNA repair machinery. Here we engineered programmable cytidine deaminases and test if we could introduce site-specific cytidine to thymidine transitions in the absence of targeted genomic lesions. Our programmable deaminases effectively convert specific cytidines to thymidines with 13% efficiency in Escherichia coli and 2...

BACKGROUND: Pigeons and doves (Columbiformes) are one of the oldest and most diverse extant lineages of birds. However, the nature and timing of the group's evolutionary radiation remains poorly resolved, despite recent advances in DNA sequencing and assembly and the growing database of pigeon mitochondrial genomes. One challenge has been to generate comparative data from the large number of extinct pigeon lineages, some of which are morphologically unique and therefore difficult to place in a phylogenetic context...

Scalable, high-throughput DNA sequencing is a prerequisite for precision medicine and biomedical research. Recently, we presented a nanopore-based sequencing-by-synthesis (Nanopore-SBS) approach, which used a set of nucleotides with polymer tags that allow discrimination of the nucleotides in a biological nanopore. Here, we designed and covalently coupled a DNA polymerase to an α-hemolysin (αHL) heptamer using the SpyCatcher/SpyTag conjugation approach. These porin-polymerase conjugates were inserted into lipid bilayers on a complementary metal oxide semiconductor (CMOS)-based electrode array for high-throughput electrical recording of DNA synthesis...

BACKGROUND: Since the completion of the Human Genome Project in 2003, it is estimated that more than 200,000 individual whole human genomes have been sequenced. A stunning accomplishment in such a short period of time. However, most of these were sequenced without experimental haplotype data and are therefore missing an important aspect of genome biology. In addition, much of the genomic data is not available to the public and lacks phenotypic information. FINDINGS: As part of the Personal Genome Project, blood samples from 184 participants were collected and processed using Complete Genomics' Long Fragment Read technology...

The degeneracy of the genetic code allows nucleic acids to encode amino acid identity as well as noncoding information for gene regulation and genome maintenance. The rare arginine codons AGA and AGG (AGR) present a case study in codon choice, with AGRs encoding important transcriptional and translational properties distinct from the other synonymous alternatives (CGN). We created a strain of Escherichia coli with all 123 instances of AGR codons removed from all essential genes. We readily replaced 110 AGR codons with the synonymous CGU codons, but the remaining 13 "recalcitrant" AGRs required diversification to identify viable alternatives...

Recoding--the repurposing of genetic codons--is a powerful strategy for enhancing genomes with functions not commonly found in nature. Here, we report computational design, synthesis, and progress toward assembly of a 3.97-megabase, 57-codon Escherichia coli genome in which all 62,214 instances of seven codons were replaced with synonymous alternatives across all protein-coding genes. We have validated 63% of recoded genes by individually testing 55 segments of 50 kilobases each. We observed that 91% of tested essential genes retained functionality with limited fitness effect...

The ability to image RNA identity and location with nanoscale precision in intact tissues is of great interest for defining cell types and states in normal and pathological biological settings. Here, we present a strategy for expansion microscopy of RNA. We developed a small-molecule linker that enables RNA to be covalently attached to a swellable polyelectrolyte gel synthesized throughout a biological specimen. Then, postexpansion, fluorescent in situ hybridization (FISH) imaging of RNA can be performed with high yield and specificity as well as single-molecule precision in both cultured cells and intact brain tissue...

The ability to write a stable record of identified molecular events into a specific genomic locus would enable the examination of long cellular histories and have many applications, ranging from developmental biology to synthetic devices. We show that the type I-E CRISPR (clustered regularly interspaced short palindromic repeats)-Cas system of Escherichia coli can mediate acquisition of defined pieces of synthetic DNA. We harnessed this feature to generate records of specific DNA sequences into a population of bacterial genomes...

BACKGROUND: The combination of the inhaled muscarinic antagonist umeclidinium (UMEC) with the long-acting β2-agonist vilanterol (VI) has been shown to provide significant improvements in lung function compared with UMEC, VI, or placebo (PBO) in patients with chronic obstructive pulmonary disease (COPD). This study was specifically designed to support these findings by assessing health-related quality of life and symptomatic outcomes in a similar population. METHODS: This was a 12-week multicenter, randomized, double-blind, parallel-group, placebo-controlled study...

The Genome in a Bottle Consortium, hosted by the National Institute of Standards and Technology (NIST) is creating reference materials and data for human genome sequencing, as well as methods for genome comparison and benchmarking. Here, we describe a large, diverse set of sequencing data for seven human genomes; five are current or candidate NIST Reference Materials. The pilot genome, NA12878, has been released as NIST RM 8398. We also describe data from two Personal Genome Project trios, one of Ashkenazim Jewish ancestry and one of Chinese ancestry...

Clustered regularly interspaced short palindromic repeats (CRISPRs) and CRISPR-associated (Cas) proteins offer a breakthrough platform for cheap, programmable, and effective sequence-specific DNA targeting. The CRISPR-Cas system is naturally equipped for targeted DNA cutting through its native nuclease activity. As such, groups researching a broad spectrum of biological organisms have quickly adopted the technology with groundbreaking applications to genomic sequence editing in over 20 different species. However, the biological code of life is not only encoded in genetics but also in epigenetics as well...