Contact

Links

Research & Scholarship

Current Research and Scholarly Interests

->What We Do:Our lab studies the mechanisms by which cells and organisms respond to genetic change.

The genetic landscape faced by a living cell is constantly changing. Developmental transitions, environmental shifts, and pathogenic invasions lend a dynamic character to both the genome and its activity pattern.We study a variety of natural mechanisms that are utilized by cells adapting to genetic change. These include mechanisms activated during normal development and systems for detecting and responding to foreign or unwanted genetic activity. At the root of these studies are questions of how a cell can distinguish "self" versus "nonself" and "wanted" versus "unwanted" gene expression.

We primarily make use of the nematode C. elegans in our experimental studies. C. elegans is small, easily cultured, and can readily be made to accept foreign DNA or RNA. The results of such experiments have outlined a number of concerted responses that recognize (and in most cases work to silence) the foreign nucleic acid. One such mechanism ("RNAi") responds to double stranded character in RNA: either as introduced experimentally into the organism or as produced from foreign DNA that has not undergone selection to avoid a dsRNA response. Much of the current effort in the lab is directed toward a molecular understanding of the RNAi machinery and its roles in the cell. RNAi is not the only cellular defense against unwanted nucleic acid, and substantial current effort in the lab is also directed at identification of other triggers and mechanisms used in recognition and response to foreign information.------>Who we are:PI: Andrew Fire, Professor of Pathology and Genetics, Stanford University School of Medicine

Prospective postdoctoral applicants should send a resume and summary of research to Dr. Fire (afire <at> stanford <dot> edu), and arrange to have 3-4 letters of reference likewise sent to this address.

Prospective graduate students are encouraged to apply to the Stanford Genetics Ph.D. program (or to any of the biosciences Ph.D. programs): http://biosciences.stanford.edu/prospective/

Rotation Students: We welcome rotation students from any program at Stanford, with Spring being the preferred quarter. Email the PI.

We occasionally have positions for undergraduate researchers in the lab (especially summers, and particularly straightforward for current or incoming Stanford students). Email the PI at the above address.

All Publications

Abstract

Conventional plasmid vectors are incapable of achieving sustained levels of transgene expression in vivo even in quiescent mammalian tissues because the transgene expression cassette is silenced. Transcriptional silencing results from the presence of the bacterial plasmid backbone or virtually any DNA sequence of >1 kb in length placed outside of the expression cassette. Here, we show that transcriptional silencing can be substantially forestalled by increasing the An/Tn sequence composition in the plasmid bacterial backbone. Increasing numbers of An/Tn sequences increased sustained transcription of both backbone sequences and adjacent expression cassettes. In order to recapitulate these expression profiles in compact and portable plasmid DNA backbones, we engineered the standard kanamycin or ampicillin antibiotic resistance genes, optimizing the number of An/Tn sequence without altering the encoded amino acids. The resulting vector backbones yield sustained transgene expression from mouse liver, providing generic DNA vectors capable of sustained transgene expression without additional genes or mammalian regulatory elements.

Abstract

A fraction of ribosomes engaged in translation will fail to terminate when reaching a stop codon, yielding nascent proteins inappropriately extended on their C termini. Although such extended proteins can interfere with normal cellular processes, known mechanisms of translational surveillance are insufficient to protect cells from potential dominant consequences. Here, through a combination of transgenics and CRISPR–Cas9 gene editing in Caenorhabditis elegans, we demonstrate a consistent ability of cells to block accumulation of C-terminal-extended proteins that result from failure to terminate at stop codons. Sequences encoded by the 3′ untranslated region (UTR) were sufficient to lower protein levels. Measurements of mRNA levels and translation suggested a co- or post-translational mechanism of action for these sequences in C. elegans. Similar mechanisms evidently operate in human cells, in which we observed a comparable tendency for translated human 3′ UTR sequences to reduce mature protein expression in tissue culture assays, including 3′ UTR sequences from the hypomorphic ‘Constant Spring’ haemoglobin stop codon variant. We suggest that 3′ UTRs may encode peptide sequences that destabilize the attached protein, providing mitigation of unwelcome and varied translation errors.

Abstract

Specific immunotherapy (SIT) is the only treatment with proved long-term curative potential in patients with allergic disease. Allergen-specific IgE is the causative agent of allergic disease, and antibodies contribute to SIT, but the effects of SIT on aeroallergen-specific B-cell repertoires are not well understood.We sought to characterize the IgE sequences expressed by allergen-specific B cells and track the fate of these B-cell clones during SIT.We used high-throughput antibody gene sequencing and identification of allergen-specific IgE with combinatorial antibody fragment library technology to analyze immunoglobulin repertoires of blood and the nasal mucosa from aeroallergen-sensitized subjects before and during the first year of subcutaneous SIT.Of 52 distinct allergen-specific IgE heavy chains from 8 allergic donors, 37 were also detected by using high-throughput antibody gene sequencing of blood samples, nasal mucosal samples, or both. The allergen-specific clones had increased persistence, higher likelihood of belonging to clones expressing other switched isotypes, and possibly larger clone size than the rest of the IgE repertoire. Clone members in nasal tissue showed close mutational relationships.In the future, combining functional binding studies, deep antibody repertoire sequencing, and information on clinical outcomes in larger studies might aid assessment of SIT mechanisms and efficacy.

Abstract

Antibodies with ontogenies from VH1-2 or VH1-46-germline genes dominate the broadly neutralizing response against the CD4-binding site (CD4bs) on HIV-1. Here, we define with longitudinal sampling from time-of-infection the development of a VH1-46-derived antibody lineage that matured to neutralize 90% of HIV-1 isolates. Structures of lineage antibodies CH235 (week 41 from time-of-infection, 18% breadth), CH235.9 (week 152, 77%), and CH235.12 (week 323, 90%) demonstrated the maturing epitope to focus on the conformationally invariant portion of the CD4bs. Similarities between CH235 lineage and five unrelated CD4bs lineages in epitope focusing, length-of-time to develop breadth, and extraordinary level of somatic hypermutation suggested commonalities in maturation among all CD4bs antibodies. Fortunately, the required CH235-lineage hypermutation appeared substantially guided by the intrinsic mutability of the VH1-46 gene, which closely resembled VH1-2. We integrated our CH235-lineage findings with a second broadly neutralizing lineage and HIV-1 co-evolution to suggest a vaccination strategy for inducing both lineages.

Abstract

Identification of locus-locus contacts at the chromatin level provides a valuable foundation for understanding of nuclear architecture and function and a valuable tool for inferring long-range linkage relationships. As one approach to this, chromatin conformation capture-based techniques allow creation of genome spatial organization maps. While such approaches have been available for some time, methodological advances will be of considerable use in minimizing both time and input material required for successful application.Here we report a modified tethered conformation capture protocol that utilizes a series of rapid and efficient molecular manipulations. We applied the method to Caenorhabditis elegans, obtaining chromatin interaction maps that provide a sequence-anchored delineation of salient aspects of Caenorhabditis elegans chromosome structure, demonstrating a high level of consistency in overall chromosome organization between biological samples collected under different conditions. In addition to the application of the method to defining nuclear architecture, we found the resulting chromatin interaction maps to be of sufficient resolution and sensitivity to enable detection of large-scale structural variants such as inversions or translocations.Our streamlined protocol provides an accelerated, robust, and broadly applicable means of generating chromatin spatial organization maps and detecting genome rearrangements without a need for cellular or chromatin fractionation.

Abstract

CRISPR systems mediate adaptive immunity in diverse prokaryotes. CRISPR-associated Cas1 and Cas2 proteins have been shown to enable adaptation to new threats in type I and II CRISPR systems by the acquisition of short segments of DNA (spacers) from invasive elements. In several type III CRISPR systems, Cas1 is naturally fused to a reverse transcriptase (RT). In the marine bacterium Marinomonas mediterranea (MMB-1), we showed that a RT-Cas1 fusion protein enables the acquisition of RNA spacers in vivo in a RT-dependent manner. In vitro, the MMB-1 RT-Cas1 and Cas2 proteins catalyze the ligation of RNA segments into the CRISPR array, which is followed by reverse transcription. These observations outline a host-mediated mechanism for reverse information flow from RNA to DNA.

Abstract

Chikungunya is caused by the mosquito-borne arthrogenic alphavirus, chikungunya virus (CHIKV). Chikungunya was introduced into the Americas in late 2013 and Nicaragua in mid-2014. Here, we sequenced five imported and 30 autochthonous Nicaraguan CHIKV from cases identified in the first epidemic in the country between August 2014 and April 2015. One full-length and two partial genomic sequences were obtained by deep sequencing; Sanger methodology yielded 33 E1 sequences from five imported and 28 autochthonous cases. Phylogenetic analysis indicates that Nicaraguan CHIKV all belonged to the Asian genotype, Caribbean clade. Moreover, E1 gene sequences revealed accumulation of mutations in later months of the epidemic, including four silent mutations in 11 autochthonous cases and three non-synonymous mutations in three autochthonous cases. No mutations contributing to increased transmissibility by Aedes albopictus were identified in the E1 gene. This represents the most comprehensive set of CHIKV sequences available from the Americas to date.

Abstract

The proliferation of CRISPR/Cas9-based methods in Caenorhabditis elegans has enabled efficient genome editing and precise genomic tethering of Cas9 fusion proteins. Experimental designs using CRISPR/Cas9 are currently limited by the need for a protospacer adjacent motif (PAM) in the target with the sequence NGG. Here we report the characterization of two modified Cas9 proteins in C. elegans that recognize NGA and NGCG PAMs. We found that each variant could stimulate homologous recombination with a donor template at multiple loci and that PAM specificity was comparable to that of wild-type Cas9. To directly compare effectiveness, we used CRISPR/Cas9 genome editing to generate a set of assay strains with a common single-guide RNA (sgRNA) target sequence, but that differ in the juxtaposed PAM (NGG, NGA, or NGCG). In this controlled setting, we determined that the NGA PAM Cas9 variant can be as effective as wild-type Cas9. We similarly edited a genomic target to study the influence of the base following the NGA PAM. Using four strains with four NGAN PAMs differing only at the fourth position and adjacent to the same sgRNA target, we observed that efficient homologous replacement was attainable with any base in the fourth position, with an NGAG PAM being the most effective. In addition to demonstrating the utility of two Cas9 mutants in C. elegans and providing reagents that permit CRISPR/Cas9 experiments with fewer restrictions on potential targets, we established a means to benchmark the efficiency of different Cas9::PAM combinations that avoids variations owing to differences in the sgRNA sequence.

Abstract

The founding heterochronic microRNAs, lin-4 and let-7, together with their validated targets and well-characterized phenotypes in C. elegans, offer an opportunity to test functionality of microRNAs in a developmental context. In this study, we defined sequence requirements at the microRNA level for these two microRNAs, evaluating lin-4 and let-7 mutant microRNAs for their ability to support temporal development under conditions where the wild-type lin-4 and let-7 gene products are absent. For lin-4, we found a strong requirement for seed sequences, with function drastically affected by several central mutations in the seed sequence, while rescue was retained by a set of mutations peripheral to the seed. let-7 rescuing activity was retained to a surprising degree by a variety of central seed mutations, while several non-seed mutant effects support potential noncanonical contributions to let-7 function. Taken together, this work illustrates both the functional partnership between seed and non-seed sequences in mediating C. elegans temporal development and a diversity among microRNA effectors in the contributions of seed and non-seed regions to activity.

Abstract

To study target sequence specificity, selectivity, and reaction kinetics of Streptococcus pyogenes Cas9 activity, we challenged libraries of random variant targets with purified Cas9::guide RNA complexes in vitro. Cleavage kinetics were nonlinear, with a burst of initial activity followed by slower sustained cleavage. Consistent with other recent analyses of Cas9 sequence specificity, we observe considerable (albeit incomplete) impairment of cleavage for targets mutated in the PAM sequence or in 'seed' sequences matching the proximal 8 bp of the guide. A second target region requiring close homology was located at the other end of the guide::target duplex (positions 13-18 relative to the PAM). Sequences flanking the guide+PAM region had measurable (albeit modest) effects on cleavage. In addition, the first-base Guanine constraint commonly imposed by gRNA expression systems has little effect on overall cleavage efficiency. Taken together, these studies provide an in vitro understanding of the complexities of Cas9-gRNA interaction and cleavage beyond the general paradigm of site determination based on the 'seed' sequence and PAM.

Abstract

Induction of HIV-1 broad neutralizing antibodies (bnAbs) is a goal of HIV-1 vaccine development but has remained challenging partially due to unusual traits of bnAbs, including high somatic hypermutation (SHM) frequencies and in-frame insertions and deletions (indels). Here we examined the propensity and functional requirement for indels within HIV-1 bnAbs. High-throughput sequencing of the immunoglobulin (Ig) VHDJH genes in HIV-1 infected and uninfected individuals revealed that the indel frequency was elevated among HIV-1-infected subjects, with no unique properties attributable to bnAb-producing individuals. This increased indel occurrence depended only on the frequency of SHM point mutations. Indel-encoded regions were generally proximal to antigen binding sites. Additionally, reconstruction of a HIV-1 CD4-binding site bnAb clonal lineage revealed that a large compound VHDJH indel was required for bnAb activity. Thus, vaccine development should focus on designing regimens targeted at sustained activation of bnAb lineages to achieve the required SHM and indel events.

Abstract

Elderly humans show decreased humoral immunity to pathogens and vaccines, yet the effects of aging on B cells are not fully known. Chronic viral infection by CMV is implicated as a driver of clonal T cell proliferations in some aging humans, but whether CMV or EBV infection contributes to alterations in the B cell repertoire with age is unclear. We have used high-throughput DNA sequencing of IGH gene rearrangements to study the BCR repertoires over two successive years in 27 individuals ranging in age from 20 to 89 y. Some features of the B cell repertoire remain stable with age, but elderly subjects show increased numbers of B cells with long CDR3 regions, a trend toward accumulation of more highly mutated IgM and IgG Ig genes, and persistent clonal B cell populations in the blood. Seropositivity for CMV or EBV infection alters B cell repertoires, regardless of the individual's age: EBV infection correlates with the presence of persistent clonal B cell expansions, whereas CMV infection correlates with the proportion of highly mutated Ab genes. These findings isolate effects of aging from those of chronic viral infection on B cell repertoires and provide a baseline for understanding human B cell responses to vaccination or infectious stimuli.

Abstract

In certain organisms, numbers of crossover events for any single chromosome are limited ("crossover interference") so that double crossover events are obtained at much lower frequencies than would be expected from the simple product of independent single-crossover events. We present a number of observations during which we examined interference over a large region of Caenorhabditis elegans chromosome V. Examining this region for multiple crossover events in heteroallelic configurations with limited dimorphism, we observed high levels of crossover interference in oocytes with only partial interference in spermatocytes.

Abstract

In order to identify novel somatic mutations associated with classic BCR/ABL1-negative myeloproliferative neoplasms, we performed high-coverage genome sequencing of DNA from peripheral blood granulocytes and cultured skin fibroblasts from a patient with MPL W515K-positive primary myelofibrosis. The primary myelofibrosis genome had a low somatic mutation rate, consistent with that observed in similar hematopoietic tumor genomes. Interfacing of whole-genome DNA sequence data with RNA expression data identified three somatic mutations of potential functional significance: a nonsense mutation in CARD6, implicated in modulation of NF-kappaB activation; a 19-base pair deletion involving a potential regulatory region in the 5'-untranslated region of BRD2, implicated in transcriptional regulation and cell cycle control; and a non-synonymous point mutation in KIAA0355, an uncharacterized protein. Additional mutations in three genes (CAP2, SOX30, and MFRP) were also evident, albeit with no support for expression at the RNA level. Re-sequencing of these six genes in 178 patients with polycythemia vera, essential thrombocythemia, and myelofibrosis did not identify recurrent somatic mutations in these genes. Finally, we describe methods for reducing false-positive variant calls in the analysis of hematologic malignancies with a low somatic mutation rate. This trial is registered with ClinicalTrials.gov (NCT01108159).

Abstract

Germ cells in animals are highly specialized to preserve the genome. A distinct set of chromatin structures must be properly established in germ cells to maintain cell fate and genome integrity. We describe DNA-surface interactions in activated Caenorhabditis elegans oocytes that are revealed through the activity of an endogenous nuclease ('endocleavage').Our analysis began with an unexpected observation that a majority (>50%) of DNA from ovulated but unfertilized C. elegans oocytes can be recovered in fragments of approximately 500 base pairs or shorter, cleaved at regular intervals (10 to 11 nt) along the DNA helix. In some areas of the genome, DNA cleavage patterns in these endoreduplicated oocytes appear consistent from cell-to-cell, indicating coherent rotational positioning of the DNA in chromatin. Particularly striking in this analysis are arrays of sensitive sites with a periodicity of approximately 10 bp that persist for several hundred base pairs of genomic DNA, longer than a single nucleosome core. Genomic regions with a strong bias toward a 10-nt periodic occurrence of A(n)/T(n) (so-called PATC regions) appear to exhibit a high degree of rotational constraint in endocleavage phasing, with a strong tendency for the periodic A(n)/T(n) sites to remain on the face of the helix protected from nuclease digestion.The present analysis provides evidence for an unusual structure in C. elegans oocytes in which genomic DNA and associated protein structures are coherently linked.

Abstract

Nematodes of the genus Caenorhabditis enter a developmental diapause state after hatching in the absence of food. To better understand the relative contributions of distinct regulatory modalities to gene expression changes associated with this developmental transition, we characterized genome-wide changes in mRNA abundance and translational efficiency associated with L1 diapause exit in four species using ribosome profiling and mRNA-seq. We found a strong tendency for translational regulation and mRNA abundance processes to act synergistically, together effecting a dramatic remodeling of the gene expression program. While gene-specific differences were observed between species, overall translational dynamics were broadly and functionally conserved. A striking, conserved feature of the response was strong translational suppression of ribosomal protein production during L1 diapause, followed by activation upon resumed development. On a global scale, ribosome footprint abundance changes showed greater similarity between species than changes in mRNA abundance, illustrating a substantial and genome-wide contribution of translational regulation to evolutionary maintenance of stable gene expression.

Abstract

More than half of C. elegans pre-mRNAs lose their original 5' ends in a process termed "trans-splicing" in which the RNA extending from the transcription start site (TSS) to the site of trans-splicing of the primary transcript, termed the "outron", is replaced with a 22nt spliced leader. This complicates the mapping of TSSs, leading to a lack of available TSS mapping data for these genes. We used growth at low temperature and nuclear isolation to enrich for transcripts still containing outrons, applying a modified SAGE capture procedure and high throughput sequencing to characterize 5' termini in this transcript population. We report from this data both a landscape of 5' end utilization for C. elegans and a representative collection of TSSs for 7,351 trans-spliced genes. TSS distributions for individual genes were often dispersed, with a greater average number of TSSs for trans-spliced genes, suggesting that trans-splicing may remove selective pressure for a single TSS. Upstream of newly defined TSSs, we observed well-known motifs (including TATAA box and SP1) as well as novel motifs. A number of these motifs showed association with tissue-specific expression and/or conservation among six worm species. Comparing TSS features between trans-spliced and non-trans-spliced genes, we found stronger signals among outron TSSs for preferentially-positioning of flanking nucleosomes and for downstream Pol II enrichment. Our data provides an enabling resource for both experimental and theoretical analysis of gene structure and function in C. elegans.

Abstract

Dengue is the most prevalent mosquito-borne viral disease in humans, and the lack of early prognostics, vaccines, and therapeutics contributes to immense disease burden. To identify patterns that could be used for sequence-based monitoring of the antibody response to dengue, we examined antibody heavy-chain gene rearrangements in longitudinal peripheral blood samples from 60 dengue patients. Comparing signatures between acute dengue, postrecovery, and healthy samples, we found increased expansion of B cell clones in acute dengue patients, with higher overall clonality in secondary infection. Additionally, we observed consistent antibody sequence features in acute dengue in the highly variable major antigen-binding determinant, complementarity-determining region 3 (CDR3), with specific CDR3 sequences highly enriched in acute samples compared to postrecovery, healthy, or non-dengue samples. Dengue thus provides a striking example of a human viral infection where convergent immune signatures can be identified in multiple individuals. Such signatures could facilitate surveillance of immunological memory in communities.

Abstract

Current human immunodeficiency virus-1 (HIV-1) vaccines elicit strain-specific neutralizing antibodies. However, cross-reactive neutralizing antibodies arise in approximately 20% of HIV-1-infected individuals, and details of their generation could provide a blueprint for effective vaccination. Here we report the isolation, evolution and structure of a broadly neutralizing antibody from an African donor followed from the time of infection. The mature antibody, CH103, neutralized approximately 55% of HIV-1 isolates, and its co-crystal structure with the HIV-1 envelope protein gp120 revealed a new loop-based mechanism of CD4-binding-site recognition. Virus and antibody gene sequencing revealed concomitant virus evolution and antibody maturation. Notably, the unmutated common ancestor of the CH103 lineage avidly bound the transmitted/founder HIV-1 envelope glycoprotein, and evolution of antibody neutralization breadth was preceded by extensive viral diversification in and near the CH103 epitope. These data determine the viral and antibody evolution leading to induction of a lineage of HIV-1 broadly neutralizing antibodies, and provide insights into strategies to elicit similar antibodies by vaccination.

Abstract

Current efforts in nonviral gene therapy are plagued by a pervasive difficulty in sustaining therapeutic levels of delivered transgenes. Minicircles (plasmid derivatives with the same expression cassette but lacking a bacterial backbone) show sustained expression and hold promise for therapeutic use where persistent transgene expression is required. To characterize the widely-observed silencing process affecting expression of foreign DNA in mammals, we used a system in which mouse liver presented with either plasmid or minicircle consistently silences plasmid but not minicircle expression. We found that preferential silencing of plasmid DNA occurs at a nuclear stage that precedes transport of mRNA to the cytoplasm, evident from a consistent >25-fold minicircle/plasmid transcript difference observed in both nuclear and total RNA. Among possible mechanisms of nuclear silencing, our data favor chromatin-linked transcriptional blockage rather than targeted degradation, aberrant processing, or compromised mRNA transport. In particular, we observe dramatic enrichment of H3K27 trimethylation on plasmid sequences. Also, it appears that Pol II can engage the modified plasmid chromatin, potentially in a manner that is not productive in the synthesis of high levels of new transcript. We outline a scenario in which sustained differences at the chromatin level cooperate to determine the activity of foreign DNA.

Abstract

miRNAs are post-transcriptional regulators of gene activity that reduce protein accumulation from target mRNAs. Elucidating precise molecular effects that animal miRNAs have on target transcripts has proven complex, with varied evidence indicating that miRNA regulation may produce different molecular outcomes in different species, systems, and/or physiological conditions. Here we use high-throughput ribosome profiling to analyze detailed translational parameters for five well-studied targets of miRNAs that regulate C. elegans developmental timing. For two targets of the miRNA lin-4 (lin-14 and lin-28), functional down-regulation was associated with decreases in both overall mRNA abundance and ribosome loading; however, these changes were of substantially smaller magnitude than corresponding changes observed in protein abundance. For three functional targets of the let-7 miRNA family for which down-regulation is critical in temporal progression of the animal (daf-12, hbl-1, and lin-41), we observed only modest changes in mRNA abundance and ribosome loading. lin-41 provides a striking example in that populations of ribosome-protected fragments from this gene remained essentially unchanged during the L3-L4 time interval when lin-41 activity is substantially down-regulated by let-7. Spectra of ribosomal positions were also examined for the five lin-4 and let-7 target mRNAs as a function of developmental time, with no indication of miRNA-induced ribosomal drop-off or significant pauses in translation. These data are consistent with models in which physiological regulation by this set of C. elegans miRNAs derives from combinatorial effects including suppressed recruitment/activation of translational machinery, compromised stability of target messages, and post- or peri-translational effects on lifetimes of polypeptide products.

Abstract

The effectiveness of RNA interference (RNAi) in many organisms is potentiated through the signal-amplifying activity of a targeted RNA-directed RNA polymerase (RdRP) system that can convert a small population of exogenously-encountered dsRNA fragments into an abundant internal pool of small interfering RNA (siRNA). As for any biological amplification system, we expect an underlying architecture that will limit the ability of a randomly encountered trigger to produce an uncontrolled and self-escalating response. Investigating such limits in Caenorhabditis elegans, we find that feed-forward amplification is limited by biosynthetic and structural distinctions at the RNA level between (1) triggers that can produce amplification and (2) siRNA products of the amplification reaction. By assuring that initial (primary) siRNAs can act as triggers but not templates for activation, and that the resulting (secondary) siRNAs can enforce gene silencing on additional targets without unbridled trigger amplification, the system achieves substantial but fundamentally limited signal amplification.

Abstract

In quiescent tissues, minicircle DNA vectors provide at least 10 times higher sustained levels of transgene expression compared to that achieved with a canonical plasmid containing the same expression cassette. It is not known if there is a specific DNA sequence or structure that is needed for DNA silencing. To directly address this question, we substituted the bacterial plasmid DNA with various lengths of extragenic spacer DNAs between the 5' and 3' ends of the transgene expression cassette and determined the expression profiles using two different reporter expression cassettes. Both the human alphoid repeat (AR) and randomly generated DNA sequences of ≥1 kb in length resulted in transgene silencing while shorter spacers, ≤500 bp exhibited similar transgene expression patterns to conventional minicircle DNA vectors. In contrast, when the ≥1 kb random DNA (RD) sequences were expressed as part of the 3'-untranslated region (UTR) transgene silencing was not observed. These data suggest that the length and not the sequence or origin of the extragenic DNA flanking the expression cassette is responsible for plasmid-mediated transgene silencing. This has implications for the design of nonviral vectors for gene transfer applications as well as providing insights into how genes are regulated.

Abstract

Epigenetic information is frequently erased near the start of each new generation. In some cases, however, epigenetic information can be transmitted from parent to progeny (multigenerational epigenetic inheritance). A particularly notable example of this type of epigenetic inheritance is double-stranded RNA-mediated gene silencing in Caenorhabditis elegans. This RNA-mediated interference (RNAi) can be inherited for more than five generations. To understand this process, here we conduct a genetic screen for nematodes defective in transmitting RNAi silencing signals to future generations. This screen identified the heritable RNAi defective 1 (hrde-1) gene. hrde-1 encodes an Argonaute protein that associates with small interfering RNAs in the germ cells of progeny of animals exposed to double-stranded RNA. In the nuclei of these germ cells, HRDE-1 engages the nuclear RNAi defective pathway to direct the trimethylation of histone H3 at Lys 9 (H3K9me3) at RNAi-targeted genomic loci and promote RNAi inheritance. Under normal growth conditions, HRDE-1 associates with endogenously expressed short interfering RNAs, which direct nuclear gene silencing in germ cells. In hrde-1- or nuclear RNAi-deficient animals, germline silencing is lost over generational time. Concurrently, these animals exhibit steadily worsening defects in gamete formation and function that ultimately lead to sterility. These results establish that the Argonaute protein HRDE-1 directs gene-silencing events in germ-cell nuclei that drive multigenerational RNAi inheritance and promote immortality of the germ-cell lineage. We propose that C. elegans use the RNAi inheritance machinery to transmit epigenetic information, accrued by past generations, into future generations to regulate important biological processes.

Abstract

Exogenous double-stranded RNA (dsRNA) has been shown to exert homology-dependent effects at the level of both target mRNA stability and chromatin structure. Using C. elegans undergoing RNAi as an animal model, we have investigated the generality, scope and longevity of dsRNA-targeted chromatin effects and their dependence on components of the RNAi machinery. Using high-resolution genome-wide chromatin profiling, we found that a diverse set of genes can be induced to acquire locus-specific enrichment of histone H3 lysine 9 trimethylation (H3K9me3), with modification footprints extending several kilobases from the site of dsRNA homology and with locus specificity sufficient to distinguish the targeted locus from the other 20,000 genes in the C. elegans genome. Genetic analysis of the response indicated that factors responsible for secondary siRNA production during RNAi were required for effective targeting of chromatin. Temporal analysis revealed that H3K9me3, once triggered by dsRNA, can be maintained in the absence of dsRNA for at least two generations before being lost. These results implicate dsRNA-triggered chromatin modification in C. elegans as a programmable and locus-specific response defining a metastable state that can persist through generational boundaries.

Abstract

The existence of many highly similar genes in the lymphocyte receptor gene loci makes them difficult to investigate, and the determination of phased "haplotypes" has been particularly problematic. However, V(D)J gene rearrangements provide an opportunity to infer the association of Ig genes along the chromosomes. The chromosomal distribution of H chain genes in an Ig genotype can be inferred through analysis of VDJ rearrangements in individuals who are heterozygous at points within the IGH locus. We analyzed VDJ rearrangements from 44 individuals for whom sufficient unique rearrangements were available to allow comprehensive genotyping. Nine individuals were identified who were heterozygous at the IGHJ6 locus and for whom sufficient suitable VDJ rearrangements were available to allow comprehensive haplotyping. Each of the 18 resulting IGHV│IGHD│IGHJ haplotypes was unique. Apparent deletion polymorphisms were seen that involved as many as four contiguous, functional IGHV genes. Two deletion polymorphisms involving multiple contiguous IGHD genes were also inferred. Three previously unidentified gene duplications were detected, where two sequences recognized as allelic variants of a single gene were both inferred to be on a single chromosome. Phased genomic data brings clarity to the study of the contribution of each gene to the available repertoire of rearranged VDJ genes. Analysis of rearrangement frequencies suggests that particular genes may have substantially different yet predictable propensities for rearrangement within different haplotypes. Together with data highlighting the extent of haplotypic variation within the population, this suggests that there may be substantial variability in the available Ab repertoires of different individuals.

Abstract

The primary cause of poor outcome following allogeneic hematopoietic cell transplantation (HCT) for chronic lymphocytic leukemia (CLL) is disease recurrence. Detection of increasing minimal residual disease (MRD) following HCT may permit early intervention to prevent clinical relapse; however, MRD quantification remains an uncommon diagnostic test because of logistical and financial barriers to widespread use. Here we describe a method for quantifying CLL MRD using widely available consensus primers for amplification of all Ig heavy chain (IGH) genes in a mixture of peripheral blood mononuclear cells, followed by high-throughput sequencing (HTS) for disease-specific IGH sequence quantification. To achieve accurate MRD quantification, we developed a systematic bioinformatic methodology to aggregate cancer clone sequence variants arising from systematic and random artifacts occurring during IGH-HTS. We then compared the sensitivity of IGH-HTS, flow cytometry, and allele-specific oligonucleotide PCR for MRD quantification in 28 samples collected from 6 CLL patients following allogeneic HCT. Using amplimer libraries generated with consensus primers from patient blood samples, we demonstrate the sensitivity of IGH-HTS with 454 pyrosequencing to be 10(-5), with a high correlation between quantification by allele-specific oligonucleotide PCR and IGH-HTS (r = 0.85). From the same dataset used to quantify MRD, IGH-HTS also allowed us to profile IGH repertoire reconstitution after HCT-information not provided by the other MRD methods. IGH-HTS using consensus primers will broaden the availability of MRD quantification in CLL and other B cell malignancies, and this approach has potential for quantitative evaluation of immune diversification following transplant and nontransplant therapies.

Abstract

In the universal genetic code, most amino acids can be encoded by multiple trinucleotide codons, and the choice among available codons can influence position-specific translation elongation rates. By using sequence-based ribosome profiling, we obtained transcriptome-wide profiles of in vivo ribosome occupancy as a function of codon identity in Caenorhabditis elegans and human cells. Particularly striking in these profiles was a universal trend of higher ribosome occupancy for codons translated via G:U wobble base-pairing compared with synonymous codons that pair with the same tRNA family using G:C base-pairing. These data support a model in which ribosomal translocation is slowed at wobble codon positions.

Abstract

The initial antibody response to HIV-1 is targeted to envelope (Env) gp41, and is nonneutralizing and ineffective in controlling viremia. To understand the origins and characteristics of gp41-binding antibodies produced shortly after HIV-1 transmission, we isolated and studied gp41-reactive plasma cells from subjects acutely infected with HIV-1. The frequencies of somatic mutations were relatively high in these gp41-reactive antibodies. Reverted unmutated ancestors of gp41-reactive antibodies derived from subjects acutely infected with HIV-1 frequently did not react with autologous HIV-1 Env; however, these antibodies were polyreactive and frequently bound to host or bacterial antigens. In one large clonal lineage of gp41-reactive antibodies, reactivity to HIV-1 Env was acquired only after somatic mutations. Polyreactive gp41-binding antibodies were also isolated from uninfected individuals. These data suggest that the majority of gp41-binding antibodies produced after acute HIV-1 infection are cross-reactive responses generated by stimulating memory B cells that have previously been activated by non-HIV-1 antigens.

Abstract

Adenosine deaminases that act on RNAs (ADARs) interact with double-stranded RNAs, deaminating adenosines to inosines. Previous studies of Caenorhabditis elegans indicated an antagonistic interaction between ADAR and RNAi machineries, with ADAR defects suppressed upon additional knockout of RNAi. This suggests a pool of common RNA substrates capable of engaging both pathways. To define and characterize such substrates, we examined small RNA and mRNA populations of ADAR mutants and identified a distinct set of loci from which RNAi-dependent short RNAs are markedly upregulated. At these same loci, we observed populations of multiply edited transcripts, supporting a specific role for ADARs in preventing access to the RNAi pathway for an extensive population of dsRNAs. Characterization of these loci revealed a substantial overlap with noncoding and intergenic regions, suggesting that the landscape of ADAR targets may extend beyond previously annotated classes of transcripts.

Abstract

The discovery of microRNAs (miRNAs) lin-4 and let-7 as temporal regulators in Caenorhabditis elegans led to broader searches for novel miRNAs and their biological roles. Unlike protein-coding genes and some long noncoding RNAs, canonical metazoan miRNAs are not known to contain introns within their genomic precursor sequences. Because the short length of miRNAs complicates a statistically definitive assignment of split genes in RNA sequencing data sets, we took an experimental approach toward testing the compatibility of splicing and functional miRNA biogenesis. To definitively evaluate the possibility that miRNAs could derive from interrupted genes, we constructed intron-interrupted variants of C. elegans lin-4 and assayed for their miRNA-encoding capability and biological activity in the developing organism. Our studies indicate that (1) intron-containing miRNAs (inc-miRs) can be efficiently spliced and processed to produce miRNAs with normal termini, and (2) these miRNAs can be functional in full rescue of developmental phenotypes in null mutants lacking endogenous lin-4. This study provides the first evidence to support the ability of intron-interrupted miRNA precursors to produce functional regulators and identifies an additional modality available for metazoan miRNA production.

Abstract

Nucleosomes are the basic packaging units of chromatin, modulating accessibility of regulatory proteins to DNA and thus influencing eukaryotic gene regulation. Elaborate chromatin remodelling mechanisms have evolved that govern nucleosome organization at promoters, regulatory elements, and other functional regions in the genome. Analyses of chromatin landscape have uncovered a variety of mechanisms, including DNA sequence preferences, that can influence nucleosome positions. To identify major determinants of nucleosome organization in the human genome, we used deep sequencing to map nucleosome positions in three primary human cell types and in vitro. A majority of the genome showed substantial flexibility of nucleosome positions, whereas a small fraction showed reproducibly positioned nucleosomes. Certain sites that position in vitro can anchor the formation of nucleosomal arrays that have cell type-specific spacing in vivo. Our results unveil an interplay of sequence-based nucleosome preferences and non-nucleosomal factors in determining nucleosome organization within mammalian cells.

Abstract

MicroRNAs (miRNAs) are small non-coding RNAs (18-24 nucleotides) that have recently been shown to regulate gene expression during cancer progression. Dicer, a central enzyme in the multi-component miRNA biogenesis pathway, is involved in cutting precursor miRNAs to functionally mature forms. Emerging evidence shows that Dicer expression is deregulated in some human malignancies and it correlates with tumor progression, yet this role has not yet been investigated in skin cancers.Using an anti-human monoclonal antibody against Dicer and immunohistochemistry, we compared the expression of Dicer protein among 404 clinically annotated controls and skin tumors consisting of melanocytic nevi (n = 71), a variety of melanomas (n = 223), carcinomas (n = 73) and sarcomas (n = 12). Results showed a cell-specific up-regulated Dicer in 81% of cutaneous, 80% of acrolentiginous and 96% of metastatic melanoma specimens compared to carcinoma or sarcoma specimens (P<0.0001). The expression of Dicer was significantly higher in melanomas compared to benign melanocytic nevi (P<0.0001). In patients with cutaneous melanomas, Dicer up-regulation was found to be significantly associated with an increased tumor mitotic index (P = 0.04), Breslow's depth of invasion (P = 0.03), nodal metastasis (P = 0.04) and a higher American Joint Committee on Caner (AJCC) clinical stage (P = 0.009). Using western blot analysis, we confirmed the cell-specific up-regulation of Dicer protein in vitro. A pooled-analysis on mRNA profiling in cutaneous tumors showed up-regulation of Dicer at the RNA level in cutaneous melanoma, also showing deregulation of other enzymes that participate in the biogenesis and maturation of canonical miRNAs.Increased Dicer expression may be a clinically useful biomarker for patients with cutaneous melanoma. Understanding deregulation of Dicer and its influence on miRNA maturation is needed to predict the susceptibility of melanoma patients to miRNA-based therapy in the future.

Abstract

C. elegans RDE-4 is a double-stranded RNA binding protein that has been shown to play a key role in response to foreign double-stranded RNA (dsRNA). We have used diverse tools for analysis of gene function to characterize the domain and organismal foci of RDE-4 action in C. elegans. First, we examined the focus of activity within the RDE-4 protein, by testing a series of RDE-4 deletion constructs for their ability to support dsRNA-triggered gene silencing. These assays indicated a molecular requirement for a linker region and the second dsRNA-binding domain of RDE-4, with ancillary contributions to function from the C and N terminal domains. Second, we used mosaic analysis to explore the cellular focus of action of RDE-4. These experiments indicated an ability of RDE-4 to function non-autonomously in foreign RNA responses. Third, we used growth under stressful conditions to search for evidence of an organismal focus of action for RDE-4 distinct from its role in response to foreign dsRNA. Propagation at high temperatures exposed a conditional requirement for RDE-4 for optimal growth and fertility, indicating at least under these conditions that RDE-4 can serve an essential role in C. elegans.

Abstract

The development of the germline in Caenorhabditis elegans is a complex process involving the regulation of thousands of genes in a coordinated manner. Several genes required for small RNA biogenesis and function are among those required for the proper organization of the germline. EGO-1 is a putative RNA-directed RNA polymerase (RdRP) that is required for multiple aspects of C. elegans germline development and efficient RNA interference (RNAi) of germline-expressed genes. RdRPs have been proposed to act through a variety of mechanisms, including the posttranscriptional targeting of specific mRNAs, as well as through a direct interaction with chromatin. Despite extensive investigation, the molecular role of EGO-1 has remained enigmatic.Here we use high-throughput small RNA and messenger RNA sequencing to investigate EGO-1 function. We found that EGO-1 is required to produce a distinct pool of small RNAs antisense to a number of germline-expressed mRNAs through several developmental stages. These potential mRNA targets fall into distinct classes, including genes required for kinetochore and nuclear pore assembly, histone-modifying activities, and centromeric proteins. We also found several RNAi-related genes to be targets of EGO-1. Finally, we show a strong association between the loss of small RNAs and the rise of mRNA levels in ego-1(-) animals.Our data support the conclusion that EGO-1 produces triphosphorylated small RNAs derived from mRNA templates and that these small RNAs modulate gene expression through the targeting of their cognate mRNAs.

Abstract

We have used a combination of three high-throughput RNA capture and sequencing methods to refine and augment the transcriptome map of a well-studied genetic model, Caenorhabditis elegans. The three methods include a standard (non-directional) library preparation protocol relying on cDNA priming and foldback that has been used in several previous studies for transcriptome characterization in this species, and two directional protocols, one involving direct capture of single-stranded RNA fragments and one involving circular-template PCR (CircLigase). We find that each RNA-seq approach shows specific limitations and biases, with the application of multiple methods providing a more complete map than was obtained from any single method. Of particular note in the analysis were substantial advantages of CircLigase-based and ssRNA-based capture for defining sequences and structures of the precise 5' ends (which were lost using the double-strand cDNA capture method). Of the three methods, ssRNA capture was most effective in defining sequences to the poly(A) junction. Using data sets from a spectrum of C. elegans strains and stages and the UCSC Genome Browser, we provide a series of tools, which facilitate rapid visualization and assignment of gene structures.

Abstract

MicroRNAs provide developing systems with substantial flexibility in posttranscriptional gene regulation. Despite advances made in understanding microRNA structure and function, the relationships between their site-of-synthesis and site-of-action ("autonomy" versus "non-autonomy") remain an open question. Given the well-defined role of microRNA lin-4 in a reproducible series of time-specific developmental switches, lin-4 is an excellent candidate for understanding whether microRNAs and the resulting heterochronic regulatory pathway have the potential to act cell autonomously. By monitoring temporal development and reporter activity in animals where lin-4 is modulated, we have demonstrated that lin-4 acts cell autonomously to specify temporal identity. This work (i) provides an example of cell autonomy in microRNA functions, and (ii) reveals a cell autonomous component of temporal regulation in C. elegans.

Abstract

Tissue differentiation is accompanied by genome-wide changes in the underlying chromatin structure and dynamics, or epigenome. By controlling when, where, and what regulatory factors have access to the underlying genomic DNA, the epigenome influences the cell's transcriptome and ultimately its function. Existing genomic methods for analyzing cell-type-specific changes in chromatin generally involve two elements: (i) a source for purified cells (or nuclei) of distinct types, and (ii) a specific treatment that partitions or degrades chromatin by activity or structural features. For many cell types of great interest, such assays are limited by our inability to isolate the relevant cell populations in an organism or complex tissue containing an intertwined mixture of other cells. This limitation has confined available knowledge of chromatin dynamics to a narrow range of biological systems (cell types that can be sorted/separated/dissected in large numbers and tissue culture models) or to amalgamations of diverse cell types (tissue chunks, whole organisms).Transgene-driven expression of DNA/chromatin modifying enzymes provides one opportunity to query chromatin structures in expression-defined cell subsets. In this work we combine in vivo expression of a bacterial DNA adenine methyltransferase (DAM) with high throughput sequencing to sample tissue-specific chromatin accessibility on a genome-wide scale. We have applied the method (DALEC: Direct Asymmetric Ligation End Capture) towards mapping a cell-type-specific view of genome accessibility as a function of differentiated state. Taking advantage of C. elegans strains expressing the DAM enzyme in diverse tissues (body wall muscle, gut, and hypodermis), our efforts yield a genome-wide dataset measuring chromatin accessibility at each of 538,000 DAM target sites in the C. elegans (diploid) genome.Validating the DALEC mapping results, we observe a strong association between observed coverage by nucleosomes and low DAM accessibility. Strikingly, we observed no extended regions of inaccessible chromatin for any of the tissues examined. These results are consistent with "local choreography" models in which differential gene expression is driven by intricate local rearrangements of chromatin structure rather than gross impenetrability of large chromosomal regions.

Abstract

The physiological function of eukaryotic DNA occurs in the context of nucleosomal arrays that can expose or obscure defined segments of the genome. Certain DNA sequences are capable of strongly positioning a nucleosome in vitro, suggesting the possibility that favorable intrinsic signals might reproducibly structure chromatin segments. As high-throughput sequencing analyses of nucleosome coverage in vitro and in vivo have become possible, a vigorous debate has arisen over the degree to which intrinsic DNA:nucleosome affinities orchestrate the in vivo positions of nucleosomes, thereby controlling physical accessibility of specific sequences in DNA.We describe here the in vivo consequences of placing a synthetic high-affinity nucleosome-positioning signal, the 601 sequence, into a DNA plasmid vector in mice. Strikingly, the 601 sequence was sufficient to position nucleosomes during an early phase after introduction of the DNA into the mice (when the plasmid vector transgene was active). This positioning capability was transient, with a loss of strong positioning at a later time point when the transgenes had become silent.These results demonstrate an ability of DNA sequences selected solely for nucleosome affinity to organize chromatin in vivo, and the ability of other mechanisms to overcome these interactions in a dynamic nuclear environment.

Abstract

Individual variation in the Ig germline gene repertoire leads to individual differences in the combinatorial diversity of the Ab repertoire, but the study of such variation has been problematic. The application of high-throughput DNA sequencing to the study of rearranged Ig genes now makes this possible. The sequencing of thousands of VDJ rearrangements from an individual, either from genomic DNA or expressed mRNA, should allow their germline IGHV, IGHD, and IGHJ repertoires to be inferred. In addition, where previously mere glimpses of diversity could be gained from sequencing studies, new large data sets should allow the rearrangement frequency of different genes and alleles to be seen with clarity. We analyzed the DNA of 108,210 human IgH chain rearrangements from 12 individuals and determined their individual IGH genotypes. The number of reportedly functional IGHV genes and allelic variants ranged from 45 to 60, principally because of variable levels of gene heterozygosity, and included 14 previously unreported IGHV polymorphisms. New polymorphisms of the IGHD3-16 and IGHJ6 genes were also seen. At heterozygous loci, remarkably different rearrangement frequencies were seen for the various IGHV alleles, and these frequencies were consistent between individuals. The specific alleles that make up an individual's Ig genotype may therefore be critical in shaping the combinatorial repertoire. The extent of genotypic variation between individuals is highlighted by an individual with aplastic anemia who appears to lack six contiguous IGHD genes on both chromosomes. These deletions significantly alter the potential expressed IGH repertoire, and possibly immune function, in this individual.

Abstract

Ultra-high throughput sequencing technologies provide opportunities both for discovery of novel molecular species and for detailed comparisons of gene expression patterns. Small RNA populations are particularly well suited to this analysis, as many different small RNAs can be completely sequenced in a single instrument run.We prepared small RNA libraries from 29 tumour/normal pairs of human cervical tissue samples. Analysis of the resulting sequences (42 million in total) defined 64 new human microRNA (miRNA) genes. Both arms of the hairpin precursor were observed in twenty-three of the newly identified miRNA candidates. We tested several computational approaches for the analysis of class differences between high throughput sequencing datasets and describe a novel application of a log linear model that has provided the most effective analysis for this data. This method resulted in the identification of 67 miRNAs that were differentially-expressed between the tumour and normal samples at a false discovery rate less than 0.001.This approach can potentially be applied to any kind of RNA sequencing data for analysing differential sequence representation between biological sample sets.

Abstract

Competition between mammalian RNAi-related gene silencing pathways is well documented. It is therefore important to identify all classes of small RNAs to determine their relationship with RNAi and how they affect each other functionally. Here, we identify two types of 5'-phosphate, 3'-hydroxylated human tRNA-derived small RNAs (tsRNAs). tsRNAs differ from microRNAs in being essentially restricted to the cytoplasm and in associating with Argonaute proteins, but not MOV10. The first type belongs to a previously predicted Dicer-dependent class of small RNAs that we find can modestly down-regulate target genes in trans. The 5' end of type II tsRNA was generated by RNaseZ cleavage downstream from a tRNA gene, while the 3' end resulted from transcription termination by RNA polymerase III. Consistent with their preferential association with the nonslicing Argonautes 3 and 4, canonical gene silencing activity was not observed for type II tsRNAs. The addition, however, of an oligonucleotide that was sense to the reporter gene, but antisense to an overexpressed version of the type II tsRNA, triggered robust, >80% gene silencing. This correlated with the redirection of the thus reconstituted fully duplexed double-stranded RNA into Argonaute 2, whereas Argonautes 3 and 4 were skewed toward less structured small RNAs, particularly single-strand RNAs. We observed that the modulation of tsRNA levels had minor effects on the abundance of microRNAs, but more pronounced changes in the silencing activities of both microRNAs and siRNAs. These findings support that tsRNAs are involved in the global control of small RNA silencing through differential Argonaute association, suggesting that small RNA-mediated gene regulation may be even more finely regulated than previously realized.

Abstract

Endogenous RNA-directed RNA polymerases (RdRPs) are cellular components capable of synthesizing new complementary RNAs from existing RNA templates. We present evidence for successive engagement of two different RdRPs in an endogenous siRNA-based mechanism targeting specific mRNAs in C. elegans soma. In the initiation stage of this process, a group of mRNA species are chosen as targets for downregulation, leading to accumulation of rare 26 nt 5'-phosphorylated antisense RNAs that depend on the RdRP homolog RRF-3, the Argonaute ERGO-1, DICER, and a series of associated ("ERI") factors. This primary process leads to production of a much more abundant class of 22 nt antisense RNAs, dependent on a secondary RdRP (RRF-1) and associating with at least one distinct Argonaute (NRDE-3). The requirement for two RdRP/Argonaute combinations and initiation by a rare class of uniquely structured siRNAs in this pathway illustrate the caution and flexibility used as biological systems exploit the physiological copying of RNA.

Abstract

We have used multiplexed high-throughput sequencing to characterize changes in small RNA populations that occur during viral infection in animal cells. Small RNA-based mechanisms such as RNA interference (RNAi) have been shown in plant and invertebrate systems to play a key role in host responses to viral infection. Although homologs of the key RNAi effector pathways are present in mammalian cells, and can launch an RNAi-mediated degradation of experimentally targeted mRNAs, any role for such responses in mammalian host-virus interactions remains to be characterized. Six different viruses were examined in 41 experimentally susceptible and resistant host systems. We identified virus-derived small RNAs (vsRNAs) from all six viruses, with total abundance varying from "vanishingly rare" (less than 0.1% of cellular small RNA) to highly abundant (comparable to abundant micro-RNAs "miRNAs"). In addition to the appearance of vsRNAs during infection, we saw a number of specific changes in host miRNA profiles. For several infection models investigated in more detail, the RNAi and Interferon pathways modulated the abundance of vsRNAs. We also found evidence for populations of vsRNAs that exist as duplexed siRNAs with zero to three nucleotide 3' overhangs. Using populations of cells carrying a Hepatitis C replicon, we observed strand-selective loading of siRNAs onto Argonaute complexes. These experiments define vsRNAs as one possible component of the interplay between animal viruses and their hosts.

Abstract

We have characterized two post-translational histone modifications in Caenorhabditis elegans on a genomic scale. Micrococcal nuclease digestion and immunoprecipitation were used to obtain distinct populations of single nucleosome cores, which were analyzed using massively parallel DNA sequencing to obtain positional and coverage maps. Two methylated histone H3 populations were chosen for comparison: H3K4 histone methylation (associated with active chromosomal regions) and H3K9 histone methylation (associated with inactivity). From analysis of the sequence data, we found nucleosome cores with these modifications to be enriched in two distinct partitions of the genome; H3K4 methylation was particularly prevalent in promoter regions of widely expressed genes, while H3K9 methylation was enriched on specific chromosomal arms. For each of the six chromosomes, the highest level of H3K9 methylation corresponds to the pairing center responsible for chromosome alignment during meiosis. Enrichment of H3K9 methylation at pairing centers appears to be an early mark in meiotic chromosome sorting, occurring in the absence of components required for proper pairing of homologous chromosomes. H3K9 methylation shows an intricate pattern within the chromosome arms with a particular anticorrelation to regions that display a strong approximately 10.5 bp periodicity of AA/TT dinucleotides that is known to associate with germline transcription. By contrast to the global features observed with H3K9 methylation, H3K4 methylation profiles were most striking in their local characteristics around promoters, providing a unique promoter-central landmark for 3,903 C. elegans genes and allowing a precise analysis of nucleosome positioning in the context of transcriptional initiation.

Abstract

The complex repertoire of immune receptors generated by B and T cells enables recognition of diverse threats to the host organism. In this work, we show that massively parallel DNA sequencing of rearranged immune receptor loci can provide direct detection and tracking of immune diversity and expanded clonal lymphocyte populations in physiological and pathological contexts. DNA was isolated from blood and tissue samples, a series of redundant primers was used to amplify diverse DNA rearrangements, and the resulting mixtures of barcoded amplicons were sequenced using long-read ultra deep sequencing. Individual DNA molecules were then characterized on the basis of DNA segments that had been joined to make a functional (or nonfunctional) immune effector. Current experimental designs can accommodate up to 150 samples in a single sequence run, with the depth of sequencing sufficient to identify stable and dynamic aspects of the immune repertoire in both normal and diseased circumstances. These data provide a high-resolution picture of immune spectra in normal individuals and in patients with hematological malignancies, illuminating, in the latter case, both the initial behavior of clonal tumor populations and the later suppression or re-emergence of such populations after treatment.

Abstract

Short interfering RNAs (siRNAs) are a class of regulatory effectors that enforce gene silencing through formation of RNA duplexes. Although progress has been made in identifying the capabilities of siRNAs in silencing foreign RNA and transposable elements, siRNA functions in endogenous gene regulation have remained mysterious. In certain organisms, siRNA biosynthesis involves novel enzymes that act as RNA-directed RNA polymerases (RdRPs). Here we analyze the function of a Caenorhabditis elegans RdRP, RRF-3, during spermatogenesis. We found that loss of RRF-3 function resulted in pleiotropic defects in sperm development and that sperm defects led to embryonic lethality. Notably, sperm nuclei in mutants of either rrf-3 or another component of the siRNA pathway, eri-1, were frequently surrounded by ectopic microtubule structures, with spindle abnormalities in a subset of the resulting embryos. Through high-throughput small RNA sequencing, we identified a population of cellular mRNAs from spermatogenic cells that appear to serve as templates for antisense siRNA synthesis. This set of genes includes the majority of genes known to have enriched expression during spermatogenesis, as well as many genes not previously known to be expressed during spermatogenesis. In a subset of these genes, we found that RRF-3 was required for effective siRNA accumulation. These and other data suggest a working model in which a major role of the RRF-3/ERI pathway is to generate siRNAs that set patterns of gene expression through feedback repression of a set of critical targets during spermatogenesis.

Abstract

Archived formalin-fixed, paraffin-embedded human tumors are widely available and represent a unique source of morphologically defined material. Formalin-fixed, paraffin-embedded tissue is known to contain a wealth of molecular information in the form of microRNAs (miRNAs), which could be correlated with clinical outcome for improved prognostication and/or treatment response. miRNAs are endogenous, noncoding RNAs ( approximately 22 nucleotides) and may function as tumor suppressors or oncogenes. A reliable, robust methodology is needed to take full advantage of archived human cancers, especially for those where fresh-frozen tumor banks are unavailable, for example, malignant melanoma. To this end, we applied a simple-to-use protocol for extracting total RNA from various formalin-fixed, paraffin-embedded specimens (colon, liver, prostate, thyroid, uterus, and skin), optimized for small RNA recovery. Using a "poison primer" strategy (ie, primer silencing), we blocked the amplification of ribosomal RNA, enabling the successful sequencing of 17 novel and 53 known miRNAs (including small RNAs) from 10-year-old archived normal skin, cutaneous scalp melanoma, and sentinel lymph nodes (both negative and positive for metastasis) excised from a 52-year-old man. The cloning incidence provided an estimation of the level of specific miRNA expression, which was confirmed by Northern analysis and quantitative real-time polymerase chain reaction. This methodology can therefore be used to facilitate miRNA discovery from archived human cancers.

Abstract

Previous work in C. elegans has shown that posterior embryonic bodywall muscle lineages are regulated through a genetically defined transcriptional cascade that includes PAL-1/Caudal-mediated activation of muscle-specific transcription factors, including HLH-1/MRF and UNC-120/SRF, which together orchestrate specification and differentiation. Using chromatin immunoprecipitation (ChIP) in embryos, we now demonstrate direct binding of PAL-1 in vivo to an hlh-1 enhancer element. Through mutational analysis of the evolutionarily conserved sequences within this enhancer, we identify two cis-acting elements and their associated transacting factors (PAL-1 and HLH-1) that are crucial for the temporal-spatial expression of hlh-1 and proper myogenesis. Our data demonstrate that hlh-1 is indeed a direct target of PAL-1 in the posterior embryonic C. elegans muscle lineages, defining a novel in vivo binding site for this crucial developmental regulator. We find that the same enhancer element is also a target of HLH-1 positive auto regulation, underlying (at least in part) the sustained high levels of CeMyoD in bodywall muscle throughout development. Together, these results provide a molecular framework for the gene regulatory network activating the muscle module during embryogenesis.

Abstract

Might DNA sequence variation reflect germline genetic activity and underlying chromatin structure? We investigated this question using medaka (Japanese killifish, Oryzias latipes), by comparing the genomic sequences of two strains (Hd-rR and HNI) and by mapping approximately 37.3 million nucleosome cores from Hd-rR blastulae and 11,654 representative transcription start sites from six embryonic stages. We observed a distinctive approximately 200-base pair (bp) periodic pattern of genetic variation downstream of transcription start sites; the rate of insertions and deletions longer than 1 bp peaked at positions of approximately +200, +400, and +600 bp, whereas the point mutation rate showed corresponding valleys. This approximately 200-bp periodicity was correlated with the chromatin structure, with nucleosome occupancy minimized at positions 0, +200, +400, and +600 bp. These data exemplify the potential for genetic activity (transcription) and chromatin structure to contribute to molding the DNA sequence on an evolutionary time scale.

Abstract

Heritable silencing effects are gene suppression phenomena that can persist for generations after induction. In the majority of RNAi experiments conducted in Caenorhabditis elegans, the silencing response results in a hypomorphic phenotype where the effects recede after the F1 generation. F2 and subsequent generations revert to the original phenotype. Specific examples of transgenerational RNAi in which effects persist to the F2 generation and beyond have been described. In this study, we describe a systematic pedigree-based analysis of heritable silencing processes resulting from initiation of interference targeted at the C. elegans oocyte maturation factor oma-1. Heritable silencing of oma-1 is a dose-dependent process where the inheritance of the silencing factor is unequally distributed among the population. Heritability is not constant over generational time, with silenced populations appearing to undergo a bottleneck three to four generations following microinjection of RNA. Transmission of silencing through these generations can be through either maternal or paternal gamete lines and is surprisingly more effective through the male gametic line. Genetic linkage tests reveal that silencing in the early generations is transmitted independently of the original targeted locus, in a manner indicative of a diffusible epigenetic element.

Abstract

Mitochondrial homeostasis reflects a dynamic balance between membrane fission and fusion events thought essential for mitochondrial function. We report here that altered expression of the C. elegans BCL2 homolog CED-9 affects both mitochondrial fission and fusion. Although striated muscle cells lacking CED-9 have no alteration in mitochondrial size or ultrastructure, these cells appear more sensitive to mitochondrial fragmentation. By contrast, increased CED-9 expression in these cells produces highly interconnected mitochondria. This mitochondrial phenotype is partially suppressed by increased expression of the dynamin-related GTPase DRP-1, with suppression dependent on the BH3 binding pocket of CED-9. This suppression suggests that CED-9 directly regulates DRP-1, a model supported by our finding that CED-9 activates the GTPase activity of human DRP1. Thus, CED-9 is capable of regulating the mitochondrial fission-fusion cycle but is not essential for either fission or fusion.

Abstract

Using the massively parallel technique of sequencing by oligonucleotide ligation and detection (SOLiD; Applied Biosystems), we have assessed the in vivo positions of more than 44 million putative nucleosome cores in the multicellular genetic model organism Caenorhabditis elegans. These analyses provide a global view of the chromatin architecture of a multicellular animal at extremely high density and resolution. While we observe some degree of reproducible positioning throughout the genome in our mixed stage population of animals, we note that the major chromatin feature in the worm is a diversity of allowed nucleosome positions at the vast majority of individual loci. While absolute positioning of nucleosomes can vary substantially, relative positioning of nucleosomes (in a repeated array structure likely to be maintained at least in part by steric constraints) appears to be a significant property of chromatin structure. The high density of nucleosomal reads enabled a substantial extension of previous analysis describing the usage of individual oligonucleotide sequences along the span of the nucleosome core and linker. We release this data set, via the UCSC Genome Browser, as a resource for the high-resolution analysis of chromatin conformation and DNA accessibility at individual loci within the C. elegans genome.

Abstract

The evolutionary origin of human hepatitis delta virus (HDV) replication by RNA-directed transcription is unclear. Here we identify two species of 5'-capped, approximately 18-25-nucleotide small RNAs. One was of antigenomic polarity, corresponding to the 5' end of hepatitis delta antigen (HDAg) mRNA, and interacted with HDAg and RNA polymerase II (Pol II), whereas the other mapped to a structurally analogous region on the genomic RNA hairpin. An HDAg-interaction screen indicated that HDAg interacts with MOV10, the human homolog of the Arabidopsis thaliana RNA amplification factor gene SDE3 and Drosophila melanogaster RISC-maturation factor gene Armitage (armi). MOV10 knockdown inhibited HDV replication, but not HDAg mRNA translation, further supporting a role for MOV10 in RNA-directed transcription. Together, our studies define RNA hairpins as critical elements for the initiation of HDV-related, RNA-directed transcription. The identification of capped small RNAs and the involvement of MOV10 in HDV replication further suggest a conserved mechanism related to RNA-directed transcription in lower eukaryotes.

Abstract

MicroRNAs (miRNAs) are approximately 22 nucleotide-long noncoding RNAs involved in several biological processes including development, differentiation and proliferation. Recent studies suggest that knowledge of miRNA expression patterns in cancer may have substantial value for diagnostic and prognostic determinations as well as for eventual therapeutic intervention. We performed comprehensive analysis of miRNA expression profiles of 27 sarcomas, 5 normal smooth muscle and 2 normal skeletal muscle tissues using microarray technology and/or small RNA cloning approaches. The miRNA expression profiles are distinct among the tumor types as demonstrated by an unsupervised hierarchical clustering, and unique miRNA expression signatures were identified in each tumor class. Remarkably, the miRNA expression patterns suggested that two of the sarcomas had been misdiagnosed and this was confirmed by reevaluation of the tumors using histopathologic and molecular analyses. Using the cloning approach, we also identified 31 novel miRNAs or other small RNA effectors in the sarcomas and normal skeletal muscle tissues examined. Our data show that different histological types of sarcoma have distinct miRNA expression patterns, reflecting the apparent lineage and differentiation status of the tumors. The identification of unique miRNA signatures in each tumor type may indicate their role in tumorigenesis and may aid in diagnosis of soft tissue sarcomas.

Abstract

Bcl-2 proteins regulate apoptosis in organisms as diverse as mammals and nematodes. These proteins are often localized at mitochondria by a C-terminal transmembrane domain. Although the transmembrane domain and mitochondrial localization are centrally involved in specific cases of vertebrate Bcl-2 activity, the significance of this localization is not clear for all species. Studying the Caenorhabditis elegans Bcl-2 homolog CED-9, we found that the transmembrane domain was both necessary and sufficient for localization at mitochondrial outer membranes. Furthermore, we found that in our assays, ced-9 transgenes lacking the transmembrane domain, although somewhat less active than equivalent transgenes derived from wild-type ced-9, rescued embryonic lethality of ced-9(lf) animals and responded properly to upstream signals in controlling the fate of Pn.aap neurons. Both of these apoptotic activities were retained in a construct where CED-9 lacking the transmembrane domain was targeted to the cytosolic surface of the endoplasmic reticulum and derived organelles, suggesting that in wild-type animals, accumulation at mitochondria is not essential for CED-9 to either inhibit or promote apoptosis in C. elegans. Taken together, these data are consistent with a multimodal character of CED-9 action, with an ability to regulate apoptosis through interactions in the cytosol coexisting with additional evolutionarily conserved role(s) at the membrane.

Abstract

Multiplexed high-throughput pyrosequencing is currently limited in complexity (number of samples sequenced in parallel), and in capacity (number of sequences obtained per sample). Physical-space segregation of the sequencing platform into a fixed number of channels allows limited multiplexing, but obscures available sequencing space. To overcome these limitations, we have devised a novel barcoding approach to allow for pooling and sequencing of DNA from independent samples, and to facilitate subsequent segregation of sequencing capacity. Forty-eight forward-reverse barcode pairs are described: each forward and each reverse barcode unique with respect to at least 4 nt positions. With improved read lengths of pyrosequencers, combinations of forward and reverse barcodes may be used to sequence from as many as n(2) independent libraries for each set of 'n' forward and 'n' reverse barcodes, for each defined set of cloning-linkers. In two pilot series of barcoded sequencing using the GS20 Sequencer (454/Roche), we found that over 99.8% of obtained sequences could be assigned to 25 independent, uniquely barcoded libraries based on the presence of either a perfect forward or a perfect reverse barcode. The false-discovery rate, as measured by the percentage of sequences with unexpected perfect pairings of unmatched forward and reverse barcodes, was estimated to be <0.005%.

Abstract

We combined components of a previous assay referred to as Molecular Inversion Probe (MIP) with a complete gap filling strategy, creating a versatile powerful one-primer multiplex amplification system. As a proof-of-concept, this novel method, which employs a Connector Inversion Probe (CIPer), was tested as a genetic tool for pathogen diagnosis, typing, and antibiotic resistance screening with two distinct systems: i) a conserved sequence primer system for genotyping Human Papillomavirus (HPV), a cancer-associated viral agent and ii) screening for antibiotic resistance mutations in the bacterial pathogen Neisseria gonorrhoeae. We also discuss future applications and advances of the CIPer technology such as integration with digital amplification and next-generation sequencing methods. Furthermore, we introduce the concept of two-dimension informational barcodes, i.e. "multiplex multiplexing padlocks" (MMPs). For the readers' convenience, we also provide an on-line tutorial with user-interface software application CIP creator 1.0.1, for custom probe generation from virtually any new or established primer-pairs.

Abstract

MicroRNAs (miRNAs) are regulatory molecules that share both biosynthetic derivation (cleavage from short hairpin precursor RNAs) and functional roles (downregulation of specific mRNAs through targeted degradation and/or translational inhibition). A distinct family of small RNAs, termed siRNAs, have some common characteristics but exhibit distinct modes of biosynthesis and function. In this study, we report procedures for purification of a predominant species of miRNA-containing ribonucleoprotein complexes from Caenorhabditis elegans and demonstrate that this population is distinct from the predominant pool of siRNA-containing ribonucleoprotein complexes. An observed miRNP-associated RNA population consisting predominantly (>95%) of miRNAs supported the unique identity of miRNPs as biological effectors within the cell, provided clean material for analysis of changes in miRNA spectra during development, and provided strong evidence of miRNA character for a number of novel small RNAs. Likewise, the RNA spectrum derived from partial siRNP purification was useful in defining functional characteristics of this more diverse population of small RNAs.

Abstract

Recent studies suggest that knowledge of differential expression of microRNAs (miRNA) in cancer may have substantial diagnostic and prognostic value. Here, we use a direct sequencing method to characterize the profiles of miRNAs and other small RNA segments for six human cervical carcinoma cell lines and five normal cervical samples. Of 166 miRNAs expressed in normal cervix and cancer cell lines, we observed significant expression variation of six miRNAs between the two groups. To further show the biological relevance of our findings, we examined the expression level of two significantly varying miRNAs in a panel of 29 matched pairs of human cervical cancer and normal cervical samples. Reduced expression of miR-143 and increased expression of miR-21 were reproducibly displayed in cancer samples, suggesting the potential value of these miRNAs as tumor markers. In addition to the known miRNAs, we found a number of novel miRNAs and an additional set of small RNAs that do not meet miRNA criteria.

Abstract

RNA interference (RNAi) is a phylogenetically widespread gene-silencing process triggered by double-stranded RNA. In plants and Caenorhabditis elegans, two distinct populations of small RNAs have been proposed to participate in RNAi: "Primary siRNAs" (derived from DICER nuclease-mediated cleavage of the original trigger) and "secondary siRNAs" [additional small RNAs whose synthesis requires an RNA-directed RNA polymerase (RdRP)]. Analyzing small RNAs associated with ongoing RNAi in C. elegans, we found that secondary siRNAs constitute the vast majority. The bulk of secondary siRNAs exhibited structure and sequence indicative of a biosynthetic mode whereby each molecule derives from an independent de novo initiation by RdRP. Analysis of endogenous small RNAs indicated that a fraction derive from a biosynthetic mechanism that is similar to that of secondary siRNAs formed during RNAi, suggesting that small antisense transcripts derived from cellular messenger RNAs by RdRP activity may have key roles in cellular regulation.

Abstract

Nucleosome positions within the chromatin landscape are known to serve as a major determinant of DNA accessibility to transcription factors and other interacting components. To delineate nucleosomal patterns in a model genetic organism, Caenorhabditis elegans, we have carried out a genome-wide analysis in which DNA fragments corresponding to nucleosome cores were liberated using an enzyme (micrococcal nuclease) with a strong preference for cleavage in non-nucleosomal regions. Sequence analysis of 284,091 putative nucleosome cores obtained in this manner from a mixed-stage population of C. elegans reveals a combined picture of flexibility and constraint in nucleosome positioning. As has previously been observed in studies of individual loci in diverse biological systems, we observe areas in the genome where nucleosomes can adopt a wide variety of positions in a given region, areas with little or no nucleosome coverage, and areas where nucleosomes reproducibly adopt a specific positional pattern. In addition to illuminating numerous aspects of chromatin structure for C. elegans, this analysis provides a reference from which to begin an investigation of relationships between the nucleosomal pattern, chromosomal architecture, and lineage-based gene activity on a genome-wide scale.

Abstract

We have developed a differential cytolocalization assay (DCLA) that allows the observation of cytoplasmic protein/protein interactions in vivo. In the DCLA, interactions are visualized as a relocalization of a green fluorescent protein-tagged "prey" by a membrane-bound "bait." This assay was tested and utilized in Caenorhabditis elegans to probe interactions among proteins involved in RNA interference (RNAi) and nonsense-mediated decay (NMD) pathways. Several previously documented interactions were confirmed with DCLA, whereas uniformly negative results were obtained in several controls in which no interaction was expected. Novel interactions were also observed, including the association of SMG-5, a protein required for NMD, to several components of the RNAi pathway. The DCLA can be readily carried out under diverse conditions, allowing a dynamic assessment of protein interactions in vivo. We used this property to test a subset of the RNAi and NMD interactions in animals in which proteins central to each mechanism were mutated; several key associations in each machinery that can occur in vivo in the absence of a functional process were identified.

Abstract

In C. elegans, the Sma/Mab TGFbeta signaling pathway regulates body size and male tail patterning. SMA-9, the C. elegans homolog of Schnurri, has been shown to function as a downstream component to mediate the Sma/Mab TGFbeta signaling pathway in these processes. We have discovered a new role for SMA-9 in dorsoventral patterning of the C. elegans post-embryonic mesoderm, the M lineage. In addition to a small body size, sma-9 mutant animals exhibit a dorsal-to-ventral fate transformation within the M lineage. This M lineage defect of sma-9 mutants is unique in that animals carrying mutations in all other known components of the TGFbeta pathway exhibit no M lineage defects. Surprisingly, mutations in the core components of the Sma/Mab TGFbeta signaling pathway suppressed the M lineage defects of sma-9 mutants without suppressing their body size defects. We show that this suppression specifically happens within the M lineage. Our studies have uncovered an unexpected role of SMA-9 in antagonizing the TGFbeta signaling pathway during mesodermal patterning, suggesting a novel mode of function for the SMA-9/Schnurri family of proteins.

Abstract

We describe a surprising long-range periodicity that underlies a substantial fraction of C. elegans genomic sequence. Extended segments (up to several hundred nucleotides) of the C. elegans genome show a strong bias toward occurrence of AA/TT dinucleotides along one face of the helix while little or no such constraint is evident on the opposite helical face. Segments with this characteristic periodicity are highly overrepresented in intron sequences and are associated with a large fraction of genes with known germline expression in C. elegans. In addition to altering the path and flexibility of DNA in vitro, sequences of this character have been shown by others to constrain DNA::nucleosome interactions, potentially producing a structure that could resist the assembly of highly ordered (phased) nucleosome arrays that have been proposed as a precursor to heterochromatin. We propose a number of ways that the periodic occurrence of An/Tn clusters could reflect evolution and function of genes that express in the germ cell lineage of C. elegans.

Abstract

Several bioinformatics studies have identified an unexpected but remarkably prevalent approximately 10 bp periodicity of AA/TT dinucleotides (hyperperiodicity) in certain regions of the Caenorhabditis elegans genome. Although the relevant C.elegans DNA segments share certain sequence characteristics with bent DNAs from other sources (e.g. trypanosome mitochondria), the nematode sequences exhibit a much more extensive and defined hyperperiodicity. Given the presence of hyperperiodic structures in a number of critical C.elegans genes, the physical characteristics of hyperperiodic DNA are of considerable interest. In this work, we demonstrate that several hyperperiodic DNA segments from C.elegans exhibit structural anomalies using high-resolution atomic force microscopy (AFM) and gel electrophoresis. Our quantitative analysis of AFM images reveals that hyperperiodic DNA adopts a significantly smaller mean square end-to-end distance, hence a more compact coil structure, compared with non-periodic DNA of similar length. While molecules remain capable of adopting both bent and straight (rod-like) configurations, indicating that their flexibility is still retained, examination of the local curvatures along the DNA contour length reveals that the decreased mean square end-to-end distance can be attributed to the presence of long-scale intrinsic bending in hyperperiodic DNA. Such bending is not detected in non-periodic DNA. Similar studies of shorter, nucleosome-length DNAs that survived micrococcal nuclease digestion show that sequence hyperperiodicity in short segments can likewise induce strong intrinsic bending. It appears, therefore, that regions of the C.elegans genome display a significant correlation between DNA sequence and unusual mechanical properties.

Nucleic acid structure and intracellular immunity: some recent ideas from the world of RNAiWorkshop on Fundamentals of Biomolecular Function - Nucleic Acids, Proteins and MembranesFire, A.CAMBRIDGE UNIV PRESS.2005: 303–9

Abstract

Cells face a constant struggle against unwanted instructions that arrive in the form of viruses and transposons. At the core of this battle are two issues: how can cellular machinery recognize certain informational molecules as 'unwanted' and how can the cell use this recognition to effectively silence malicious genetic activity. While defenses against some specific parasites may be triggered by individual nucleic acid or protein sequences, such sequence-specific mechanisms have the limitation of allowing the parasite to evade following relatively minor evolutionary change. A more general set of defense mechanisms is based on recognition of structural features that are intrinsic aspects of one or more parasitic lifestyle. Recognition of extended regions of double-stranded RNA (dsRNA) provides cells with one such defensive modality. Essentially absent during 'normal' gene expression, long stretches of dsRNA within a cell serve as a dramatic warning that a segment of information may be replicating as RNA. In addition to exemplifying many of the mechanistic issues in genome defense, the cellular response to dsRNA provides several examples of the logic by which organisms attempt to focus their limited immunity resources on the most immediate and dangerous targets.

Abstract

We have observed a gamete-of-origin imprinting effect in C. elegans using a set of GFP reporter transgenes. From a single progenitor line carrying an extrachromosomal unc-54::gfp transgene array, we generated three independent autosomal integrations of the unc-54::gfp transgene. The progenitor line, two of its three integrated derivatives, and a nonrelated unc-119:gfp transgene exhibit an imprinting effect: single-generation transmission of these transgenes through the male germline results in approximately 1.5- to 2.0-fold greater expression than transmission through the female germline. There is a detectable resetting of the imprint after passage through the opposite germline for a single generation, indicating that the imprinted status of the transgenes is reversible. In cases where the transgene is maintained in either the oocyte lineage or sperm lineage for multiple, consecutive generations, a full reset requires passage through the opposite germline for several generations. Taken together, our results indicate that C. elegans has the ability to imprint chromosomes and that differences in the cell and/or molecular biology of oogenesis and spermatogenesis are manifest in an imprint that can persist in both somatic and germline gene expression for multiple generations.

Abstract

During development, progression through the cell cycle must be coordinately regulated with cellular differentiation. Despite significant progress in identifying genes required independently for each of these processes, the molecules which facilitate this cross talk have for the most part been elusive. Using the six macrophage-like coelomocytes of the nematode Caenorhabditis elegans as a model system to gain insight into the mesodermal differentiation pathway, we have isolated a set of mutants that alter coelomocyte numbers. One of these mutations, cc600, apparently results from a partial loss-of-function in the C. elegans cyclin D gene, cyd-1. The mutant has coelomocyte-specific defects without changes in other lineages. The mutants show that cell growth, terminal differentiation and cellular function proceed in the absence of cyd-1 activity and cell division. The results suggest that certain mesodermal lineages may be uniquely affected by changes in cyd-1 activity.

Abstract

Mutations in the unc-39 gene of C. elegans lead to migration and differentiation defects in a subset of mesodermal and ectodermal cells, including muscles and neurons. Defects include mesodermal specification and differentiation as well a neuronal migration and axon pathfinding defects. Molecular analysis revealed that unc-39 corresponds to the previously named gene ceh-35 and that the UNC-39 protein belongs to the Six4/5 family of homeodomain transcription factors and is similar to human Six5, a protein implicated in the pathogenesis of type I myotonic dystrophy (DM1). We show that human Six5 and UNC-39 are functional homologs, suggesting that further characterization of the C. elegans unc-39 gene might provide insight into the etiology of DM1.

Abstract

Introduction of double-stranded RNA (dsRNA) can elicit a gene-specific RNA interference response in a variety of organisms and cell types. In many cases, this response has a systemic character in that silencing of gene expression is observed in cells distal from the site of dsRNA delivery. The molecular mechanisms underlying the mobile nature of RNA silencing are unknown. For example, although cellular entry of dsRNA is possible, cellular exit of dsRNA from normal animal cells has not been directly observed. We provide evidence that transgenic strains of Caenorhabditis elegans transcribing dsRNA from a tissue-specific promoter do not exhibit comprehensive systemic RNA interference phenotypes. In these same animals, modifications of environmental conditions can result in more robust systemic RNA silencing. Additionally, we find that genetic mutations can influence the systemic character of RNA silencing in C. elegans and can separate mechanisms underlying systemic RNA silencing into tissue-specific components. These data suggest that trafficking of RNA silencing signals in C. elegans is regulated by specific physiological and genetic factors.

Abstract

RNA interference (RNAi) is a broadly used reverse genetics method in C. elegans. Unfortunately, RNAi does not inhibit all genes. We show that loss of function of a putative RNA-directed RNA polymerase (RdRP) of C. elegans, RRF-3, results in a substantial enhancement of sensitivity to RNAi in diverse tissues. This is particularly striking in the nervous system; neurons that are generally refractory to RNAi in a wild-type genetic background can respond effectively to interference in an rrf-3 mutant background. These data provide the first indication of physiological negative modulation of the RNAi response and implicate an RdRP-related factor in this effect. The rrf-3 strain can be useful to study genes that, in wild-type, do not show a phenotype after RNAi, and it is probably the strain of choice for genome-wide RNAi screens.

Abstract

RNA interference (RNAi) is a mechanism that appears to control unwanted gene expression in a wide range of species. In Drosophila, RNAi is most effectively induced by double-stranded RNAs (dsRNAs) of over approximately 80 nucleotides (nt) and in mammalian cells an RNAi-like inhibition of gene expression has been shown to be mediated by dsRNAs of approximately 21-23 nt. To test if RNAi can be used to specifically down-regulate a human disease-related transcript we have used Drosophila and human tissue culture models of the dominant genetic disorder spinobulbar muscular atrophy (SBMA). A variety of different dsRNAs were assessed for the ability to inhibit expression of transcripts that included a truncated human androgen receptor (ar) gene containing different CAG repeat lengths (16-112 repeats). In Drosophila cells, dsRNAs corresponding to non-repetitive sequences mediated a high degree of sequence-specific inhibition, whereas RNA duplexes containing CAG repeat tracts only induced gene-specific inhibition when flanking ar sequences were included; dsRNAs containing various lengths of CAG repeats plus ar sequences were unable to induce allele-specific interference. In mammalian cells we tested sequence-specific small dsRNAs of 22 nt; these rescued the toxicity and caspase-3 activation induced by plasmids expressing a transcript encoding an expanded polyglutamine tract. This study demonstrates the feasibility of targeting a transcript associated with an important group of genetic diseases by RNAi.

Abstract

We have isolated mutations in a gene mls-1 that is required for proper specification of nonstriated muscle fates in Caenorhabditis elegans. Loss of MLS-1 activity causes uterine muscle precursors to forego their normal fates, instead differentiating as vulval muscles. We have cloned mls-1 and shown that the product is a member of the T-box family of transcriptional regulators. MLS-1 acts as a cell fate determinant in that ectopic expression can transform other cell types to uterine muscle precursors. Uterine muscle patterning is executed by regulation of MLS-1 at several different levels. The mls-1 promoter is activated by the C. elegans orthologs of Twist and Daughterless, but is only active in a subset of the lineage where these two transcription factors are present. mls-1 activity also appears to be regulated by posttranscriptional processes, as expression occurs in both uterine and vulval muscle precursors.

Abstract

We have investigated the role of trigger RNA amplification during RNA interference (RNAi) in Caenorhabditis elegans. Analysis of small interfering RNAs (siRNAs) produced during RNAi in C. elegans revealed a substantial fraction that cannot derive directly from input dsRNA. Instead, a population of siRNAs (termed secondary siRNAs) appeared to derive from the action of a cellular RNA-directed RNA polymerase (RdRP) on mRNAs that are being targeted by the RNAi mechanism. The distribution of secondary siRNAs exhibited a distinct polarity (5'-->3' on the antisense strand), suggesting a cyclic amplification process in which RdRP is primed by existing siRNAs. This amplification mechanism substantially augments the potency of RNAi-based surveillance, while ensuring that the RNAi machinery will focus on expressed mRNAs.

Abstract

RNA interference (RNAi) is a cellular defense mechanism that uses double-stranded RNA (dsRNA) as a sequence-specific trigger to guide the degradation of homologous single-stranded RNAs. RNAi is a multistep process involving several proteins and at least one type of RNA intermediate, a population of small 21-25 nt RNAs (called siRNAs) that are initially derived from cleavage of the dsRNA trigger. Genetic screens in Caenorhabditis elegans have identified numerous mutations that cause partial or complete loss of RNAi. In this work, we analyzed cleavage of injected dsRNA to produce the initial siRNA population in animals mutant for rde-1 and rde-4, two genes that are essential for RNAi but that are not required for organismal viability or fertility. Our results suggest distinct roles for RDE-1 and RDE-4 in the interference process. Although null mutants lacking rde-1 show no phenotypic response to dsRNA, the amount of siRNAs generated from an injected dsRNA trigger was comparable to that of wild-type. By contrast, mutations in rde-4 substantially reduced the population of siRNAs derived from an injected dsRNA trigger. Injection of chemically synthesized 24- or 25-nt siRNAs could circumvent RNAi resistance in rde-4 mutants, whereas no bypass was observed in rde-1 mutants. These results support a model in which RDE-4 is involved before or during production of siRNAs, whereas RDE-1 acts after the siRNAs have been formed.

Specific inhibition of gene expression by small double-stranded RNAs in invertebrate and vertebrate systemsPROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICACaplen, N. J., Parrish, S., Imani, F., Fire, A., Morgan, R. A.2001; 98 (17): 9742-9747

Abstract

Short interfering RNAs (siRNAs) are double-stranded RNAs of approximately 21-25 nucleotides that have been shown to function as key intermediaries in triggering sequence-specific RNA degradation during posttranscriptional gene silencing in plants and RNA interference in invertebrates. siRNAs have a characteristic structure, with 5'-phosphate/3'-hydroxyl ends and a 2-base 3' overhang on each strand of the duplex. In this study, we present data that synthetic siRNAs can induce gene-specific inhibition of expression in Caenorhabditis elegans and in cell lines from humans and mice. In each case, the interference by siRNAs was superior to the inhibition of gene expression mediated by single-stranded antisense oligonucleotides. The siRNAs seem to avoid the well documented nonspecific effects triggered by longer double-stranded RNAs in mammalian cells. These observations may open a path toward the use of siRNAs as a reverse genetic and therapeutic tool in mammalian cells.

Abstract

RNAi is a gene-silencing phenomenon triggered by double-stranded (ds) RNA and involves the generation of 21 to 26 nt RNA segments that guide mRNA destruction. In Caenorhabditis elegans, lin-4 and let-7 encode small temporal RNAs (stRNAs) of 22 nt that regulate stage-specific development. Here we show that inactivation of genes related to RNAi pathway genes, a homolog of Drosophila Dicer (dcr-1), and two homologs of rde-1 (alg-1 and alg-2), cause heterochronic phenotypes similar to lin-4 and let-7 mutations. Further we show that dcr-1, alg-1, and alg-2 are necessary for the maturation and activity of the lin-4 and let-7 stRNAs. Our findings suggest that a common processing machinery generates guide RNAs that mediate both RNAi and endogenous gene regulation.

Abstract

Genetic interference mediated by double-stranded RNA (RNAi) has been a valuable tool in the analysis of gene function in Caenorhabditis elegans. Here we report an efficient induction of RNAi using bacteria to deliver double-stranded RNA. This method makes use of bacteria that are deficient in RNaseIII, an enzyme that normally degrades a majority of dsRNAs in the bacterial cell. Bacteria deficient for RNaseIII were engineered to produce high quantities of specific dsRNA segments. When fed to C. elegans, such engineered bacteria were found to produce populations of RNAi-affected animals with phenotypes that were comparable in expressivity to the corresponding loss-of-function mutants. We found the method to be most effective in inducing RNAi for non-neuronal tissue of late larval and adult hermaphrodites, with decreased effectiveness in the nervous system, in early larval stages, and in males. Bacteria-induced RNAi phenotypes could be maintained over the course of several generations with continuous feeding, allowing for convenient assessments of the biological consequences of specific genetic interference and of continuous exposure to dsRNAs.

Abstract

Members of the Hox family of homeoproteins and their cofactors play a central role in pattern formation of all germ layers. During postembryonic development of C. elegans, non-gonadal mesoderm arises from a single mesoblast cell M. Starting in the first larval stage, M divides to produce 14 striated muscles, 16 non-striated muscles, and two non-muscle cells (coelomocytes). We investigated the role of the C. elegans Hox cluster and of the exd ortholog ceh-20 in patterning of the postembryonic mesoderm. By examining the M lineage and its differentiation products in different Hox mutant combinations, we found an essential but overlapping role for two of the Hox cluster genes, lin-39 and mab-5, in diversification of the postembryonic mesoderm. This role of the two Hox gene products required the CEH-20 cofactor. One target of these two Hox genes is the C. elegans twist ortholog hlh-8. Using both in vitro and in vivo assays, we demonstrated that twist is a direct target of Hox activation. We present evidence from mutant phenotypes that twist is not the only target for Hox genes in the M lineage: in particular we show that lin-39 mab-5 double mutants exhibit a more severe M lineage defect than the hlh-8 null mutant.

Abstract

Caenorhabditis elegans has a single lamin gene, designated lmn-1 (previously termed CeLam-1). Antibodies raised against the lmn-1 product (Ce-lamin) detected a 64-kDa nuclear envelope protein. Ce-lamin was detected in the nuclear periphery of all cells except sperm and was found in the nuclear interior in embryonic cells and in a fraction of adult cells. Reductions in the amount of Ce-lamin protein produce embryonic lethality. Although the majority of affected embryos survive to produce several hundred nuclei, defects can be detected as early as the first nuclear divisions. Abnormalities include rapid changes in nuclear morphology during interphase, loss of chromosomes, unequal separation of chromosomes into daughter nuclei, abnormal condensation of chromatin, an increase in DNA content, and abnormal distribution of nuclear pore complexes (NPCs). Under conditions of incomplete RNA interference, a fraction of embryos escaped embryonic arrest and continue to develop through larval life. These animals exhibit additional phenotypes including sterility and defective segregation of chromosomes in germ cells. Our observations show that lmn-1 is an essential gene in C. elegans, and that the nuclear lamins are involved in chromatin organization, cell cycle progression, chromosome segregation, and correct spacing of NPCs.

Abstract

In RNA-mediated interference (RNAi), externally provided mixtures of sense and antisense RNA trigger concerted degradation of homologous cellular RNAs. We show that RNAi requires duplex formation between the two trigger strands, that the duplex must include a region of identity between trigger and target RNAs, and that duplexes as short as 26 bp can trigger RNAi. Consistent with in vitro observations, a fraction of input dsRNA is converted in vivo to short segments of approximately 25 nt. Interference assays with modified dsRNAs indicate precise chemical requirements for both bases and backbone of the RNA trigger. Strikingly, certain modifications are well tolerated on the sense, but not the antisense, strand, indicating that the two trigger strands have distinct roles in the interference process.

Abstract

MEF2 is an evolutionarily conserved MADS (MCM1, Agamous, Deficiens, and serum response factor) box-type transcription factor that plays a critical role in vertebrate and Drosophila melanogaster myogenesis. We have addressed the developmental role of the single MEF2-like factor, CeMEF2, in Caenorhabditis elegans. Using expression assays and two mef-2 deletion alleles, we show that CeMEF2 is not required for proper myogenesis or development. Moreover, a putative null mef-2 allele fails to enhance or suppress the phenotypes of mutants in CeMyoD or CeTwist. Our results suggest that despite its evolutionary conservation of sequence and DNA binding properties, CeMEF2 has adopted a divergent role in development in the nematode compared with Drosophila and vertebrates.

Abstract

RNA interference (RNAi) is a form of post-transcriptional gene silencing that has been described in a number of plant, nematode, protozoan, and invertebrate species. RNAi is characterized by a number of features: induction by double stranded RNA (dsRNA), a high degree of specificity, remarkable potency and spread across cell boundaries, and a sustained down-regulation of the target gene. Previous studies of RNAi have examined this effect in whole organisms or in extracts thereof; we have now examined the induction of RNAi in tissue culture. A screen of mammalian cells from three different species showed no evidence for the specific down-regulation of gene expression by dsRNA. By contrast, RNAi was observed in Drosophila Schneider 2 (S2) cells. Green fluorescent protein (GFP) expression in S2 cells was inhibited in a dose-dependent manner by transfection of dsRNA corresponding to gfp when GFP was expressed either transiently or stably. This effect was structure- and sequence-specific in that: (1) little or no effect was seen when antisense (or sense) RNA was transfected; (2) an unrelated dsRNA did not reduce GFP expression; and (3) dsRNA corresponding to gfp had no effect on the expression of an unrelated target transgene. This invertebrate tissue culture model should allow facile assays for loss of function in a well-defined cellular system and facilitate further understanding of the mechanism of RNAi and the genes involved in this process.

Abstract

Lysosome associated membrane proteins (LAMPs) constitute a family of vertebrate proteins located predominantly in lysosomes, with lesser amounts present in endosomes and at the cell surface. Macrosialin/CD68s are similar to LAMPs in their subcellular distribution and amino acid sequence and presumed structure across the carboxyl terminal two thirds of their length. The functions of LAMPs and CD68s are not known. In the present study, a bioinformatics approach was used to identify a Caenorhabditis elegans protein (LMP-1) with sequence and presumed structural similarity to LAMPs and CD68s. LMP-1 appears to be the only membrane protein in C. elegans that carries a GYXX(phi) vertebrate lysosomal targeting sequence at its C terminus (where (phi) is a large, hydrophobic residue). LMP-1 was found to be present from early embryonic stages through adulthood and to be predominantly localized at the periphery of a population of large, membrane-bound organelles, called granules, that are seen throughout the early embryo but in later stages are restricted to the cells of the intestine. Analysis of an LMP-1 deficient C. elegans mutant revealed that LMP-1 is not required for viability under laboratory conditions, but the absence of LMP-1 leads to an alteration in intestinal granule populations, with apparent loss of one type of granule.

Abstract

The basic helix-loop-helix (bHLH) transcription factor Twist plays a role in mesodermal development in both invertebrates and vertebrates. In an effort to understand the role of the unique Caenorhabditis elegans Twist homolog, hlh-8, we analyzed mesodermal development in animals with a deletion in the hlh-8 locus. This deletion was predicted to represent a null allele because the HLH domain is missing and the reading frame for the protein is disrupted. Animals lacking CeTwist function were constipated and egg-laying defective. Both of these defects were rescued in transgenic mutant animals expressing wild-type hlh-8. Observing a series of mesoderm-specific markers allowed us to characterize the loss of hlh-8 function more thoroughly. Our results demonstrate that CeTwist performs an essential role in the proper development of a subset of mesodermal tissues in C. elegans. We found that CeTwist was required for the formation of three out of the four non-striated enteric muscles born in the embryo. In contrast, CeTwist was not required for the formation of the embryonically derived striated muscles. Most of the post-embryonic mesoderm develops from a single lineage. CeTwist was necessary for appropriate patterning in this lineage and was required for expression of two downstream target genes, but was not required for the expression of myosin, a marker of differentiation. Our results suggest that mesodermal patterning by Twist is an evolutionarily conserved function.

Abstract

Mechanisms for repetition of DNA pose both opportunities and challenges to a functional genome: opportunities for increasing gene expression by amplification of useful sequences, and challenges of controlling amplification by unwanted sequences such as transposons and viruses. Experiments in numerous organisms have suggested the likely existence of a general mechanism for recognition of repeated character in DNA. This review focuses (a) on the nature of these recognition mechanisms, and (b) on types of chromatin modification and gene silencing that are used to control repeated DNA.

Abstract

Context-dependent gene silencing is used by many organisms to stably modulate gene activity for large chromosomal regions. We have used tandem array transgenes as a model substrate in a screen for Caenorhabditis elegans mutants that affect context-dependent gene silencing in somatic tissues. This screen yielded multiple alleles of a previously uncharacterized gene, designated tam-1 (for tandem-array-modifier). Loss-of-function mutations in tam-1 led to a dramatic reduction in the activity of numerous highly repeated transgenes. These effects were apparently context dependent, as nonrepetitive transgenes retained activity in a tam-1 mutant background. In addition to the dramatic alterations in transgene activity, tam-1 mutants showed modest alterations in expression of a subset of endogenous cellular genes. These effects include genetic interactions that place tam-1 into a group called the class B synMuv genes (for a Synthetic Multivulva phenotype); this family plays a negative role in the regulation of RAS pathway activity in C. elegans. Loss-of-function mutants in other members of the class-B synMuv family, including lin-35, which encodes a protein similar to the tumor suppressor Rb, exhibit a hypersilencing in somatic transgenes similar to that of tam-1 mutants. Molecular analysis reveals that tam-1 encodes a broadly expressed nuclear protein with RING finger and B-box motifs.

Abstract

Double-stranded (ds) RNA can induce sequence-specific inhibition of gene function in several organisms. However, both the mechanism and the physiological role of the interference process remain mysterious. In order to study the interference process, we have selected C. elegans mutants resistant to dsRNA-mediated interference (RNAi). Two loci, rde-1 and rde-4, are defined by mutants strongly resistant to RNAi but with no obvious defects in growth or development. We show that rde-1 is a member of the piwi/sting/argonaute/zwille/eIF2C gene family conserved from plants to vertebrates. Interestingly, several, but not all, RNAi-deficient strains exhibit mobilization of the endogenous transposons. We discuss implications for the mechanism of RNAi and the possibility that one natural function of RNAi is transposon silencing.

Abstract

Double-stranded RNA (dsRNA) has recently been shown to trigger sequence-specific gene silencing in a wide variety of organisms, including nematodes, plants, trypanosomes, fruit flies and planaria; meanwhile an as yet uncharacterized RNA trigger has been shown to induce DNA methylation in several different plant systems. In addition to providing a surprisingly effective set of tools to interfere selectively with gene function, these observations are spurring new inquiries to understand RNA-triggered genetic-control mechanisms and their biological roles.

Abstract

We describe the use of modified versions of the Aequora victoria green fluorescent protein (GFP) to simultaneously follow the expression and distribution of two different proteins in the nematode, Caenorhabditis elegans. A cyan-colored GFP derivative, designated CFP, contains amino acid (aa) substitutions Y66W, N146I, M153T and V163A relative to the original GFP sequence and is similar to the previously reported "W7" form. A yellow-shifted GFP derivative, designated YFP, contains aa substitutions S65G, V68A, S72A and T203Y and is similar to the previously described "I0C" variant. Coding regions for CFP and YFP were constructed in the context of a high-activity C. elegans expression system. Previously characterized promoters and localization signals have been used to express CFP and YFP in C. elegans. Filter sets designed to distinguish YFP and CFP fluorescence spectra allowed visualization of the two distinct forms of GFP in neurons and in muscle cells. A series of expression vectors carrying CFP and YFP have been constructed and are being made available to the scientific community.

Abstract

The formation of striated muscle in both vertebrates and invertebrates involves the activity of the MyoD family of basic-helix-loop-helix (bHLH) transcription factors. The high degree of evolutionary conservation of MyoD-related proteins, both in the sequence of their bHLH domains and in their general developmental expression patterns, suggests that these factors are also conserved at the level of function. We have addressed this directly using MyoD and E protein factors from vertebrates, Drosophila, and Caenorhabditis elegans. Various MyoD and E factor combinations were tested for their ability to interact in vitro and to function in vivo in the myogenic conversion of 10T12 mouse fibroblasts. We found that the ability of different homo- and heterodimers to bind DNA in vitro was an accurate measure of biological activity in vivo. A second assessment of conserved function comes from the ability of these factors to rescue a C. elegans hlh-1 (CeMyoD) null mutation. We found that both Drosophila and chicken MyoD-related factors were able to rescue a C. elegans CeMyoD loss-of-function mutation. These results demonstrate a remarkable degree of functional conservation of these myogenic factors despite differences in E-protein interactions.

RNA as a target of double-stranded RNA-mediated genetic interference in Caenorhabditis elegansPROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICAMontgomery, M. K., Xu, S. Q., Fire, A.1998; 95 (26): 15502-15507

Abstract

Introduction of exogenous double-stranded RNA (dsRNA) into Caenorhabditis elegans has been shown to specifically and potently disrupt the activity of genes containing homologous sequences. In this study we present evidence that the primary interference effects of dsRNA are post-transcriptional. First, we examined the primary DNA sequence after dsRNA-mediated interference and found no evidence for alterations. Second, we found that dsRNA-mediated interference with the upstream gene in a polar operon had no effect on the activity of the downstream gene; this finding argues against an effect on initiation or elongation of transcription. Third, we observed by in situ hybridization that dsRNA-mediated interference produced a substantial, although not complete, reduction in accumulation of nascent transcripts in the nucleus, while cytoplasmic accumulation of transcripts was virtually eliminated. These results indicate that the endogenous mRNA is the target for interference and suggest a mechanism that degrades the targeted RNA before translation can occur. This mechanism is not dependent on the SMG system, an mRNA surveillance system in C. elegans responsible for targeting and destroying aberrant messages. We suggest a model of how dsRNA might function in a catalytic mechanism to target homologous mRNAs for degradation.

Abstract

Mesodermal development is a multistep process in which cells become increasingly specialized to form specific tissue types. In Drosophila and mammals, proper segregation and patterning of the mesoderm involves the bHLH factor Twist. We investigated the activity of a Twist-related factor, CeTwist, during Caenorhabditis elegans mesoderm development. Embryonic mesoderm in C. elegans derives from a number of distinct founder cells that are specified during the early lineages; in contrast, a single blast cell (M) is responsible for all nongonadal mesoderm formation during postembryonic development. Using immunofluorescence and reporter fusions, we determined the activity pattern of the gene encoding CeTwist. No activity was observed during specification of mesodermal lineages in the early embryo; instead, the gene was active within the M lineage and in a number of mesodermal cells with nonstriated muscle fates. A role for CeTwist in postembryonic mesodermal cell fate specification was indicated by ectopic expression and genetic interference assays. These experiments showed that CeTwist was responsible for activating two target genes normally expressed in specific subsets of nonstriated muscles derived from the M lineage. In vitro and in vivo assays suggested that CeTwist cooperates with the C. elegans E/Daughterless homolog in directly activating these targets. The two target genes that we have studied, ceh-24 and egl-15, encode an NK-2 class homeodomain and an FGF receptor (FGFR) homolog, respectively. Twist activates FGFR and NK-homeodomain target genes during mesodermal patterning of Drosophila and similar target interactions have been proposed to modulate mesenchymal growth during closure of the vertebrate skull. These results suggest the possibility that a conserved pathway may be used for diverse functions in mesodermal specification.

Abstract

The germline of the nematode Caenorhabditis elegans exhibits a remarkable ability to specifically silence transgenic DNA. We have shown that this silencing mechanism is disrupted in animals mutant for the maternal effect sterile genes mes-2, mes-3, mes-4 and mes-6. The proteins encoded by mes-2 and mes-6 have been shown to be related to the Polycomb Group of transcriptional repressors (Holdeman, R., Nehrt, S. and Strome, S. (1998). Development 125, 2457-2467; Korf, I., Fan, F. and Strome, S. (1998). Development 125, 2469-2478). These results suggest that a genetic silencing process is essential for sustained germline function, and that this silencing is mediated, at least in part, by Polycomb Group proteins.

Abstract

Basic-helix-loop helix factors of the myoD/myf5/ myogenin/MRF4 family have been implicated in acquisition and elaboration of muscle cell fates. Here we describe both myogenic and non-myogenic roles for the Caenorhabditis elegans member of this family (CeMyoD) in postembryonic mesodermal patterning. The postembryonic mesodermal lineage in C. elegans provides a paradigm for many of the issues in mesodermal fate specification: a single mesoblast ('M') divides to generate 14 striated muscles, 16 non-striated muscles, and two non-muscle cells. To study CeMyoD function in the M lineage, we needed to circumvent an embryonic requirement for the protein. Two approaches were used: (1) isolation of mutants that decrease CeMyoD levels while retaining viability, and (2) analysis of genetic mosaics that had lost CeMyoD in the M lineage. With either manipulation, we observed a series of cell-fate transformations affecting a subset of both striated muscles and non-muscle cells. In place of these normal fates, the affected lineages produced a number of myoblast-like cells that initially failed to differentiate, instead swelling to acquire a resemblance to sex myoblasts (M-lineage-derived precursors to non-striated uterine and vulval muscles). Like normal sex myoblasts, the ectopic myoblast-like cells were capable of migration and proliferation followed by differentiation of progeny cells into vulval and uterine muscle. Our results demonstrate a cell-intrinsic contribution of CeMyoD to specification of both non-muscle and muscle fates.

Abstract

Experimental introduction of RNA into cells can be used in certain biological systems to interfere with the function of an endogenous gene. Such effects have been proposed to result from a simple antisense mechanism that depends on hybridization between the injected RNA and endogenous messenger RNA transcripts. RNA interference has been used in the nematode Caenorhabditis elegans to manipulate gene expression. Here we investigate the requirements for structure and delivery of the interfering RNA. To our surprise, we found that double-stranded RNA was substantially more effective at producing interference than was either strand individually. After injection into adult animals, purified single strands had at most a modest effect, whereas double-stranded mixtures caused potent and specific interference. The effects of this interference were evident in both the injected animals and their progeny. Only a few molecules of injected double-stranded RNA were required per affected cell, arguing against stochiometric interference with endogenous mRNA and suggesting that there could be a catalytic or amplification component in the interference process.

Abstract

We have identified a new Caenorhabditis elegans NK-2 class homeobox gene, designated ceh-24. Distinct cis-acting elements generate a complex neuronal and mesodermal expression pattern. A promoter-proximal enhancer mediates expression in a single pharyngeal muscle, the donut-shaped m8 cell at the posterior end of the pharynx. A second mesodermal enhancer is active in a set of eight nonstriated vulval muscles used in egg laying. Activation in the egg laying muscles requires an 'NdE-box' consensus motif (CATATG) which is related to, but distinct from, the standard E-box motif bound by the MyoD family of transcriptional activators. Ectodermal expression of ceh-24 is limited to a subset of sublateral motor neurons in the head of the animal; this activity requires a cis-acting activator element that is distinct from the control elements for pharyngeal and vulval muscle expression. Activation of ceh-24 in each of the three cell types coincides with the onset of differentiation. Using a set of transposon-induced null mutations, we show that ceh-24 is not essential for the formation of any of these cells. Although ceh-24 mutants have no evident defects under laboratory conditions, the pattern of ceh-24 activity is apparently important for Rhabditid nematodes: the related species C. briggsae contains a close homologue of C. elegans ceh-24 including a highly conserved and functionally equivalent set of cis-acting control signals.

Abstract

The elimination of identified cells is a powerful tool for investigating development and system function. Here we report on genetically mediated cell disruption effected by the toxic Caenorhabditis elegans mec-4(d) allele. We found that ectopic expression of mec-4(d) in the nematode causes dysfunction of a wide range of nerve, muscle, and hypodermal cells. mec-4(d)-mediated toxicity is dependent on the activity of a second gene, mec-6, rendering cell disruption conditionally dependent on genetic background. We describe a set of mec-4(d) vectors that facilitate construction of cell-specific disruption reagents and note that genetic cell disruption can be used for functional analyses of specific neurons or neuronal classes, for confirmation of neuronal circuitry, for generation of nematode populations lacking defined classes of functional cells, and for genetic screens. We suggest that mec-4(d) and/or related genes may be effective general tools for cell inactivation that could be used toward similar purposes in higher organisms.

Abstract

Pharyngeal muscle development in the nematode Caenorhabditis elegans appears to share similarities with cardiac muscle development in other species. We have previously described CEH-22, an NK-2 class homeodomain transcription factor similar to Drosophila tinman and vertebrate Nkx2-5, which is expressed exclusively in the pharyngeal muscles. In vitro, CEH-22 binds the enhancer from myo-2, a pharyngeal muscle-specific myosin heavy chain gene. In this paper, we examine the role CEH-22 plays in pharyngeal muscle development and gene activation by (a) ectopically expressing ceh-22 in transgenic C. elegans and (b) examining the phenotype of a ceh-22 loss-of-function mutant. These experiments indicate that CEH-22 is an activator of myo-2 expression and that it is required for normal pharyngeal muscle development. However, ceh-22 is necessary for neither formation of the pharyngeal muscles, nor for myo-2 expression. Our data suggest parallel and potentially compensating pathways contribute to pharyngeal muscle differentiation. We also examine the relationship between ceh-22 and the pharyngeal organ-specific differentiation gene pha-1. Mutations in ceh-22 and pha-1 have strongly synergistic effects on pharyngeal muscle gene expression; in addition, a pha-1 mutation enhances the lethal phenotype caused by a mutation in ceh-22. Wild-type pha-1 is not required for the onset of ceh-22 expression but it appears necessary for maintained expression of ceh-22.

Abstract

We show that three of the eleven genes of the nematode Caenorhabditis elegans that mediate resistance to the nematocide levamisole and to other cholinergic agonists encode nicotinic acetylcholine receptor (nAChR) subunits. unc-38 encodes an alpha subunit while lev-1 and unc-29 encode non-alpha subunits. The nematode nAChR subunits show conservation of many mammalian nAChR sequence features, implying an ancient evolutionary origin of nAChR proteins. Expression in Xenopus oocytes of combinations of these subunits that include the unc-38 alpha subunit results in levamisole-induced currents that are suppressed by the nAChR antagonists mecamylamine, neosurugatoxin, and d-tubocurarine but not alpha-bungarotoxin. The mutant phenotypes reveal that unc-38 and unc-29 subunits are necessary for nAChR function, whereas the lev-1 subunit is not. An UNC-29-GFP fusion shows that UNC-29 is expressed in body and head muscles. Two dominant mutations of lev-1 result in a single amino acid substitution or addition in or near transmembrane domain 2, a region important to ion channel conductance and desensitization. The identification of viable nAChR mutants in C. elegans provides an advantageous system in which receptor expression and synaptic targeting can be manipulated and studied in vivo.

Abstract

The E proteins of mammals, and the related Daughterless (DA) protein of Drosophila, are ubiquitously expressed helix-loop-helix (HLH) transcription factors that play a role in many developmental processes. We report here the characterization of a related C. elegans protein, CeE/DA, which has a dynamic and restricted distribution during development. CeE/DA is present embryonically in neuronal precursors, some of which are marked by promoter activity of a newly described Achaete-scute-like gene hlh-3. In contrast, we have been unable to detect CeE/DA in CeMyoD-positive striated muscle cells. In vitro gel mobility shift analysis detects dimerization of CeE/DA with HLH-3 while efficient interaction of CeE/DA with CeMyoD is not seen. These studies suggest multiple roles for CeE/DA in C. elegans development and provide evidence that both common and alternative strategies have evolved for the use of related HLH proteins in controlling cell fates in different species.

Abstract

In screening for embryonic-lethal mutations in Caenorhabditis elegans, we defined an essential gene (let-858) that encodes a nuclear protein rich in acidic and basic residues. We have named this product nucampholin. Closely homologous sequences in yeast, plants, and mammals demonstrate strong evolutionary conservation in eukaryotes. Nucampholin resides in all nuclei of C. elegans and is essential in early development and in differentiating tissue. Antisense-mediated depletion of LET-858 activity in early embryos causes a lethal phenotype similar to characterized treatments blocking embryonic gene expression. Using transgene-rescue, we demonstrated the additional requirement for let-858 in the larval germline. The broad requirements allowed investigation of soma-germline differences in gene expression. When introduced into standard transgene arrays, let-858 (like many other C. elegans genes) functions well in soma but poorly in germline. We observed incremental silencing of simple let-858 arrays in the first few generations following transformation and hypothesized that silencing might reflect recognition of arrays as repetitive or heterochromatin-like. To give the transgene a more physiological context, we included an excess of random genomic fragments with the injected DNA. The resulting transgenes show robust expression in both germline and soma. Our results suggest the possibility of concerted mechanisms for silencing unwanted germiline expression of repetitive sequences.

Abstract

The distinction between soma and germline was recognized more than a century ago: somatic cells form the body of an organism, whereas germ cells serve to produce future generations. In Caenorhabditis elegans, the separation of some and germline occurs through a series of asymmetrical divisions, in which embryonic germline blastomeres divide unequally to produce one somatic daughter and one germline daughter. Here we show that after each asymmetrical division, embryonically transcribed RNAs are detected in somatic, but not germline, blastomeres. This asymmetry depends on the activity of the germline specific factor, PIE-1. In the absence of PIE-1, embryonically transcribed RNAs are detected in both somatic and germline blastomeres. Furthermore, ectopic expression of PIE-1 in somatic blastomeres can significantly reduce the accumulation of new transcripts in these cells. Taken together, these results suggest that germ-cell fate depends on an inhibitory mechanism that blocks new gene expression in the early embryonic germ lineage.

Abstract

During the 4-cell stage of C. elegans embryogenesis, the P2 blastomere provides a signal that allows two initially equivalent sister blastomeres, called ABa and ABp, to adopt different fates. Preventing P2 signalling in wild-type embryos results in defects in ABp development that are similar to those caused by mutations in the glp-1 and apx-1 genes, which are homologs of the Drosophila genes Notch and Delta, respectively. Previous studies have shown that GLP-1 protein is expressed in 4-cell stage embryos in both ABa and ABp. In this report, we show that APX-1 protein is expressed in the P2 blastomere and that a temperature-sensitive apx-1 mutant has a temperature-sensitive period between the 4-cell and 8-cell stages. We propose that APX-1 is part or all of the P2 signal that induces ABp to adopt a fate different than ABa.

ROLLING REPLICATION OF SHORT DNA CIRCLESPROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICAFire, A., Xu, S. Q.1995; 92 (10): 4641-4645

Abstract

Natural genes and proteins often contain tandemly repeated sequence motifs that dramatically increase physiological specificity and activity. Given the selective value of such repeats, it is likely that several different mechanisms have been responsible for their generation. One mechanism that has been shown to generate relatively long tandem repeats (in the kilobase range) is rolling circle replication. In this communication, we demonstrate that rolling circle synthesis in a simple enzymatic system can produce tandem repeats of monomers as short as 34 bp. In addition to suggesting possible origins for natural tandem repeats, these observations provide a facile means for constructing libraries of repeated motifs for use in "in vitro evolution" experiments designed to select molecules with defined biological or chemical properties.

Abstract

We investigated the cis-acting sequences regulating expression of the Caenorhabditis elegans gene hlh-1, a homolog of the MyoD family of myogenic regulatory factors. The hlh-1 gene is expressed in mature body wall muscle, in clonal muscle precursors, in a set of early embryonic blastomeres (the MS-granddaughters), and in six glial-like cells called GLRs. The entire natural hlh-1 expression pattern is recapitulated in transgenic animals containing an hlh-1::lacZ fusion with 5.1 kb of hlh-1 sequences beginning upstream of the coding region and extending into the second exon. Deletions and rearrangements in this 5.1-kb sequence were assayed for their effects on reporter gene expression in transgenic animals. Deletions removing a segment 434-550 bp upstream of the coding region resulted in a loss of all aspects of the expression pattern, suggesting that at least one common regulatory factor is required for all expression. Deletions of other regulatory elements affected distinct aspects of the expression pattern; hence, each expression subpattern exhibits a different set of sequence requirements. Interspecies sequence comparison and lacZ expression constructs with the related nematode Caenorhabditis briggsae indicated that the C. briggsae and C. elegans genes contain equivalent sets of control signals. The results from this work implicate the hlh-1 promoter as an integration point for diverse temporal and spatial control signals.

Abstract

We describe the dissection of a body muscle-specific enhancer sequence contained within the Caenorhabditis elegans myosin heavy chain gene unc-54. A 90-base pair segment that was sufficient for both enhancer function and tissue specificity was subjected to mutational analysis. Several separated sites within this region were required for activity; mutations in these sites led to dramatic decreases in enhancer activity, while substitutions in the intervening regions had minimal effects on activity. The individual sites appear to function as semi-independent and partially interchangeable enhancer subelements, as seen by our ability to create functional enhancers by constructing novel multimers and combinations. Four different enhancer subelements (designated O, I, II, and III) were identified in this way. Although partially interchangeable, some differences between these subelements were evident. In particular, concatamers of site III exhibited the highest levels of activity but had a broader tissue specificity than the intact enhancer, including both hypodermal and muscle tissue. The specificity of the intact enhancer thus reflects a combinatorial function of the specificities of the constituent subelements.

Abstract

Early embryogenesis in Caenorhabditis elegans is characterized by a series of unequal cleavages that mark the stepwise separation of somatic and germ lineages. We have developed an in situ hybridization protocol to examine the localization of specific maternal and embryonically transcribed messenger RNAs during these early cleavages. We detected three classes of maternal RNAs: RNAs that are maintained in all cells, RNAs that are maintained in germline cells but are lost from somatic cells, and a population of RNAs that are associated with the germline-specific P granules. We observed embryonically transcribed RNAs in somatic cells as early as the 4-cell stage. These transcripts were not detected in germline cells. These observations suggest that mechanisms which distinguish between soma and germline cause asymmetries in mRNA stability and transcription within the first few cleavages of C. elegans embryogenesis.

Abstract

The pharyngeal muscles of Caenorhabditis elegans are single sarcomere muscles used for feeding. Like vertebrate cardiac and smooth muscles, C. elegans pharyngeal muscle does not express any of the known members of the MyoD family of myogenic factors. To identify mechanisms regulating gene expression in this tissue, we have characterized a pharyngeal muscle-specific enhancer from myo-2, a myosin heavy chain gene expressed exclusively in pharyngeal muscle. Assaying enhancer function in transgenic animals, we identified three subelements, designated A, B and C, that contribute to myo-2 enhancer activity. These subelements are individually inactive; however, any combination of two or more subelements forms a functional enhancer. The B and C subelements have distinct cell type specificities. A duplication of B activates transcription in a subset of pharyngeal muscles (m3, m4, m5 and m7). A duplication of C activates transcription in all pharyngeal cells, muscle and non-muscle. Thus, the activity of the myo-2 enhancer is regulated by a combination of pharyngeal muscle-type-specific and organ-specific signals. Screening a cDNA expression library, we identified a gene encoding an NK-2 class homeodomain protein, CEH-22, that specifically binds a site necessary for activity of the B subelement. CEH-22 protein is first expressed prior to myogenic differentiation and is present in the same subset of pharyngeal muscles in which B is active. Expression continues throughout embryonic and larval development. This expression pattern suggests CEH-22 plays a key role in pharyngeal muscle-specific activity of the myo-2 enhancer.

Abstract

The paper describes a digital image archiving system for time-lapse microscopy. The system uses an MS-DOS compatible computer to store video images while simultaneously controlling a stepping motor. In a typical experiment, images might be taken at 30 s intervals in each of 25 consecutive focal planes. A system with 2.5 Gbyte disk capacity can store approximately 18,000 full frame images: 6 h recording at maximum resolution. Once recorded, images series stored on disk can be 'played back' in any order. Generally, images from a single focal plane are displayed consecutively in either forward or reverse time. The focal plane can be shifted during playback, allowing individual cells to be followed as they move between focal planes. To facilitate the annotation and interpretation of the real-time images, a mouse-driven interface allows users to define and follow individual objects (e.g. cells). The recorded image series can be achieved inexpensively using standard digital tape backup hardware. In this laboratory, the system has been particularly useful for tracing embryonic cell lineages and cell migrations. Detailed system specifications, including source code, compiled programs, hardware requirements and users manual are available directly from the author or by anonymous FTP (ciw1.ciwemb.edu).

Abstract

A family of muscle-specific helix-loop-helix transcription factors (myoD, myogenin, myf-5 and MRF4) has been implicated in the control of vertebrate skeletal myogenesis. Searches for homologues of this family in Caenorhabditis elegans identified a single family member, hlh-1, which is expressed in striated muscles and their clonal precursors. We have isolated a null allele of hlh-1 following chemical mutagenesis. Animals homozygous for the null mutation produce contractile body-wall muscles, although muscle contractions are weak and coordination is defective. In addition to the evident muscle defects, mutant animals fail to complete embryonic elongation and die as larvae or young adults. Ultrastructural analysis of the mutant muscle reveals an apparently normal local lattice of thick and thin filaments, with more global defects in sarcomere organization and muscle cell placement. Mosaic studies using the point mutation and an extrachromosomal transgene indicate that the requirement for hlh-1 is fully zygotic, with no maternal hlh-1 requirement for either muscle development or viability.

A SCREEN FOR GENETIC-LOCI REQUIRED FOR BODY-WALL MUSCLE DEVELOPMENT DURING EMBRYOGENESIS IN CAENORHABDITIS-ELEGANSGENETICSAhnn, J., Fire, A.1994; 137 (2): 483-498

Abstract

We have used available chromosomal deficiencies to screen for genetic loci whose zygotic expression is required for formation of body-wall muscle cells during embryogenesis in Caenorhabditis elegans. To test for muscle cell differentiation we have assayed for both contractile function and the expression of muscle-specific structural proteins. Monoclonal antibodies directed against two myosin heavy chain isoforms, the products of the unc-54 and myo-3 genes, were used to detect body-wall muscle differentiation. We have screened 77 deficiencies, covering approximately 72% of the genome. Deficiency homozygotes in most cases stain with antibodies to the body-wall muscle myosins and in many cases muscle contractile function is observed. We have identified two regions showing distinct defects in myosin heavy chain gene expression. Embryos homozygous for deficiencies removing the left tip of chromosome V fail to accumulate the myo-3 and unc-54 products, but express antigens characteristic of hypodermal, pharyngeal and neural development. Embryos lacking a large region on chromosome III accumulate the unc-54 product but not the myo-3 product. We conclude that there exist only a small number of loci whose zygotic expression is uniquely required for adoption of a muscle cell fate.

Abstract

Four Caenorhabditis elegans genes encode muscle-type specific myosin heavy chain isoforms: myo-1 and myo-2 are expressed in the pharyngeal muscles; unc-54 and myo-3 are expressed in body wall muscles. We have used transformation-rescue and lacZ fusion assays to determine sequence requirements for regulated myosin gene expression during development. Multiple tissue-specific activation elements are present for all four genes. For each of the four genes, sequences upstream of the coding region are tissue-specific promoters, as shown by their ability to drive expression of a reporter gene (lacZ) in the appropriate muscle type. Each gene contains at least one additional tissue-specific regulatory element, as defined by the ability to enhance expression of a heterologous promoter in the appropriate muscle type. In rescue experiments with unc-54, two further requirements apparently independent of tissue specificity were found: sequences within the 3' non-coding region are essential for activity while an intron near the 5' end augments expression levels. The general intron stimulation is apparently independent of intron sequence, indicating a mechanistic effect of splicing. To further characterize the myosin gene promoters and to examine the types of enhancer sequences in the genome, we have initiated a screen of C. elegans genomic DNA for fragments capable of enhancing the myo-2 promoter. The properties of enhancers recovered from this screen suggest that the promoter is limited to muscle cells in its ability to respond to enhancers.

Abstract

Two genes (mtl-1 and mtl-2) that encode the novel metallothioneins (MTs) of Caenorhabditis elegans (CeMTs) were cloned and characterized. Both genes contain a single intron that interrupts codon 6 and short 3'-untranslated regions. However, their promotor regions are distinctively non-homologous. The mtl-2 promoter contains a TATAA box and a single putative metal regulatory element. These elements are absent in the mtl-1 promoter. Nevertheless, both CeMT1 and CeMT2 mRNAs are induced by cadmium and contain precisely initiated, 5'-untranslated sequences. The inducibility and cell type specificity of metallothionein gene expression were investigated in transgenic C. elegans that carry the lacZ (beta-galactosidase) reporter gene under the control of an mtl-1 or mtl-2 promoter sequence. Upon treatment of transgenic C. elegans with cadmium or heat stress, the mtl-2:lacZ fusion gene is abundantly and exclusively expressed in the intestinal cells of larvae and adult animals. Expression is not detected in the absence of metal or heat shock. In contrast, an mtl-1:lacZ construct is constitutively expressed in the pharynx and induced by cadmium and heat shock in the intestinal cells of C. elegans larvae. The metal-inducible expression of the mtl-1:lacZ gene is attenuated in adult transgenic nematodes. Thus, the activity of each mtl promoter is modulated by metals as well as developmental and environmental factors.

Abstract

We have characterized two transcripts from the male-determining her-1 locus in Caenorhabditis elegans. The larger transcript, which appears more important for male development, is predicted to encode a novel 175-amino-acid, cysteine-rich polypeptide with an apparent amino-terminal signal sequence and potential cleavage and glycosylation sites. Expression of a full-length cDNA construct for the larger transcript driven by a body-wall-myosin promoter causes extensive masculinization of all sexually dimorphic tissues in XX (normally hermaphrodite) animals. This activity is dependent on the presence of the her-1 signal sequence or a substitute synthetic signal sequence in the encoded polypeptide. These results suggest that a secreted product of the her-1 gene dictates male development.

Abstract

Escherichia coli beta-galactosidase is a commonly used reporter molecule for analyzing gene expression. Recently, beta-galactosidase fusions have been applied to a variety of eukaryotic systems. The techniques for constructing and introducing beta-galactosidase fusion constructs as well as soluble assays for total enzyme function have been described in detail elsewhere. This article describes histochemical techniques for analyzing organisms that contain a functional beta-galactosidase fusion construct. The object is to determine semiquantitatively which cells are expressing the beta-galactosidase fusion protein, as well as the subcellular localization of the protein. Due to its prevalence in the author's laboratory, Caenorhabditis elegans is used as a canonical organism for the detailed methods described.

Abstract

The myoD family of DNA binding proteins has been implicated in the control of myogenesis in a variety of organisms. Searches for homologs in the nematode Caenorhabditis elegans yielded only one gene, designated hlh-1, expressed in body-wall muscle cells and their precursors. To assess the role of hlh-1 in C. elegans myogenesis, genetic deficiencies spanning the hlh-1 locus were isolated after gamma irradiation. Embryos homozygous for these deficiencies exhibited extensive body-wall muscle differentiation, including expression of several characteristic myofilament proteins and weak contracile behavior. Thus, zygotic hlh-1 expression was not required for body-wall muscle precursors to adopt muscle cell fates.

Abstract

The Caenorhabditis elegans protein, CeMyoD, is related to the vertebrate myogenic regulatory factors MyoD, myogenin, MRF-4 and Myf-5. Like its vertebrate counterparts, CeMyoD accumulates in the nucleus of striated muscle cells prior to the onset of terminal differentiation. CeMyoD also shares functional similarities with the vertebrate myogenic regulatory factors. Viral LTR driven expression of CeMyoD in mouse 10T1/2 cells can convert this cell line into myoblasts as well as efficiently trans-activate mouse muscle-specific promoters. Furthermore, mouse MyoD expression can activate a CeMyoD-beta-galactosidase reporter construct in a 10T1/2 co-transfection assay.

Abstract

We have used an antisense strategy to effectively disrupt the expression of two genes encoding myofilament proteins present in C. elegans body wall muscles. DNA segments from the unc-22 and unc-54 genes have been placed in reverse orientation in vectors designed to produce RNA in body wall muscles. When the resulting plasmids are injected into oocytes, progeny with defects in muscle function are produced. These animals have phenotypes consistent with reduction and/or elimination of function of the gene to which antisense RNA has been produced: twitching and disorganization of muscle filaments for the unc-22 antisense constructs and lack of muscle tone, slow movement, and egg laying defects for the unc-54 antisense constructs. A fraction of the affected animals transmit the defective-muscle trait to subsequent generations. In these cases the transforming DNA is present at high copy number and cosegregates with the observed muscle defects. We have examined several of the unc-22 antisense plasmid transformed lines to determine the mechanistic basis for the observed phenotypes. The RNA product of the endogenous unc-22 locus is present at normal levels and this RNA is properly spliced in the region homologous to the antisense RNA. No evidence for modification of this RNA by deamination of adenosine to inosine was found. In affected animals the level of protein product from the endogenous unc-22 locus is greatly reduced. Antisense RNA produced from the transforming DNA was detected and was much more abundant than 'sense' RNA from the endogenous locus. These data suggest that the observed phenotypes result from interference with a late step in gene expression, such as transport into the cytoplasm or translation.

Abstract

We have cloned a gene from the nematode C. elegans that is closely related to the vertebrate MyoD gene family. The nematode gene product, CeMyoD, is a nuclear protein that is expressed specifically in body wall muscle cells. Antibody staining of early embryos shows that CeMyoD accumulates in early blastomeres that will subsequently produce only body wall muscle cells. CeMyoD is not detected in pharyngeal muscle cells or in nonmyogenic lineages. A CeMyoD-beta-galactosidase fusion gene is accurately expressed in myogenic cells that also express CeMyoD. In addition, the beta-galactosidase reporter is expressed as early as the 28 cell stage of embryogenesis in specific blastomeres prior to their clonal commitment to body wall muscle. This early fusion gene activity reveals that part of the specificity for CeMyoD transcription can arise very early in development and that subsequently, negative events may restrict CeMyoD expression in progeny cells not destined to become muscle.

Abstract

We describe a series of plasmid vectors which contain modular features particularly useful for studying gene expression in eukaryotic systems. The vectors contain the Escherichia coli beta-galactosidase (beta Gal)-encoding region (the lacZ gene) flanked by unique polylinker segments on the 5' and 3' ends, and several combinations of a variety of modules: a selectable marker (an amber suppressor tRNA), a translational initiation region, a synthetic intron segment, the early polyadenylation signal from SV40, and 3' regions from two nematode genes. A segment encoding the nuclear localization peptide from the SV40 T antigen is incorporated into many of the constructs, leading to beta Gal accumulation in nuclei, which can facilitate identification of producing cells in complex tissues. To make functional beta Gal fusions to secreted proteins, we constructed plasmids with an alternate module encoding a synthetic transmembrane domain upstream from lacZ. This domain is designed to stop transfer of secreted proteins across the membrane during secretion, allowing the beta Gal domain of the fusion polypeptide to remain in the cytoplasm and thus function in enzymatic assays. We have used the vectors to analyze expression of several genes in the nematode Caenorhabditis elegans, and have demonstrated in these studies that lacZ can be expressed in a wide variety of different tissues and cell types. These vectors should be useful in studying gene expression both in C. elegans and in other experimental systems.

Abstract

Caenorhabditis elegans has four genes which encode skeletal myosin heavy chain isoforms. We have re-introduced clones of two of these genes, myo-3 and unc-54 at low copy number into the germline of C. elegans. The resulting loci behave as functional copies of the genes by two genetic criteria: (i) they can result in phenotypic rescue of strains carrying inactivating myo-3 or unc-54 mutations, and (ii) their presence in strains with wild-type copies of the endogenous myosin loci has genetic consequences similar to duplicating the endogenous loci. The re-introduced genes function at a level close to that of the endogenous loci. Monoclonal antibodies specific for the different isoforms have been used to localize the expressed proteins. The re-introduced genes express in precisely the same cell types as the endogenous genes, and the myosin products produced assemble into filament structures as in wild-type. Unexpectedly, we have found in the course of this work that very high copy numbers of the unc-54 gene lead to a disruption of muscle structure which may result from overexpression of the protein product.

Abstract

The purine nucleoside analog 5,6-dichloro-1-beta-D-ribofuranosylbenzimidazole (DRB) is a selective inhibitor of transcription by RNA polymerase II. Although a wealth of in vivo studies have suggested that DRB inhibits transcription by enhancing the premature termination of elongating polymerase molecules, in vitro studies to date have been interpreted to suggest that DRB acts at the level of transcription initiation. We have analyzed the mechanism of DRB-mediated transcription inhibition in vitro both in HeLa whole cell extracts and in a partially purified transcription system. The results indicate that the extent to which DRB inhibits the synthesis of a RNA transcript is directly proportional to its length. For example, DRB was found to preferentially inhibit transcription in vitro of promoter-distal relative to promoter-proximal portions of the adenovirus major late transcription unit. A factor potentially involved in mediating this inhibitory effect is identified. We conclude that the mechanism of DRB inhibition of transcription in vivo and in vitro are similar.

Abstract

A technique for introducing exogenous DNA into the chromosomes of the nematode Caenorhabditis elegans is presented. A cloned C. elegans amber suppressor tRNA gene, sup-7, is used as a selectable marker. The activity of this amber suppressor is selected for by injecting worms which carry an amber termination mutation in a gene (tra-3) whose function is required for fertility. Transient expression of sup-7 is evidenced by the presence of fertile (rescued) animals in the generation after injection. In a fraction of cases, these fertile animals give rise to stable suppressor lines (eight have been characterized so far). Each of the stable suppressor lines carries injected DNA sequences. The suppressor activities have been mapped to chromosomal loci, indicating that the exogenous DNA has integrated into the genome. This technique has been used to introduce a chimeric gene containing a Drosophila heat shock promoter element fused to coding sequences from the Escherichia coli beta-galactosidase gene. This chimeric gene functions and is heat inducible in the resulting stably transformed lines.

Abstract

Accurate transcription by RNA polymerase II has been shown to require multiple factors in addition to the purified polymerase. In this study, we use a reconstituted transcription system, consisting of purified RNA polymerase II and three essential HeLa cell chromatographic fractions, to study events leading to transcription from the adenovirus major late promoter. A preincubation-pulse-chase protocol resolves the reaction into events occurring before and after nucleotide addition. Preincubation of template with a mixture of RNA polymerase II and factors allows formation of "activated" complexes, which are defined by the ability to rapidly commence accurate transcription when presented nucleotides. Maximal activation requires that polymerase, template, and each of the three HeLa fractions be present during preincubation. The activated complexes are template associated, as shown by their inability to exchange onto a second template added during further preincubation. Similar protocols are used to define functional intermediates leading to the activated complex. A template-associated functional complex is formed during the preincubation of template with just two of the HeLa fractions. Polymerase can associate with this intermediate complex in the absence of the third HeLa fraction. In the accompanying paper, we describe a direct analysis of initiation by "activated" complexes.

Abstract

Mammalian RNA polymerase II was shown to utilize dinucleoside monophosphates for priming of promoter specific RNAs. In a reconstituted system containing purified polymerase and HeLa cell fractions, dinucleotides were incorporated by complementarity with template sequences at the in vivo cap sites of the adenovirus major late and adenovirus early region IV promoters. Incorporation was shown by label transfer experiments and by determining the size of 5'-terminal RNase T1-resistant oligonucleotides. All 16 dinucleotides were tested for priming of RNA chains at the major late promoter. RNA polymerase II initiated with various primers over a contiguous region of 9 bases, centered around the in vivo initiation site. We suggest that the polymerase drifts or oscillates over this region. Using a dinucleotide challenge protocol, the rate of initiation at the major late promoter was measured following preincubation of the template DNA with RNA polymerase II and factors. Initiation with ATP was 90% complete within the 1st min after addition of nucleotide triphosphates. Stimulation of transcription by dinucleotides was not observed, due to this rapid initiation. The 5'-hydroxyl terminus of dinucleotide-primed RNAs remained unmodified. Although transcripts initiated with ATP were rapidly capped in whole cell extracts, ATP-primed RNA synthesized in the reconstituted system retained free 5'-terminal phosphates. Thus, capping was not essential for synthesis of long runoff RNAs.

Abstract

Transcription of a Xenopus laevis tRNATyr gene and splicing of the transcript have been studied in HeLa cell extracts. This tRNATyr gene has a 13-base intervening sequence and is expressed as mature tRNA when transfected into mammalian cells. The tRNATyr gene is transcribed under conditions of low concentrations of magnesium and ATP, but is processed by splicing only when both of these cofactors are added at higher concentrations. The endonucleolytic activity of the tRNA-splicing system in the HeLa extract produces exons with 3'-phosphate and 5'-hydroxyl groups. The 3'-phosphate is retained during the ligation reaction and forms the phosphodiester bond in the mature tRNA. Retention of the 3'-phosphate during tRNA splicing differs from the more extensively studied process in yeast extracts where a phosphate group from an ATP cofactor is used to form the phosphodiester bond joining the exons. Thus, eucaryotic organisms can splice tRNA precursors by at least two distinguishable mechanisms.

Abstract

A whole cell extract of HeLa cells was resolved through two successive chromatographic steps using an extension of the procedure of Matsui et al. (Matsui, T., Segall, J., Weil, P. A., and Roeder, R. G. (1980) J. Biol. Chem. 255, 11992-11996). RNA polymerase II and three of the resulting fractions were necessary and sufficient for accurate transcription of the adenovirus major late promoter. This accurate transcription was quantitated as a function of each of the required fractions, polymerase, and DNA. A linear range of response was observed in each case. Using the linear ranges for assay, it was possible to calculate net purifications and yields for each of the required transcriptional activities after chromatography. These activities were each shown to sediment with a distinct peak on sucrose gradients. The effects of variations in salt concentration, magnesium concentration, temperature, and reaction time were determined. High resolution analysis of runoff transcripts showed that the reconstituted system initiated transcription precisely at the adenovirus major late and early region IV promoters.

Abstract

A series of recombinants of adenovirus DNA fragments and pBR322 was used to test the transcriptional activity of the nine known adenovirus promoters in a cell-free extract. Specific initiation was seen at all five early promoters as well as at the major late promotor and at the intermediate promoter for polypeptide IX. The system failed to recognize the two other adenovirus promoters, which were prominent in vivo only at intermediate and late stages in infection. Microheterogeneity of 5' termini at several adenovirus promoters, previously shown in vivo, was reproduced in the in vitro reaction and indeed appeared to result from heterogeneous initiation rather than 5' processing. To test for the presence of soluble factors involved in regulation of nRNA synthesis, the activity of extracts prepared from early and late stages of infection was compared on an assortment of viral promoter sites. Although mock and early extracts showed identical transcription patterns, extracts prepared from late stages gave 5- to 10-fold relative enhancement of the late and polypeptide IX promoters as compared with early promoters.

Abstract

To study the poliovirus-induced inhibition of host-cell RNA synthesis, we prepared transcription extracts from mock-infected and poliovirus-infected HeLa cells. In contrast with the control extracts, poliovirus-infected cell extracts prepared 3 hr after infection were unable to transcribe specifically DNA templates recognized by RNA polymerase II. Accurate transcription by RNA polymerase III, however, was only slightly reduced. Supplementation of the infected cell extract with a crude preparation of transcription factors (S100) restored its ability to transcribe a polymerase II template specifically; supplementation with purified polymerase II had no effect. When the S100 was fractionated on a phosphocellulose column, the restoration activity eluted between 0.35 M and 1 M KCl. When we tested infected extracts for inhibitory activity by mixing uninfected and infected cell extracts, no in vitro inhibition of polymerase II transcription by the uninfected extract was evident. These results indicate that at least one factor required for specific transcription by polymerase II is deficient in extracts from poliovirus-infected cells.

Abstract

The lytic cycle of adenovirus is a tightly regulated sequence of stages. When this regulation is studied at the level of mRNA production, the most significant step in controlling gene expression is initiation of transcription. Thus in preceding from one stage of expression to another, viral factors seem to turn on transcription of new sets of genes. At the moment, it is thought that viral mRNA synthesis involves initiation of transcription at ten different promoter sites. It is likely that in some manner the frequency of an initiation of transcription at nine of these sites is affected by one or more viral gene products. With the recent development of soluble in vitro transcription systems that respond to exogenously added DNA, it should be possible to begin to study regulation of gene expression at this stage of transcription. At present, these systems yield the paradoxical observation that extracts prepared from uninfected human cells more efficiently recognize the late promoter as compared to the early promoter of adenovirus. As more is learned about regulation of synthesis of viral mRNAs, examples will surely be found where RNA processing and RNA turnover play a critical role in determining the level of mRNAs. Such cases are more likely to appear in the balancing of synthesis of different mRNAs derived from one transcriptional unit. Few experiments have been directed to this possibility and the study of adenovirus molecular biology is only now entering the age of maturity where these experiments are feasible.

DNA-DEPENDENT TRANSCRIPTION OF ADENOVIRUS GENES IN A SOLUBLE WHOLE-CELL EXTRACTPROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCESManley, J. L., Fire, A., Cano, A., Sharp, P. A., Gefter, M. L.1980; 77 (7): 3855-3859

Abstract

We have developed a cell-free system for studying the synthesis of mRNA in mammalian cells. The system consists of a dialyzed and concentrated whole-cell extract derived from HeLa cells, small molecules and cofactors needed for transcription, and exogenously added DNA. Accurate transcription by RNA polymerase II is entirely dependent upon addition of promoter-containing eukaryotic DNA. At optimal DNA and extract concentrations, transcription initiation from the adenovirus serotype 2 late promoter is readily detectable, and specific transcripts over 4000 nucleotides in length are observed. The RNA synthesized in vitro contains the same 5' capped RNase T1 undecanucleotide as does the in vivo transcript. RNA synthesis also initiates accurately at both an early and an intermediate adenovirus promoter site.