Across the animal kingdom, gastrulation represents a key developmental event during which embryonic pluripotent cells diversify into lineage-specific precursors that will generate the adult organism. Here we report the transcriptional profiles of 116,312 single cells from mouse embryos collected at nine sequential time points ranging from 6.5 to 8.5 days post-fertilization. We construct a molecular map of cellular differentiation from pluripotency towards all major embryonic lineages, and explore the complex events involved in the convergence of visceral and primitive streak-derived endoderm. Furthermore, we use single-cell profiling to show that Tal1 chimeric embryos display defects in early mesoderm diversification, and we thus demonstrate how combining temporal and transcriptional information can illuminate gene function. Together, this comprehensive delineation of mammalian cell differentiation trajectories in vivo represents a baseline for understanding the effects of gene mutations during development, as well as a roadmap for the optimization of in vitro differentiation protocols for regenerative medicine.

Alternative splicing is a key regulatory mechanism in eukaryotic cells and increases the effective number of functionally distinct gene products. Using bulk RNA sequencing, splicing variation has been studied across human tissues and in genetically diverse populations. This has identified disease-relevant splicing events, as well as associations between splicing and genomic features, including sequence composition and conservation. However, variability in splicing between single cells from the same tissue or cell type and its determinants remains poorly understood.

The molecular regulation of zygotic genome activation (ZGA) in mammals remains an exciting area of research. Primed mouse embryonic stem cells contain a rare subset of "2C-like" cells that are epigenetically and transcriptionally similar to the two-cell embryo and thus represent an in vitro approximation for studying ZGA transcription regulation. Recently, the transcription factor Dux, expressed in the minor wave of ZGA, was described to activate many downstream ZGA transcripts. However, it remains unknown what upstream maternal factors initiate ZGA in either a Dux-dependent or Dux-independent manner. Here we performed a candidate-based overexpression screen, identifying, among others, developmental pluripotency-associated 2 (Dppa2) and Dppa4 as positive regulators of 2C-like cells and transcription of ZGA genes. In the germline, promoter DNA demethylation coincides with expression of Dppa2 and Dppa4, which remain expressed until embryonic day 7.5 (E7.5), when their promoters are remethylated. Furthermore, Dppa2 and Dppa4 are also expressed during induced pluripotent stem cell (iPSC) reprogramming at the time that 2C-like transcription transiently peaks. Through a combination of overexpression, knockdown, knockout, and rescue experiments together with transcriptional analyses, we show that Dppa2 and Dppa4 directly regulate the 2C-like cell population and associated transcripts, including Dux and the Zscan4 cluster. Importantly, we teased apart the molecular hierarchy in which the 2C-like transcriptional program is initiated and stabilized. Dppa2 and Dppa4 require Dux to initiate 2C-like transcription, suggesting that they act upstream by directly regulating Dux. Supporting this, ChIP-seq (chromatin immunoprecipitation [ChIP] combined with high-throughput sequencing) analysis revealed that Dppa2 and Dppa4 bind to the Dux promoter and gene body and drive its expression. Zscan4c is also able to induce 2C-like cells in wild-type cells but, in contrast to Dux, can no longer do so in Dppa2/4 double-knockout cells, suggesting that it may act to stabilize rather than drive the transcriptional network. Our findings suggest a model in which Dppa2/4 binding to the Dux promoter leads to Dux up-regulation and activation of the 2C-like transcriptional program, which is subsequently reinforced by Zscan4c.

Conventional human embryonic stem cells are considered to be primed pluripotent but can be induced to enter a naive state. However, the transcriptional features associated with naive and primed pluripotency are still not fully understood. Here we used single-cell RNA sequencing to characterize the differences between these conditions. We observed that both naive and primed populations were mostly homogeneous with no clear lineage-related structure and identified an intermediate subpopulation of naive cells with primed-like expression. We found that the naive-primed pluripotency axis is preserved across species, although the timing of the transition to a primed state is species specific. We also identified markers for distinguishing human naive and primed pluripotency as well as strong co-regulatory relationships between lineage markers and epigenetic regulators that were exclusive to naive cells. Our data provide valuable insights into the transcriptional landscape of human pluripotency at a cellular and genome-wide resolution.

The inactive X chromosome (Xi) in female mammals adopts an atypical higher-order chromatin structure, manifested as a global loss of local topologically associated domains (TADs), A/B compartments and formation of two mega-domains. Here we demonstrate that the non-canonical SMC family protein, SmcHD1, which is important for gene silencing on Xi, contributes to this unique chromosome architecture. Specifically, allelic mapping of the transcriptome and epigenome in SmcHD1 mutant cells reveals the appearance of sub-megabase domains defined by gene activation, CpG hypermethylation and depletion of Polycomb-mediated H3K27me3. These domains, which correlate with sites of SmcHD1 enrichment on Xi in wild-type cells, additionally adopt features of active X chromosome higher-order chromosome architecture, including A/B compartments and partial restoration of TAD boundaries. Xi chromosome architecture changes also occurred following SmcHD1 knockout in a somatic cell model, but in this case, independent of Xi gene derepression. We conclude that SmcHD1 is a key factor in defining the unique chromosome architecture of Xi.

Dietary, pharmacological and genetic interventions can extend health- and lifespan in diverse mammalian species. DNA methylation has been implicated in mediating the beneficial effects of these interventions; methylation patterns deteriorate during ageing, and this is prevented by lifespan-extending interventions. However, whether these interventions also actively shape the epigenome, and whether such epigenetic reprogramming contributes to improved health at old age, remains underexplored. We analysed published, whole-genome, BS-seq data sets from mouse liver to explore DNA methylation patterns in aged mice in response to three lifespan-extending interventions: dietary restriction (DR), reduced TOR signaling (rapamycin), and reduced growth (Ames dwarf mice). Dwarf mice show enhanced DNA hypermethylation in the body of key genes in lipid biosynthesis, cell proliferation and somatotropic signaling, which strongly correlates with the pattern of transcriptional repression. Remarkably, DR causes a similar hypermethylation in lipid biosynthesis genes, while rapamycin treatment increases methylation signatures in genes coding for growth factor and growth hormone receptors. Shared changes of DNA methylation were restricted to hypermethylated regions, and they were not merely a consequence of slowed ageing, thus suggesting an active mechanism driving their formation. By comparing the overlap in ageing-independent hypermethylated patterns between all three interventions, we identified four regions, which, independent of genetic background or gender, may serve as novel biomarkers for longevity-extending interventions. In summary, we identified gene body hypermethylation as a novel and partly conserved signature of lifespan-extending interventions in mouse, highlighting epigenetic reprogramming as a possible intervention to improve health at old age.

The mouse embryo is the canonical model for mammalian preimplantation development. Recent advances in single cell profiling allow detailed analysis of embryogenesis in other eutherian species, including human, to distinguish conserved from divergent regulatory programs and signalling pathways in the rodent paradigm. Here, we identify and compare transcriptional features of human, marmoset and mouse embryos by single cell RNA-seq. Zygotic genome activation correlates with the presence of polycomb repressive complexes in all three species, while ribosome biogenesis emerges as a predominant attribute in primate embryos, supporting prolonged translation of maternally deposited RNAs. We find that transposable element expression signatures are species, stage and lineage specific. The pluripotency network in the primate epiblast lacks certain regulators that are operative in mouse, but encompasses WNT components and genes associated with trophoblast specification. Sequential activation of GATA6, SOX17 and GATA4 markers of primitive endoderm identity is conserved in primates. Unexpectedly, OTX2 is also associated with primitive endoderm specification in human and non-human primate blastocysts. Our cross-species analysis demarcates both conserved and primate-specific features of preimplantation development, and underscores the molecular adaptability of early mammalian embryogenesis.

Nucleosomes are the basic unit of chromatin that help the packaging of genetic material while controlling access to the genetic information. The underlying DNA sequence, together with transcription-associated proteins and chromatin remodelling complexes, are important factors that influence the organization of nucleosomes. Here, we show that the naturally occurring DNA modification, 5-formylcytosine (5fC) is linked to tissue-specific nucleosome organization. Our study reveals that 5fC is associated with increased nucleosome occupancy in vitro and in vivo. We demonstrate that 5fC-associated nucleosomes at enhancers in the mammalian hindbrain and heart are linked to elevated gene expression. Our study also reveals the formation of a reversible-covalent Schiff base linkage between lysines of histone proteins and 5fC within nucleosomes in a cellular environment. We define their specific genomic loci in mouse embryonic stem cells and look into the biological consequences of these DNA-histone Schiff base sites. Collectively, our findings show that 5fC is a determinant of nucleosome organization and plays a role in establishing distinct regulatory regions that control transcription.

Maternal overnutrition has been associated with increased susceptibility to develop obesity and neurological disorders later in life. Most epidemiological as well as experimental studies have focused on the metabolic consequences across generations following an early developmental nutritional insult. Recently, it has been shown that maternal high-fat diet (HFD) affects third-generation female body mass via the paternal lineage. We showed here that the offspring born to HFD ancestors displayed addictive-like behaviors as well as obesity and insulin resistance up to the third generation in the absence of any further exposure to HFD. These findings, implicate that the male germ line is a major player in transferring phenotypic traits. These behavioral and physiological alterations were paralleled by reduced striatal dopamine levels and increased dopamine 2 receptor density. Interestingly, by the third generation a clear gender segregation emerged, where females showed addictive-like behaviors while male HFD offspring showed an obesogenic phenotype. However, methylome profiling of F1 and F2 sperm revealed no significant difference between the offspring groups, suggesting that the sperm methylome might not be the major carrier for the transmission of the phenotypes observed in our mouse model. Together, our study for the first time demonstrates that maternal HFD insult causes sustained alterations of the mesolimbic dopaminergic system suggestive of a predisposition to develop obesity and addictive-like behaviors across multiple generations.

Pluripotency is accompanied by the erasure of parental epigenetic memory, with naïve pluripotent cells exhibiting global DNA hypomethylation both in vitro and in vivo. Exit from pluripotency and priming for differentiation into somatic lineages is associated with genome-wide de novo DNA methylation. We show that during this phase, co-expression of enzymes required for DNA methylation turnover, DNMT3s and TETs, promotes cell-to-cell variability in this epigenetic mark. Using a combination of single-cell sequencing and quantitative biophysical modeling, we show that this variability is associated with coherent, genome-scale oscillations in DNA methylation with an amplitude dependent on CpG density. Analysis of parallel single-cell transcriptional and epigenetic profiling provides evidence for oscillatory dynamics both in vitro and in vivo. These observations provide insights into the emergence of epigenetic heterogeneity during early embryo development, indicating that dynamic changes in DNA methylation might influence early cell fate decisions.

Recent research has focused on environmental effects that control tissue functionality and systemic metabolism. However, whether such stimuli affect human thermogenesis and body mass index (BMI) has not been explored. Here we show retrospectively that the presence of brown adipose tissue (BAT) and the season of conception are linked to BMI in humans. In mice, we demonstrate that cold exposure (CE) of males, but not females, before mating results in improved systemic metabolism and protection from diet-induced obesity of the male offspring. Integrated analyses of the DNA methylome and RNA sequencing of the sperm from male mice revealed several clusters of co-regulated differentially methylated regions (DMRs) and differentially expressed genes (DEGs), suggesting that the improved metabolic health of the offspring was due to enhanced BAT formation and increased neurogenesis. The conclusions are supported by cell-autonomous studies in the offspring that demonstrate an enhanced capacity to form mature active brown adipocytes, improved neuronal density and more norepinephrine release in BAT in response to cold stimulation. Taken together, our results indicate that in humans and in mice, seasonal or experimental CE induces an epigenetic programming of the sperm such that the offspring harbor hyperactive BAT and an improved adaptation to overnutrition and hypothermia.

Defective germline reprogramming in Piwil4 (Miwi2)- and Dnmt3l-deficient mice results in the failure to reestablish transposon silencing, meiotic arrest and progressive loss of spermatogonia. Here we sought to understand the molecular basis for this spermatogonial dysfunction. Through a combination of imaging, conditional genetics and transcriptome analysis, we demonstrate that germ cell elimination in the respective mutants arises as a result of defective de novo genome methylation during reprogramming rather than because of a function for the respective factors within spermatogonia. In both Miwi2 and Dnmt3l spermatogonia, the intracisternal-A particle (IAP) family of endogenous retroviruses is derepressed, but, in contrast to meiotic cells, DNA damage is not observed. Instead, we find that unmethylated IAP promoters rewire the spermatogonial transcriptome by driving expression of neighboring genes. Finally, spermatogonial numbers, proliferation and differentiation are altered in Miwi2 and Dnmt3l mice. In summary, defective reprogramming deregulates the spermatogonial transcriptome and may underlie spermatogonial dysfunction.

A remarkable epigenetic remodelling process occurs shortly after fertilization, which restores totipotency to the zygote. This involves global DNA demethylation, chromatin remodelling, genome spatial reorganization and substantial transcriptional changes. Key to these changes is the transition from the maternal environment of the oocyte to an embryonic-driven developmental expression programme, a process termed the maternal-to-zygotic transition (MZT). Zygotic genome activation occurs predominantly at the two-cell stage in mice and the eight-cell stage in humans, yet the dynamics of its control are still mostly obscure. In recent years, partly due to single-cell and low-cell number epigenomic studies, our understanding of the epigenetic and chromatin landscape of preimplantation development has improved considerably. In this Review, we discuss the latest advances in the study of the MZT, focusing on DNA methylation, histone post-translational modifications, local chromatin structure and higher-order genome organization. We also discuss key mechanistic studies that investigate the mode of action of chromatin regulators, transcription factors and non-coding RNAs during preimplantation development. Finally, we highlight areas requiring additional research, as well as new technological advances that could assist in eventually completing our understanding of the MZT.

Whole-genome bisulfite sequencing (WGBS) is becoming an increasingly accessible technique, used widely for both fundamental and disease-oriented research. Library preparation methods benefit from a variety of available kits, polymerases and bisulfite conversion protocols. Although some steps in the procedure, such as PCR amplification, are known to introduce biases, a systematic evaluation of biases in WGBS strategies is missing.

The recent advent of methods for high-throughput single-cell molecular profiling has catalyzed a growing sense in the scientific community that the time is ripe to complete the 150-year-old effort to identify all cell types in the human body. The Human Cell Atlas Project is an international collaborative effort that aims to define all human cell types in terms of distinctive molecular profiles (such as gene expression profiles) and to connect this information with classical cellular descriptions (such as location and morphology). An open comprehensive reference map of the molecular state of cells in healthy human tissues would propel the systematic study of physiological states, developmental trajectories, regulatory circuitry and interactions of cells, and also provide a framework for understanding cellular dysregulation in human disease. Here we describe the idea, its potential utility, early proofs-of-concept, and some design considerations for the Human Cell Atlas, including a commitment to open data, code, and community.

Expression of the transcription factors OCT4, SOX2, KLF4, and cMYC (OSKM) reprograms somatic cells into induced pluripotent stem cells (iPSCs). Reprogramming is a slow and inefficient process, suggesting the presence of safeguarding mechanisms that counteract cell fate conversion. One such mechanism is senescence. To identify modulators of reprogramming-induced senescence, we performed a genome-wide shRNA screen in primary human fibroblasts expressing OSKM. In the screen, we identified novel mediators of OSKM-induced senescence and validated previously implicated genes such as CDKN1A We developed an innovative approach that integrates single-cell RNA sequencing (scRNA-seq) with the shRNA screen to investigate the mechanism of action of the identified candidates. Our data unveiled regulation of senescence as a novel way by which mechanistic target of rapamycin (mTOR) influences reprogramming. On one hand, mTOR inhibition blunts the induction of cyclin-dependent kinase (CDK) inhibitors (CDKIs), including p16(INK4a), p21(CIP1), and p15(INK4b), preventing OSKM-induced senescence. On the other hand, inhibition of mTOR blunts the senescence-associated secretory phenotype (SASP), which itself favors reprogramming. These contrasting actions contribute to explain the complex effect that mTOR has on reprogramming. Overall, our study highlights the advantage of combining functional screens with scRNA-seq to accelerate the discovery of pathways controlling complex phenotypes.

Erasure of DNA methylation and repressive chromatin marks in the mammalian germline leads to risk of transcriptional activation of transposable elements (TEs). Here, we used mouse embryonic stem cells (ESCs) to identify an endosiRNA-based mechanism involved in suppression of TE transcription. In ESCs with DNA demethylation induced by acute deletion of Dnmt1, we saw an increase in sense transcription at TEs, resulting in an abundance of sense/antisense transcripts leading to high levels of ARGONAUTE2 (AGO2)-bound small RNAs. Inhibition of Dicer or Ago2 expression revealed that small RNAs are involved in an immediate response to demethylation-induced transposon activation, while the deposition of repressive histone marks follows as a chronic response. In vivo, we also found TE-specific endosiRNAs present during primordial germ cell development. Our results suggest that antisense TE transcription is a "trap" that elicits an endosiRNA response to restrain acute transposon activity during epigenetic reprogramming in the mammalian germline.

DNA methylation is an important epigenetic modification in many species that is critical for development, and implicated in ageing and many complex diseases, such as cancer. Many cost-effective genome-wide analyses of DNA modifications rely on restriction enzymes capable of digesting genomic DNA at defined sequence motifs. There are hundreds of restriction enzyme families but few are used to date, because no tool is available for the systematic evaluation of restriction enzyme combinations that can enrich for certain sites of interest in a genome. Herein, we present customised Reduced Representation Bisulfite Sequencing (cuRRBS), a novel and easy-to-use computational method that solves this problem. By computing the optimal enzymatic digestions and size selection steps required, cuRRBS generalises the traditional MspI-based Reduced Representation Bisulfite Sequencing (RRBS) protocol to all restriction enzyme combinations. In addition, cuRRBS estimates the fold-reduction in sequencing costs and provides a robustness value for the personalised RRBS protocol, allowing users to tailor the protocol to their experimental needs. Moreover, we show in silico that cuRRBS-defined restriction enzymes consistently out-perform MspI digestion in many biological systems, considering both CpG and CHG contexts. Finally, we have validated the accuracy of cuRRBS predictions for single and double enzyme digestions using two independent experimental datasets.

Mouse embryonic stem cells derived from the epiblast contribute to the somatic lineages and the germline but are excluded from the extra-embryonic tissues that are derived from the trophectoderm and the primitive endoderm upon reintroduction to the blastocyst. Here we report that cultures of expanded potential stem cells can be established from individual eight-cell blastomeres, and by direct conversion of mouse embryonic stem cells and induced pluripotent stem cells. Remarkably, a single expanded potential stem cell can contribute both to the embryo proper and to the trophectoderm lineages in a chimaera assay. Bona fide trophoblast stem cell lines and extra-embryonic endoderm stem cells can be directly derived from expanded potential stem cells in vitro. Molecular analyses of the epigenome and single-cell transcriptome reveal enrichment for blastomere-specific signature and a dynamic DNA methylome in expanded potential stem cells. The generation of mouse expanded potential stem cells highlights the feasibility of establishing expanded potential stem cells for other mammalian species.

Single-cell multi-omics has recently emerged as a powerful technology by which different layers of genomic output-and hence cell identity and function-can be recorded simultaneously. Integrating various components of the epigenome into multi-omics measurements allows for studying cellular heterogeneity at different time scales and for discovering new layers of molecular connectivity between the genome and its functional output. Measurements that are increasingly available range from those that identify transcription factor occupancy and initiation of transcription to long-lasting and heritable epigenetic marks such as DNA methylation. Together with techniques in which cell lineage is recorded, this multilayered information will provide insights into a cell's past history and its future potential. This will allow new levels of understanding of cell fate decisions, identity, and function in normal development, physiology, and disease.

The mouse inner cell mass (ICM) segregates into the epiblast and primitive endoderm (PrE) lineages coincident with implantation of the embryo. The epiblast subsequently undergoes considerable expansion of cell numbers prior to gastrulation. To investigate underlying regulatory principles, we performed systematic single-cell RNA sequencing (seq) of conceptuses from E3.5 to E6.5. The epiblast shows reactivation and subsequent inactivation of the X chromosome, with Zfp57 expression associated with reactivation and inactivation together with other candidate regulators. At E6.5, the transition from epiblast to primitive streak is linked with decreased expression of polycomb subunits, suggesting a key regulatory role. Notably, our analyses suggest elevated transcriptional noise at E3.5 and within the non-committed epiblast at E6.5, coinciding with exit from pluripotency. By contrast, E6.5 primitive streak cells became highly synchronized and exhibit a shortened G1 cell-cycle phase, consistent with accelerated proliferation. Our study systematically charts transcriptional noise and uncovers molecular processes associated with early lineage decisions.

Much attention has focussed on the conversion of human pluripotent stem cells (PSCs) to a more naïve developmental status. Here we provide a method for resetting via transient histone deacetylase inhibition. The protocol is effective across multiple PSC lines and can proceed without karyotype change. Reset cells can be expanded without feeders with a doubling time of around 24 h. WNT inhibition stabilises the resetting process. The transcriptome of reset cells diverges markedly from that of primed PSCs and shares features with human inner cell mass (ICM). Reset cells activate expression of primate-specific transposable elements. DNA methylation is globally reduced to a level equivalent to that in the ICM and is non-random, with gain of methylation at specific loci. Methylation imprints are mostly lost, however. Reset cells can be re-primed to undergo tri-lineage differentiation and germline specification. In female reset cells, appearance of biallelic X-linked gene transcription indicates reactivation of the silenced X chromosome. On reconversion to primed status, XIST-induced silencing restores monoallelic gene expression. The facile and robust conversion routine with accompanying data resources will enable widespread utilisation, interrogation, and refinement of candidate naïve cells.

Aging of the hematopoietic stem cell (HSC) compartment is characterized by lineage bias and reduced stem cell function, the molecular basis of which is largely unknown. Using single-cell transcriptomics, we identified a distinct subpopulation of old HSCs carrying a p53 signature indicative of stem cell decline alongside pro-proliferative JAK/STAT signaling. To investigate the relationship between JAK/STAT and p53 signaling, we challenged HSCs with a constitutively active form of JAK2 (V617F) and observed an expansion of the p53-positive subpopulation in old mice. Our results reveal cellular heterogeneity in the onset of HSC aging and implicate a role for JAK2V617F-driven proliferation in the p53-mediated functional decline of old HSCs.