Method

Organism

DNA methylation at the 5 position of cytosine (5mC) in the mammalian genome is a key epigenetic event critical for various cellular processes. The ten-eleven translocation (Tet) family of 5mC-hydroxylases, which convert 5mC to 5-hydroxymethylcytosine (5hmC), offers a way for dynamic regulation of DNA methylation. Here we report that Tet1 binds to unmodified… (More)

DNA methylation has been traditionally viewed as a highly stable epigenetic mark in postmitotic cells. However, postnatal brains appear to show stimulus-induced methylation changes, at least in a few identified CpG dinucleotides. How extensively the neuronal DNA methylome is regulated by neuronal activity is unknown. Using a next-generation sequencing-based… (More)

Meiosis is a germ-cell-specific cell division process through which haploid gametes are produced for sexual reproduction. Before the initiation of meiosis, mouse primordial germ cells undergo a series of epigenetic reprogramming steps, including the global erasure of DNA methylation at the 5-position of cytosine (5mC) in CpG-rich DNA. Although several… (More)

Conditional independence testing is an important problem, especially in Bayesian network learning and causal discovery. Due to the curse of dimensionality, testing for conditional independence of continuous variables is particularly challenging. We propose a Kernel-based Conditional Independence test (KCI-test), by constructing an appropriate test statistic… (More)

By taking into account the nonlinear effect of the cause, the inner noise effect, and the measurement distortion effect in the observed variables, the post-nonlinear (PNL) causal model has demonstrated its excellent performance in distinguishing the cause from effect. However, its identifiability has not been properly addressed, and how to apply it in the… (More)

In multi-label learning, each training example is associated with a set of labels and the task is to predict the proper label set for the unseen example. Due to the tremendous (exponential) number of possible label sets, the task of learning from multi-label examples is rather challenging. Therefore, the key to successful multi-label learning is how to… (More)

A new generation of technologies is poised to reduce DNA sequencing costs by several orders of magnitude. But our ability to fully leverage the power of these technologies is crippled by the absence of suitable 'front-end' methods for isolating complex subsets of a mammalian genome at a scale that matches the throughput at which these platforms will… (More)

Adenosine-to-inosine (A-to-I) RNA editing leads to transcriptome diversity and is important for normal brain function. To date, only a handful of functional sites have been identified in mammals. We developed an unbiased assay to screen more than 36,000 computationally predicted nonrepetitive A-to-I sites using massively parallel target capture and DNA… (More)

Metabolism is vital to every aspect of cell function, yet the metabolome of induced pluripotent stem cells (iPSCs) remains largely unexplored. Here we report, using an untargeted metabolomics approach, that human iPSCs share a pluripotent metabolomic signature with embryonic stem cells (ESCs) that is distinct from their parental cells, and that is… (More)