1Laboratory of Molecular Carcinogenesis, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, NC 27709, USA.

Abstract

Metazoan transcription is controlled through either coordinated recruitment of transcription machinery to the gene promoter or regulated pausing of RNA polymerase II (Pol II) in early elongation. We report that a striking difference between genes that use these distinct regulatory strategies lies in the "default" chromatin architecture specified by their DNA sequences. Pol II pausing is prominent at highly regulated genes whose sequences inherently disfavor nucleosome formation within the gene but favor occlusion of the promoter by nucleosomes. In contrast, housekeeping genes that lack pronounced Pol II pausing show higher nucleosome occupancy downstream, but their promoters are deprived of nucleosomes regardless of polymerase binding. Our results indicate that a key role of paused Pol II is to compete with nucleosomes for occupancy of highly regulated promoters, thereby preventing the formation of repressive chromatin architecture to facilitate further or future gene activation.

Genes with Prominent NELF-mediated Pausing Have Strong Promoters and More Focused Transcription Initiation

(A) Heatmaps depict loss of Pol II signal upon NELF-depletion or fold enrichment for the factors indicated at genes bound by Pol II in untreated cells. The rank order places promoter regions (+/- 250 bp) that lose the most Pol II signal upon NELF depletion at the top, and those least affected at bottom.(B) Promoter motifs are enriched among the most NELF-affected genes (Quartile 1), whereas less NELF-affected genes (Quartiles 2-4) are more likely to possess activator binding sites, such as the E-box, homeo domain response element (Hox RE) or DNA-replication-related element binding factor (DREF). Pol II-bound genes with high confidence TSS annotation were analyzed (n=6,461; including 1,615 of the most- and 4,846 of the less NELF-affected genes), and the number and percentage of genes that possess each motif are shown, along with P-value (Fisher's exact test).(C and D) Examples of genes that display highly focused transcription initiation (Tl, CG5490) or more dispersed initiation patterns (CG7364). Shown are the number of short RNA 5′-ends, at single-nucleotide resolution, that map near each TSS.(E) The most NELF-affected genes (Quartile 1) have more focused initiation than genes less affected by NELF RNAi (Quartiles 2-4). Initiation was considered focused when ≥50% of total promoter-proximal reads (+/- 50 bp from TSS) mapped to a single location.See also Figures S2 and S3.

(A) The most NELF-affected genes are preferentially depleted of downstream nucleosomes. Heatmaps show the change in Pol II signal upon NELF depletion for Pol II-bound genes (as in Figure 1A) and nucleosome occupancy determined by paired-end MNase-seq (color intensity indicates the number of read centers that lie in each 50 bp bin).(B) Pol II-bound genes were divided into quartiles based on the effect of NELF-depletion on Pol II promoter occupancy, from most affected (Quartile 1) to least (Quartile 4). Nucleosome distribution at genes in each quartile was determined by summing the number of nucleosome centers mapping to each position from the TSS to +1 kb.(C) Transcription elongation modestly disrupts chromatin architecture. Heatmaps show Ser2-P Pol II signal and nucleosome distribution at genes rank ordered by levels of Ser2-P enrichment within the gene.(D) Nucleosome occupancy is lower downstream of the most NELF-affected genes than at genes with the most active elongation (panel C, Quartile 1).(E) Nucleosome occupancy at genes separated into quartiles by Pausing indices, where Quartile 1 represents genes with the most pausing.(F) Predicted nucleosome occupancy at genes in each quartile of Pausing indices, based on intrinsic DNA sequence preferences of nucleosome formation (Kaplan et al., 2009).(G) Intron content is shown for genes in each quartile of Pausing indices, revealing significantly elevated intron levels at genes that are highly affected by NELF-depletion (Kruskal-Wallis test, boxes depict 25-75th percentiles, whiskers show 10-90th).See also Figure S4.

(A) Nucleosome occupancy at Pol II-bound genes (n=7,466) separated into quartiles by Pausing indices, where Quartile 1 represents the most paused genes. Nucleosome occupancy at genes in each quartile was determined by summing the number of nucleosome centers mapping to each position.(B) Predicted nucleosome occupancy at genes in each quartile of Pausing indices, based on intrinsic DNA sequence preferences of nucleosome formation, as in (Kaplan et al., 2009).(C) Loss of Pol II upon NELF-depletion is accompanied by increased nucleosome occupancy. Pol II ChIP-chip fold enrichment (red) and MNase-seq read distribution (black, depicts read centers in 25-bp bins) around a highly NELF-affected gene (CG12896) in mock-treated and NELF-depleted samples.(D) Genes down-regulated by NELF-depletion show increased promoter nucleosome occupancy. Nucleosome occupancy (calculated as in A) at genes whose expression decreased >2-fold following NELF-depletion.(E and F) NELF-depletion leads to increased nucleosome occupancy over highly paused promoters. The change in nucleosome counts upon NELF-depletion (MNase-seq reads in NELF-depleted/Mock-treated samples) is shown for genes in each quartile of Pausing indices as: fold change in read number at each position (E) or the raw increase in the number of nucleosome reads +/- 200 bp from the TSS (F).

The Relationship Between Pol II and Nucleosome Occupancy at Highly Paused and Less Paused Genes

Pol II distribution in S2 cells +/- 24-hour treatment with ecdysone is depicted as fold enrichment from ChIP-chip experiments. MNase protection assays are shown below to compare nucleosome occupancy at each gene in the Pol II-bound vs. unbound state. Data points represent average qPCR signal of DNA protected against MNase digestion from two biological replicates at primer pairs centered at the indicated distance from the TSS; error bars depict range.(A) Ugt35A (CG6644), a gene with a high Pausing index in control cells that becomes unbound by Pol II following treatment with ecdysone.(B) CG9664, a gene unbound by Pol II in control cells that becomes highly paused in ecdysone-treated cells.(C) Ecdysone causes CG9799, a gene with a low Pausing index in control cells, to become unbound by Pol II.(D) CG8950, an unbound gene in control cells, has uniform Pol II distribution upon ecdysone treatment.See also Figure S5.

Promoters Adopt their Predicted Nucleosome Configuration in the Absence of Paused Pol II

(A) Pol II distribution and nucleosome occupancy (H2A.Z nucleosomes from Mavrich et al., 2008) in Drosophila embryos. Genes are rank ordered by descending Pausing indices in embryos.(B) The most highly paused genes in embryos are depleted of downstream nucleosomes. Pol II-bound genes in embryos were divided into quartiles based on Pausing indices and composite metagene analyses of nucleosome reads around their promoters were generated from the data in (Mavrich et al., 2008).(C) Predicted nucleosome occupancy at genes in each quartile of Pausing indices as determined in Drosophila embryos.(D) A promoter with a high Pausing index in embryos (top panel) that is not occupied by Pol II in S2 cells (middle panel) becomes occluded by nucleosomes in the unbound state. Bottom panel shows nucleosome occupancy at the unbound gene in S2 cells as determined by MNase-seq, with read centers displayed in 25 bp bins.(E) In the absence of Pol II, in vivo nucleosome occupancy closely resembles predictions. Nucleosome occupancies were determined in S2 cells, where these genes are not bound by Pol II. Shown are genes that are highly paused (Quartile 1) or lacking paused Pol II (Quartile 4) in embryos.See also Figure S6.