Cell- and gene-specific regulation of primary target genes by the androgen receptor.

Department of Cellular and Molecular Pharmacology, University of California, San Francisco, California 94143, USA.

Abstract

The androgen receptor (AR) mediates the physiologic and pathophysiologic effects of androgens including sexual differentiation, prostate development, and cancer progression by binding to genomic androgen response elements (AREs), which influence transcription of AR target genes. The composition and context of AREs differ between genes, thus enabling AR to confer multiple regulatory functions within a single nucleus. We used expression profiling of an immortalized human prostate epithelial cell line to identify 205 androgen-responsive genes (ARGs), most of them novel. In addition, we performed chromatin immunoprecipitation to identify 524 AR binding regions and validated in reporter assays the ARE activities of several such regions. Interestingly, 67% of our AREs resided within approximately 50 kb of the transcription start sites of 84% of our ARGs. Indeed, most ARGs were associated with two or more AREs, and ARGs were sometimes themselves linked in gene clusters containing up to 13 AREs and 12 ARGs. AREs appeared typically to be composite elements, containing AR binding sequences adjacent to binding motifs for other transcriptional regulators. Functionally, ARGs were commonly involved in prostate cell proliferation, communication, differentiation, and possibly cancer progression. Our results provide new insights into cell- and gene-specific mechanisms of transcriptional regulation of androgen-responsive gene networks.

qPCR validation of the expression of ARGs identified by microarray. (A) Most of the 157 putative ARGs identified by expression microarray analysis were subsequently validated using qPCR. The colorimetric representation shows genes, indicated by HUGO Gene Nomenclature Committee gene symbols, whose transcripts were repressed (B, green) and induced (C, red) by androgen. The color intensity reflects the relative fold change in transcript level for androgen- versus vehicle-treated cells. The time course spanned 4, 8, 15, and 24 h. The mean change in expression level for genes above the orange line was greater than twofold across the time course, whereas the mean change in expression for those genes below the line was greater than 1.5 fold.

ARBRs identified by AR ChIP-chip lead to the discovery of novel primary ARGs. (A) Many of the 189 ARBR-proximal genes that were not detected by microarray analysis were, in fact, primary ARGs. The colorimetric representation shows genes, near identified ARBRs, whose transcripts were repressed (B, green) or induced (C, red) by androgen treatment. See for colorimetric and time course details. (D) Of the 524 ARBRs (blue) and 205 (108 initial and 97 additional ARGs, red) differentially expressed ARGs, we found 352 ARBRs near 172 ARGs, which represent direct, functional targets of AR. The response of 62 genes containing in total 68 intragenic ARBRs (gray) was independent of androgen (androgen-unresponsive genes, URGs).

Intersection of ARBR and ARG data reveals primary ARG clusters. (A) An 800-kb region on chromosome 4 containing seven validated ARBRs (red asterisks) was found to contain genes that were repressed (green) or unresponsive (black) following treatment with androgen. (B) A 370-kb region on chromosome 1 containing 13 validated ARBRs (red asterisks) was found to contain genes that were induced (red) or unresponsive (black) following treatment with androgen. Gene loci are shown based on the May 2004 human genome freeze hg17 in the UCSC browser using GenBank RefSeq positions. The androgen responsiveness of some genes was not determined (gray). Genes above the heavy black line are transcribed from left to right whereas those below the line are transcribed from right to left. The longest transcriptional variants for each gene are represented. The ChIP–chip signals (red bars) are shown for duplicate experiments and indicate the relative fold enrichment of AR-immunoprecipitated DNA fragments for androgen- versus vehicle-treated cells.

Distribution analysis of AREs located near ARGs. (A) The distribution of AREs identified by ChIP–chip was plotted relative to the predicted transcription start site (+1) of the longest transcriptional variant of the nearest ARG. AREs reside great distances upstream and downstream, as well as near the promoter. Notably, fewer AREs were found upstream of than downstream from ARG transcription start sites of the longest transcriptional variants. Relative distances upstream of (negative) and downstream (positive) from the transcription start sites (vertical black line) were assigned to 10 kb bins. AREs were generally found beyond 10 kb of the transcription start site. (B) The location of AREs is shown relative to the predicted gene substructure of the longest transcriptional variant of the nearby ARGs. When the distribution of all AREs near all ARGs was assessed, most AREs were situated downstream from the transcription start site. These distributions were subdivided based on whether the ARG was repressed or induced, suggesting biases in terms of ARE location and the mean AREs/ARG ratios.

ARBSs and other cis-regulatory motifs are enriched within AREs. Unbiased motif searches of all 524 AREs revealed the presence of significantly recurring motifs. (A) BioProspector analysis identified a consensus ARBS, which is visually represented in WebLogo (http://weblogo.berkeley.edu). (B) MobyDick analysis detected a consensus AR half-site motif as well as putative motifs for AP-1, RAR, ZNF42, HNF-4α, and EGR. These enriched sequence motifs are represented using IUPAC symbols (http://www.iupac.org). Motifs where the putative regulatory factors were not determined (ND) are indicated. P-value calculations are described in Materials and Methods.