The K homology (KH) motif is one of the major classes of nucleic acid binding proteins. Some members of this family have been shown to interact with DNA while others have RNA targets. There have been no reports containing direct experimental evidence regarding the nature of KH module-DNA interaction. In this study, the interaction of the C-terminal KH domain of heterogeneous nuclear ribonucleoprotein K (KH3) with its cognate single-stranded DNA (ssDNA) are investigated. Chemical shift perturbation mapping indicates that the first two helices, the conserved GxxG loop, beta 1, and beta 2, are the primary regions involved in DNA binding for KH3. The nature of the KH3-ssDNA interaction is further illuminated by a comparison of backbone 15N relaxation data for the bound and unbound KH3. Relaxation data are also used to confirm that the backbone of wild-type KH3 is structurally identical to that of the G26R mutant KH3, which was previously published. Amide proton exchange experiments indicate that the two helices involved in DNA binding are less stable than other regions of secondary structure and that a large portion of KH3 backbone amide hydrogens are protected in some manner upon ssDNA binding. The major backbone dynamics features of KH3 are similar to those of the structurally comparable human papillomavirus-31 E2 DNA binding domain. Secondary structure information for ssDNA-bound wild-type KH3 is also presented and shows that binding results in no global changes in the protein fold.

Solution structure of the MEF2A-DNA complex: structural basis for the modulation of DNA bending and specificity by MADS-box transcription factors.

EMBO J. 2000; 19: 2615-28

Display abstract

The solution structure of the 33 kDa complex between the dimeric DNA-binding core domain of the transcription factor MEF2A (residues 1-85) and a 20mer DNA oligonucleotide comprising the consensus sequence CTA(A/T)(4)TAG has been solved by NMR. The protein comprises two domains: a MADS-box (residues 1-58) and a MEF2S domain (residues 59-73). Recognition and specificity are achieved by interactions between the MADS-box and both the major and minor grooves of the DNA. A number of critical differences in protein-DNA contacts observed in the MEF2A-DNA complex and the DNA complexes of the related MADS-box transcription factors SRF and MCM1 provide a molecular explanation for modulation of sequence specificity and extent of DNA bending ( approximately 15 versus approximately 70 degrees ). The structure of the MEF2S domain is entirely different from that of the equivalent SAM domain in SRF and MCM1, accounting for the absence of cross-reactivity with other proteins that interact with these transcription factors.

AHM1, a novel type of nuclear matrix-localized, MAR binding protein with a single AT hook and a J domain-homologous region.

Plant Cell. 2000; 12: 1903-16

Display abstract

Interactions between the nuclear matrix and special regions of chromosomal DNA called matrix attachment regions (MARs) have been implicated in various nuclear functions. We have identified a novel protein from wheat, AT hook-containing MAR binding protein1 (AHM1), that binds preferentially to MARs. A multidomain protein, AHM1 has the special combination of a J domain-homologous region and a Zn finger-like motif (a J-Z array) and an AT hook. For MAR binding, the AT hook at the C terminus was essential, and an internal portion containing the Zn finger-like motif was additionally required in vivo. AHM1 was found in the nuclear matrix fraction and was localized in the nucleoplasm. AHM1 fused to green fluorescent protein had a speckled distribution pattern inside the nucleus. AHM1 is most likely a nuclear matrix component that functions between intranuclear framework and MARs. J-Z arrays can be found in a group of (hypothetical) proteins in plants, which may share some functions, presumably to recruit specific Hsp70 partners as co-chaperones.

SARs (scaffold attachment regions) are candidate DNA elements for partitioning eukaryotic genomes into independent chromatin loops by attaching DNA to proteins of a nuclear scaffold or matrix. The interaction of SARs with the nuclear scaffold is evolutionarily conserved and appears to be due to specific DNA binding proteins that recognize SARs by a mechanism not yet understood. We describe a novel, evolutionarily conserved protein domain that specifically binds to SARs but is not related to SAR binding motifs of other proteins. This domain was first identified in human scaffold attachment factor A (SAF-A) and was thus designated SAF-Box. The SAF-Box is present in many different proteins ranging from yeast to human in origin and appears to be structurally related to a homeodomain. We show here that SAF-Boxes from four different origins, as well as a synthetic SAF-Box peptide, bind to natural and artificial SARs with high specificity. Specific SAR binding of the novel domain is achieved by an unusual mass binding mode, is sensitive to distamycin but not to chromomycin, and displays a clear preference for long DNA fragments. This is the first characterization of a specific SAR binding domain that is conserved throughout evolution and has DNA binding properties that closely resemble that of the unfractionated nuclear scaffold.

The DNA-binding domain of the gene regulatory protein AreA extends beyond the minimal zinc-finger region conserved between GATA proteins.

Biochim Biophys Acta. 2000; 1493: 325-32

Display abstract

The AreA protein of Aspergillus nidulans regulates the activity of over 100 genes involved in the utilisation of nitrogen, and has a limited region of homology with the vertebrate family of GATA proteins around a zinc finger (Zf) motif. A 66 amino acid (a.a.) residue fragment (Zf(66)) corresponding to the zinc finger, a 91 a.a fragment (Zf(91)) containing an additional 25 a.a. at the C-terminus, and a much larger 728 a.a. sequence (3'EX) corresponding to the 3'exon have been over-expressed as fusion proteins in E. coli and purified. The DNA-protein complexes formed by these proteins have been examined by gel retardation analysis. The 91 a.a. protein forms a discrete shifted species with a GATA-containing DNA fragment with high affinity (K(d)=0.15 nM), whereas the 66 a.a. protein has very low ( approximately microM) affinity for the same sequence. The results show that the region of AreA required for high affinity DNA binding extends beyond the zinc finger motif that is homologous to GATA-1, requiring in addition a region within the 25 a.a. sequence C-terminal to the zinc finger. Using hydroxyl radical and ethylation interference footprinting, the minimal Zinc finger protein (Zf(66)) shows no appreciable interference effects whereas Zf(91) shows much stronger interference effects, identical to those of the larger protein. These effects extend over sequences up to two nucleotides either side of the GATA site, and indicate contacts additional to those observed in the three-dimensional structure of the complex of the minimal zinc-finger protein with DNA. We suggest that these additional contacts are responsible for the enhanced DNA binding affinity of the extended zinc-finger protein Zf(91).

MIDA1, an Id-associating protein, has two distinct DNA binding activities that are converted by the association with Id1: a novel function of Id protein.

Biochem Biophys Res Commun. 1999; 266: 147-51

Display abstract

Id proteins not only regulate cell differentiation negatively, but they also promote growth, immortalization, and apoptosis. To know the mechanism of how Id regulates cell fate, we previously isolated an Id-associating protein, MIDA1, which positively regulates cell growth (1). Its predicted amino acid sequence consists of a Zuotin (a Z-DNA binding protein in yeast) homology region and tryptophan-mediated repeats (Tryp-med repeats). MIDA1 exhibits a sequence-specific DNA binding activity through the Tryp-med repeats (manuscript in preparation). In this study, we revealed that, like Zuotin, MIDA1 can specifically bind to Z-DNA. This suggested that MIDA is a novel DNA binding protein that has two different DNA binding activities. Furthermore, association of Id1 with MIDA1 stimulated the sequence-specific DNA binding activity, while it inhibited the Z-DNA binding activity. Therefore, we concluded that MIDA1 may act as a mediator of the growth-promoting function of Id, by switching the two DNA binding activities of MIDA1.

The cyclophilin-like domain mediates the association of Ran-binding protein 2 with subunits of the 19 S regulatory complex of the proteasome.

J Biol Chem. 1998; 273: 24676-82

Display abstract

The combination of the Ran-binding domain 4 and cyclophilin domains of Ran-binding protein 2 selectively associate with a subset of G protein-coupled receptors, red/green opsins, upon cis-trans prolyl isomerase-dependent and direct modification of opsin followed by association of the modified opsin isoform to Ran-binding domain 4. This effect enhances in vivo the production of functional receptor and generates an opsin isoform with no propensity to self-aggregate in vitro. We now show that another domain of Ran-binding protein 2, cyclophilin-like domain, specifically associates with the 112-kDa subunit, P112, and other subunits of the 19 S regulatory complex of the 26 S proteasome in the neuroretina. This association possibly mediates Ran-binding protein 2 limited proteolysis into a smaller and stable isoform. Also, the interaction of Ran-binding protein 2 with P112 regulatory subunit of the 26 S proteasome involves still another protein, a putative kinesin-like protein. Our results indicate that Ran-binding protein 2 is a key component of a macro-assembly complex selectively linking protein biogenesis with the proteasome pathway and, thus, with potential implications for the presentation of misfolded and ubiquitin-like modified proteins to this proteolytic machinery.

A nondenaturing purification scheme for the DNA-binding domain of poly(ADP-ribose) polymerase, a structure-specific DNA-binding protein.

Protein Expr Purif. 1998; 14: 79-86

Display abstract

Poly(ADP-ribose) polymerase (PARP) is thought to be involved in DNA repair given its ability to recognize and bind to DNA strand breaks. During apoptosis, PARP is proteolytically cleaved into two stable fragments, the N-terminal 25-kDa DNA-binding domain (DBD) and the 85-kDa fragment containing the automodification and catalytic domains. To understand the DNA-binding properties of PARP, we expressed a recombinant hexahistidine tagged protein (His-DBD) in Escherichia coli. We modified expression to facilitate protein folding by including zinc and reducing the induction temperature. Properly folded, the DNA-binding domain of PARP binds to single- and double-stranded DNA in a structure-specific manner. To eliminate contamination with bacterial DNA that occurred during the purification process, a purification procedure was developed to produce DNA-free protein. In addition, to remove the hexahistidine tag from the recombinant protein, thrombin cleavage was carried out while the recombinant protein was bound to a DNA column. This procedure stabilized the recombinant protein and resulted in nearly 100% cleavage with no appreciable loss to unwanted proteolytic degradation. This nondenaturing purification scheme results in high-quality, native PARP-DBD for use in structural and biochemical studies.

Fifteen sequences belonging to the Chinese hamster genome were isolated from a library of sequences preferentially binding to the nuclear matrix (matrix attachment regions, MAR), sequenced, and characterized. Fourteen of the 15 sequences (> 90%) bound to the nuclear matrix with affinities 2.5-60 times higher than those of control DNA fragments containing no MARs. One clone displayed a considerable homology to the ORF1 region of the mouse LINE repeat. Such MARs within LINE repeats may considerably alter the activities of some genes and the transcription status of chromatin domains upon the LINE repeat propagation in the genome over the course of evolution.

Here, we describe the cloning and further characterization of chicken ARBP, an abundant nuclear protein with a high affinity for MAR/SARs. Surprisingly, ARBP was found to be homologous to the rat protein MeCP2, previously identified as a methyl-CpG-binding protein. A region spanning 125 amino acids in the N-terminal halves is 96.8% identical between chicken ARBP and rat MeCP2. A deletion mutation analysis using Southwestern and band shift assays identified this highly conserved region as the MAR DNA binding domain. Alignment of chicken ARBP with rat and human MeCP2 proteins revealed six trinucleotide amplifications generating up to 34-fold repetitions of a single amino acid. Because MeCP2 was previously localized to pericentromeric heterochromatin in mouse chromosomes, we analyzed the in vitro binding of ARBP to various repetitive sequences. In band shift experiments, ARBP binds to two chicken repetitive sequences as well as to mouse satellite DNA with high affinity similar to that of its binding to chicken lysozyme MAR fragments. In mouse satellite DNA, use of several footprinting techniques characterized two high-affinity binding sites, whose sequences are related to the ARBP binding site consensus in the chicken lysozyme MAR (5'-GGTGT-3'). Band shift experiments indicated that methylation increased in vitro binding of ARBP to mouse satellite DNA two- to fivefold. Our results suggest that ARBP/MeCP2 is a multifunctional protein with roles in loop domain organization of chromatin, the structure of pericentromeric heterochromatin, and DNA methylation.