RNA-binding proteins (RBPs) determine RNA fate from synthesis to decay. Employing two complementary protocols for covalent UV crosslinking of RBPs to RNA, we describe a systematic, unbiased, and comprehensive approach, termed "interactome capture," to define the mRNA interactome of proliferating human HeLa cells. We identify 860 proteins that qualify as RBPs by biochemical and statistical criteria, adding more than 300 RBPs to those previously known and shedding light on RBPs in disease, RNA-binding enzymes of intermediary metabolism, RNA-binding kinases, and RNA-binding architectures. Unexpectedly, we find that many proteins of the HeLa mRNA interactome are highly intrinsically disordered and enriched in short repetitive amino acid motifs. Interactome capture is broadly applicable to study mRNA interactome composition and dynamics in varied biological settings.

Protein-RNA interactions are fundamental to core biological processes, such as mRNA splicing, localization, degradation, and translation. We developed a photoreactive nucleotide-enhanced UV crosslinking and oligo(dT) purification approach to identify the mRNA-bound proteome using quantitative proteomics and to display the protein occupancy on mRNA transcripts by next-generation sequencing. Application to a human embryonic kidney cell line identified close to 800 proteins. To our knowledge, nearly one-third were not previously annotated as RNA binding, and about 15% were not predictable by computational methods to interact with RNA. Protein occupancy profiling provides a transcriptome-wide catalog of potential cis-regulatory regions on mammalian mRNAs and showed that large stretches in 3' UTRs can be contacted by the mRNA-bound proteome, with numerous putative binding sites in regions harboring disease-associated nucleotide polymorphisms. Our observations indicate the presence of a large number of mRNA binders with diverse molecular functions participating in combinatorial posttranscriptional gene-expression networks.

Mutations in PHF6 are the cause of Börjeson-Forssman-Lehman syndrome (BFLS), an X-linked intellectual disability (XLID) disorder, and both T-cell acute lymphoblastic leukemia (T-ALL) and acute myeloid leukemia (AML). The PHF6 gene encodes a protein with two plant homeodomain (PHD)-like zinc finger domains. As many PHD-like domains function to target chromatin remodelers to post-translationally modified histones, this suggests a role for PHF6 in chromatin regulation. However, PHD domains are usually found in association with a catalytic domain, a feature that is lacking in PHF6. This distinct domain structure and the minimal information on its cellular function prompted us to perform a proteomic screen to identify PHF6 binding partners. We expressed recombinant Flag-tagged PHF6 in HEK 293T cells for coimmunoprecipitation, and analyzed the purified products by mass spectrometry. We identified proteins involved in ribosome biogenesis, RNA splicing, and chromatin regulation, consistent with PHF6 localization to both the nucleoplasm and nucleolus. Notably, PHF6 copurified with multiple constituents of the nucleosome remodeling and deacetylation (NuRD) complex, including CHD4, HDAC1, and RBBP4. We demonstrate that this PHF6-NuRD complex is not present in the nucleolus but is restricted to the nucleoplasm. The association with NuRD represents the first known interaction for PHF6 and implicates it in chromatin regulation.

A cellular process that results in the biosynthesis of constituent macromolecules, assembly, and arrangement of constituent parts of ribosome subunits; includes transport to the sites of protein synthesis.

The cellular metabolic process in which a protein is formed, using the sequence of a mature mRNA molecule to specify the sequence of amino acids in a polypeptide chain. Translation is mediated by the ribosome, and begins with the formation of a ternary complex between aminoacylated initiator methionine tRNA, GTP, and initiation factor 2, which subsequently associates with the small subunit of the ribosome and an mRNA. Translation ends with the release of a polypeptide chain from the ribosome.

Hepatitis C virus uses an internal ribosome entry site (IRES) in the viral RNA to directly recruit human 40S ribosome subunits during cap-independent translation initiation. Although IRES-mediated translation initiation is not subject to many of the regulatory mechanisms that control cap-dependent translation initiation, it is unknown whether other noncanonical protein factors are involved in this process. Thus, a global protein composition analysis of native and IRES-bound 40S ribosomal complexes has been conducted to facilitate an understanding of the IRES ribosome recruitment mechanism. A combined top-down and bottom-up mass spectrometry approach was used to identify both the proteins and their posttranslational modifications (PTMs) in the native 40S subunit and the IRES recruited translation initiation complex. Thirty-one out of a possible 32 ribosomal proteins were identified by combining top-down and bottom-up mass spectrometry techniques. Proteins were found to contain PTMs, including loss of methionine, acetylation, methylation, and disulfide bond formation. In addition to the 40S ribosomal proteins, RACK1 was consistently identified in the 40S fraction, indicating that this protein is associated with the 40S subunit. Similar methodology was then applied to the hepatitis C virus IRES-bound 40S complex. Two 40S ribosomal proteins, RS25 and RS29, were found to contain different PTMs than those in the native 40S subunit. In addition, RACK1, eukaryotic initiation factor 3 proteins and nucleolin were identified in the IRES-mediated translation initiation complex.

Reverse-phase HPLC was used to fractionate 40S ribosomal proteins from human placenta. Application of a C4 reverse-phase column allowed us to obtain 27 well-resolved peaks. The protein composition of each chromatographic fraction was established by two-dimensional polyacrylamide gel electrophoresis and N-terminal sequencing. N-terminally blocked proteins were cleaved with endoproteinase Lys-C, and suitable peptides were sequenced. All sequences were compared with those of ribosomal proteins available from data bases. This allowed us to identify all proteins from the 40S human ribosomal subunit in the HPLC elution profile. By matrix-assisted laser-desorption ionization mass spectrometry the masses of the 40S proteins were determined and checked for the presence of post-translational modifications. For several proteins differences to the deduced sequences and the calculated masses were found to be due to post-translational modifications.

Keywords

Proteins conjugated with ribonucleic acid (RNA). Ribonucleoprotein are involved in a wide range of cellular processes. Besides ribosomes, in eukaryotic cells both initial RNA transcripts in the nucleus (hnRNA) and cytoplasmic mRNAs exist as complexes with specific sets of proteins. Processing (splicing) of the former is carried out by small nuclear RNPs (snRNPs). Other examples are the signal recognition particle responsible for targetting proteins to endoplasmic reticulum and a complex involved in termination of transcription.

Protein of the ribosome, large ribonucleoprotein particles where the translation of messenger RNA (mRNA) into protein occurs. They are both free in the cytoplasm and attached to membranes of eukaryotic and prokaryotic cells. Ribosomes are also present in all plastids and mitochondria, where they translate organelle-encoded mRNA.

Protein which is part of a reference proteome. Reference proteomes are a subset of proteomes that have been selected either manually or algorithmically according to a number of criteria to provide a broad coverage of the tree of life and a representative cross-section of the taxonomic diversity found within UniProtKB, as well as the proteomes of well-studied model organisms and other species of interest for biomedical research.