Contact

Links

Research & Scholarship

Current Research and Scholarly Interests

The Elias Lab seeks to develop and apply methods for large scale proteome characterization to solve fundamental problems in cell biology and disease. The growing field of proteomics has the lofty goal of characterizing the proteins in any isolated complex, subcellular compartment, cultured cell line or tissue. We use mass spectrometry-based approaches which can do more than simply determining the identity of a protein isolated in a polyacrylamide gel -- rapidly advancing technologies are allowing us to measure dynamic changes in protein abundances, post-translational modification states, splice isoforms, interaction partners, and localization across multiple cell states. The combination of liquid chromatography with tandem mass spectrometry (LC-MS/MS) has emerged as the most robust technology for making proteome-scale discoveries. Although the tools of proteomics have greatly advanced in recent years, many challenges lie ahead. Our lab focuses on developing new methods in protein fractionation, instrumentation, and data analysis to meet these challenges, and then applies them to studying important biomedical paradigms, including antigen presentation in cancer, and monitoring host responses to the gut microbiome..

Abstract

Cancer somatic mutations can generate neoantigens that distinguish malignant from normal cells. However, the personalized identification and validation of neoantigens remains a major challenge. Here we discover neoantigens in human mantle-cell lymphomas by using an integrated genomic and proteomic strategy that interrogates tumour antigen peptides presented by major histocompatibility complex (MHC) class I and class II molecules. We applied this approach to systematically characterize MHC ligands from 17 patients. Remarkably, all discovered neoantigenic peptides were exclusively derived from the lymphoma immunoglobulin heavy- or light-chain variable regions. Although we identified MHC presentation of private polymorphic germline alleles, no mutated peptides were recovered from non-immunoglobulin somatically mutated genes. Somatic mutations within the immunoglobulin variable region were almost exclusively presented by MHC class II. We isolated circulating CD4(+) T cells specific for immunoglobulin-derived neoantigens and found these cells could mediate killing of autologous lymphoma cells. These results demonstrate that an integrative approach combining MHC isolation, peptide identification, and exome sequencing is an effective platform to uncover tumour neoantigens. Application of this strategy to human lymphoma implicates immunoglobulin neoantigens as targets for lymphoma immunotherapy.

Abstract

Dependent on concise, predefined protein sequence databases, traditional search algorithms perform poorly when analyzing mass spectra derived from wholly uncharacterized protein products. Conversely, de novo peptide sequencing algorithms can interpret mass spectra without relying on reference databases. However, such algorithms have been difficult to apply to complex protein mixtures, in part due to a lack of methods for automatically validating de novo sequencing results. Here, we present novel metrics for benchmarking de novo sequencing algorithm performance on large-scale proteomics data sets and present a method for accurately calibrating false discovery rates on de novo results. We also present a novel algorithm (LADS) that leverages experimentally disambiguated fragmentation spectra to boost sequencing accuracy and sensitivity. LADS improves sequencing accuracy on longer peptides relative to that of other algorithms and improves discriminability of correct and incorrect sequences. Using these advancements, we demonstrate accurate de novo identification of peptide sequences not identifiable using database search-based approaches.

Abstract

Improved understanding of the interplay between host and microbes stands to illuminate new avenues for disease diagnosis, treatment, and prevention. Here, we provide a high-resolution view of the dynamics between host and gut microbiota during antibiotic-induced intestinal microbiota depletion, opportunistic Salmonella typhimurium and Clostridium difficile pathogenesis, and recovery from these perturbed states in a mouse model. Host-centric proteome and microbial community profiles provide a nuanced longitudinal view, revealing the interdependence between host and microbiota in evolving dysbioses. Time- and condition-specific molecular and microbial signatures are evident and clearly distinguished from pathogen-independent inflammatory fingerprints. Our data reveal that mice recovering from antibiotic treatment or C. difficile infection retain lingering signatures of inflammation, despite compositional normalization of the microbiota, and host responses could be rapidly and durably relieved through fecal transplant. These experiments demonstrate insights that emerge from the combination of these orthogonal, untargeted approaches to the gastrointestinal ecosystem.

Abstract

The diverse community of microbes that inhabits the human bowel is vitally important to human health. Host-expressed proteins are essential for maintaining this mutualistic relationship and serve as reporters on the status of host-microbiota interaction. Therefore, unbiased and sensitive methods focused on host proteome characterization are needed. Herein we describe a novel method for applying shotgun proteomics to the analysis of feces, focusing on the secreted host proteome. We have conducted the most complete analysis of the extracellular mouse gut proteome to date by employing a gnotobiotic mouse model. Using mice colonized with defined microbial communities of increasing complexity or a complete human microbiota ('humanized'), we show that the complexity of the host stool proteome mirrors the complexity of microbiota composition. We further show that host responses exhibit signatures specific to the different colonization states. We demonstrate feasibility of this approach in human stool samples and provide evidence for a "core" stool proteome as well as personalized host response features. Our method provides a new avenue for noninvasive monitoring of host-microbiota interaction dynamics via host-produced proteins in stool.

Abstract

Plasmodium falciparum and Toxoplasma gondii are obligate intracellular apicomplexan parasites that rapidly invade and extensively modify host cells. Protein phosphorylation is one mechanism by which these parasites can control such processes. Here we present a phosphoproteome analysis of peptides enriched from schizont stage P. falciparum and T. gondii tachyzoites that are either "intracellular" or purified away from host material. Using liquid chromatography-tandem mass spectrometry, we identified over 5,000 and 10,000 previously unknown phosphorylation sites in P. falciparum and T. gondii, respectively, revealing that protein phosphorylation is an extensively used regulation mechanism both within and beyond parasite boundaries. Unexpectedly, both parasites have phosphorylated tyrosines, and P. falciparum has unusual phosphorylation motifs that are apparently shaped by its A:T-rich genome. This data set provides important information on the role of phosphorylation in the host-pathogen interaction and clues to the evolutionary forces operating on protein phosphorylation motifs in both parasites.

Abstract

Cyclin D1 is a component of the core cell cycle machinery. Abnormally high levels of cyclin D1 are detected in many human cancer types. To elucidate the molecular functions of cyclin D1 in human cancers, we performed a proteomic screen for cyclin D1 protein partners in several types of human tumours. Analyses of cyclin D1 interactors revealed a network of DNA repair proteins, including RAD51, a recombinase that drives the homologous recombination process. We found that cyclin D1 directly binds RAD51, and that cyclin D1-RAD51 interaction is induced by radiation. Like RAD51, cyclin D1 is recruited to DNA damage sites in a BRCA2-dependent fashion. Reduction of cyclin D1 levels in human cancer cells impaired recruitment of RAD51 to damaged DNA, impeded the homologous recombination-mediated DNA repair, and increased sensitivity of cells to radiation in vitro and in vivo. This effect was seen in cancer cells lacking the retinoblastoma protein, which do not require D-cyclins for proliferation. These findings reveal an unexpected function of a core cell cycle protein in DNA repair and suggest that targeting cyclin D1 may be beneficial also in retinoblastoma-negative cancers which are currently thought to be unaffected by cyclin D1 inhibition.

Abstract

The continual destruction and renewal of proteins that maintain cellular homeostasis has been rigorously studied since the late 1930s. Experimental techniques for measuring protein turnover have evolved to measure the dynamic regulation of key proteins and now, entire proteomes. In the past decade, the proteomics field has aimed to discover how cells adjust their proteomes to execute numerous regulatory programs in response to specific cellular and environmental cues. By combining classical biochemical techniques with modern, high-throughput technologies, researchers have begun to reveal the synthesis and degradation mechanisms that shape protein turnover on a global scale. This review examines several recent developments in protein turnover research, emphasizing the combination of metabolic labeling and mass spectrometry.

Abstract

Although most tissues in an organism are genetically identical, the biochemistry of each is optimized to fulfill its unique physiological roles, with important consequences for human health and disease. Each tissue's unique physiology requires tightly regulated gene and protein expression coordinated by specialized, phosphorylation-dependent intracellular signaling. To better understand the role of phosphorylation in maintenance of physiological differences among tissues, we performed proteomic and phosphoproteomic characterizations of nine mouse tissues. We identified 12,039 proteins, including 6296 phosphoproteins harboring nearly 36,000 phosphorylation sites. Comparing protein abundances and phosphorylation levels revealed specialized, interconnected phosphorylation networks within each tissue while suggesting that many proteins are regulated by phosphorylation independently of their expression. Our data suggest that the "typical" phosphoprotein is widely expressed yet displays variable, often tissue-specific phosphorylation that tunes protein activity to the specific needs of each tissue. We offer this dataset as an online resource for the biological research community.

Abstract

Liquid chromatography and tandem mass spectrometry (LC-MS/MS) has become the preferred method for conducting large-scale surveys of proteomes. Automated interpretation of tandem mass spectrometry (MS/MS) spectra can be problematic, however, for a variety of reasons. As most sequence search engines return results even for 'unmatchable' spectra, proteome researchers must devise ways to distinguish correct from incorrect peptide identifications. The target-decoy search strategy represents a straightforward and effective way to manage this effort. Despite the apparent simplicity of this method, some controversy surrounds its successful application. Here we clarify our preferred methodology by addressing four issues based on observed decoy hit frequencies: (i) the major assumptions made with this database search strategy are reasonable; (ii) concatenated target-decoy database searches are preferable to separate target and decoy database searches; (iii) the theoretical error associated with target-decoy false positive (FP) rate measurements can be estimated; and (iv) alternate methods for constructing decoy databases are similarly effective once certain considerations are taken into account.

Abstract

Researchers have several options when designing proteomics experiments. Primary among these are choices of experimental method, instrumentation and spectral interpretation software. To evaluate these choices on a proteome scale, we compared triplicate measurements of the yeast proteome by liquid chromatography tandem mass spectrometry (LC-MS/MS) using linear ion trap (LTQ) and hybrid quadrupole time-of-flight (QqTOF; QSTAR) mass spectrometers. Acquired MS/MS spectra were interpreted with Mascot and SEQUEST algorithms with and without the requirement that all returned peptides be tryptic. Using a composite target decoy database strategy, we selected scoring criteria yielding 1% estimated false positive identifications at maximum sensitivity for all data sets, allowing reasonable comparisons between them. These comparisons indicate that Mascot and SEQUEST yield similar results for LTQ-acquired spectra but less so for QSTAR spectra. Furthermore, low reproducibility between replicate data acquisitions made on one or both instrument platforms can be exploited to increase sensitivity and confidence in large-scale protein identifications.

Abstract

Measuring protein changes over time or following stimuli is one of the important tasks of proteomics. In the past decade, several strategies have been developed for the relative quantification of proteins using mass spectrometry (MS). Isobaric labeling strategies for relative quantitative proteomics allow for parallel multiplexing of quantitative experiments. With this technique, multiple peptide samples are chemically labeled with isobaric chemical tag variants and each variant has the same molecular structure and mass. Each variant, however, is designed to produce a unique "reporter ion" when fragmented inside a mass spectrometer. Once peptide samples are labeled, combined, and analyzed using MS, differentially labeled peptides are indistinguishable in a first, MS spectrum of intact peptides. However, since each tag variant contains a labile component with different mass, "reporter ions" can be generated and recorded in a subsequent MS(2) spectrum. Intensities from each variant are recorded to represent the relative abundances of the peptide in each sample. Isobaric tags for relative and absolute quantitation (iTRAQ) and tandem mass tags (TMT) are commercially available reagents for performing this technique. Here, we describe the general workflow of relative quantification of proteins using TMT by MS(2), or an additional MS(3) spectrum.

Abstract

Major histocompatibility complex Class I (MHCI) and Class II (MHCII) presented peptides powerfully modulate T cell immunity and play a vital role in generating effective anti-tumor and anti-viral immune responses in mammals. Characterizing these MHCI or MHCII presented peptides can help generate therapeutic treatments, afford information on T cell mediated biomarkers, provide insight into disease progression, and reduce adverse anti-drug side effects from engineered biotherapeutics. Here, we explore the tools and techniques commonly employed to discover both MHCI- and MHCII-presented peptides. We describe complementary strategies that enhance the characterization of these peptides and the informatics tools employed for both predicting and characterizing MHCI- and MHCII-presented epitopes. The evolution of methodologies for isolating MHC-presented peptides is discussed, as are the mass spectrometric workflows that can be employed for their characterization. We provide a perspective on where this field is headed, and how these tools may be applicable to the discovery and monitoring of epitopes in a variety of scenarios.

Abstract

Plasmodium parasites contain several unique membrane compartments in which prenylated proteins may play important roles in pathogenesis. Protein prenylation has also been proposed as an antimalarial drug target because farnesyltransferase inhibitors cause potent growth inhibition of blood-stage Plasmodium However, the specific prenylated proteins that mediate antimalarial activity have yet to be identified. Given the potential for new parasite biology and elucidating drug mechanism-of-action, we performed a large-scale identification of the prenylated proteome in blood-stage P. falciparum parasites using an alkyne-labeled prenyl analog to specifically enrich parasite prenylated proteins. Twenty high-confidence candidates were identified, including several examples of pathogen-specific prenylation activity. One unique parasite prenylated protein was FYVE-containing coiled-coil protein (FCP), which is only conserved in Plasmodium and related Apicomplexan parasites and localizes to the parasite food vacuole. Targeting of FCP to this parasite-specific compartment was dependent on prenylation of its CaaX motif, as mutation of the prenylation site caused cytosolic mislocalization. We also showed that PfRab5b, which lacks C-terminal cysteines that are the only known site of Rab GTPase modification, is prenylated. Finally, we show that the THQ class of farnesyltransferase inhibitors abolishes FCP prenylation and causes its mislocalization, providing the first demonstration of a specific prenylated protein disrupted by antimalarial farnesyl transferase inhibitors. Altogether, these findings identify prenylated proteins that reveal unique parasite biology and are useful for evaluating prenyltransferase inhibitors for antimalarial drug development.

Abstract

T cell activation in response to Ag is largely regulated by protein posttranslational modifications. Although phosphorylation has been extensively characterized in T cells, much less is known about the glycosylation of serine/threonine residues by O-linked N-acetylglucosamine (O-GlcNAc). Given that O-GlcNAc appears to regulate cell signaling pathways and protein activity similarly to phosphorylation, we performed a comprehensive analysis of O-GlcNAc during T cell activation to address the functional importance of this modification and to identify the modified proteins. Activation of T cells through the TCR resulted in a global elevation of O-GlcNAc levels and in the absence of O-GlcNAc, IL-2 production and proliferation were compromised. T cell activation also led to changes in the relative expression of O-GlcNAc transferase (OGT) isoforms and accumulation of OGT at the immunological synapse of murine T cells. Using a glycoproteomics approach, we identified >200 O-GlcNAc proteins in human T cells. Many of the identified proteins had a functional relationship to RNA metabolism, and consistent with a connection between O-GlcNAc and RNA, inhibition of OGT impaired nascent RNA synthesis upon T cell activation. Overall, our studies provide a global analysis of O-GlcNAc dynamics during T cell activation and the first characterization, to our knowledge, of the O-GlcNAc glycoproteome in human T cells.

Abstract

The Flaviviridae are a family of viruses that cause severe human diseases. For example, dengue virus (DENV) is a rapidly emerging pathogen causing an estimated 100 million symptomatic infections annually worldwide. No approved antivirals are available to date and clinical trials with a tetravalent dengue vaccine showed disappointingly low protection rates. Hepatitis C virus (HCV) also remains a major medical problem, with 160 million chronically infected patients worldwide and only expensive treatments available. Despite distinct differences in their pathogenesis and modes of transmission, the two viruses share common replication strategies. A detailed understanding of the host functions that determine viral infection is lacking. Here we use a pooled CRISPR genetic screening strategy to comprehensively dissect host factors required for these two highly important Flaviviridae members. For DENV, we identified endoplasmic-reticulum (ER)-associated multi-protein complexes involved in signal sequence recognition, N-linked glycosylation and ER-associated degradation. DENV replication was nearly completely abrogated in cells deficient in the oligosaccharyltransferase (OST) complex. Mechanistic studies pinpointed viral RNA replication and not entry or translation as the crucial step requiring the OST complex. Moreover, we show that viral non-structural proteins bind to the OST complex. The identified ER-associated protein complexes were also important for infection by other mosquito-borne flaviviruses including Zika virus, an emerging pathogen causing severe birth defects. By contrast, the most significant genes identified in the HCV screen were distinct and included viral receptors, RNA-binding proteins and enzymes involved in metabolism. We found an unexpected link between intracellular flavin adenine dinucleotide (FAD) levels and HCV replication. This study shows notable divergence in host-depenency factors between DENV and HCV, and illuminates new host targets for antiviral therapy.

Abstract

Endogenous intestinal microbiota have wide-ranging and largely uncharacterized effects on host physiology. Here, we used reverse-phase liquid chromatography-coupled tandem mass spectrometry to define the mouse intestinal proteome in the stomach, jejunum, ileum, cecum and proximal colon under three colonization states: germ-free (GF), monocolonized with Bacteroides thetaiotaomicron and conventionally raised (CR). Our analysis revealed distinct proteomic abundance profiles along the gastrointestinal (GI) tract. Unsupervised clustering showed that host protein abundance primarily depended on GI location rather than colonization state and specific proteins and functions that defined these locations were identified by random forest classifications. K-means clustering of protein abundance across locations revealed substantial differences in host protein production between CR mice relative to GF and monocolonized mice. Finally, comparison with fecal proteomic data sets suggested that the identities of stool proteins are not biased to any region of the GI tract, but are substantially impacted by the microbiota in the distal colon.

Abstract

The readout of histone modifications plays a critical role in chromatin-regulated processes. Dimethylation at Lys-36 on histone H3 (H3K36me2) is associated with actively transcribed genes, and global up-regulation of this modification is associated with several cancers. However, the molecular mechanism by which H3K36me2 is sensed and transduced to downstream biological outcomes remains unclear. Here we identify a PWWP domain within the histone lysine methyltransferase and oncoprotein NSD2 that preferentially binds to nucleosomes containing H3K36me2. In cells, the NSD2 PWWP domain interaction with H3K36me2 plays a role in stabilizing NSD2 at chromatin. Furthermore, NSD2's ability to induce global increases in H3K36me2 via its enzymatic activity, and consequently promote cellular proliferation, is compromised by mutations within the PWWP domain that specifically abrogate H3K36me2-recognition. Together, our results identify a pivotal role for NSD2 binding to its catalytic product in regulating its cellular functions, and suggest a model for how this interaction may facilitate epigenetic spreading and propagation of H3K36me2.

Abstract

Genomic technologies have significantly advanced our understanding of the composition and diversity of host-associated microbial populations. However, their spatial organization and functional interactions relative to the host have been more challenging to study. Here we present a pipeline for the assessment of intestinal microbiota localization within immunofluorescence images of fixed gut cross-sections that includes a flexible software package, BacSpace, for high-throughput quantification of microbial organization. Applying this pipeline to gnotobiotic and human microbiota-colonized mice, we demonstrate that elimination of microbiota-accessible carbohydrates (MACs) from the diet results in thinner mucus in the distal colon, increased proximity of microbes to the epithelium, and heightened expression of the inflammatory marker REG3?. Measurements of microbe-microbe proximity reveal that a MAC-deficient diet alters monophyletic spatial clustering. Furthermore, we quantify the invasion of Helicobacter pylori into the glands of the mouse stomach relative to host mitotic progenitor cells, illustrating the generalizability of this approach.

Abstract

The lysine methyltransferase (KMT) SETMAR is implicated in the response to and repair of DNA damage, but its molecular function is not clear. SETMAR has been associated with dimethylation of histone H3 lysine 36 (H3K36) at sites of DNA damage. However, SETMAR does not methylate H3K36 in vitro. This and the observation that SETMAR is not active on nucleosomes suggest that H3K36 methylation is not a physiologically relevant activity. To identify potential non-histone substrates, we utilized a strategy on the basis of quantitative proteomic analysis of methylated lysine. Our approach identified lysine 130 of the mRNA splicing factor snRNP70 as a SETMAR substrate in vitro, and we show that the enzyme primarily generates monomethylation at this position. Furthermore, we show that SETMAR methylates snRNP70 Lys-130 in cells. Because snRNP70 is a key early regulator of 5' splice site selection, our results suggest a model in which methylation of snRNP70 by SETMAR regulates constitutive and/or alternative splicing. In addition, the proteomic strategy described here is broadly applicable and is a promising route for large-scale mapping of KMT substrates.

Role of disease-associated tolerance in infectious superspreadersPROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICAGopinath, S., Lichtman, J. S., Bouley, D. M., Elias, J. E., Monack, D. M.2014; 111 (44): 15780-15785

Role of disease-associated tolerance in infectious superspreaders.Proceedings of the National Academy of Sciences of the United States of AmericaGopinath, S., Lichtman, J. S., Bouley, D. M., Elias, J. E., Monack, D. M.2014; 111 (44): 15780-15785

Abstract

Natural populations show striking heterogeneity in their ability to transmit disease. For example, a minority of infected individuals known as superspreaders carries out the majority of pathogen transmission events. In a mouse model of Salmonella infection, a subset of infected hosts becomes superspreaders, shedding high levels of bacteria (>10(8) cfu per g of feces) but remain asymptomatic with a dampened systemic immune state. Here we show that superspreader hosts remain asymptomatic when they are treated with oral antibiotics. In contrast, nonsuperspreader Salmonella-infected hosts that are treated with oral antibiotics rapidly shed superspreader levels of the pathogen but display signs of morbidity. This morbidity is linked to an increase in inflammatory myeloid cells in the spleen followed by increased production of acute-phase proteins and proinflammatory cytokines. The degree of colonic inflammation is similar in antibiotic-treated superspreader and nonsuperspreader hosts, indicating that the superspreader hosts are tolerant of antibiotic-mediated perturbations in the intestinal tract. Importantly, neutralization of acute-phase proinflammatory cytokines in antibiotic-induced superspreaders suppresses the expansion of inflammatory myeloid cells and reduces morbidity. We describe a unique disease-associated tolerance to oral antibiotics in superspreaders that facilitates continued transmission of the pathogen.

Abstract

Overexpression of cyclin D1 and its catalytic partner, CDK4, is frequently seen in human cancers. We constructed cyclin D1 and CDK4 protein interaction network in a human breast cancer cell line MCF7, and identified novel CDK4 protein partners. Among CDK4 interactors we observed several proteins functioning in protein folding and in complex assembly. One of the novel partners of CDK4 is FKBP5, which we found to be required to maintain CDK4 levels in cancer cells. An integrative analysis of the extended cyclin D1 cancer interactome and somatic copy number alterations in human cancers identified BAIAPL21 as a potential novel human oncogene. We observed that in several human tumor types BAIAPL21 is expressed at higher levels as compared to normal tissue. Forced overexpression of BAIAPL21 augmented anchorage independent growth, increased colony formation by cancer cells and strongly enhanced the ability of cells to form tumors in vivo. Lastly, we derived an Aggregate Expression Score (AES), which quantifies the expression of all cyclin D1 interactors in a given tumor. We observed that AES has a prognostic value among patients with ER-positive breast cancers. These studies illustrate the utility of analyzing the interactomes of proteins involved in cancer to uncover potential oncogenes, or to allow better cancer prognosis.

Abstract

The ubiquitin-proteasome system (UPS) is a potential pathway for therapeutic intervention for pathogens such as Plasmodium, the causative agent of malaria. However, due to the essential nature of this proteolytic pathway, proteasome inhibitors must avoid inhibition of the host enzyme complex to prevent toxic side effects. The Plasmodium proteasome is poorly characterized, making rational design of inhibitors that induce selective parasite killing difficult. In this study, we developed a chemical probe that labels all catalytic sites of the Plasmodium proteasome. Using this probe, we identified several subunit selective small molecule inhibitors of the parasite enzyme complex. Treatment with an inhibitor that is specific for the ?5 subunit during blood stage schizogony led to a dramatic decrease in parasite replication while short-term inhibition of the ?2 subunit did not affect viability. Interestingly, coinhibition of both the ?2 and ?5 catalytic subunits resulted in enhanced parasite killing at all stages of the blood stage life cycle and reduced parasite levels in vivo to barely detectable levels. Parasite killing was achieved with overall low host toxicity, something that has not been possible with existing proteasome inhibitors. Our results highlight differences in the subunit dependency of the parasite and human proteasome, thus providing a strategy for development of potent antimalarial drugs with overall low host toxicity.

Abstract

Calcium-dependent protein kinases (CDPKs) are conserved in plants and apicomplexan parasites. In Toxoplasma gondii, TgCDPK3 regulates parasite egress from the host cell in the presence of a calcium-ionophore. The targets and the pathways that the kinase controls, however, are not known. To identify pathways regulated by TgCDPK3, we measured relative phosphorylation site usage in wild type and TgCDPK3 mutant and knock-out parasites by quantitative mass-spectrometry using stable isotope-labeling with amino acids in cell culture (SILAC). This revealed known and novel phosphorylation events on proteins predicted to play a role in host-cell egress, but also a novel function of TgCDPK3 as an upstream regulator of other calcium-dependent signaling pathways, as we also identified proteins that are differentially phosphorylated prior to egress, including proteins important for ion-homeostasis and metabolism. This observation is supported by the observation that basal calcium levels are increased in parasites where TgCDPK3 has been inactivated. Most of the differential phosphorylation observed in CDPK3 mutants is rescued by complementation of the mutants with a wild type copy of TgCDPK3. Lastly, the TgCDPK3 mutants showed hyperphosphorylation of two targets of a related calcium-dependent kinase (TgCDPK1), as well as TgCDPK1 itself, indicating that this latter kinase appears to play a role downstream of TgCDPK3 function. Overexpression of TgCDPK1 partially rescues the egress phenotype of the TgCDPK3 mutants, reinforcing this conclusion. These results show that TgCDPK3 plays a pivotal role in regulating tachyzoite functions including, but not limited to, egress.

Abstract

The MRN (MRE11-RAD50-NBS1) complex has been implicated in many aspects of the DNA damage response. It has key roles in sensing and processing DNA double-strand breaks, as well as in activation of ATM (ataxia telangiectasia mutated). We reveal a function for MRN in ATR (ATM- and RAD3-related) activation by using defined ATR-activating DNA structures in Xenopus egg extracts. Strikingly, we demonstrate that MRN is required for recruitment of TOPBP1 to an ATR-activating structure that contains a single-stranded DNA (ssDNA) and a double-stranded DNA (dsDNA) junction and that this recruitment is necessary for phosphorylation of CHK1. We also show that the 911 (RAD9-RAD1-HUS1) complex is not required for TOPBP1 recruitment but is essential for TOPBP1 function. Thus, whereas MRN is required for TOPBP1 recruitment at an ssDNA-to-dsDNA junction, 911 is required for TOPBP1 "activation." These findings provide molecular insights into how ATR is activated.

Abstract

Cyclin D1 belongs to the core cell cycle machinery, and it is frequently overexpressed in human cancers. The full repertoire of cyclin D1 functions in normal development and oncogenesis is unclear at present. Here we developed Flag- and haemagglutinin-tagged cyclin D1 knock-in mouse strains that allowed a high-throughput mass spectrometry approach to search for cyclin D1-binding proteins in different mouse organs. In addition to cell cycle partners, we observed several proteins involved in transcription. Genome-wide location analyses (chromatin immunoprecipitation coupled to DNA microarray; ChIP-chip) showed that during mouse development cyclin D1 occupies promoters of abundantly expressed genes. In particular, we found that in developing mouse retinas-an organ that critically requires cyclin D1 function-cyclin D1 binds the upstream regulatory region of the Notch1 gene, where it serves to recruit CREB binding protein (CBP) histone acetyltransferase. Genetic ablation of cyclin D1 resulted in decreased CBP recruitment, decreased histone acetylation of the Notch1 promoter region, and led to decreased levels of the Notch1 transcript and protein in cyclin D1-null (Ccnd1(-/-)) retinas. Transduction of an activated allele of Notch1 into Ccnd1(-/-) retinas increased proliferation of retinal progenitor cells, indicating that upregulation of Notch1 signalling alleviates the phenotype of cyclin D1-deficiency. These studies show that in addition to its well-established cell cycle roles, cyclin D1 has an in vivo transcriptional function in mouse development. Our approach, which we term 'genetic-proteomic', can be used to study the in vivo function of essentially any protein.

Abstract

Accurate and precise methods for estimating incorrect peptide and protein identifications are crucial for effective large-scale proteome analyses by tandem mass spectrometry. The target-decoy search strategy has emerged as a simple, effective tool for generating such estimations. This strategy is based on the premise that obvious, necessarily incorrect "decoy" sequences added to the search space will correspond with incorrect search results that might otherwise be deemed to be correct. With this knowledge, it is possible not only to estimate how many incorrect results are in a final data set but also to use decoy hits to guide the design of filtering criteria that sensitively partition a data set into correct and incorrect identifications.

Abstract

Beta-site APP cleaving enzyme 1 (BACE1) is a transmembrane aspartyl protease with a lumenal active site that sheds the ectodomains of membrane proteins through juxtamembrane proteolysis. BACE1 has been studied principally for its role in Alzheimer's disease as the beta-secretase responsible for generating the amyloid-beta protein. Emerging evidence from mouse models has identified the importance of BACE1 in myelination and cognitive performance. However, the substrates that BACE1 processes to regulate these functions are unknown, and to date only a few beta-secretase substrates have been identified through candidate-based studies. Using an unbiased approach to substrate identification, we performed quantitative proteomic analysis of two human epithelial cell lines stably expressing BACE1 and identified 68 putative beta-secretase substrates, a number of which we validated in a cell culture system. The vast majority were of type I transmembrane topology, although one was type II and three were GPI-linked proteins. Intriguingly, a preponderance of these proteins are involved in contact-dependent intercellular communication or serve as receptors and have recognized roles in the nervous system and other organs. No consistent sequence motif predicting BACE1 cleavage was identified in substrates versus non-substrates. These findings expand our understanding of the proteins and cellular processes that BACE1 may regulate, and suggest possible mechanisms of toxicity arising from chronic BACE1 inhibition.

Abstract

Recently, mass spectrometry has been employed in many studies to provide unbiased, reproducible, and quantitative protein abundance information on a proteome-wide scale. However, how instruments' limited dynamic ranges impact the accuracy of such measurements has remained largely unexplored, especially in the context of complex mixtures. Here, we examined the distribution of peptide signal versus background noise (S/N) and its correlation with quantitative accuracy. With the use of metabolically labeled Jurkat cell lysate, over half of all confidently identified peptides had S/N ratios less than 10 when examined using both hybrid linear ion trap-Fourier transform ion cyclotron resonance and Orbitrap mass spectrometers. Quantification accuracy was also highly correlated with S/N. We developed a mass precision algorithm that significantly reduced measurement variance at low S/N beyond the use of highly accurate mass information alone and expanded it into a new software suite, Vista. We also evaluated the interplay between mass measurement accuracy and S/N; finding a balance between both parameters produced the greatest identification and quantification rates. Finally, we demonstrate that S/N can be a useful surrogate for relative abundance ratios when only a single species is detected.

Abstract

The presenilin/gamma-secretase complex, an unusual intramembrane aspartyl protease, plays an essential role in cellular signaling and membrane protein turnover. Its ability to liberate numerous intracellular signaling proteins from the membrane and also mediate the secretion of amyloid-beta protein (Abeta) has made modulation of gamma-secretase activity a therapeutic goal for cancer and Alzheimer disease. Although the proteolysis of the prototypical substrates Notch and beta-amyloid precursor protein (APP) has been intensely studied, the full spectrum of substrates and the determinants that make a transmembrane protein a substrate remain unclear. Using an unbiased approach to substrate identification, we surveyed the proteome of a human cell line for targets of gamma-secretase and found a relatively small population of new substrates, all of which are type I transmembrane proteins but have diverse biological roles. By comparing these substrates to type I proteins not regulated by gamma-secretase, we determined that besides a short ectodomain, gamma-secretase requires permissive transmembrane and cytoplasmic domains to bind and cleave its substrates. In addition, we provide evidence for at least two mechanisms that can target a substrate for gamma cleavage: one in which a substrate with a short ectodomain is directly cleaved independent of sheddase association, and a second where a substrate requires ectodomain shedding to instruct subsequent gamma-secretase processing. These findings expand our understanding of the mechanisms of substrate selection as well as the diverse cellular processes to which gamma-secretase contributes.

Abstract

Activity-based protein profiling has emerged as a valuable technology for labeling, enriching, and assessing protein activities from complex mixtures. This is primarily accomplished via a two-step identification and quantification process. Here we show a highly quantitative and streamlined method, termed catch-and-release activity profiling of enzymes (CAPE), which reduces this procedure to a single step. Furthermore the CAPE approach has the ability to detect small quantitative changes that may have been missed by alternative mass spectrometry-based techniques.

Abstract

Protein phosphorylation is essential for numerous cellular processes. Large-scale profiling of phosphoproteins continues to enhance the depth and speed at which we understand these processes. The development of effective phosphoprotein and peptide enrichment techniques and improvements to mass spectrometric instrumentation have intensified phosphoproteomic research in recent years, leading to unprecedented achievements. Here, we describe a large-scale phosphorylation analysis of alpha-factor-arrested yeast. Using a multidimensional separation strategy involving preparative SDS-PAGE for prefractionation, in-gel digestion with trypsin, and immobilized metal affinity chromatography (IMAC) enrichment of phosphopeptides, followed by LC-MS/MS analysis employing a hybrid LTQ-Orbitrap mass spectrometer, we were able to catalog a substantial portion of the phosphoproteins present in yeast whole-cell lysate. This analysis yielded the confident identification of 2288 nonredundant phosphorylation sites from 985 proteins. The ambiguity score (Ascore) algorithm was utilized to determine the certainty of site localization for the entire data set. In addition, the size of the data set permitted extraction of known and novel kinase motifs using the Motif-X algorithm. Finally, a large number of members of the pheromone signaling pathway were found as phosphoproteins and are discussed.

Abstract

The relative quantification of protein expression levels in different cell samples through the utilization of stable isotope dilution has become a standard method in the field of proteomics. We describe here the development of a new reductively cleavable reagent which facilitates the relative quantification of thousands of proteins from only tens of micrograms of starting protein. The ligand features a novel disulfide moiety that links biotin and a thiol-reactive entity. The disulfide is stable to reductive conditions employed during sample labeling but is readily cleaved under mild conditions using tris-(2-carboxyethyl) phosphine (TCEP). This unique chemical property allows for the facile use of immobilized avidin in a manner equivalent to the use of conventional reversible-binding affinity resins. Target peptides are bound to avidin resin, washed rigorously, then cleaved directly from the resin, resulting in simplified sample handling procedures and reduced nonspecific interactions. Here we demonstrate the stability of the linker under two different reducing conditions and show how this "catch-and-release (CAR)" reagent can be used to quantitatively compare protein abundances from two distinct cellular lysates. Starting with only 40 microg protein from each sample, 1840 individual proteins were identified in a single experiment. Using in-house software for automated peak integration, 1620 of these proteins were quantified for differential expression.

Abstract

Mass spectrometers that provide high mass accuracy such as FT-ICR instruments are increasingly used in proteomic studies. Although the importance of accurately determined molecular masses for the identification of biomolecules is generally accepted, its role in the analysis of shotgun proteomic data has not been thoroughly studied. To gain insight into this role, we used a hybrid linear quadrupole ion trap/FT-ICR (LTQ FT) mass spectrometer for LC-MS/MS analysis of a highly complex peptide mixture derived from a fraction of the yeast proteome. We applied three data-dependent MS/MS acquisition methods. The FT-ICR part of the hybrid mass spectrometer was either not exploited, used only for survey MS scans, or also used for acquiring selected ion monitoring scans to optimize mass accuracy. MS/MS data were assigned with the SEQUEST algorithm, and peptide identifications were validated by estimating the number of incorrect assignments using the composite target/decoy database search strategy. We developed a simple mass calibration strategy exploiting polydimethylcyclosiloxane background ions as calibrant ions. This strategy allowed us to substantially improve mass accuracy without reducing the number of MS/MS spectra acquired in an LC-MS/MS run. The benefits of high mass accuracy were greatest for assigning MS/MS spectra with low signal-to-noise ratios and for assigning phosphopeptides. Confident peptide identification rates from these data sets could be doubled by the use of mass accuracy information. It was also shown that improving mass accuracy at a cost to the MS/MS acquisition rate substantially lowered the sensitivity of LC-MS/MS analyses. The use of FT-ICR selected ion monitoring scans to maximize mass accuracy reduced the number of protein identifications by 40%.

Abstract

The primary goal of proteomics is to gain a better understanding of biological function at the protein expression level. As the field matures, numerous technologies are being developed to aid in the identification, quantification and characterization of protein expression and post-translational modifications on a near-global scale. Stable isotope labeling by amino acids in cell culture is one such technique that has shown broad biological applications. While we have recently shown the application of this technology to a model of metastatic prostate cancer, we now report a substantial improvement in quantitative analysis using a linear ion-trap Fourier transform ion cyclotron resonance mass spectrometer (LTQ FT) and novel quantification software. This resulted in the quantification of nearly 1400 proteins, a greater than 3-fold increase in comparison to our earlier study. This dramatic increase in proteome coverage can be attributed to (1) use of a double-labeling strategy, (2) greater sensitivity, speed and mass accuracy provided by the LTQ FT mass spectrometer, and (3) more robust quantification software. Finally, by using a concatenated target/decoy protein database for our peptide searches, we now report these data in the context of an estimated false-positive rate of one percent.

Abstract

Major analytical challenges encountered by shotgun proteome analysis include both the diversity and dynamic range of protein expression. Often new instrumentation can provide breakthroughs in areas where other analytical improvements have not been successful. In the current study, we utilized new instrumentation (LTQ FT) to characterize complex protein samples by shotgun proteomics. Proteomic analyses were performed on murine spleen tissue separated by magnetic beads into distinct CD45- and CD45+ cell populations. Using shotgun protein analysis we identified approximately 2,000 proteins per cell group by over 12,000 peptides with mass deviations of less than 4.5 ppm. Datasets obtained by LTQ FT analysis provided a significant increase in the number of proteins identified and greater confidence in those identifications and improved reproducibility in replicate analyses. Because CD45- and not CD45+ cells are able to regenerate functional pancreatic islet cells in a mouse model of type I diabetes, protein expression was further compared by a subtractive proteomic approach in search of an exclusive protein expression profile in CD45- cells. Characterization of the proteins exclusively identified in CD45- cells was performed using gene ontology terms via the Javascript GoMiner. The CD45- cell subset readily revealed proteins involved in development, suggesting the persistence of a fetal stem cell in an adult animal.

Abstract

Mutations in the DJ-1 gene cause early-onset autosomal recessive Parkinson's disease (PD), although the role of DJ-1 in the degeneration of dopaminergic neurons is unresolved. Here we show that the major interacting-proteins with DJ-1 in dopaminergic neuronal cells are the nuclear proteins p54nrb and pyrimidine tract-binding protein-associated splicing factor (PSF), two multifunctional regulators of transcription and RNA metabolism. PD-associated DJ-1 mutants exhibit decreased nuclear distribution and increased mitochondrial localization, resulting in diminished co-localization with co-activator p54nrb and repressor PSF. Unlike pathogenic DJ-1 mutants, wild-type DJ-1 acts to inhibit the transcriptional silencing activity of the PSF. In addition, the transcriptional silencer PSF induces neuronal apoptosis, which can be reversed by wild-type DJ-1 but to a lesser extent by PD-associated DJ-1 mutants. DJ-1-specific small interfering RNA sensitizes cells to PSF-induced apoptosis. Both DJ-1 and p54nrb block oxidative stress and mutant alpha-synuclein-induced cell death. Thus, DJ-1 is a neuroprotective transcriptional co-activator that may act in concert with p54nrb and PSF to regulate the expression of a neuroprotective genetic program. Mutations that impair the transcriptional co-activator function of DJ-1 render dopaminergic neurons vulnerable to apoptosis and may contribute to the pathogenesis of PD.

Abstract

Proper centrosome duplication and spindle formation are crucial for prevention of chromosomal instability, and BRCA1 plays a role in this process. In this study, transient inhibition of BRCA1 function in cell lines derived from mammary tissue caused rapid amplification and fragmentation of centrosomes. Cell lines tested that were derived from nonmammary tissues did not amplify the centrosome number in this transient assay. We tested whether BRCA1 and its binding partner, BARD1, ubiquitinate centrosome proteins. Results showed that centrosome components, including gamma-tubulin, are ubiquitinated by BRCA1/BARD1 in vitro. The in vitro ubiquitination of gamma-tubulin was specific, and function of the carboxy terminus was necessary for this reaction; truncated BRCA1 did not ubiquitinate gamma-tubulin. BRCA1/BARD1 ubiquitinated lysines 48 and 344 of gamma-tubulin in vitro, and expression in cells of gamma-tubulin K48R caused a marked amplification of centrosomes. This result supports the notion that the modification of these lysines in living cells is critical in the maintenance of centrosome number. One of the key problems in understanding the biology of BRCA1 has been the identification of a specific target of BRCA1/BARD1 ubiquitination and its effect on mammary cell biology. The results of this study identify a ubiquitination target and suggest a biological impact important in the etiology of breast cancer.

Abstract

Determining the site of a regulatory phosphorylation event is often essential for elucidating specific kinase-substrate relationships, providing a handle for understanding essential signaling pathways and ultimately allowing insights into numerous disease pathologies. Despite intense research efforts to elucidate mechanisms of protein phosphorylation regulation, efficient, large-scale identification and characterization of phosphorylation sites remains an unsolved problem. In this report we describe an application of existing technology for the isolation and identification of phosphorylation sites. By using a strategy based on strong cation exchange chromatography, phosphopeptides were enriched from the nuclear fraction of HeLa cell lysate. From 967 proteins, 2,002 phosphorylation sites were determined by tandem MS. This unprecedented large collection of sites permitted a detailed accounting of known and unknown kinase motifs and substrates.

Abstract

Tandem mass spectrometry is commonly used to identify peptides (and thereby proteins) that are present in complex mixtures. Peptide identification from tandem mass spectra is partially automated, but still requires human curation to resolve "borderline" peptide-spectrum matches (PSMs). SILVER is web-based software that assists manual curation of tandem mass spectra, using a recently developed intensity-based machine-learning approach to scoring PSMs, Elias et al. In this method, a large training set of peptide, fragment, and peak-intensity properties for both matched and mismatched PSMs was used to develop a score measuring consistency between each predicted fragment ion of a candidate peptide and its corresponding observed spectral peak intensity. The SILVER interface provides a visual representation of match quality between each candidate fragment ion and the observed spectrum, thereby expediting manual curation of tandem mass spectra. SILVER is available online at http://llama.med.harvard.edu/Software.html.

Abstract

Tandem mass spectrometry (MS/MS) has emerged as a cornerstone of proteomics owing in part to robust spectral interpretation algorithms. Widely used algorithms do not fully exploit the intensity patterns present in mass spectra. Here, we demonstrate that intensity pattern modeling improves peptide and protein identification from MS/MS spectra. We modeled fragment ion intensities using a machine-learning approach that estimates the likelihood of observed intensities given peptide and fragment attributes. From 1,000,000 spectra, we chose 27,000 with high-quality, nonredundant matches as training data. Using the same 27,000 spectra, intensity was similarly modeled with mismatched peptides. We used these two probabilistic models to compute the relative likelihood of an observed spectrum given that a candidate peptide is matched or mismatched. We used a 'decoy' proteome approach to estimate incorrect match frequency, and demonstrated that an intensity-based method reduces peptide identification error by 50-96% without any loss in sensitivity.

Abstract

There is a growing need for techniques that can identify and characterize protein modifications on a large or global scale. We report here a proteomics approach to enrich, recover, and identify ubiquitin conjugates from Saccharomyces cerevisiae lysate. Ubiquitin conjugates from a strain expressing 6xHis-tagged ubiquitin were isolated, proteolyzed with trypsin and analyzed by multidimensional liquid chromatography coupled with tandem mass spectrometry (LC/LC-MS/MS) for amino acid sequence determination. We identified 1,075 proteins from the sample. In addition, we detected 110 precise ubiquitination sites present in 72 ubiquitin-protein conjugates. Finally, ubiquitin itself was found to be modified at seven lysine residues providing evidence for unexpected diversity in polyubiquitin chain topology in vivo. The methodology described here provides a general tool for the large-scale analysis and characterization of protein ubiquitination.

Abstract

Highly complex protein mixtures can be directly analyzed after proteolysis by liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS). In this paper, we have utilized the combination of strong cation exchange (SCX) and reversed-phase (RP) chromatography to achieve two-dimensional separation prior to MS/MS. One milligram of whole yeast protein was proteolyzed and separated by SCX chromatography (2.1 mm i.d.) with fraction collection every minute during an 80-min elution. Eighty fractions were reduced in volume and then re-injected via an autosampler in an automated fashion using a vented-column (100 microm i.d.) approach for RP-LC-MS/MS analysis. More than 162,000 MS/MS spectra were collected with 26,815 matched to yeast peptides (7,537 unique peptides). A total of 1,504 yeast proteins were unambiguously identified in this single analysis. We present a comparison of this experiment with a previously published yeast proteome analysis by Yates and colleagues (Washburn, M. P.; Wolters, D.; Yates, J. R., III. Nat. Biotechnol. 2001, 19, 242-7). In addition, we report an in-depth analysis of the false-positive rates associated with peptide identification using the Sequest algorithm and a reversed yeast protein database. New criteria are proposed to decrease false-positives to less than 1% and to greatly reduce the need for manual interpretation while permitting more proteins to be identified.