We implemented negative electron-transfer dissociation (NETD) on a hybrid ion trap/Orbitrap mass spectrometer to conduct ion/ion reactions using peptide anions and radical reagent cations. In addition to sequence-informative ladders of... more
We implemented negative electron-transfer dissociation (NETD) on a hybrid ion trap/Orbitrap mass spectrometer to conduct ion/ion reactions using peptide anions and radical reagent cations. In addition to sequence-informative ladders of a•- and x-type fragment ions, NETD generated intense neutral loss peaks corresponding to the entire or partial side-chain cleavage from amino acids constituting a given peptide. Thus, a critical step towards the characterization of this recently introduced fragmentation technique is a systematic study of synthetic peptides to identify common neutral losses and preferential fragmentation pathways. Examining 46 synthetic peptides with high mass accuracy and high resolution analysis permitted facile determination of the chemical composition of each neutral loss. We identified 19 unique neutral losses from 14 amino acids and three modified amino acids, and assessed the specificity and sensitivity of each neutral loss using a database of 1542 confidently identified peptides generated from NETD shotgun experiments employing high-pH separations and negative electrospray ionization. As residue-specific neutral losses indicate the presence of certain amino acids, we determined that many neutral losses have potential diagnostic utility. We envision this catalogue of neutral losses being incorporated into database search algorithms to improve peptide identification specificity and to further advance characterization of the acidic proteome.
Using a large set of high mass accuracy and resolution ETD tandem mass spectra, we characterized ETD-induced neutral losses. From these data we deduced the chemical formula for 20 of these losses. Many of them have been previously... more
Using a large set of high mass accuracy and resolution ETD tandem mass spectra, we characterized ETD-induced neutral losses. From these data we deduced the chemical formula for 20 of these losses. Many of them have been previously observed in electron-capture dissociation (ECD) spectra, such as losses of the side chains of arginine, aspartic acid, glutamic acid, glutamine, asparagine, leucine, histidine, and carbamidomethylated cysteine residues. With this information, we examined the diagnostic value of these amino acid-specific losses. Among 1285 peptide–spectrum matches, 92.5% have agreement between neutral loss-derived peptide amino acid composition and the peptide sequences. Moreover, we show that peptides can be uniquely identified by using only the accurate precursor mass and amino acid composition based on neutral losses; the median number of sequence candidates from an accurate mass query is reduced from 21 to 8 by adding side chain loss information. Besides increasing confidence in peptide identification, our findings suggest the potential use of these diagnostic losses in ETD spectra to improve false discovery rate estimation and to enhance the performance of scoring functions in database search algorithms.
Here we present the Coon OMSSA Proteomic Analysis Software Suite (COMPASS): a free and open-source software pipeline for high-throughput analysis of proteomics data, designed around the Open Mass Spectrometry Search Algorithm. We detail a... more
Here we present the Coon OMSSA Proteomic Analysis Software Suite (COMPASS): a free and open-source software pipeline for high-throughput analysis of proteomics data, designed around the Open Mass Spectrometry Search Algorithm. We detail a synergistic set of tools for protein database generation, spectral reduction, peptide false discovery rate analysis, peptide quantitation via isobaric labeling, protein parsimony and protein false discovery rate analysis, and protein quantitation. We strive for maximum ease of use, utilizing graphical user interfaces and working with data files in the original instrument vendor format. Results are stored in plain text comma-separated value files, which are easy to view and manipulate with a text editor or spreadsheet program. We illustrate the operation and efficacy of COMPASS through the use of two LC-MS/MS data sets. The first is a data set of a highly annotated mixture of standard proteins and manually validated contaminants that exhibits the identification workflow. The second is a data set of yeast peptides, labeled with isobaric stable isotope tags and mixed in known ratios, to demonstrate the quantitative workflow. For these two data sets, COMPASS performs equivalently or better than the current de facto standard, the Trans-Proteomic Pipeline.
Collision-activated dissociation and electron-transfer dissociation (ETD) each produce spectra containing unique features. Though several database search algorithms (e.g. SEQUEST, MASCOT, and Open Mass Spectrometry Search Algorithm) have... more
Collision-activated dissociation and electron-transfer dissociation (ETD) each produce spectra containing unique features. Though several database search algorithms (e.g. SEQUEST, MASCOT, and Open Mass Spectrometry Search Algorithm) have been modified to search ETD data, this consists chiefly of the ability to search for c- and z•-ions; additional ETD-specific features are often unaccounted for and may hinder identification. Removal of these features via spectral processing increased total search sensitivity by ∼20% for both human and yeast data sets; unique peptide identifications increased by ∼17% for the yeast data sets and ∼16% for the human data set.
Using both automated nanospray and online liquid chromatography mass spectrometry LC-MS strategies, 99 proteins have been newly identified by top-down tandem mass spectrometry (MS/MS) in Methanosarcina acetivorans, the methanogen with the... more
Using both automated nanospray and online liquid chromatography mass spectrometry LC-MS strategies, 99 proteins have been newly identified by top-down tandem mass spectrometry (MS/MS) in Methanosarcina acetivorans, the methanogen with the largest known genome [5.7 mega base pairs (Mb)] for an Archaeon. Because top-down MS/MS was used, 15 proteins were detected with mispredicted start sites along with an additional five from small open reading frames (SORFs). Beyond characterization of these more common discrepancies in genome annotation, one SORF resulted from a rare start codon (AUA) as the initiation site for translation of this protein. Also, a methylation on a 30S ribosomal protein (MA1259) was localized to Pro59–Val69, contrasting sharply from its homologue in Escherichia coli (rp S12) known to harbor an unusual β-thiomethylated aspartic acid residue.
Tandem mass spectra (MS/MS) produced using electron transfer dissociation (ETD) differ from those derived from collision-activated dissociation (CAD) in several important ways. Foremost, the predominant fragment ion series are different:... more
Tandem mass spectra (MS/MS) produced using electron transfer dissociation (ETD) differ from those derived from collision-activated dissociation (CAD) in several important ways. Foremost, the predominant fragment ion series are different: c- and z ·-type ions are favored in ETD spectra while b- and y-type ions comprise the bulk of the fragments in CAD spectra. Additionally, ETD spectra possess charge-reduced precursors and unique neutral losses. Most database search algorithms were designed to analyze CAD spectra, and have only recently been adapted to accommodate c- and z ·-type ions; therefore, inclusion of these additional spectral features can hinder identification, leading to lower confidence scores and decreased sensitivity. Because of this, it is important to pre-process spectral data before submission to a database search to remove those features that cause complications. Here, we demonstrate the effects of removing these features on the number of unique peptide identifications at a 1% false discovery rate (FDR) using the open mass spectrometry search algorithm (OMSSA). When analyzing two biologic replicates of a yeast protein extract in three total analyses, the number of unique identifications with a ∼1% FDR increased from 4611 to 5931 upon spectral pre-processing—an increase of ∼28. 6%. We outline the most effective pre-processing methods, and provide free software containing these algorithms.