Peptide mass maps: a highly informative approach to protein identification

J R Yates 3rd; S Speicher; P R Griffin; T Hunkapiller

doi:10.1006/abio.1993.1514

Peptide mass maps: a highly informative approach to protein identification

Anal Biochem. 1993 Nov 1;214(2):397-408. doi: 10.1006/abio.1993.1514.

Authors

J R Yates 3rd¹, S Speicher, P R Griffin, T Hunkapiller

Affiliation

¹ Department of Molecular Biotechnology, School of Medicine, University of Washington, Seattle 98195.

PMID: 8109726
DOI: 10.1006/abio.1993.1514

Abstract

A computer searching algorithm has been used to identify protein sequences in the Protein Information Resource (PIR) database with peptide mass information (mass map) obtained from proteolytic digests of proteins analyzed by microcapillary high-performance liquid chromatography electrospray ionization mass spectrometry. A theoretical analysis of the cytochrome c family demonstrates the ability to identify protein sequences in the PIR database with a high degree of accuracy using a set of six predicted tryptic peptide masses. This method was also applied to experimentally determined peptide masses for a small GTP-binding protein, a protein from pig uterus, the human sex steroid binding protein, and a thermostable DNA polymerase. The results demonstrate that a set of observed masses which is less than 50% of the total number of predicted masses can be used to identify a protein sequence in the database. For the analysis presented in this paper, a mass matching tolerance of 1 amu is used. Under these conditions, mass maps created by fast atom bombardment mass spectrometry and matrix-assisted laser desorption time-of-flight would also be applicable. In cases where multiple matches are observed or verification of the protein identification is needed, tandem mass spectrometry sequencing can be used to establish sequence similarity.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Amino Acid Sequence
Cytochrome c Group / chemistry*
Databases, Factual
Mass Spectrometry
Molecular Sequence Data
Molecular Weight
Peptide Mapping / methods*

Substances

Cytochrome c Group