Skip to main content
. Author manuscript; available in PMC: 2012 Feb 1.
Published in final edited form as: J Proteomics. 2010 Nov 3;74(2):199–211. doi: 10.1016/j.jprot.2010.10.005

Table 3.

Two sequences are used to demonstrate the structure of our processed information file. The text line after the “>” symbol contains accession numbers associated with the sequence. The other rows each contains six entries separated by tabs. The first column indicates the residue position. The second column indicates the modified residue(s) that can occur at the position specified in the first column. The third column, labeled by either SAP or PTM, indicates the modification type. The fifth column contains the accession number of the source of modification, this may be a protein sequence or mRNA. When the source of modification is obtained from proteotypic consensus peptide within a spectral library, the fifth column shows the proteotypic peptide instead. The fourth column explains the nature of the modification; a lower case letter indicates residue content in the source sequence, the upper case letter indicates the modified residue in the variant sequence. The notation, vI, indicates the source sequence with amino acid V can change into I, ie, a SAP. The notation, gT CA, is a short hand for codon change from gtc to atc, ie, a SNP that changes the coded amino acid from V to I as well. The sixth column contains additional information for the fourth column. It may include disease information, database entry index, or spectral indices when the modification is associated with a proteotypic peptide. As an example, the post-translational modification (M06) at position 59 of the second sequence is observed in the following proteotypic peptides: PDETM06VIGNYR, CFIEEIPDETM06VIGNYR, FHIGETEKKC-FIEEIPDETM06VIGNYR and CFIEEIPDETM06VIGNYR. The first three are obtained from spectra of GPM (human_cmp_20) spectral library with spectral indices 134010, 442918, 442918, 442920, and 589710, while the fourth one came from the NIST (#NIST_human_IT_v3.0) spectral library with spectral indices 25094 and 25102.

>NP_775259,NM_173167,Q8IWX7
60 I00 SAP gTC→A ∣ NM 173167 ∣ dbSNP:16970659
60 I00 SAP v→I Q8IWX7 dbSNP:16970659, FTId=VAR 027506
199 V00 SAP GcA→T ∣ NM 173167 ∣ dbSNP:35749208
377 R00 SAP AaG→G ∣ NM 173167 ∣ dbSNP:41389545
496 H00 SAP d→H Q8IWX7 (breast cancer). FTId=VAR 035870
>NP_059980,NM_017510
55 Q00 SAP CcG→A ∣ NM 017510 ∣ #dbSNP:11545866
59 M06 PTM m Hydroxylated PDETM06VIGNYR #human_cmp_20:134010
59 M06 PTM m Hydroxylated CFIEEIPDETM06VIGNYR #human_cmp_20:442918,442919,442920
59 M06 PTM m Hydroxylated FHIGETEKKCFIEEIPDETM06VIGNYR #human_cmp_20:589710
59 M06 PTM m Hydroxylated CFIEEIPDETM06VIGNYR #NIST_human_IT_v3.0:25094,25102
77 Q07 PTM q Deamidated EEYQ07PATPGLGMFVEVKDPEDK #human_cmp_20:524138
78 Q00 SAP CcG→ A ∣ NM 017510 ∣ #dbSNP:11545867
85 M06 PTM m Hydroxylated EEYQPATPGLGM06FVEVK #human_cmp_20:393423,393424,393425
85 M06 PTM m Hydroxylated EEYQPATPGLGM06FVEVK #NIST_human_IT_v3.0:50099