Data set for the proteomics analysis of the endomembrane system from the unicellular Entamoeba histolytica

Doranda Perdomo; Nawel Aït-Ammar; Sylvie Syan; Martin Sachse; Gagan Deep Jhingan; Nancy Guillén

doi:10.1016/j.dib.2014.08.007

. 2014 Sep 3;1:29–36. doi: 10.1016/j.dib.2014.08.007

Data set for the proteomics analysis of the endomembrane system from the unicellular Entamoeba histolytica

Doranda Perdomo ^a,^b,^c, Nawel Aït-Ammar ^a,^b, Sylvie Syan ^a,^b, Martin Sachse ^d, Gagan Deep Jhingan ^e, Nancy Guillén ^a,^b,^⁎

PMCID: PMC4459567 PMID: 26217682

Abstract

Entamoeba histolytica is the protozoan parasite agent of amebiasis, an infectious disease of the human intestine and liver. This parasite contact and kills human cells by an active process involving pathogenic factors. Cellular traffic and secretion activities are poorly characterized in E. histolytica. In this work, we took advantage of a wide proteomic analysis to search for principal components of the endomembrane system in E. histolytica. A total of 5683 peptides matching with 1531 proteins (FDR of 1%) were identified which corresponds to roughly 20% of the total amebic proteome. Bioinformatics investigations searching for domain homologies (Smart and InterProScan programs) and functional descriptions (KEGG and GO terms) allowed this data to be organized into distinct categories. This data represents the first in-depth proteomics analysis of subcellular compartments in E. histolytica and allows a detailed map of vesicle traffic components in an ancient single-cell organism that lacks a stereotypical ER and Golgi apparatus to be established. The data are related to [1].

Data are supplied here and have been deposited to the open access library of ProteomeXchange Consortium (http://www.proteomexchange.org) via the PRIDE partner repository [2] with the dataset identifier PXD000770

Specificationstable

Subject area	Biology, parasitology
More specific subject area	Proteomics on the endomembrane system of Entamoeba histolytica
Type of data	Proteome Discoverer and Maxquant results (.txt) and list of identified proteins as tables (.xls)
How data was acquired	Liquid chromatography mass spectrometry in tandem (LC–MS/MS). Proteins from the internal membrane fraction of E. histolytica trophozoites were treated to obtain tryptic peptides. These were separated by HPLC coupled to an LTQ-Orbitrap Velos mass spectrometer (Thermo Fisher Scientific)
Data format	Raw and analyzed
Experimental factors	Non applied
Experimental features	Cell fractionation of E. histolytica to obtain enriched endomembrane proteins as described before [4] with some modifications. Samples were then prepared for liquid chromatography–mass spectrometry (LC–MS/MS) analysis. (Fig. 1)
Data source location	Paris, France. Institut Pasteur.
Data accessibility	Data are supplied here and have also been deposited to the open access library of ProteomeXchange Consortium (http://www.proteomexchange.org) via the PRIDE partner repository [2] with the dataset identifier PXD000770

Open in a new tab

Value of the data

•
First in-depth proteomics analysis of subcellular compartments in E. histolytic.
•
Proteomics characterization of the endomembrane network in E. histolytica.
•
Strong iBAQ intensity values from the protein spectra indicative of abundance and relevant to the construction of amebic intracellular trafficking components.

1. Data, experimental design, materials and methods

1.1. Preparation of samples for proteomics analysis

Proteins from the internal membrane fraction (50 µg) were precipitated with the methanol–chloroform method [3] and the resulting dried pellet was dissolved in freshly prepared digestion buffer (8 M urea in 25 mM NH₄HCO₃). Sample were reduced with 5 mM TCEP (45 min, 37 °C) and alkylated with 50 mm iodoacetamide (60 min, 37 °C) in the dark. Sample were diluted with 25 mM NH₄HCO₃ to a final concentration of 1 M urea and digested overnight at 37 °C with sequencing grade trypsin gold (1 µg, Promega USA). After digestion, peptide mixtures were acidified to pH 2.8 with formic acid and desalted with minispin C18 columns (Nestgrp, USA). Samples were dried under vacuum and solubilized in 0.1% formic acid and 2% acetonitrile before mass spectrometric analysis.

1.2. Liquid chromatography–mass spectrometry (LC–MS/MS) analysis of proteins

The tryptic peptide samples (1 µl roughly containing 1 µg) were separated by reverse-phase chromatography for each experiment via Thermo Scientific Proxeon nano LC using a C18 picofrit analytical column (360 μm OD, 75 μm ID, 10 μm tip, Magic C18 resin, 5 µm size, Newobjective, USA). The HPLC was coupled to an LTQ-Orbitrap Velos mass spectrometer (Thermo Fisher Scientific). Peptides were loaded onto the column with Buffer A (2% acetonitrile, 0.1% formic acid) and eluted with 120 min linear gradient from 2 to 40% buffer B (80% acetonitrile, 0.1% formic acid). After the gradient the column was washed with 90% buffer B and finally equilibrated with buffer A for next run. The mass spectra were acquired in the LTQ Orbitrap velos with full MS scan (RP 30,000) followed by 10 data-dependent MS/MS scans with detection of the fragment ions in the FTMS HCD mode (RP 7500). Target values were 1×10⁶ for full FT-MS scans and 5×10⁴ for FT-MS MSn scans. Ion selection threshold was set to 5000 counts.

1.3. Proteomic data analysis

Data analysis was performed using Thermo Proteome Discoverer software suite (version 1.4). For the search engine SEQUEST, the peptide precursor mass tolerance was set to 10 ppm, and fragment ion mass tolerance was set to 0.6 Da. Carbamidomethylation on cysteine residues was used as fixed modification, and oxidation of methionine along with N-terminal acetylation was used as variable modifications. Spectra were queried against the E. histolytica uniProt database. In order to improve the rate of peptide identifications percolator node in proteome discoverer was utilized with the false discovery rate (FDR) set to 1% for peptide and protein identifications. The identified protein list was further arranged in protein groups based on common peptide matches. For a comparative analysis of all the identified peptide and protein lists among the three biological replicates (the three internal membrane samples) a common merger table was generated and provided in Supplemental material 1, Table 1 (Sheet 1). All the individually sample specific protein groups and their corresponding peptide list are also presented in Table 1 (Sheets 2–6). A summary of the identified proteins and their corresponding functional category is represented in Fig. 2. For detail analysis, each category group of proteins is listed in Supplemental material 1, Table 2 (ER, Golgi apparatus, heat shock, TGN-ER retrograde transport, Endosomes and MVBs), Supplemental material 1, Table 3 (proteins with potential enzymatic activity associated to internal membranes), Supplemental material 1, Table 4 (GTPAses) and Supplemental material 1, Table 5 (possible cargo proteins). Proteins of unknown function present in the endomembrane fractions are listed in Supplemental material 1, Table 6 and of multiple functions are presented in Supplemental material 1, Table 7.

Fig. 2 — LC MS/MS identified proteins indexed in their corresponding categories. (A) Categories present from proteins identified in the isolated internal membrane fraction. (B) Percentage of proteins related to endomembrane compartments. (Taken from Perdomo et al. [1]).

In order to determine the absolute abundance of different proteins within a single sample we used iBAQ feature of MaxQuant version 1.4.0.5 software using default search parameters [5,6]. The results of proteome discoverer and Maxquant searches were arranged together. The mass spectrometry proteomics data have been deposited to the open access library of ProteomeXchange Consortium (http://www.proteomexchange.org) via the PRIDE partner repository [2] with the dataset identifier PXD000770.

1.4. Bioinformatic analysis

Proteome discoverer annotation node, which is connected to ProteinCenter web based application, was used to download categorical GO database information in the form of biological process (BF), molecular function (MF), and cellular component (CC). Maxquant and Perseus were utilized for protein identification and assignment of Interpro, KEGG, Prosite annotations along with their iBAQ values. The iBAQ values were obtained by Maxquant software and are represented in.txt files representing protein and peptide identification results of the endomembrane enriched fractions.

A list showing dynamic range of internal membrane proteome of E. histolytica are represented as iBAQ values in Figs. 3–5.

Fig. 3 — Proteins identified by LC/MS/MS with measurable iBAQ values corresponding to the endomembrane-trafficking system. After applying the iBAQ algorithm on the three raw files containing 1531 proteins, 1015 proteins had measurable iBAQ values and were separated into categories related to the ER, Golgi apparatus, TGN-ER, endosomes, MVBs, mitosomes and unknown. The iBAQ values varied over 5 orders of magnitude with respect to the most abundant and least abundant proteins. iBAQ analysis showed calreticulin to be the most abundant protein among all.

Fig. 4 — Proteins identified by LC/MS/MS with measurable iBAQ values corresponding to the enzymatic activities. After applying the iBAQ algorithm on the three raw files containing 1531 proteins, 1015 proteins had measurable iBAQ values and were separated into categories related to lipid metabolism, glycosylation, detoxification and lysosome. The iBAQ values varied over 5 orders of magnitude with respect to the most abundant and least abundant proteins.

Fig. 5 — Proteins identified by LC/MS/MS with measurable iBAQ values corresponding to the G-ATPases family. After applying the iBAQ algorithm on the three raw files containing 1531 proteins, 1015 proteins had measurable iBAQ values and were separated into categories related to Rab, Ras, Rho, GEF and GAP. The iBAQ values varied over 5 orders of magnitude with respect to the most abundant and least abundant proteins.

Maxquant results and analysis are found as a folder in Supplemental material 2 (Fig. 6).

Conflict of interests

The authors declare that they have no competing interests.

Fig. 1 — Scheme of the procedure used for separation of endomembrane enriched fraction. *Entamoeba histolytica* subcellular fraction separation was performed as described before [4] from a trophozoite pellet corresponding to 2×10⁸ cells. The final yield of internal membrane proteins was of 50 μg/µl.

Acknowledgments

This work is supported by grants to NG from the French National Agency for Research (ANR-10-INTB-1301-PARACTIN) and from the French Parasitology Network of Excellence ParaFrap (Grant ANR-11-LABX0024). DP was supported by fellowships from the French Ministère de la Recherche et la Technologie (MRT) and from Fondation pour la Recherche Médicale (FRM). GDJ is supported by the Wellcome Trust–DBT India Alliance (Grant 500080/Z/09/Z).

Footnotes

^{Appendix A}

Supplementary data associated with this article can be found in the online version at doi:10.1016/j.dib.2014.08.007.

Supplementary materials

Supplementary data

mmc1.zip^{(4.9MB, zip)}

Supplementary data

mmc2.zip^{(26.7MB, zip)}

References

1.Perdomo D., Aït-Ammar N., Syan S., Sachse M., Jhingan D.G., Guillen N. Cellular and proteomics analysis of the endomembrane system from the unicellular Entamoeba histolytica. J. Proteomics. 2014 doi: 10.1016/j.jprot.2014.07.034. (in press) [DOI] [PubMed] [Google Scholar]
2.Vizcaino J.A., Cote R.G., Csordas A., Dianes J.A., Fabregat A., Foster J.M. The Proteomics IDEntifications (PRIDE) database and associated tools: status in 2013. Nucleic Acids Res. 2013;41:D1063–D1069. doi: 10.1093/nar/gks1262. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Wessel D., Flugge U.I. A method for the quantitative recovery of protein in dilute solution in the presence of detergents and lipids. Anal. Biochem. 1984;138:141–143. doi: 10.1016/0003-2697(84)90782-6. [DOI] [PubMed] [Google Scholar]
4.Aley S.B., Scott W.A., Cohn Z.A. Isolation of the plasma membrane of Entamoeba histolytica. Arch. Invest. Med. 1980;11:41–45. [PubMed] [Google Scholar]
5.Neuhauser N., Michalski A., Cox J., Mann M. Expert system for computer-assisted annotation of MS/MS spectra. Mol. Cell. Proteomics: MCP. 2012;11:1500–1509. doi: 10.1074/mcp.M112.020271. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Cox J., Mann M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 2008;26:1367–1372. doi: 10.1038/nbt.1511. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary data

mmc1.zip^{(4.9MB, zip)}

Supplementary data

mmc2.zip^{(26.7MB, zip)}

[bib1] 1.Perdomo D., Aït-Ammar N., Syan S., Sachse M., Jhingan D.G., Guillen N. Cellular and proteomics analysis of the endomembrane system from the unicellular Entamoeba histolytica. J. Proteomics. 2014 doi: 10.1016/j.jprot.2014.07.034. (in press) [DOI] [PubMed] [Google Scholar]

[bib2] 2.Vizcaino J.A., Cote R.G., Csordas A., Dianes J.A., Fabregat A., Foster J.M. The Proteomics IDEntifications (PRIDE) database and associated tools: status in 2013. Nucleic Acids Res. 2013;41:D1063–D1069. doi: 10.1093/nar/gks1262. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib3] 3.Wessel D., Flugge U.I. A method for the quantitative recovery of protein in dilute solution in the presence of detergents and lipids. Anal. Biochem. 1984;138:141–143. doi: 10.1016/0003-2697(84)90782-6. [DOI] [PubMed] [Google Scholar]

[bib4] 4.Aley S.B., Scott W.A., Cohn Z.A. Isolation of the plasma membrane of Entamoeba histolytica. Arch. Invest. Med. 1980;11:41–45. [PubMed] [Google Scholar]

[bib5] 5.Neuhauser N., Michalski A., Cox J., Mann M. Expert system for computer-assisted annotation of MS/MS spectra. Mol. Cell. Proteomics: MCP. 2012;11:1500–1509. doi: 10.1074/mcp.M112.020271. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib6] 6.Cox J., Mann M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 2008;26:1367–1372. doi: 10.1038/nbt.1511. [DOI] [PubMed] [Google Scholar]

PERMALINK

Data set for the proteomics analysis of the endomembrane system from the unicellular Entamoeba histolytica

Doranda Perdomo

Nawel Aït-Ammar

Sylvie Syan

Martin Sachse

Gagan Deep Jhingan

Nancy Guillén

Abstract

1. Data, experimental design, materials and methods

1.1. Preparation of samples for proteomics analysis