Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2022 Feb 25;17(2):e0264467. doi: 10.1371/journal.pone.0264467

Modular automated bottom-up proteomic sample preparation for high-throughput applications

Yan Chen 1,2,3, Nurgul Kaplan Lease 1,2,3,¤, Jennifer W Gin 1,2,3, Tadeusz L Ogorzalek 1,2,3, Paul D Adams 1,4,5, Nathan J Hillson 1,2,3, Christopher J Petzold 1,2,3,*
Editor: John Matthew Koomen6
PMCID: PMC8880914  PMID: 35213656

Abstract

Manual proteomic sample preparation methods limit sample throughput and often lead to poor data quality when thousands of samples must be analyzed. Automated liquid handler systems are increasingly used to overcome these issues for many of the sample preparation steps. Here, we detail a step-by-step protocol to prepare samples for bottom-up proteomic analysis for Gram-negative bacterial and fungal cells. The full modular protocol consists of three optimized protocols to: (A) lyse Gram-negative bacteria and fungal cells; (B) quantify the amount of protein extracted; and (C) normalize the amount of protein and set up tryptic digestion. These protocols have been developed to facilitate rapid, low variance sample preparation of hundreds of samples, be easily implemented on widely-available Beckman-Coulter Biomek automated liquid handlers, and allow flexibility for future protocol development. By using this workflow 50 micrograms of protein from 96 samples can be prepared for tryptic digestion in under an hour. We validate these protocols by analyzing 47 Pseudomonas putida and Rhodosporidium toruloides samples and show that this modular workflow provides robust, reproducible proteomic samples for high-throughput applications. The expected results from these protocols are 94 peptide samples from Gram-negative bacterial and fungal cells prepared for bottom-up quantitative proteomic analysis without the need for desalting column cleanup and with protein relative quantity variance (CV%) below 15%.

Introduction

Proteomic sample preparation protocols consist of many liquid transfer steps that are well suited for automation with liquid handler systems. As the number of proteomic samples for biotechnological and clinical applications increases, automated solutions will be required to minimize human error, save time and resources, and improve the data quality. There have been a number of automated sample preparation protocols developed for both mammalian and bacterial cells that reduce processing time, variability, and overall cost [112]. Most of these methods automate the sample cleanup and tryptic digestion portions of the workflow whereas a few automate the entire workflow from cell lysis to digestion [5, 6, 11]. These automation methods show significant improvement in variability and time-savings over manual sample preparation methods. Additionally, high-quality, low variance results can be achieved by researchers without extensive experience in proteomic sample preparation. While automation methods for the full workflow are powerful and convenient they are not as flexible, consequently, when proteomic research projects incorporate new organisms, different amounts of cells, or other variations the entire automated process must be modified. The three protocols described here separate the steps of the fully automated protocol described in Chen et al. [6] to enable flexibility for changing research directions and needs. The modular protocols are much simpler to operate, enable flexible methods development, and process samples in half the time (<1 hour) of the fully-automated protocol due to manual intervention at various steps, such as centrifugation and protein resuspension. Furthermore, the modular automation protocols offer greater flexibility and adaptability without highly-specialized liquid handler systems.

These protocols detail three optimized step-by-step methods to: (A) lyse Gram-negative bacteria and fungal cells; (B) quantify the amount of protein extracted; and (C) normalize the amount of protein and set up tryptic digestion. Importantly, samples prepared through these protocols do not include salts that must be removed prior to LC-MS analysis, thus minimizing sample handling and the associated variance. These protocols have been developed to facilitate rapid, low variance sample preparation of hundreds of samples, be easily implemented on widely-available Beckman-Coulter Biomek automated liquid handlers that use disposable pipet tips, and allow flexibility for future protocol development. By using this modular workflow 96 samples can be prepared for tryptic digestion in under an hour. The tryptic digestion step can be optimized for the given application with many high-throughput digestion protocols such as microwave, elevated temperature, and ultrasonic methods [13, 14] or traditional overnight digestion.

Materials and methods

The protocol described in this peer-reviewed article is published on protocols.io (dx.doi.org/10.17504/protocols.io.b3gxqjxn) and is included for printing as S1 File with this article.” The individual protocols are published on protocols.io (Cell lysis: dx.doi.org/10.17504/protocols.io.b3gsqjwe, Protein quantification: dx.doi.org/10.17504/protocols.io.b3grqjv6, Protein normalization: dx.doi.org/10.17504/protocols.io.b3gtqjwn) and are included for printing as S2S4 Files with this article.

Expected results

The modular bottom-up proteomic sample preparation automation protocol (S1 File) is composed of three protocols that detail: (A) cell lysis, protein extraction, protein precipitation; (B) protein quantification; and (C) protein normalization and tryptic digestion. Using the chloroform-methanol protein extraction protocol (S2 File) described, we obtained median amounts of over 115 μg and 50 μg of protein from one OD*mLs (~1 x 109 cells) of P. putida and two OD*mLs R. toruloides, respectively (Fig 1). To demonstrate the inter-day variability of the protocol, a single overnight cell culture of P. putida and R. toruloides was harvested and distribute into two 96 deep well plates and the protocol was repeated on two separate days (Day 1 and Day 7) to demonstrate the reproducibility of the method. The protocol takes 20 minutes to process one 96-well plate, including centrifugation steps. The amount of protein scaled with the starting amount of biomass which provides flexibility for the desired application. This amount of protein is sufficient for typical nano- and standard-flow LC-MS data acquisition methods and can easily be adjusted for applications requiring larger amounts of protein. The upper limit on the amount of biomass that can be processed with this protocol is limited by the amount of chloroform and methanol that can be added to the PCR plate (~125 μL). For applications that require larger amounts of protein, such as multi-dimensional chromatography, the protocol can easily be adapted to extractions in 96 deep-well plates with more chloroform-methanol. The protocol can also be scaled down to lower cell amounts, but the amount of protein extracted becomes increasingly variable as the amount of cells decreases, so increasing the number of replicates would be advisable. Sample types other than microbial cell pellets, such as tissues and complex biofluids, haven’t been tested with this protocol and may need additional preparation steps. Proteins resulting from these samples however are readily suitable for the following two protocols in the workflow.

Fig 1. Violin plots with data points showing the total protein extracted by using the modular automated protocol on P. putida and R. toruloides from different amounts of biomass (n = 47).

Fig 1

D1 and D7 samples correspond to repeat analysis of a single culture of each organism seven days apart to demonstrate the inter-day variability of the protocol.

The protein quantification protocol (S3 File) takes 15 minutes and produces concentration data for two replicates of the samples in a 96-well plate by using the DC protein assay (Bio-Rad), a modified Lowry protein quantification method [15]. The protocol uses a total of 3 μL of each sample and requires aliquoting known concentrations of a BSA standard in a separate plate for calibration curve generation. Duplicate protein quantification was chosen based on previous experience as a balance between sample consumption and accurate concentration measurement. When protein samples are processed by the protein extraction protocol above, the concentration measured by this method falls within a calibration range of 0.125 to 2 μg/μL. For larger or smaller amounts of cells, the concentration may fall outside the calibration range described here, consequently, the dilution factor may need to be adjusted. Once the concentrations of the samples are known the third protocol (S4 File) described here is used to normalize the amount of protein for tryptic digestion and subsequent LC-MS analysis. This protocol takes 20 minutes on the Biomek NX-S8 liquid handler system because the concentration of each well must be adjusted individually. Trypsin, iodoacetic acid, and tris(2-carboxyethyl)phosphine (TCEP) are then added via the Biomek NX-S8 or multi-channel pipette. These protocols are being used for proteomic analysis of metabolically engineered bacteria and fungi. The expected quantitative proteomic results from samples prepared by the modular protocol is demonstrated in Figs 2 and 3 by using an Agilent 1290 UHPLC coupled to a Thermo Orbitrap Exploris 480 system operating in data-dependent acquisition (DDA) mode [16]. The LC-MS/MS method (15 minute total run time) identified over 900 proteins (>6000 peptides) from 14 μg load of P. putida protein digest and over 1000 proteins (>4500 peptides) from 10 μg load of R. toruloides protein digest. To demonstrate the inter-day variability of the protocol, a single overnight cell culture of P. putida and R. toruloides was harvested and distribute into two 96 deep well plates at a total cell amount of 1 OD* mL and 2 OD*mL per well, respectively, and subsequently processed via the modular automation workflow on different days. We used the MS1 ion intensity method with Skyline [17] to quantify over 900 proteins from P. putida and over 1000 proteins from R. toruloides samples. The median protein variance for the samples were between 18 and 22% CV on two separate days from automated sample preparation protocol (Figs 2 and 3). The high-throughput modular automated protocol enables one researcher to prepare thousands of bottom-up proteomic samples per week. Supporting publications and other organisms are under development.

Fig 2. Reproducibility of the modular automated sample preparation workflow as measured by label-free LC-MS/MS shotgun proteomics analysis.

Fig 2

(A) Violin plots showing the coefficient of variation of MS1 ion intensity quantification for over 900 and 1000 proteins from P. putida and R. toruloides, respectively (n = 47). The violin plots display the kernel density estimation of the CV and inside each violin plot is a box plot summarizing ranges (IQR, whiskers, outlier points) and individual medians (solid lines). The LCMS analysis raw data have been deposited to the ProteomeXchange Consortium data depository at http://www.proteomexchange.org/. They are publicly accessible with the dataset identifier PXD029122 and 10.6019/PXD029122.

Fig 3. Scatter plot display of the CV% for each protein (y-axis) vs the mean MS1 ion intensity detected for each protein (x-axis).

Fig 3

Supporting information

S1 File. Modular automated bottom-up proteomic sample preparation for high-throughput applications.

Also available on protocols.io.

(PDF)

S2 File. Automated chloroform-methanol protein extraction on the Biomek-FX liquid handler system.

Also available on protocols.io.

(PDF)

S3 File. Automated protein quantification with the Biomek-FX liquid handler system.

Also available on protocols.io.

(PDF)

S4 File. Automated protein normalization and tryptic digestion on a Biomek-NX liquid handler system.

Also available on protocols.io.

(PDF)

Acknowledgments

The authors thank Kristin Burnum-Johnson, Yuzian Gao, and Nathalie Muñoz for helpful discussions about the protocols and Stephen Tan for help with instrumentation.

Data Availability

All proteomic data are available via ProteomeXchange with identifier PXD029122 and 10.6019/PXD029122.

Funding Statement

The funders had and will not have a role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The views and opinions of the authors expressed herein do not necessarily state or reflect those of the United States Government or any agency thereof. Neither the United States Government nor any agency thereof, nor any of their employees, makes any warranty, expressed or implied, or assumes any legal liability or responsibility for the accuracy, completeness, or usefulness of any information, apparatus, product, or process disclosed, or represents that its use would not infringe privately owned rights. The proof-of-concept work and resources were part of the Joint BioEnergy Institute (JBEI; http://www.jbei.org) and extension of the procedure and identification of the sources of error were part of the Agile BioFoundry (ABF; http://agilebiofoundry.org) supported through contract DE-AC02-05CH11231 between Lawrence Berkeley National Laboratory and the U. S. Department of Energy.

References

  • 1.Fu Q, Kowalski MP, Mastali M, Parker SJ, Sobhani K, van den Broek I, et al. Highly reproducible automated proteomics sample preparation workflow for quantitative mass spectrometry. J Proteome Res. 2018;17: 420–428. doi: 10.1021/acs.jproteome.7b00623 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Lu X, Wang Z, Gao Y, Chen W, Wang L, Huang P, et al. Autoproteome chip system for fully automated and integrated proteomics sample preparation and peptide fractionation. Anal Chem. 2020. doi: 10.1021/acs.analchem.0c00752 [DOI] [PubMed] [Google Scholar]
  • 3.Dayon L, Núñez Galindo A, Corthésy J, Cominetti O, Kussmann M. Comprehensive and Scalable Highly Automated MS-Based Proteomic Workflow for Clinical Biomarker Discovery in Human Plasma. J Proteome Res. 2014;13: 3837–3845. doi: 10.1021/pr500635f [DOI] [PubMed] [Google Scholar]
  • 4.Arul A-B, Byambadorj M, Han N-Y, Park JM, Lee H. Development of an Automated, High-throughput Sample Preparation Protocol for Proteomics Analysis. Bull Korean Chem Soc. 2015;36: 1791–1798. doi: 10.1002/bkcs.10338 [DOI] [Google Scholar]
  • 5.Müller T, Kalxdorf M, Longuespée R, Kazdal DN, Stenzinger A, Krijgsveld J. Automated sample preparation with SP3 for low-input clinical proteomics. Mol Syst Biol. 2020;16: e9111. doi: 10.15252/msb.20199111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Chen Y, Guenther JM, Gin JW, Chan LJG, Costello Z, Ogorzalek TL, et al. Automated “Cells-To-Peptides” Sample Preparation Workflow for High-Throughput, Quantitative Proteomic Assays of Microbes. J Proteome Res. 2019;18: 3752–3761. doi: 10.1021/acs.jproteome.9b00455 [DOI] [PubMed] [Google Scholar]
  • 7.Liang Y, Acor H, McCown MA, Nwosu AJ, Boekweg H, Axtell NB, et al. Fully Automated Sample Processing and Analysis Workflow for Low-Input Proteome Profiling. Anal Chem. 2020. doi: 10.1021/acs.analchem.0c04240 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Specht H, Harmange G, Perlman DH, Emmott E, Niziolek Z, Budnik B, et al. Automated sample preparation for high-throughput single-cell proteomics. BioRxiv. 2018. doi: 10.1101/399774 [DOI] [Google Scholar]
  • 9.Lee J, Kim H, Sohn A, Yeo I, Kim Y. Cost-Effective Automated Preparation of Serum Samples for Reproducible Quantitative Clinical Proteomics. J Proteome Res. 2019;18: 2337–2345. doi: 10.1021/acs.jproteome.9b00023 [DOI] [PubMed] [Google Scholar]
  • 10.Fu Q, Johnson CW, Wijayawardena BK, Kowalski MP, Kheradmand M, Van Eyk JE. A Plasma Sample Preparation for Mass Spectrometry using an Automated Workstation. J Vis Exp. 2020. doi: 10.3791/59842 [DOI] [PubMed] [Google Scholar]
  • 11.Switzar L, van Angeren J, Pinkse M, Kool J, Niessen WMA. A high-throughput sample preparation method for cellular proteomics using 96-well filter plates. Proteomics. 2013;13: 2980–2983. doi: 10.1002/pmic.201300080 [DOI] [PubMed] [Google Scholar]
  • 12.Potriquet J, Laohaviroj M, Bethony JM, Mulvenna J. A modified FASP protocol for high-throughput preparation of protein samples for mass spectrometry. PLoS ONE. 2017;12: e0175967. doi: 10.1371/journal.pone.0175967 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.López-Ferrer D, Capelo JL, Vázquez J. Ultra fast trypsin digestion of proteins by high intensity focused ultrasound. J Proteome Res. 2005;4: 1569–1574. doi: 10.1021/pr050112v [DOI] [PubMed] [Google Scholar]
  • 14.Zheng YZ, DeMarco ML. Manipulating trypsin digestion conditions to accelerate proteolysis and simplify digestion workflows in development of protein mass spectrometric assays for the clinical laboratory. Clinical Mass Spectrometry. 2017. doi: 10.1016/j.clinms.2017.10.001 [DOI] [Google Scholar]
  • 15.Lowry OH, Rosebrough NJ, Farr AL, Randall RJ. Protein measurement with the Folin phenol reagent. J Biol Chem. 1951;193: 265–275. doi: 10.1016/S0021-9258(19)52451-6 [DOI] [PubMed] [Google Scholar]
  • 16.Chen Y, Gin JW, Petzold CJ. Discovery proteomic (DDA) LC-MS/MS data acquisition and analysis. In: protocols.io [Internet]. 7 May 2021. [cited 11 Aug 2021]. Available: 10.17504/protocols.io.buthnwj6 [DOI] [Google Scholar]
  • 17.MacLean B, Tomazela DM, Shulman N, Chambers M, Finney GL, Frewen B, et al. Skyline: an open source document editor for creating and analyzing targeted proteomics experiments. Bioinformatics. 2010;26: 966–968. doi: 10.1093/bioinformatics/btq054 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

John Matthew Koomen

29 Dec 2021

PONE-D-21-32984Modular automated bottom-up proteomic sample preparation for high-throughput applicationsPLOS ONE

Dear Dr. Petzold,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

 Review the critiques provided by the reviewers.  This journal ranks clarity, publicly available data, and a high level of detail to support reproducibility of methods.  Keep these criteria in mind as you address the points raised by the reviewers.

Please submit your revised manuscript by Feb 12 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

John Matthew Koomen, PhD

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We note that the grant information you provided in the ‘Funding Information’ and ‘Financial Disclosure’ sections do not match.

When you resubmit, please ensure that you provide the correct grant numbers for the awards you received for your study in the ‘Funding Information’ section.

3. Thank you for stating the following financial disclosure:

“The authors have declared that no competing interests exist.”

At this time, please address the following queries:

a)        Please clarify the sources of funding (financial or material support) for your study. List the grants or organizations that supported your study, including funding received from your institution.

b)        State what role the funders took in the study. If the funders had no role in your study, please state: “The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.”

c)        If any authors received a salary from any of your funders, please state which authors and which funders.

d)        If you did not receive any funding for this study, please state: “The authors received no specific funding for this work.”

Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

4. Thank you for stating the following in the Acknowledgments Section of your manuscript:

“The proof-ofconcept work and resources were part of the Joint BioEnergy Institute (JBEI; http://www.jbei.org) and extension of the procedure and identification of the sources of error were part of the Agile BioFoundry (ABF; http://agilebiofoundry.org) supported through contract DE-AC02-05CH11231 between Lawrence Berkeley National Laboratory and the U. S. Department of Energy”

We note that you have provided additional information within the Acknowledgements Section that is not currently declared in your Funding Statement. Please note that funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form.

Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows:

 “The authors have declared that no competing interests exist.”

Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

5. In your Data Availability statement, you have not specified where the minimal data set underlying the results described in your manuscript can be found. PLOS defines a study's minimal data set as the underlying data used to reach the conclusions drawn in the manuscript and any additional data required to replicate the reported study findings in their entirety. All PLOS journals require that the minimal data set be made fully available. For more information about our data policy, please see http://journals.plos.org/plosone/s/data-availability.

Upon re-submitting your revised manuscript, please upload your study’s minimal underlying data set as either Supporting Information files or to a stable, public repository and include the relevant URLs, DOIs, or accession numbers within your revised cover letter. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Any potentially identifying patient information must be fully anonymized.

Important: If there are ethical or legal restrictions to sharing your data publicly, please explain these restrictions in detail. Please see our guidelines for more information on what we consider unacceptable restrictions to publicly sharing data: http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. Note that it is not acceptable for the authors to be the sole named individuals responsible for ensuring data access.

We will update your Data Availability statement to reflect the information you provide in your cover letter.

6. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information.

7. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Does the manuscript report a protocol which is of utility to the research community and adds value to the published literature?

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the protocol been described in sufficient detail?

Descriptions of methods and reagents contained in the step-by-step protocol should be reported in sufficient detail for another researcher to reproduce all experiments and analyses. The protocol should describe the appropriate controls, sample sizes and replication needed to ensure that the data are robust and reproducible.

Reviewer #1: Yes

Reviewer #2: Partly

**********

3. Does the protocol describe a validated method?

The manuscript must demonstrate that the protocol achieves its intended purpose: either by containing appropriate validation data, or referencing at least one original research article in which the protocol was used to generate data.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. If the manuscript contains new data, have the authors made this data fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: No

**********

5. Is the article presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please highlight any specific errors that need correcting in the box below.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors present an optimized workflow to prepare samples for bottom-up proteomics using automated liquid handling devices. The expected result is less than 15% cv of relative protein abundance.

Specific comments:

1. Data availability statement needs to be edited to reflect accurately where the data has been made available.

2. The authors suggest the method to be a widely applicable proteomics sample preparation workflow, and demonstrate the utility with gram-negative and fungal cell culture. However, many proteomics applications utilize tissue and complex biofluids (serum, lavage fluid, etc) that will likely require significant additional sample preparation steps. The authors should comment on the broader applicability of the method, or revise language to describe suitable sample types

3. The document(s) should be carefully proofread to eliminate typographical errors, specifically with respect to subscripts in chemical formulas.

Reviewer #2: The manuscript by Chen et al. describes a protocol for “automated” sample processing of fungal and bacterial cells for proteomic analysis. Specifically, the authors report 3 separate protocols that are carried out in a commonly used Beckman liquid handler - protein extraction from cells, normalization of protein content, and then trypsin digestion. There is some value in the described approach that could be of interest to a targeted audience; however, there are several issues that need to be addressed prior to publication consideration:

1) The reported protocol seems to be an extension of a previously published research article from their lab, which is within the guidelines of this type of article. However, the authors need to further detail the processing/automation that was mentioned in the first paper but also distinguish any differences. The protocol itself could be more detailed.

2) The authors state that this protocol was very similar to the previously published work but that the updated version was more flexible since it could “operated independently”. This point should be described more clearly and highlighted further in the protocol section if applicable.

3) Some points in the “expected results” don’t match up to the online protocol. For example, the online protocol instructs use of 5ul of sample for protein assay; however, in the results, authors state it uses 3ul of each sample.

4) Description and/or assessment of lysis/lysis efficiency aren’t clear. The main concern is that there might not be adequate representation of the entire proteome (e.g., membrane proteins or other specific classes) or perhaps variability in proteome composition from sample to sample. The authors in the previous paper use MRM to assess specific targets and in this reported protocol show low-coverage, spectral counting data. I understand the potential time savings and benefit of automation, but the authors should expand on limitation of the approach in this regard if applicable.

5) The original paper addresses getting the protein precipitate back in solution by performing “80 cycles of pipetting mixing at the maximum allowable aspiration and dispensing speed” but this is not addressed at all in newer protocol. Based on the extraction procedure used, this seems like a critical point that is important for subsequent processing steps.

6) There is concern of cross contamination between samples and reagents based on the deep well reagent reservoir used (there doesn’t seem to be any separation/ wells in the reservoir). Are there any washes (e.g., needle washes) in between steps?

7) More specifics should be added in the step that describes mixing on deck “with user defined times”. The total step duration to add Methanol shown is 3 minutes – is this mixing only or include other aspects of this overall step?

8) The authors state “In case of large protein pellets, mixing with a multichannel pipette may be necessary”. Is there a particular range you can provide to help the reader assess when that would be required? Related to this, while eliminating most manual pipetting, there are still some manual steps at critical points preventing this from being fully automated.

9) The authors should explain what day 1 and day 7 represent in Figure 1 (how long the organisms were left to grow before collection or days between processing?).

10) In Figure 2, the authors state UHPLC-MRM; however, they describe spectral counting as their quant approach. I believe MRM is copied over from the original paper where they used the MRM approach.

11) Although the mass spectrometry analysis is not necessarily part of the processing protocol, it was used to assess the protocol in terms of protein ID and quant (CV measurement across proteome). A short gradient was used, but the instrument employed should be able to identify more proteins than reported (especially if the protein loading amount on-column reported is correct). A more appropriate quant approach could be used (rather than spectral counting) to determine CVs across the dataset, which could address reproducibility in protein representation across the proteome if the LC-MS system is set up for precise LFQ-based quant. Measurement of the (high-abundance) top few hundred proteins with spectral counting is not really that informative.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2022 Feb 25;17(2):e0264467. doi: 10.1371/journal.pone.0264467.r002

Author response to Decision Letter 0


13 Jan 2022

Response to Reviewer Comments:

Reviewer #1: The authors present an optimized workflow to prepare samples for bottom-up proteomics using automated liquid handling devices. The expected result is less than 15% cv of relative protein abundance.

We thank the reviewer for your comments and suggestions. We have responded to your comments below and edited the corresponding text in our manuscript and protocols accordingly.

Specific comments:

1. Data availability statement needs to be edited to reflect accurately where the data has been made available.

Response: We clarified the text describing where to access the LCMS data in the Data Availability Statement at the end of the manuscript and included it in the caption for Figure 2. The text reads, “The LCMS analysis raw data have been deposited to the ProteomeXchange Consortium data depository at http://www.proteomexchange.org/. They are publicly accessible with the dataset identifier PXD029122 and 10.6019/PXD029122.”

2. The authors suggest the method to be a widely applicable proteomics sample preparation workflow, and demonstrate the utility with gram-negative and fungal cell culture. However, many proteomics applications utilize tissue and complex biofluids (serum, lavage fluid, etc) that will likely require significant additional sample preparation steps. The authors should comment on the broader applicability of the method, or revise language to describe suitable sample types

Response: We clarified the sample types that were used to test the protein extraction protocol and commented on the broader applicability of the protocols established in the workflow in the expected result section. We added text to the Abstract that now reads (new text in bold), “Here, we detail a step-by-step protocol to prepare samples for bottom-up proteomic analysis for Gram-negative bacterial and fungal cells.”

In the Expected Results section, the text now reads, “Sample types other than microbial cell pellets, such as tissues and complex biofluids, haven’t been tested with this protocol and may need additional preparation steps. Proteins resulting from these samples however are readily suitable for the following two protocols in the workflow.”

3. The document(s) should be carefully proofread to eliminate typographical errors, specifically with respect to subscripts in chemical formulas.

Response: Thanks for your comment. We went through all documents and corrected any typo errors we could find.

Reviewer #2: The manuscript by Chen et al. describes a protocol for “automated” sample processing of fungal and bacterial cells for proteomic analysis. Specifically, the authors report 3 separate protocols that are carried out in a commonly used Beckman liquid handler - protein extraction from cells, normalization of protein content, and then trypsin digestion. There is some value in the described approach that could be of interest to a targeted audience; however, there are several issues that need to be addressed prior to publication consideration:

Response: We appreciate your detailed review comments and suggestions for improving our manuscript. We have responded to your comments below and edited the corresponding texts in our manuscript and protocols accordingly.

1) The reported protocol seems to be an extension of a previously published research article from their lab, which is within the guidelines of this type of article. However, the authors need to further detail the processing/automation that was mentioned in the first paper but also distinguish any differences. The protocol itself could be more detailed.

Response: Our manuscript describes a collection of three detailed automation procedures that modularized the full automated proteomic sample preparation process we published previously. The fully automated method requires customized scripts that are difficult to adapt at other labs and requires a highly specialized hybrid liquid handling system with an integrated centrifuge and plate reader which further complicates adoption of the protocol by others. The strengths of the protocols described here are their robustness and application on commonly available liquid handling systems and non-integrated lab equipment. We clarified the text to emphasize the differences between the two methods.

The text now reads, “While automation methods for the full workflow are powerful and convenient, they are not as flexible, consequently, when proteomic research projects incorporate new organisms, different amounts of cells, or other variations the entire automated process must be modified. The three protocols described here separate the steps of the fully automated workflow described in Chen et al. [6] to enable flexibility for changing research directions and needs. The modular protocols are much simpler to operate, enable flexible methods development, and process samples in half the time (<1 hour) of the fully-automated protocol due to manual intervention at various steps, such as centrifugation and protein resuspension. Furthermore, the modular automation protocols offer greater flexibility and adaptability without highly-specialized liquid handler systems. ”

2) The authors state that this protocol was very similar to the previously published work but that the updated version was more flexible since it could “operated independently”. This point should be described more clearly and highlighted further in the protocol section if applicable.

Response: Thanks for your suggestion. We have added sentences in the introduction section to clarify the “operated independently” point and highlight its advantages in comparison to the fully automated procedure.

The text now reads, “The modular protocols are much simpler to operate, enable flexible methods development, and process samples in half the time (<1 hour) of the fully-automated protocol due to manual intervention at various steps, such as centrifugation and protein resuspension. Furthermore, the modular automation protocols offer greater flexibility and adaptability without highly-specialized liquid handler systems.”

3) Some points in the “expected results” don’t match up to the online protocol. For example, the online protocol instructs use of 5ul of sample for protein assay; however, in the results, authors state it uses 3ul of each sample.

Response: Thanks for pointing out the error. 5 ul was used in the proof-of-concept protocol version, and later was updated to 3 ul in the method. We updated the protocol to be consistent with the statement in the manuscript. In addition, we moved the Note containing additional information about the dilution to a more prominent place in the Step.

4) Description and/or assessment of lysis/lysis efficiency aren’t clear. The main concern is that there might not be adequate representation of the entire proteome (e.g., membrane proteins or other specific classes) or perhaps variability in proteome composition from sample to sample. The authors in the previous paper use MRM to assess specific targets and in this reported protocol show low-coverage, spectral counting data. I understand the potential time savings and benefit of automation, but the authors should expand on limitation of the approach in this regard if applicable.

Response: The Chloroform-methanol extraction of proteins is a well established method for protein extraction that disrupts lipid membrane and precipitate proteins very effectively without biases to certain classes of proteins. Likewise, the metabolite, protein, and lipid extraction (MPLEx) protocol, has been proven to be robust and applicable to a diverse set of sample types, including cell cultures, microbial communities, and tissues (DOI:https://doi.org/10.1128/mSystems.00043-16). It has proven useful for multi-omics profiling of microbial pathogens (DOI: 10.1039/c6an02486f). Chloroform-methanol extraction method was evaluated to be very effective for the study of membrane proteins of non-model plants (DOI:10.1007/s00425-010-1121-1). We also demonstrated the effectiveness of the chloroform-methanol method in over 50 publications, including using it to accurately quantify membrane electron transport chain proteins in E. coli (DOI: 10.1126/science.aat7925). Furthermore, here, we identified over 900 proteins from 14 μg load of P. putida protein digest and over 1000 proteins from 10 μg load of R. toruloides protein digest with a rapid 15 minutes total LC-MS/MS method, which represent proteins in a wide range of categories including multi-pass integral membrane proteins, cytosolic soluble proteins, lipoproteins, and others. As such, we don’t believe that this is a limitation of the method.

5) The original paper addresses getting the protein precipitate back in solution by performing “80 cycles of pipetting mixing at the maximum allowable aspiration and dispensing speed” but this is not addressed at all in newer protocol. Based on the extraction procedure used, this seems like a critical point that is important for subsequent processing steps.

Response: The extensive resuspension process in the fully-automated paper was one of the primary issues with process time and reproducibility that we overcame by switching to the modular protocols. We address this issue in the text by adding, “The modular protocols are much simpler to operate, enable flexible methods development, and process samples in half the time (<1 hour) of the fully-automated protocol due to manual intervention at various steps, such as centrifugation and protein resuspension.”

We highlight this aspect in the protein extraction protocol, step 22 states that the user could define mixing times (an adjustable variable) in the method to mix protein resuspension on deck. The variable mixing time provides users great flexibility to balance method time and efficient protein suspension. We also noted in the same step that users could implement manual pipetting after visual inspection of the resuspension to ensure that no visible protein aggregates in the samples.

6) There is concern of cross contamination between samples and reagents based on the deep well reagent reservoir used (there doesn’t seem to be any separation/ wells in the reservoir). Are there any washes (e.g., needle washes) in between steps?

Response: The BioMek liquid handling system uses disposable tips instead of fixed tips. New tips were used to address contamination concerns. We added the bold text, shown below, to convey that point.

“”These protocols have been developed to facilitate rapid, low variance sample preparation of hundreds of samples, be easily implemented on widely-available Beckman-Coulter Biomek automated liquid handlers that use disposable pipet tips, and allow flexibility for future protocol development.

7) More specifics should be added in the step that describes mixing on deck “with user defined times”. The total step duration to add Methanol shown is 3 minutes – is this mixing only or include other aspects of this overall step?

Response: We specified the mixing step with the number of mixing cycles we used in the demonstration experiments, and clarified that the number of mixing cycles is a variable that users could define themselves as needed. The time shown at each step is an estimated duration of all the components of a step. Specifically, the 3 minutes in your commented step includes tip box moving, tip loading, liquid transfer, and mixing.

8) The authors state “In case of large protein pellets, mixing with a multichannel pipette may be necessary”. Is there a particular range you can provide to help the reader assess when that would be required? Related to this, while eliminating most manual pipetting, there are still some manual steps at critical points preventing this from being fully automated.

Response: We noted in the step that users need to visually inspect samples after method finishes in case chunks of protein may present in samples of large protein pellets. Our established modular proteomic sample preparation protocols are semi-automatic.

9) The authors should explain what day 1 and day 7 represent in Figure 1 (how long the organisms were left to grow before collection or days between processing?).

Response: We added text to explain how samples were collected for day 1 and day 7 batches. The text now reads, “To demonstrate the inter-day variability of the protocol, a single overnight cell culture of P. putida and R. toruloides was harvested and distribute into two 96 deep well plates and the protocol was repeated on two separate days (Day 1 and Day 7) to demonstrate the reproducibility of the method.”

We also added “D1 and D7 samples correspond to repeat analysis of a single culture of each organism seven days apart to demonstrate the inter-day variability of the protocol.” to the Figure 1 caption.

10) In Figure 2, the authors state UHPLC-MRM; however, they describe spectral counting as their quant approach. I believe MRM is copied over from the original paper where they used the MRM approach.

Response: Thanks for pointing out the error. We have changed the analysis method to “label-free LC-MS/MS shotgun proteomics analysis”

11) Although the mass spectrometry analysis is not necessarily part of the processing protocol, it was used to assess the protocol in terms of protein ID and quant (CV measurement across proteome). A short gradient was used, but the instrument employed should be able to identify more proteins than reported (especially if the protein loading amount on-column reported is correct). A more appropriate quant approach could be used (rather than spectral counting) to determine CVs across the dataset, which could address reproducibility in protein representation across the proteome if the LC-MS system is set up for precise LFQ-based quant. Measurement of the (high-abundance) top few hundred proteins with spectral counting is not really that informative.

Response: We agree with the reviewer that the Exploris 480 Orbitrap instrument can identify more proteins if longer gradients or nano-flow chromatography are used, but the purpose of the analysis was to demonstrate the use of the protocol for high-throughput applications. As such, the proteome coverage of about 1000 proteins in a microbial sample using a 15 minutes total DDA shotgun method at a standard flow LC-MS system is expected. Users could choose nano-LC systems, longer LC gradients, and DIA workflows to achieve deeper proteome coverage, however, those choices are dependent on the proteomics applications of interest and are separate from the sample preparation protocols detailed here. We chose to increase the number of replicates instead of the depth of proteome coverage to show the power of the sample preparation method for our application. Our data shows high reproducibility for a large number of replicates indicating that this protocol is useful for quantitative proteomic applications.

Regarding the quantification method, spectral count-based label free quantification (LFQ) is a well-accepted approach that has been applied widely and is discussed in a review by Bantscheff et al (DOI:10.1007/s00216-012-6203-4). And, the Scaffold software used to process the label free quantification is also benchmarked in the paper published in Journal proteomic research (DOI:10.1021/acs.jproteome.6b00645). We agree with the reviewer that other LFQ methods are available and each has its own advantages and disadvantages to be considered. To address the reviewer’s point, we analyzed the data by using the MS1 ion intensity LFQ method in Skyline and observed CVs comparable to that of the spectral count analysis for the data that we use for demonstrating the protocol (updated figures 2 and 3 in the manuscript). We quantified more proteins and still observed a median CV of ~20% for the samples. As such, we replaced Figures 2 and 3 with the results of the MS1 ion intensity quantification analysis. Our data shows that the sample preparation protocols detailed in the manuscript produce data with coefficients of variation that are consistent with good analytical results.

Attachment

Submitted filename: Response to Reviewers.docx

Decision Letter 1

John Matthew Koomen

11 Feb 2022

Modular automated bottom-up proteomic sample preparation for high-throughput applications

PONE-D-21-32984R1

Dear Dr. Petzold,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

John Matthew Koomen, PhD

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Does the manuscript report a protocol which is of utility to the research community and adds value to the published literature?

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the protocol been described in sufficient detail?

Descriptions of methods and reagents contained in the step-by-step protocol should be reported in sufficient detail for another researcher to reproduce all experiments and analyses. The protocol should describe the appropriate controls, sample sizes and replication needed to ensure that the data are robust and reproducible.

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Does the protocol describe a validated method?

The manuscript must demonstrate that the protocol achieves its intended purpose: either by containing appropriate validation data, or referencing at least one original research article in which the protocol was used to generate data.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. If the manuscript contains new data, have the authors made this data fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: N/A

**********

5. Is the article presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please highlight any specific errors that need correcting in the box below.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors' revised manuscript is acceptable as written. All comments/concerns have been addressed.

Reviewer #2: The authors addressed the comments from my original review.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Acceptance letter

John Matthew Koomen

17 Feb 2022

PONE-D-21-32984R1

Modular automated bottom-up proteomic sample preparation for high-throughput applications

Dear Dr. Petzold:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. John Matthew Koomen

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 File. Modular automated bottom-up proteomic sample preparation for high-throughput applications.

    Also available on protocols.io.

    (PDF)

    S2 File. Automated chloroform-methanol protein extraction on the Biomek-FX liquid handler system.

    Also available on protocols.io.

    (PDF)

    S3 File. Automated protein quantification with the Biomek-FX liquid handler system.

    Also available on protocols.io.

    (PDF)

    S4 File. Automated protein normalization and tryptic digestion on a Biomek-NX liquid handler system.

    Also available on protocols.io.

    (PDF)

    Attachment

    Submitted filename: Response to Reviewers.docx

    Data Availability Statement

    All proteomic data are available via ProteomeXchange with identifier PXD029122 and 10.6019/PXD029122.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES