Tutorial:StPeter1
From SPCTools
Revision as of 00:03, 24 October 2017 Mhoopman (Talk | contribs) ← Previous diff |
Current revision Mhoopman (Talk | contribs) |
||
Line 3: | Line 3: | ||
== About Tutorial == | == About Tutorial == | ||
- | This tutorial provides basic instruction on the use of StPeter for label-free quantification of proteins inferred from shotgun MS/MS spectra. It is recommended that users are familiar with performing data conversion, database searches, peptide validation, and protein inference within TPP. For additional instruction on how to perform these tasks, we recommend the broader [[TPP_Tutorial|TPP Tutorial]]. For simplicity, the tutorial makes use of the Petunia interface. However, advanced users can replicate all steps from the command line. | + | This tutorial provides basic instruction on the use of [[Software:StPeter|StPeter]] for label-free quantification of proteins inferred from shotgun MS/MS spectra. It is recommended that users are familiar with performing data conversion, database searches, peptide validation, and protein inference within TPP. For additional instruction on how to perform these tasks, we recommend the broader [[TPP_Tutorial|TPP Tutorial]]. For simplicity, the tutorial makes use of the Petunia interface. However, advanced users can replicate all steps from the command line. |
== Requirements and TPP Versions == | == Requirements and TPP Versions == |
Current revision
Contents |
Introduction
About Tutorial
This tutorial provides basic instruction on the use of StPeter for label-free quantification of proteins inferred from shotgun MS/MS spectra. It is recommended that users are familiar with performing data conversion, database searches, peptide validation, and protein inference within TPP. For additional instruction on how to perform these tasks, we recommend the broader TPP Tutorial. For simplicity, the tutorial makes use of the Petunia interface. However, advanced users can replicate all steps from the command line.
Requirements and TPP Versions
- Proteome Exchange dataset PXD001819 (RAW files only)
- This FASTA sequence database
- TPP Version 5.0 or newer
Description of Data
Tutorial
Searching the Data
- Log into the Petunia interface and set the pipeline to Comet.
- Navigate to TPP Tools > mzML/mzXML
- Convert all Thermo RAW files from PXD001819 to mzML.
- Navigate to TPP Tools > Comet Search
- Search all mzML files with the provided FASTA file and with the following Comet search parameters (leaving all other as default):
- add_C_cysteine = 57.021464
- variable_mod01 = 15.9949 M 0 3 -1 0 0
- peptide_mass_tolerance = 25
- peptide_mass_units = 2
- isotope_error = 1
- fragment_bin_tol = 1.0005
- fragment_bin_offset = 0.4
The results at this stage of the tutorial are pepXML files for each of the 27 input data files.
Validating PSMs and Inferring Proteins
- Navigate to TPP Tools > Analyze Peptides
- For each of the 27 pepXML files, individually run PeptideProphet, iProphet, and ProteinProphet.
- Give each output a unique name for identification.
- Under PEPTIDEPROPHET OPTIONS:
- Filter out results below this PeptideProphet probability: 0
- Use accurate mass binning, using PPM
- Only use Expect Score as the discriminant
- Use decoy hits to pin down the negative distribution. Decoy protein names begin with: reverse
- Use Non-parameteric model
- Report decoy hits with a computed probability.
- Under IPROPHET OPTIONS:
- Select RUN iProphet
- Select Also run ProteinProphet on these results
The results at this stage will add probabilities to the pepXML files. Additionally, ipro.pepXML and ipro.protXML files will be generated for each of the 27 original pepXML files. These ipro files contain iProphet adjusted probabilities for peptide identifications and protein inferences, respectively.
Quantifying Proteins
- Navigate to TPP Tools > Quantify Label-Free (MS2)
- Add the 27 ipro.protXML files.
- Set the StPeter options:
- Specify FDR cutoff = 0.01
- Tolerance for matching MS2 peaks = 0.4
- Amount of protein sample measured = 2.0
The results of this step appends protein quantities to all protein identifications in the ipro.protXMLs at the specified 1% FDR.