Main Page
From SPCTools
Revision as of 19:45, 10 September 2007 Jtasman (Talk | contribs) (removing prequips link) ← Previous diff |
Current revision Dshteynb (Talk | contribs) (→The Trans-Proteomic Pipeline (TPP): MS2 proteomics validation and analysis) |
||
Line 1: | Line 1: | ||
- | __NOTOC__ | + | The '''Seattle Proteome Center (SPC)''' is committed to providing free, open-source, software projects in support of cutting-edge proteomics research. The SPC maintains active support and education for the open-source proteomics community. |
- | == Seattle Proteome Center Software Tools Wiki == | + | |
- | Welcome to the Seattle Proteome Center (SPC) software tools wiki. This wiki is primarily intended as means of sharing user documentation for the SPC tools. We encourage outside contribution, but please make your edits responsibly. | + | ==Education, outreach, training and support== |
- | <div class="MainPageBG" style="border: 4px solid #ffad08; padding: .5em 1em; color: #000; background-color: #ffffff; margin: 3px 3px 0; text-align: left"> | + | |
- | * To edit or create pages, log in before you start editing pages. Thanks! | + | *SPC/TPP course: |
- | * The tools change rapidly, so make a note of what version you are using if you edit information. | + | ** No courses scheduled at the moment. Please check back soon! |
+ | ** [https://moritz.isbscience.org/courses/proteomics-informatics-course/ Information and materials from previous courses] | ||
- | '''Latest TPP version: 3.2.1 (June 20, 2007)''' | + | *Outreach at conferences: |
+ | ** None scheduled at the moment | ||
- | '''News:''' | + | *Active support: all users, both new and experienced, are encouraged to participate in our very active [http://groups.google.com/group/spctools-discuss spctools-discuss discussion group]. Daily discussions range from installation support to advanced data processing questions and feature requests. Please also see [http://groups.google.com/group/spctools-announce spctools-announce discussion group] for important notifications of updates to our software. |
- | *'''New wiki accounts are temporarily disabled'''. Due to high levels of spam we've temporarily halted creation of new accounts. We will open the wiki again when we have stronger anti-spam measures in place. In the meanwhile, anyone who wants an account can contact Josh (you should be able to find email contact through the spctools-discuss newsgroup website, see below.) | + | |
- | *For developers: The Sashimi project has migrated to Subversion (SVN) from CVS. Please see the spctools-discuss newsgroup for more info. | + | |
- | *The Windows/Cygwin TPP distribution now includes [[TPP:X!Tandem and the TPP|X!Tandem]] with the K-Score plugin! While the PeptideProphet model is still undergo refinement, this brings us closer to offering a completely open-source, free proteomics solution! | + | |
- | </div> | + | ==[[Formats:Overview|SPC open proteomics data formats]]: mzXML, pepXML, protXML== |
+ | Learn more about the SPC-developed open formats, which include [[Formats:mzXML|mzXML]], [[Formats:pepXML|pepXML]], and [[Formats:protXML|protXML]] on the [[Formats:Overview|Formats]] page. | ||
- | == SPC Software== | + | ==The [[Software:TPP|Trans-Proteomic Pipeline (TPP)]]: MS2 proteomics validation and analysis== |
+ | The Trans-Proteomic Pipeline (TPP) is a completely free software | ||
+ | solution for MS/MS-based shotgun proteomics analysis with a well-established | ||
+ | worldwide user community. The TPP is | ||
+ | available on Linux and Windows systems and builds on Mac OS X as well. The TPP spans | ||
+ | * conversion of raw data to open formats; | ||
+ | * support for searching spectra and peptide identification with various popular search engines, including the bundled X!Tandem search software (from The GPM) and [http://comet-ms.sourceforge.net/ Comet], as well as spectral library searching with [[SpectraST]], and cross-linking data with [http://www.kojak-ms.org/ Kojak]; | ||
+ | * conversion of search engine results to a unified format; | ||
+ | * statistical validation of peptide and protein identifications; | ||
+ | * quantitative proteomics analysis tools; | ||
+ | * and tools for interaction with and visualizing results. | ||
+ | * '''The latest TPP version is ''7.0.0,'' released ''March 2024''.''' | ||
+ | ** Download directly at: [https://sourceforge.net/projects/sashimi/files/Trans-Proteomic%20Pipeline%20%28TPP%29/TPP%20v7.0%20%28Arafel%29%20rev%200/TPP_7.0.0-src.tgz/download Sourceforge] | ||
+ | ** Release Notes: [[TPP:7.0.0_Release_Notes]] | ||
+ | *Additional [[Software:TPP | TPP documentation]] | ||
- | The SPC is committed to providing open-source, free software to support proteomics research. The | + | ==The [[Software:Firegoose, Gaggle, and PIPE|Protein Information and Property Explorer (PIPE)]]== |
- | [[Software:Overview| SPC Software Overview]] page lists (almost) all of the open-source software projects produced at the SPC. Major projects: | + | After data processing with the TPP, the PIPE is a starting point for the next step in analysis of proteomic experimental results. It is a launching pad from where simple operations can be performed on this data and messages can be passed to other more sophisticated analysis software such as the Gaggle. |
- | * '''TPP''': our proteomics pipeline software is described on the [[TPP:Documentation|Trans-Proteomic Pipeline (TPP) Overview]] page. | + | |
- | * '''Open Data Formats''': You can learn more about the SPC-developed open-formats on the [[Formats:Overview|Formats]] page. | + | |
- | * [[Software:SBEAMS|SBEAMS Project]]: SPC's high-powered LIMS database system. | + | |
- | ==Links== | + | Once protein identifiers have been established with the TPP, the PIPE can perform several different operations on the list of IDs, including: mapping to other IDs (entrez gene, unigene, uniprot, etc.), Gene Ontology enrichment calculations (to see what biological processes are overrepresented in the list), annotation of the list to specific GO terms, and more. The PIPE can also serve as a data management tool, saving lists with user specified metadata (i.e., description of the data). |
- | ===SPC Links=== | + | |
- | *newsgroups: '''subscriptions highly recommended for SPC Tools users''' | + | |
- | **[http://groups.google.com/group/spctools-discuss spctools-discuss discussion group]: very active, daily discussions ranging from installation to data processing. All users, new and experienced, encouraged to participate. | + | |
- | **[http://groups.google.com/group/spctools-announce spctools-announce discussion group]: infrequent, important notifications of updates to our software | + | |
- | *[http://www.sourceforge.net/projects/sashimi Sashimi Project Page on SourceForge]: Of interest to developers-- our code repository and software distribution site. | + | |
- | *[http://www.proteomecenter.org/course.php SPC Proteomics Course]: Next course is '''November 5-9, 2007''' | + | |
- | *[http://tools.proteomecenter.org/software.php Seattle Proteome Center (SPC) general website]: about our organization. | + | |
- | *[http://sashimi.sourceforge.net Sashimi Website]: an older documentation site, mainly containing useful information about the [[Formats:mzXML|mzXML]] format. | + | |
- | ===Non-SPC, related links=== | + | ==[[Software:Corra|Corra]]: Label-free MS1-based biomarker discovery pipeline== |
- | *[http://gaggle.systemsbiology.org The Gaggle project]: The ISB's data interaction, visualization, and exploration system, increasingly integrated with the [[Software:TPP|TPP]] | + | Corra is open source software for discovering biomarkers by detecting differentially expressed features/peptides using a MS1 based processing pipeline. Corra has an extensible software architecture to plug-in new MS1 feature extraction alignment modules as well as statistical packages. |
- | *[http://www.cytoscape.org Cytoscape Project]: Network visualization tool, integrated with the Gaggle. | + | |
- | *[http://www.peptideatlas.org PeptideAtlas]: The world's largest public repository of MS/MS spectra, developed and hosted at the ISB. | + | |
- | *[http://www.systemsbiology.org/Resources_and_Development/Current_Course_Offerings/Introduction_to_Systems_Biology Intro to Systems Biology course] offered at the ISB | + | |
- | == Wiki Guidlines == | + | ==[[Software:TIQAM|TIQAM]]: Targeted Identification for Quantitative Analysis by MRM== |
- | # These tools change rapidly. Please include the version when describing anything. | + | TIQAM supports hypothesis-driven biomarker validation by SRM (Selected Reaction Monitoring) for targeted proteomics. The TIQAM suite is a desktop application which assists in target selection and validation, and contains: |
- | # These pages should '''not''' be used to report bugs. Please use the appropriate support avenues | + | * TIQAM-PeptideAtlas, which interfaces with the public observed MS/MS database, PeptideAtlas; |
+ | * TIQAM-Digestor integrates TIQAM-PeptideAtlas output, pepXML files, and other data in order to select transition lists according to user-defined priorities; | ||
+ | * TIQAM-viewer provides a visual aid to analyze SRM-triggered MS/MS experiments, and allow users to catalog the validated transitions. | ||
- | Refer to [http://meta.wikimedia.org/wiki/MediaWiki_User's_Guide#For_editors MediaWiki user's guide] for an introduction to formatting and editing pages. | + | ==[[Software:SBEAMS|SBEAMS-Proteomics]]: Multi-experiment data management== |
+ | [[Software:SBEAMS|SBEAMS Project]]: SPC's high-powered LIMS database system has a proteomics module for managing and tracking large amounts (terrabytes) of data. | ||
+ | |||
+ | SBEAMS-Proteomics is part of the SBEAMS (Systems Biology Experiment Analysis | ||
+ | Management System) Project, which is a framework for collecting, storing, | ||
+ | and accessing data produced by a variety of different experiments. These | ||
+ | experiments can be managed separately but then correlated later under the | ||
+ | same framework. This integrated system combines a unified relational database management system (RDBMS) back end, a | ||
+ | collection of tools to store, manage, and query experiment information and | ||
+ | results in the RDBMS, a web front end for querying the database and | ||
+ | providing integrated access to remote data sources, and an interface to | ||
+ | existing programs for clustering and other analysis. The knowledge stored while analyzing experiments remains | ||
+ | easily accessible at later times and by other investigators to fully | ||
+ | capitalize on previous work. | ||
+ | |||
+ | ==[[Software:PeptideAtlas|PeptideAtlas]]== | ||
+ | PeptideAtlas is a multi-organism, publicly accessible compendium of peptides | ||
+ | identified in a large set of tandem mass spectrometry proteomics | ||
+ | experiments. Mass spectrometer output files are collected for human, mouse, | ||
+ | yeast, and several other organisms, and searched using the latest search | ||
+ | engines and protein sequences. All results of sequence and spectral library | ||
+ | searching are subsequently processed through the Trans Proteomic Pipeline to | ||
+ | derive a probability of correct identification for all results in a uniform | ||
+ | manner to insure a high quality database, along with false discovery rates | ||
+ | at the whole atlas level. Results may be queried and browsed at the | ||
+ | PeptideAtlas web site. The raw data, search results, and full builds can | ||
+ | also be downloaded for other uses. |
Current revision
The Seattle Proteome Center (SPC) is committed to providing free, open-source, software projects in support of cutting-edge proteomics research. The SPC maintains active support and education for the open-source proteomics community.
Education, outreach, training and support
- SPC/TPP course:
- No courses scheduled at the moment. Please check back soon!
- Information and materials from previous courses
- Outreach at conferences:
- None scheduled at the moment
- Active support: all users, both new and experienced, are encouraged to participate in our very active spctools-discuss discussion group. Daily discussions range from installation support to advanced data processing questions and feature requests. Please also see spctools-announce discussion group for important notifications of updates to our software.
SPC open proteomics data formats: mzXML, pepXML, protXML
Learn more about the SPC-developed open formats, which include mzXML, pepXML, and protXML on the Formats page.
The Trans-Proteomic Pipeline (TPP): MS2 proteomics validation and analysis
The Trans-Proteomic Pipeline (TPP) is a completely free software solution for MS/MS-based shotgun proteomics analysis with a well-established worldwide user community. The TPP is available on Linux and Windows systems and builds on Mac OS X as well. The TPP spans
- conversion of raw data to open formats;
- support for searching spectra and peptide identification with various popular search engines, including the bundled X!Tandem search software (from The GPM) and Comet, as well as spectral library searching with SpectraST, and cross-linking data with Kojak;
- conversion of search engine results to a unified format;
- statistical validation of peptide and protein identifications;
- quantitative proteomics analysis tools;
- and tools for interaction with and visualizing results.
- The latest TPP version is 7.0.0, released March 2024.
- Download directly at: Sourceforge
- Release Notes: TPP:7.0.0_Release_Notes
- Additional TPP documentation
The Protein Information and Property Explorer (PIPE)
After data processing with the TPP, the PIPE is a starting point for the next step in analysis of proteomic experimental results. It is a launching pad from where simple operations can be performed on this data and messages can be passed to other more sophisticated analysis software such as the Gaggle.
Once protein identifiers have been established with the TPP, the PIPE can perform several different operations on the list of IDs, including: mapping to other IDs (entrez gene, unigene, uniprot, etc.), Gene Ontology enrichment calculations (to see what biological processes are overrepresented in the list), annotation of the list to specific GO terms, and more. The PIPE can also serve as a data management tool, saving lists with user specified metadata (i.e., description of the data).
Corra: Label-free MS1-based biomarker discovery pipeline
Corra is open source software for discovering biomarkers by detecting differentially expressed features/peptides using a MS1 based processing pipeline. Corra has an extensible software architecture to plug-in new MS1 feature extraction alignment modules as well as statistical packages.
TIQAM: Targeted Identification for Quantitative Analysis by MRM
TIQAM supports hypothesis-driven biomarker validation by SRM (Selected Reaction Monitoring) for targeted proteomics. The TIQAM suite is a desktop application which assists in target selection and validation, and contains:
- TIQAM-PeptideAtlas, which interfaces with the public observed MS/MS database, PeptideAtlas;
- TIQAM-Digestor integrates TIQAM-PeptideAtlas output, pepXML files, and other data in order to select transition lists according to user-defined priorities;
- TIQAM-viewer provides a visual aid to analyze SRM-triggered MS/MS experiments, and allow users to catalog the validated transitions.
SBEAMS-Proteomics: Multi-experiment data management
SBEAMS Project: SPC's high-powered LIMS database system has a proteomics module for managing and tracking large amounts (terrabytes) of data.
SBEAMS-Proteomics is part of the SBEAMS (Systems Biology Experiment Analysis Management System) Project, which is a framework for collecting, storing, and accessing data produced by a variety of different experiments. These experiments can be managed separately but then correlated later under the same framework. This integrated system combines a unified relational database management system (RDBMS) back end, a collection of tools to store, manage, and query experiment information and results in the RDBMS, a web front end for querying the database and providing integrated access to remote data sources, and an interface to existing programs for clustering and other analysis. The knowledge stored while analyzing experiments remains easily accessible at later times and by other investigators to fully capitalize on previous work.
PeptideAtlas
PeptideAtlas is a multi-organism, publicly accessible compendium of peptides identified in a large set of tandem mass spectrometry proteomics experiments. Mass spectrometer output files are collected for human, mouse, yeast, and several other organisms, and searched using the latest search engines and protein sequences. All results of sequence and spectral library searching are subsequently processed through the Trans Proteomic Pipeline to derive a probability of correct identification for all results in a uniform manner to insure a high quality database, along with false discovery rates at the whole atlas level. Results may be queried and browsed at the PeptideAtlas web site. The raw data, search results, and full builds can also be downloaded for other uses.