Terry's compbio links
From SPCTools
| Revision as of 19:50, 11 April 2014 Tfarrah (Talk | contribs) ← Previous diff | Revision as of 19:51, 11 April 2014 Tfarrah (Talk | contribs) (→Genomics stuff) Next diff → | ||
| Line 41: | Line 41: | ||
| ===Genomics stuff=== | ===Genomics stuff=== | ||
| [http://picard.sourceforge.net/explain-flags.html SAM flags decoder] | [http://picard.sourceforge.net/explain-flags.html SAM flags decoder] | ||
| + | |||
| [http://biobits.org/samtools_primer.html SAMtools Primer] revised July 2013 | [http://biobits.org/samtools_primer.html SAMtools Primer] revised July 2013 | ||
| + | |||
| [http://arxiv.org/abs/1403.7486 SAMBLASTER] duplicate marking and structural variant read extraction | [http://arxiv.org/abs/1403.7486 SAMBLASTER] duplicate marking and structural variant read extraction | ||
| + | |||
| [https://www.broadinstitute.org/gatk/about/#typical-workflows Intro to GATK] | [https://www.broadinstitute.org/gatk/about/#typical-workflows Intro to GATK] | ||
| - | |||
| - | |||
| [http://supportres.illumina.com/documents/documentation/software_documentation/wgs/services_whole_genome_ug_15040892.pdf Illumina Whole-Genome Sequencing Services User Guide] | [http://supportres.illumina.com/documents/documentation/software_documentation/wgs/services_whole_genome_ug_15040892.pdf Illumina Whole-Genome Sequencing Services User Guide] | ||
| + | |||
| ===Proteomics stuff=== | ===Proteomics stuff=== | ||
Revision as of 19:51, 11 April 2014
| Contents | 
Terry Farrah's Computational Biology Links
ISB specific stuff
Mail Intranet PubMed special link ISB journal subscriptions Reservations paychex (0495 w123 bar w/o hyphen + pilipina) ISB contacts
Bus home: 64 76 71 To Hearthstone: 16 To ISB from home: 64, 71, 76 @65th 74 @55th To ISB from Trip: 66 To ISB from Betsy: 68, 73E
Terry's notes on how to perform a Peptide Atlas build
Additional notes on PeptideAtlas software
Protein identification terminology
Using the PeptideAtlas SearchProteins tab
Dave Campbell's PABST peptide selector for targetted proteomics
Notes on searching and TPP usage from the Unix command line at ISB
Short TPP tutorial, Long TPP tutorial, TPP testimonials
Notes on database programming for SBEAMS
SBeams: SPC's experimental data storage system. Go here to run an SBeams test drive.
db.systemsbiology.net: Developer's back door into SBeams. Go here to contribute data.
db.systemsbiology.net/devTF/sbeams/cgi/main.cgi Terry's SBeams software development sandbox
Joe Slagel's Oct. 2009 notes on scheduling on the regis cluster
mspecLINE: linking disease to PeptideAtlas
Genomics stuff
SAMtools Primer revised July 2013
SAMBLASTER duplicate marking and structural variant read extraction
Illumina Whole-Genome Sequencing Services User Guide
Proteomics stuff
JPR Jan 2014 C-HPP special issue
Peptide Atlas: a multi-organism, publicly accessible compendium of peptides identified in a large set of tandem mass spectrometry proteomics experiments.
TPP developers' group, TPP users' group
NIST Peptide Mass Spectral Libraries
Mayu: analysis of (large) mass spectrometry-based shotgun proteomics data sets. Mayu determines protein identification false discovery rates (protFDR), peptide identification false discovery rates (pepFDR) and peptide-spectrum match false discovery rates (mFDR) using a novel robust and fast strategy. Developed by Lukas Reiter.
X!Tandem parameters; X!Tandem home page
InsPecT sequence search engine
PRIDE PRoteomics IDEntification database
Terry's blog on improving proteomics peptide and protein identifications
Specifications for protXML, pepXML, and more
UniMod database of amino acid modifications
How to use the mProphet SRM results processing/validation pipeline
General computational biology stuff
The Pipe Protein Information and Property Explorer, developed at ISB by Hector Ramos
Swiss-Prot curated protein sequence database
Amigo: browser for the Gene Ontology database
ftp://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/complete: Download Swiss-Prot and other UniProt databases, updated every 3 weeks. UniProtKB/Swiss-Prot purportedly contains the canonical human proteome (The UniProt Consortium, "The Universal Protein Resource (UniProt) 2009, Nucleic Acids Research 2008 1-6) consisting of 20,325 entries for UniProt release 14.0. Swiss-Prot release 56.6 of 16-Dec-08 contains 20,333 human proteins.
[ftp://ftp.ncbi.nih.gov/repository/UniGene/Homo_sapiens: Download NCBI's UniGene human. Contains chromosomal information, among many other data.
http://www.broad.mit.edu/~mclamp/alpheus/ Gene content of human genome]: which Ensembl entries are real genes and which are not, according to Clamp et al, "Distinguishing protein-coding and noncoding genes in the human genome", PNAS December 4, 2007, 19428-19433, Vol. 104 No. 49.
General programming stuff
Unix shell: bash/tcsh syntax differences
Bash: Bash shell reference manual, Bash tips, Bash tests and comparators.
Sed: Nice sed (stream editor) reference sed one-liners
Vim: A Byte of Vim, the text editor. Sweet little Vim recipes. Line terminator info.
Perl: Steve Litt's Perl's of Wisdom, Perl tutorials, and www.cs.cf.ac.uk/Dave/PERL/, another nice Perl reference.
PDL (Perl Data Language) quick start
Informative discussion of test module options
Python: www.python.org/doc: Helpful links for Python programmers.
svn: One-page version of the subversion book.
gnu screen: One-page ref manual. Screen allows you to run several terminal sessions from one window, and allows sessions to persist after you close the window. Very handy if you want to take your work home with you -- close windows, then resume screen from your home computer.
IDL: [1]
Drawing Venn Diagrams: [2]
Excel: String operations
General biology stuff
Awesome animation of the inner life of the cell
A few other Links
ReligiousTolerance.org: I saw one intelligently written article there. Their purpose is to provide unbiased essays on various topics.

