Pfam

From CLAB

Revision as of 21:51, 16 January 2009 by Jhannah (Talk | contribs)
(diff) ←Older revision | Current revision (diff) | Newer revision→ (diff)
Jump to: navigation, search

Pfam is a large collection of protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs).

[edit] Attaching to EMBOSS

$ cd /clab_bdb/pfam
$ wget ftp://ftp.sanger.ac.uk/pub/databases/Pfam/current_release/*.*
$ gzip -d Pfam_fs.gz Pfam_ls.gz

Verifying that it is functional:

$ ehmmpfam
MMER hidden markov model file: /clab_bdb/pfam/Pfam_fs
Input (gapped) protein sequence(s): UniProt:194K_TRVSY