Pfam

From CLAB

Jump to: navigation, search

Pfam is a large collection of protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs).

[edit] Attaching to EMBOSS

$ cd /clab_bdb/pfam
$ wget ftp://ftp.sanger.ac.uk/pub/databases/Pfam/current_release/*.*
$ gzip -d Pfam_fs.gz Pfam_ls.gz

Verifying that it is functional:

$ ehmmpfam
MMER hidden markov model file: /clab_bdb/pfam/Pfam_fs
Input (gapped) protein sequence(s): UniProt:194K_TRVSY
Personal tools