TMHMM¶
Description¶
TMHMM is a program to predict the location of transmembrane helices in proteins, directly from sequence. ssbio provides a wrapper to execute and parse the “long” output format of TMHMM.
Installation instructions (Unix)¶
Note
These instructions were created on an Ubuntu 17.04 system.
Register for the software (academic license only) at the TMHMM download page
Receive instructions to download the software at your email address
Download the file tmhmm-2.0c.Linux.tar.gz
Extract it to a place where you store software
Install it according to the TMHMM installation instructions, repeated and annotated below…
- Insert the correct path for perl 5.x in the first line of the scripts bin/tmhmm and bin/tmhmmformat.pl (if not
/usr/local/bin/perl
). Usewhich perl
andperl -v
in the terminal to help find the correct path. - Make sure you have an executable version of decodeanhmm in the bin directory.
- Include the directory containing tmhmm in your path (how do I add something to my Path?)
- Read the TMHMM2.0.guide.html
- Insert the correct path for perl 5.x in the first line of the scripts bin/tmhmm and bin/tmhmmformat.pl (if not
Program execution¶
In the shell¶
To run the program on its own, execute the following command with your protein sequences contained in a FASTA file:
tmhmm my_sequences.fasta
With ssbio¶
To run the program using the ssbio Python wrapper, see: ssbio.protein.path.to.wrapper()
FAQs¶
How do I cite TMHMM?
- Krogh A, Larsson B, von Heijne G & Sonnhammer EL (2001) Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J. Mol. Biol. 305: 567–580 Available at: http://dx.doi.org/10.1006/jmbi.2000.4315
I’m having issues running TMHMM…
- See the ssbio wiki for (hopefully) some solutions - or add yours in when you find the answer!
API¶
-
ssbio.protein.sequence.properties.tmhmm.
label_TM_tmhmm_residue_numbers_and_leaflets
(tmhmm_seq)[source]¶ Determine the residue numbers of the TM-helix residues that cross the membrane and label them by leaflet.
Parameters: tmhmm_seq – g.protein.representative_sequence.seq_record.letter_annotations[‘TM-tmhmm’] Returns: a dictionary with leaflet_variable : [residue list] where the variable is inside or outside TM_boundary dict: outputs a dictionar with : TM helix number : [TM helix residue start , TM helix residue end] Return type: leaflet_dict Todo
untested method!