“School of Biological%20Sciences”

Back to Papers Home
Back to Papers of School of Biological%20Sciences

Paper   IPM / Biological%20Sciences / 13170
School of Biological Sciences
  Title:   A Modified Hidden Markov Model And Its Application In Protein Secondary Structure Prediction
  Author(s): 
1.  sima
2.  V. Rezaei.
3.  H.Pezeshk.
4.  D. Matthews.
  Status:   Published
  Journal: Proteomics Bioinform
  No.:  1
  Vol.:  5
  Year:  2012
  Supported by:  IPM
  Abstract:
One of the important tools in analyzing and modeling biological data is the Hidden Markov Model (HMM), which is used for gene prediction, protein secondary structure and other essential tasks. An HMM is a stochastic process in which a hidden Markov chain called; the chain of states, emits a sequence of observations. Using this sequence, various questions about the underlying emission generation scheme can be addressed. Applying an HMM to any particular situation is an attempt to infer which state in the chain emits an observation. This is usually called posterior decoding. In general, the emissions are assumed to be conditionally independent from each other. In this work we consider some dependencies among the states and emissions. The aim of our research is to study a certain relationship among emissions, with a focus on the Markov property. We assume that the probability of observing an emission depends not only on the current state but also on the previous state and one of the previous emissions. We also use additional environmental information, and classify amino acids into three groups, using the Relative Solvent Accessibility (RSA). We also investigate how this modification might change the current algorithms for ordinary HMMs, and introduce modified Viterbi and Forward-Backward algorithms for the new model. We apply our proposed model to an actual dataset concerning prediction of the protein secondary structure and demonstrate improved accuracy compared to the ordinary HMM. In particular, the overall accuracy of our modified HMM, which uses the RSA information, is 63.95
more info: http://www.omicsonline.org/0974-276X/JPB-05-024.php

Download TeX format
back to top
scroll left or right