“School of Biological”
Back to Papers HomeBack to Papers of School of Biological
Paper IPM / Biological / 13289  


Abstract:  
A profile hidden Markov model (PHMM) is widely used in assigning protein sequences to protein families. In this model, the hidden states only depend on the previous hidden state and observations are independent given hidden states. In other words, in the PHMM, only the information of the left side of a hidden state is considered. However, it makes sense that considering the information of the both left and right sides of a hidden state can improve the assignment task. For this purpose, bidirectional profile hidden Markov model (BPHMM) can be used. Also, because of the evolutionary relationship between sequences in a protein family, the information of the corresponding amino acid in the preceding sequence of residues in the PHMM can be considered. For this purpose the hidden Markov random field on regular lattice (HMRFRL) is introduced. In a PHMM, the parameters are defined by the transition and emission probability matrices. The parameters are usually estimated using an EM (ExpectationMaximization) algorithm known as BaumWelch algorithm. In this paper, the bidirectional BaumWelch algorithm and theBaumWelch algorithm on regular lattice are defined for estimating the parameters of the BPHMM and the HMRFRL respectively. We also compare the performance of common BaumWelch algorithm, bidirectional BaumWelch algorithm and the BaumWelch algorithm on regular lattice by applying them to the real top ten protein families from Pfam database. Results show that using the lattice model for sequence assignment increases the number of correctly assigned protein sequences to profiles compared to BPHMM .
Download TeX format 

back to top 