17

Hidden Markov Models: applications

instructor: Ross A. Lippert

http://www-math.mit.edu/~lippert/18.417/

Announcements:
  • Problem set 6 available


Review of hidden Markov model quantities

T(i,j) : prob(state j from state i)

E(i,x) : prob(letter x emitted from state i)

Two traditionally important auxillary quantities

Viterbi algorithm, tropicalized f and b


A short trip to the tropical semi-ring

Statistical mechanics motivation: partition functions

defn of a semi-ring: (Set, id+, id*, op+, op*) closed and has distributive law

Example: (R>=0, 0, 1, +, *), the non-negative reals

Example: ({R,inf}, inf, 0, min_T, +), the ``Boltzmann'' semi-ring

Example: ({R,-inf}, -inf, 0, max, +), the tropical semi-ring

Interesting results:


Training an HMM

Maximum likelihood estimation (ML):

Maximum a posteriori (MAP) also used

Nonlinear optimization problem with constraints: M >= 0, E >= 0, M 1 = 1, E 1 = 1


Expectation maximization

Very commonly used in machine learning

An outgrowth of general considerations of gradient search for constrained polynomials

a^ = avg # of times 'a' happens

Baum-Welch EM algorithm

Converges to a local optimum


Global alignment

The maximum likelihood path in this HMM is equivalent to global alignment with affine gap penalties

HMM for some sequence y:

Equivalence:


A graphical depiction of the ML path


Extensions of alignment via HMMs

Using HMMs we can align to profiles:

Krogh et al: took this idea to an extreme

What can a multisequence HMM tell us?


GENSCAN

Chris Burge, et al

leading HMM for gene discovery


GenScan's HMM

A generalized HMM

Double stranded: model divided into forward and reverse strands