Professional Documents
Culture Documents
PA , PG , PC and PT PA + PG + PC + PT = 1 p0 = ( P A , P G , P C , P T )
M=
The chance that base A mutates to base C
PG|A
PC|A PT|A
PG|G
PC|G P+ T|G
=1
PG|C
PC|C PT|C
PG|T
PC|T PT|T
In each column are entries referring to the same ancestral base S0, and each row are entries referring to the same descendent base S1.
PA PG PC
PT|A
PT|G
PT|C
PT|T
PT
PA|A PA + M p0 =
PC|A PA +
PG|A PA + PT|A PA +
Using Eq. (4.1) page 132, and by applying similar reasoning: = P(S1 = T and S0 = A) + P(S1 = T and S0 = G) + P(S1 = T and S0 = C)+ P(S1 = T and S0 = T)
This is the sum of four probabilities of mutually exclusive events. By addition rule the probability of the union of the four events P(S1 = T )
M is a transition matrix
P(S0 = i) P(S1 = j)
i, j = A, G, C, and T
G 0 9 2 0
0 .818 .182 0
C 1 2 7 1
.091 .182 .636 .091
T 1 0 2 6
.111 0 .222 .667
p0 =
.111 0 .111
Important assumption:
What happens to the system over a given time step depends only on:
the state the system is in at the start of that step, (current state of the system) and the transition probabilities.
Markov Model/Chain/Process
Markov chain is a mathematical system that undergoes transitions from one state to another, between a finite number of possible states.
http://en.wikipedia.org/wiki/Markov_chain
Markov property is stated as the future is independent of the past given the present
http://www.columbia.edu/~ks20/stochastic-I/stochastic-I-TimeReversibility.pdf
The system is a site in a DNA sequence: the site is initially in one of 4 states (A,G,C, or T) p0 vector of initial probabilities that the system is in each of these states. (all entries 0) Markov/Transition matrix M (4 x 4) hold the conditional probabilities. (all entries 0 and the sum of each column = 1)
Assuming that each site in the sequence behaves identically and independently of every other site.
Markov Model
This assumption is not very reasonable for DNA in some genes. WHY?
The genetic code allows for many changes in the third site of each codon to have no effects on the product of the gene. Since genes may produce proteins, a change at one site may well be tied to changes at another (dependence).
Public Health
Useful Tutorial:
http://www.youtube.com/watch?v=7KGdE2AK_MQ&feature=relmfu