Solve Markov Chain Problem: Find Initial Probability Vector p(1)

  • Context: Graduate 
  • Thread starter Thread starter celestra
  • Start date Start date
  • Tags Tags
    Chain Markov chain
Click For Summary
SUMMARY

The discussion revolves around deriving the initial probability vector p(1) from a given realization sequence in a Markov chain defined by the equation p(n+1)=Q*p(n), where Q is the transition probability matrix. Participants suggest using Maximum Likelihood Estimation (MLE) and Maximum A Posteriori (MAP) estimation as potential methods for this derivation. The transition matrix Q can be calculated from the sequence of ones and zeros, but determining p(1) remains complex due to the stochastic nature of the process. The conversation concludes that while it is challenging to ascertain p(1) directly, the assumption of stationarity and the use of statistical methods can provide insights.

PREREQUISITES
  • Understanding of Markov chains and their mathematical representation.
  • Familiarity with transition probability matrices and their properties.
  • Knowledge of Maximum Likelihood Estimation (MLE) and Maximum A Posteriori (MAP) estimation techniques.
  • Basic concepts of probability theory and stochastic processes.
NEXT STEPS
  • Research how to apply Maximum Likelihood Estimation (MLE) in Markov chains.
  • Study the principles of Maximum A Posteriori (MAP) estimation and its applications in probability theory.
  • Explore the concept of stationary distributions in Markov processes.
  • Investigate the relationship between eigenvectors and transition matrices in Markov chains.
USEFUL FOR

Mathematicians, data scientists, and statisticians interested in Markov chain analysis, probability theory, and stochastic modeling will benefit from this discussion.

celestra
Messages
18
Reaction score
0
[SOLVED] Markov chain problem

Hi, all! This is not a homework thing or something like that. I'm just curious.

Suppose a Markov chain p(n+1)=Q*p(n), where Q=[q11, q12; q21, q22] is the transition probability matrix and p(n)=[p1(n);p2(n)] is the probability vector.

Given Q and p(1), we can generate a realization from it, i.e., a sequence of ones and zeros.

Now I want to do the inverse. Given a long enough sequence of ones and zeros, Q can be obtained by counting the number of changes from zero to zero(which will be q11), zero to one(q12), one to zero(q21), and one to one(q22). But, how can I get the initial probability vector p(1) from this sequence? Maximum Likelihood Estimation, blah blah? I don't know about that. Please, someone give me a clue how I can do this. Or is it impossible to do that from just only one realization sequence?

Thanks in advance.
 
Physics news on Phys.org
How is the realization r linked to p? If p1 > 0.5 then r = 1, otherwise r = 0?
 
Oh, I forgot that. The realization is not deterministic but stochastic through a kind of random number generation according to the probability vector. If p1=.4 and p2=.6, then the probability that a zero will come out is 40 percent and the probability that a one will come out is 60 percent.
 
So how do you construct your sequence of 1's and 0's? Toss a coin, with a 1 on one side and a 0 on the other?
 
It's a good idea. You can use any sequence if it is composed of zeros and ones.
 
For example, you are given this sequence: 010100101101001011010010111010.
Then, you can get Q as [3/14, 11/15; 11/14, 4/15].
And, I have no idea how I can get p(1) from this.
Additionally, is it possible anyway?
 
Two ideas:
1. assume uniform initial probability, p = 0.5
2. assume stationary Markov, then solve p = Qp.
 
It's impossible to completely determine the probability matrix from samples, even if we stipulate that it's a markov process, but it is possible to use statistics to find what the probabilities are likely to be.

Any probability matrix where all entries are non-zero is can produce any sequence. For example, a process with the probability matrix:
\[ \left[ \begin{array}{cc}<br /> \frac{1}{2}&amp; \frac{1}{2} \\<br /> \frac{1}{2} &amp; \frac{1}{2} \\ \end{array} \right]\]
Will produce any particular sequence of length n with probability \frac{1}{2^n}.

So, assuming even initial distribution it would generate 010100101101001011010010111010 about
\frac{1}{2^{30}} of the time.

The probability matrix you produced
\[ \left[ \begin{array}{cc}<br /> \frac{3}{14}&amp; \frac{11}{15} \\<br /> \frac{11}{14} &amp; \frac{4}{15} \\ \end{array} \right]\]
Is just the one that has the highest likelihood of producing the sequence you gave, but, because the sample is relatively small, there really isn't a whole lot confidence that it's correct.

This 'reversing probability' stuff is a bit more advanced.
 
What about MAP(Maximum A Posteriori) esimation, \hat{x}=argmaxp(x|y), since we're interested in the random variable x, p(1) in this case, given data as the realization y, in this case the sequence? I feel like this will work. The a posteriori pdf p(x|y) can be calculated using Bayes's theorem p(x|y)=\frac{p(y|x)}{p(y)}p(x) where p(x) is the a priori pdf, p(y) is the marginal pdf, and p(y|x) is proportional to the likelihood function. And we can put \left[\begin{array}{c} .5\\ .5\\ \end{array}\right] in the place of the a priori pdf p(x) using maximum entropy principle as EnumaElish did in the first idea. But, I don't know how to calculate the rest of things. Now I'm seeking for the calculation in some books, but I haven't found yet. I guess it's just MLE(Maximum Likelihood Estimation) after all. But, I don't know how to do that.
 
Last edited:
  • #10
I don't like the stationarity assumption because I want to deal with a sequence even like this: 010100100100000000000000000000000110100101.
 
  • #11
Isn't there is a p that solves p = Qp for any arbitrary Q (hence an arbitrary sequence)?
 
  • #12
I did a mistake. You're right EnumaElish. :wink: Any Q has it's own eigenvectors at least one. So your stationarity assumption will be okay. And it seems the best idea for my problem currently.
 

Similar threads

  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 13 ·
Replies
13
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
Replies
24
Views
4K
  • · Replies 1 ·
Replies
1
Views
5K
  • · Replies 14 ·
Replies
14
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K