Solve Markov Chain Problem: Find Initial Probability Vector p(1)

  • Context: Graduate 
  • Thread starter Thread starter celestra
  • Start date Start date
  • Tags Tags
    Chain Markov chain
Click For Summary

Discussion Overview

The discussion revolves around the challenge of determining the initial probability vector p(1) of a Markov chain from a given realization sequence of binary outcomes (ones and zeros). Participants explore various methods and assumptions related to this problem, including Maximum Likelihood Estimation (MLE) and Maximum A Posteriori (MAP) estimation, while questioning the feasibility of deriving p(1) solely from the sequence.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • One participant suggests that the transition probability matrix Q can be derived from the sequence by counting transitions between states.
  • Another participant questions how the realization is linked to the probability vector, proposing a stochastic interpretation of the outcomes based on the values of p1 and p2.
  • Some participants propose using a uniform initial probability assumption or a stationary Markov process to estimate p(1).
  • It is noted that it is impossible to completely determine the probability matrix from samples, but statistics can provide likely estimates.
  • One participant introduces the idea of using MAP estimation and discusses the application of Bayes's theorem to calculate the posterior probability of p(1) given the realization.
  • Concerns are raised about the stationarity assumption, with one participant expressing a desire to analyze non-stationary sequences.
  • A later reply acknowledges the validity of the stationarity assumption, recognizing that every Q has its own eigenvectors.

Areas of Agreement / Disagreement

Participants express multiple competing views on how to approach the problem of determining p(1) from the realization sequence. There is no consensus on a definitive method or solution, and the discussion remains unresolved.

Contextual Notes

Participants highlight limitations in deriving p(1) from a single realization sequence, noting the dependence on assumptions about the initial distribution and the nature of the Markov process. The discussion also reflects uncertainty regarding the applicability of different estimation methods.

celestra
Messages
18
Reaction score
0
[SOLVED] Markov chain problem

Hi, all! This is not a homework thing or something like that. I'm just curious.

Suppose a Markov chain p(n+1)=Q*p(n), where Q=[q11, q12; q21, q22] is the transition probability matrix and p(n)=[p1(n);p2(n)] is the probability vector.

Given Q and p(1), we can generate a realization from it, i.e., a sequence of ones and zeros.

Now I want to do the inverse. Given a long enough sequence of ones and zeros, Q can be obtained by counting the number of changes from zero to zero(which will be q11), zero to one(q12), one to zero(q21), and one to one(q22). But, how can I get the initial probability vector p(1) from this sequence? Maximum Likelihood Estimation, blah blah? I don't know about that. Please, someone give me a clue how I can do this. Or is it impossible to do that from just only one realization sequence?

Thanks in advance.
 
Physics news on Phys.org
How is the realization r linked to p? If p1 > 0.5 then r = 1, otherwise r = 0?
 
Oh, I forgot that. The realization is not deterministic but stochastic through a kind of random number generation according to the probability vector. If p1=.4 and p2=.6, then the probability that a zero will come out is 40 percent and the probability that a one will come out is 60 percent.
 
So how do you construct your sequence of 1's and 0's? Toss a coin, with a 1 on one side and a 0 on the other?
 
It's a good idea. You can use any sequence if it is composed of zeros and ones.
 
For example, you are given this sequence: 010100101101001011010010111010.
Then, you can get Q as [3/14, 11/15; 11/14, 4/15].
And, I have no idea how I can get p(1) from this.
Additionally, is it possible anyway?
 
Two ideas:
1. assume uniform initial probability, p = 0.5
2. assume stationary Markov, then solve p = Qp.
 
It's impossible to completely determine the probability matrix from samples, even if we stipulate that it's a markov process, but it is possible to use statistics to find what the probabilities are likely to be.

Any probability matrix where all entries are non-zero is can produce any sequence. For example, a process with the probability matrix:
\[ \left[ \begin{array}{cc}<br /> \frac{1}{2}&amp; \frac{1}{2} \\<br /> \frac{1}{2} &amp; \frac{1}{2} \\ \end{array} \right]\]
Will produce any particular sequence of length n with probability \frac{1}{2^n}.

So, assuming even initial distribution it would generate 010100101101001011010010111010 about
\frac{1}{2^{30}} of the time.

The probability matrix you produced
\[ \left[ \begin{array}{cc}<br /> \frac{3}{14}&amp; \frac{11}{15} \\<br /> \frac{11}{14} &amp; \frac{4}{15} \\ \end{array} \right]\]
Is just the one that has the highest likelihood of producing the sequence you gave, but, because the sample is relatively small, there really isn't a whole lot confidence that it's correct.

This 'reversing probability' stuff is a bit more advanced.
 
What about MAP(Maximum A Posteriori) esimation, \hat{x}=argmaxp(x|y), since we're interested in the random variable x, p(1) in this case, given data as the realization y, in this case the sequence? I feel like this will work. The a posteriori pdf p(x|y) can be calculated using Bayes's theorem p(x|y)=\frac{p(y|x)}{p(y)}p(x) where p(x) is the a priori pdf, p(y) is the marginal pdf, and p(y|x) is proportional to the likelihood function. And we can put \left[\begin{array}{c} .5\\ .5\\ \end{array}\right] in the place of the a priori pdf p(x) using maximum entropy principle as EnumaElish did in the first idea. But, I don't know how to calculate the rest of things. Now I'm seeking for the calculation in some books, but I haven't found yet. I guess it's just MLE(Maximum Likelihood Estimation) after all. But, I don't know how to do that.
 
Last edited:
  • #10
I don't like the stationarity assumption because I want to deal with a sequence even like this: 010100100100000000000000000000000110100101.
 
  • #11
Isn't there is a p that solves p = Qp for any arbitrary Q (hence an arbitrary sequence)?
 
  • #12
I did a mistake. You're right EnumaElish. :wink: Any Q has it's own eigenvectors at least one. So your stationarity assumption will be okay. And it seems the best idea for my problem currently.
 

Similar threads

  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 13 ·
Replies
13
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
Replies
24
Views
4K
  • · Replies 1 ·
Replies
1
Views
5K
  • · Replies 14 ·
Replies
14
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K