Financial Physics - Probability of Winning

physicsoxford · May 8, 2012

Homework Statement

Question:
In game A the probability of winning at time t is determined by success (in any
game) at the previous two timesteps t-2 and t-1. A win (W) earns one unit of cash,
and a loss (L) results in paying one unit of cash. Following a sequence of outcomes (L;L)
at time steps (t - 2, t - 1), the probability of winning at timestep t is p1. Following
(L;W) it is p2, following (W;L) it is p3 and following (W;W) it is p4. Let D1(t) be
the probability of the sequence (L;L) at timesteps (t - 1, t), D2(t) be the probability
of (L;W), D3(t) be the probability of (W;L), and D4(t) be the probability of (W;W).
Find expressions for the Di in the steady state, for i = 1 to 4. Show that a player loses
on average when
p1p2 < (1 - p3)(1 - p4)

Homework Equations

No other equations are given!

The Attempt at a Solution

Im taking a class on Financial Physics and have no previous knowledge of probability. I have not taken statistical mechanics or Quantum yet. I am completely lost on this one. I been learning more about it but this is just over my head. Can someone help! Dont know where to start!

Ray Vickson · May 8, 2012

physicsoxford said:

Homework Statement

Question:
In game A the probability of winning at time t is determined by success (in any
game) at the previous two timesteps t-2 and t-1. A win (W) earns one unit of cash,
and a loss (L) results in paying one unit of cash. Following a sequence of outcomes (L;L)
at time steps (t - 2, t - 1), the probability of winning at timestep t is p1. Following
(L;W) it is p2, following (W;L) it is p3 and following (W;W) it is p4. Let D1(t) be
the probability of the sequence (L;L) at timesteps (t - 1, t), D2(t) be the probability
of (L;W), D3(t) be the probability of (W;L), and D4(t) be the probability of (W;W).
Find expressions for the Di in the steady state, for i = 1 to 4. Show that a player loses
on average when
p1p2 < (1 - p3)(1 - p4)

Homework Equations

No other equations are given!

The Attempt at a Solution

Im taking a class on Financial Physics and have no previous knowledge of probability. I have not taken statistical mechanics or Quantum yet. I am completely lost on this one. I been learning more about it but this is just over my head. Can someone help! Dont know where to start!

While it may not help (because you have no previous exposure to probability) you can model the system as a Markov chain, where the state at time t consists of the outcomes at times t and t-1. There are four states:
state 1 = (W,W), state 2 = (W,L), state 3 = (L,W) and state 4 = (L,L)
If we are in state i (=1,2,3 or 4) at time t, what are the probabilities we will be in state j at time (t+1)? These are the so-called one-step transition probabilities, typically denoted as p_ij. We have p_ij≥0 for all i,j and Ʃ_{j=1..4} p_ij= 1 for i = 1,2,3,4.

In the present case:
[tex] \begin{array}{l} P(LL \to LW) = p_1 \, , \; P(LL \to LL) = 1-p_1\\ P(LW \to WW) = p_2 \, , \; P(LW \to WL) = 1-p_2 \\ P(WL \to LW) = p_3 \, , \; P(WL \to LL) = 1-p_3 \\ P(WW \to WW) = p_4 \, , \; P(WW \to WL) = 1-p_4 \end{array}[/tex]
with all other transitions having P(i → j) = 0.

The reward r at time t is r = +1 in states LW and WW, and is r = -1 in states WL and LL. The expected long-run reward per unit time is
[tex]\text{average reward } = \bar{r} = (+1)( \pi_{WW} + \pi_{LW}) + (-1)(\pi_{WL} + \pi_{LL}),[/tex]
where [itex]\pi_{WW},[/itex] etc., are the steady-state probabilities of states WW, WL, LW and LL. These can be found using standard methods for Markov chains, and you can find all the needed material through Google, for example.

RGV

RoshanBBQ · May 8, 2012

Out of curiosity, are you learning about Markov chains in your class currently? Without that theory, I'm unaware of how you'd find long term averages for a system like this, though I am pretty inexperienced with probability.

physicsoxford · May 9, 2012

The only thing that we did in class that could relate to this would be the binomial tree model. So after looking up Markov Chain and reading a bit it makes more since but I am still struggling. Here is an attempt:

So the underlining reasoning is that S₀P = S₁ Where P is the transition probability matrix and S₀ is the initial state distribution matrix and S₁ = a later state distribution matrix.

P = Matrix:
p₄, 1-p₄, 0, 0
0, 0, p₃, 1-p₃
p₂, 1-p₂, 0, 0
0, 0, p₁, 1-p₁

As you showed in your response.

And S₁ = matrix [∏_WW,∏_WL,∏_LW, ∏_LL]
and S₀ = matrix [.25, .25, .25, .25] ?

Plug this in and solve for ∏_WW, ...

Is this even close?

Ray Vickson · May 9, 2012

physicsoxford said:

The only thing that we did in class that could relate to this would be the binomial tree model. So after looking up Markov Chain and reading a bit it makes more since but I am still struggling. Here is an attempt:

So the underlining reasoning is that S₀P = S₁ Where P is the transition probability matrix and S₀ is the initial state distribution matrix and S₁ = a later state distribution matrix.

P = Matrix:
p₄, 1-p₄, 0, 0
0, 0, p₃, 1-p₃
p₂, 1-p₂, 0, 0
0, 0, p₁, 1-p₁

As you showed in your response.

And S₁ = matrix [∏_WW,∏_WL,∏_LW, ∏_LL]
and S₀ = matrix [.25, .25, .25, .25] ?

Plug this in and solve for ∏_WW, ...

Is this even close?

The steady-state probabilities π_i depend on the transition matrix P = (p_ij). For an n-state chain with transition matrix P they are solutions of a set of linear equations:
[tex]\pi_j = \sum_{i} \pi_i p_{ij}, j=1,2, \ldots, n, \;\text{ and } \sum_{j} \pi_j = 1.[/tex]
The first n equations above can be summarized as [itex]\pi = \pi P,[/itex] where [itex]\pi = (\pi_1, \pi_2, \ldots, \pi_n)[/itex] is a row vector. Because each row of P sums to 1, one of the equations [itex]\pi_j = \sum_{i} \pi_i p_{ij}[/itex] is redundant (that is, if n-1 of them hold, the nth one also holds), so we proceed by omitting anyone of those equations and replacing it by the normalization condition sum = 1. For the type of chain you have here (having a single "recurrent class") the system has a provably unique solution. Never mind for now if you don't know exactly what I am referring to; for now, it is enough to solve the equations to see what happens.

Let's do a little example, with three states:
[tex]P = \left[ \matrix{1/2&0&1/2\\0&1/4&3/4\\1/4&1/2&1/4} \right].[/tex]
The steady-state equations are:
[tex] \begin{array}{rcl} \pi_1&=& \frac{1}{2} \pi_1 + \frac{1}{4} \pi_3 \\ \pi_2&=& \frac{1}{4} \pi_2 + \frac{1}{2} \pi_3 \\ \pi_3&=& \frac{1}{2} \pi_1 + \frac{3}{4} \pi_2 + \frac{1}{4} \pi_3 \end{array}[/tex]
and [itex]\pi_1 + \pi_2 + \pi_3 = 1.[/itex]
We leave out one of the first three equations (say the third one---but anyone of them would do) and replace it by the sum condition. That gives the linear system
[tex] \begin{array}{ccl} \pi_1&=& \frac{1}{2} \pi_1 + \frac{1}{4} \pi_3 \\ \pi_2&=& \frac{1}{4} \pi_2 + \frac{1}{2} \pi_3 \\ 1 &=& \pi_1 + \pi_2 + \pi_3 \end{array}[/tex]
The solution is [itex]\pi_1 = 3/13, \pi_2 = 4/13, \pi_3 = 6/13.[/itex]

The theory behind all this can be found in textbooks and web pages.

RGV

physicsoxford · May 10, 2012

Alright let's see if I got this. Notation was killing me so I changed it: ∏_WW=∏₁, ∏_WL=∏₂, ∏_LW=∏₃, ∏_LL=∏₄. Using these equations:

∏₁ = ∏₁P₄ + ∏₃P₂ --Equation 1

∏₂ = ∏₁(1-P₄) + ∏₃(1-P₃) --Equation 2

∏₄ = ∏₂(1-P₃) + ∏₄(1-P₁) --Equations 3

∏₁ + ∏₂ + ∏₃ + ∏₄ = 1 -- Equation 4
------

Equation 1 : ∏₁ = ∏₃P₂/(1-P₄) (A)

Equation 2 (subbing in Equation (A)): ∏₃ = ∏₃ [P₂ + (1-P₃)] (B)

Equation 3 (subbing in (B) ): ∏₄ = ∏₃[1-P₂P₃-(1-P₃)P₃] /P₁ (C)

Equation 4 (Subbing in (A,B,C) ):

∏3 = P₁(1-P₄)/δ

Where δ = [(1-P₄)(1+2P₁+P₁P₂-P₁P₃-P₂P₃-P₃+ P₃²)] + P₂P₃

We can then plug back in and find ∏₁,∏₂, and ∏₄

Average reward = [P₁P₂+ P₁(1-P₄) - P₁P₂(1-P₄) - P₁(1-P₄)(1-P₃) - (1-P₄)(1-P₂P₃-(1-P₃)P₃]/ δ

Sure is messy what am I doing wrong? should it not simplify?

Ray Vickson · May 10, 2012

physicsoxford said:

Alright let's see if I got this. Notation was killing me so I changed it: ∏_WW=∏₁, ∏_WL=∏₂, ∏_LW=∏₃, ∏_LL=∏₄. Using these equations:

∏₁ = ∏₁P₄ + ∏₃P₂ --Equation 1

∏₂ = ∏₁(1-P₄) + ∏₃(1-P₃) --Equation 2

∏₄ = ∏₂(1-P₃) + ∏₄(1-P₁) --Equations 3

∏₁ + ∏₂ + ∏₃ + ∏₄ = 1 -- Equation 4
------

Equation 1 : ∏₁ = ∏₃P₂/(1-P₄) (A)

Equation 2 (subbing in Equation (A)): ∏₃ = ∏₃ [P₂ + (1-P₃)] (B)

Equation 3 (subbing in (B) ): ∏₄ = ∏₃[1-P₂P₃-(1-P₃)P₃] /P₁ (C)

Equation 4 (Subbing in (A,B,C) ):

∏3 = P₁(1-P₄)/δ

Where δ = [(1-P₄)(1+2P₁+P₁P₂-P₁P₃-P₂P₃-P₃+ P₃²)] + P₂P₃

We can then plug back in and find ∏₁,∏₂, and ∏₄

Average reward = [P₁P₂+ P₁(1-P₄) - P₁P₂(1-P₄) - P₁(1-P₄)(1-P₃) - (1-P₄)(1-P₂P₃-(1-P₃)P₃]/ δ

Sure is messy what am I doing wrong? should it not simplify?

When I did it I used states 1 = LL, 2 = LW, 3 = WL, 4 = WW, giving a matrix
[tex] P = \left[ \matrix{1-p_1 & p_1 & 0 & 0\\ 0 & 0 & 1-p_2 & p_2 \\ 1-p_3 & p_3 & 0 & 0\\ 0 & 0 & 1-p_4 & p_4}\right][/tex]
Letting v1=π_1, v2 = π_2, etc, the steady-state equations are:
v1 = q1*v1 + q3*v3, v2 = p1*v1 + p3*v3, v3 = q2*v2 + q4*v4, v1+v2+v3+v4=1,
Solving these (using Maple) gives some expressions similar to yours. The long-run win probability is Pwin = v2+v4:
Pwin = p1*(p2+1-p4)/D, where D = 2*p1-2*p4*p1+p2*p1+1-p3-p4+p4*p3.
We want to have Pwin < 1/2, or 2*p1*(p2+1-4) < D, or 2*p1*(p2+1-p4) -D < 0. That last form simplifies to what you need.

I have not checked your solution in detail, because your row/column ordering is different from mine.

RGV

Financial Physics - Probability of Winning

Homework Help Overview

Discussion Character

Approaches and Questions Raised

Discussion Status

Contextual Notes

Homework Statement

Homework Equations

The Attempt at a Solution

Homework Statement

Homework Equations

The Attempt at a Solution

Similar threads

Distance between a Clock's hands when the distance is increasing most rapidly

Polar integral

Deriving spatial derivatives

Is this the correct general solution of the given PDE?

J_1(x) = (x^2/10)*(J_1(x) + J_3(x)) How to solve?

Insights Revisiting the Velocity-Time Function

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect