Stats question- likelihood function for a ratio

Zoe-b
Messages
91
Reaction score
0

Homework Statement


According to genetic linkage theory, observed frequencies of four phenotypes
resulting from crossing tomato plants are in the ratio 9/16 + a : 3/16 - a : 3/16 - a : 1/16 + a.
In 1931, J.W. MacArthur reported the following frequencies:
Observed
Phenotype frequency
Tall, cut-leaf 926
Tall, potato-leaf 288
Dwarf, cut-leaf 293
Dwarf, potato-leaf 104
Total 1611

Write down the likelihood of a given these observations. Find the maximum likeli-
hood estimate of a, use it to calculate expected frequencies for the four phenotypes
and compare them with the observed frequencies. Does genetic linkage theory look
plausible?

Homework Equations


likelihood of a data set (x1...xn) occurring is the product of fX(x), if Xi are independent. Then I could find max likelihood etc as usual


The Attempt at a Solution


Basically I just cannot start this problem at all. I have only ever found likelihood before as a function producing one value, here I seem to want to find the probability that the ratio is b:c:d:e which I have no idea how to do. Also surely I need to know something about the variance of a? At least then I could write down a probability to do with each bit of the ratio?
Sorry if I'm not making much sense/if this is obvious but I have spent a long time attempting this with no luck. Google hasn't helped at all either..
 
Physics news on Phys.org
maybe let's simplify and see how we would do it for a two variable case and see if we can build up on that

so let's say you have a discrete probability distribution with one of 2 outcomes B & C, with probabilities P(B)=b and so on with c=1-b

Now let's say you roll the dice n number of times, in this case the probabilty distribution is binomial and the probabilty of getting k B events, given a probabilty B is
<br /> P(N_B=k|b) = \frac{n!}{k!(n-k)!}(1-b)^{n-k}b^k<br />

but this is just the likelihood of the estimator b, so now
<br /> L(b|N_B=k) = P(N_B=k|b) = \frac{n!}{k!(n-k)!}(1-b)^{n-k}b^k<br />

Taking the logarithm
<br /> ln\{L(b|N_B=k)\} = ln(\frac{n!}{k!(n-k)!})+(n-k)ln(1-b)+k.ln(b)<br />

Differentiating to find the MLE gives
<br /> -\frac{n-k}{1-b} +\frac{k}{b}=0<br />

Note that the logarithm and and differentiation mean multiplicative constants fall away as they don't change the form of the likelihood function, but just normalise
<br /> k(1-b)=(n-k)b<br />

Giving
<br /> b=\frac{k}{n}<br />

which is what we would have guessed anyway, but hopefully you can build from there.
 
Last edited:
the tricky part will is finding that probabilities, and this is only a start, but let's call each outcome B,C,D,E eg. getting B=b, means out of n trials you found b "Tall, cut-leaf" plants

Lets deal just with the B information, the first observation, first we can treat this as a binomial distribution, with n trials and b successes
<br /> P(B=b|a) = \frac{n!}{b!(n-b)!}(1-\frac{9}{16} - a)^{n-b}(\frac{9}{16} + a)^b<br />

Now let's look at adding the C information, however the outcome for C is not independent from the B outcome, so I'm thinking you can probably stack them up using conditional probabilty
<br /> P(B=b,C=c|a) = P(C=c|B=b,a)P(B=b|a)<br />

Note that if B & C were independent P(C=c|B=b,a)=P(C=c|a) and this would reduce to the multiplicative form above, but this is not the case and P(C=c|B=b,a)\neq P(C=c|a) .

We just found P(B=b|a), now if we take B=b can you find P(C=c|B=b,a)?

then you can repeat and stack up the last piece of information as
<br /> P(B=b,C=c,D=d|a) = P(D=d|C=c,B=b,a)P(C=c|B=b,a)P(B=b|a)<br />
 
Last edited:
Zoe-b said:
Basically I just cannot start this problem at all. I have only ever found likelihood before as a function producing one value, here I seem to want to find the probability that the ratio is b:c:d:e which I have no idea how to do.

Hi Zoe-b! :smile:

Can you deduce from your ratio what the chances are on each of your phenotypes (as functions of a)?

Suppose you have a specific set of those plants with in total nB, nC, nD, and nE of each type.
What would the chance be on that specific set?

Note that this chance defines your likelihood as function of a.
 
Zoe-b said:
Basically I just cannot start this problem at all. I have only ever found likelihood before as a function producing one value, here I seem to want to find the probability that the ratio is b:c:d:e which I have no idea how to do. Also surely I need to know something about the variance of a? At least then I could write down a probability to do with each bit of the ratio?
Sorry if I'm not making much sense/if this is obvious but I have spent a long time attempting this with no luck. Google hasn't helped at all either..

I'm not too sure if the variance of a makes sense here, but at the end of the day you will have the likelihood function for a which should help you decide whether the value of a to support the theory is reasonable given the data
 
Prove $$\int\limits_0^{\sqrt2/4}\frac{1}{\sqrt{x-x^2}}\arcsin\sqrt{\frac{(x-1)\left(x-1+x\sqrt{9-16x}\right)}{1-2x}} \, \mathrm dx = \frac{\pi^2}{8}.$$ Let $$I = \int\limits_0^{\sqrt 2 / 4}\frac{1}{\sqrt{x-x^2}}\arcsin\sqrt{\frac{(x-1)\left(x-1+x\sqrt{9-16x}\right)}{1-2x}} \, \mathrm dx. \tag{1}$$ The representation integral of ##\arcsin## is $$\arcsin u = \int\limits_{0}^{1} \frac{\mathrm dt}{\sqrt{1-t^2}}, \qquad 0 \leqslant u \leqslant 1.$$ Plugging identity above into ##(1)## with ##u...
Back
Top