Maximum likelyhood extimater(MLE)

  • Thread starter Thread starter semidevil
  • Start date Start date
  • Tags Tags
    Maximum
Click For Summary

Homework Help Overview

The discussion revolves around the concept of Maximum Likelihood Estimation (MLE) and the likelihood function, particularly in the context of probability density functions (pdf) and their evaluation at multiple data points. Participants are examining the transition from individual likelihoods to a combined likelihood for multiple observations.

Discussion Character

  • Exploratory, Conceptual clarification, Mathematical reasoning

Approaches and Questions Raised

  • Participants are attempting to understand the derivation of the likelihood function from the pdf and how to handle the product of likelihoods for multiple data points. Questions arise regarding the notation and the introduction of indices, as well as the manipulation of terms in the likelihood function.

Discussion Status

Some participants express confusion about the calculations involved in forming the likelihood function and taking its logarithm. There is an acknowledgment of the need for clarity on how terms are combined and simplified, with some guidance provided on the multiplication of terms and the use of indices.

Contextual Notes

Participants are grappling with the definitions and properties of likelihood functions, particularly in relation to the number of data points and their influence on the resulting expressions. There are indications of potential typos or misunderstandings in the original expressions being discussed.

semidevil
Messages
156
Reaction score
2
so by definition, the likelyhood function(w, theta) is the product of the pdf fw(w, theta) evalutated at n data points.

but I don't know how they do those calculations...

so for example:

fy(y, theta) = [tex]1/\theta^2 ye^{-y/\theta}[/tex]

L(theta) = [tex]\theta^{-2n}\prod y_{i}e^{-1/\theta}\sum y_{i}[/tex]

so first of all, I'm looking at this but I don't know how they went from this to that...I look at another problme

how did they go from [tex]e^{-(y-\theta)} [\tex]to [tex]\pro e^{-(y_{i}- \theta)}[/tex] [/theta]<br /> <br /> I dotn see a pattern...I compared it w/ the definitoin, but I just don't get it...<br /> I mean, when they did the L(theta) it seems that they added some "n' and i's somewhere...and I dotn know where they added these things.[/tex]
 
Last edited:
Physics news on Phys.org
semidevil said:
so by definition, the likelyhood function(w, theta) is the product of the pdf fw(w, theta) evalutated at n data points.

but I don't know how they do those calculations...

so for example:

fy(y, theta) = [tex]1/\theta^2 ye^{-y/\theta}[/tex]

L(theta) = [tex]\theta^{-2n}\prod y_{i}e^{-1/\theta}\sum y_{i}[/tex]

so first of all, I'm looking at this but I don't know how they went from this to that...I look at another problme

how did they go from [tex]e^{-(y-\theta)} [\tex]to [tex]\pro e^{-(y_{i}- \theta)}[/tex] [/theta]<br /> <br /> I dotn see a pattern...I compared it w/ the definitoin, but I just don't get it...<br /> I mean, when they did the L(theta) it seems that they added some "n' and i's somewhere...and I dotn know where they added these things.[/tex]
[tex] <br /> It looks like there are some typos in your expression.<br /> <br /> You are trying to estimate [tex]\theta[/tex] using n data points, labelled as [tex]y_i[/tex]. The single likelihood for [tex]\theta[/tex] given one data point[tex]y_i[/tex] is:<br /> <br /> [tex]1/\theta^2 y_{i}e^{-y_{i}/\theta}[/tex]<br /> <br /> In order to get the likelihood of [tex]\theta[/tex]for all n data points, then you need to multiply the single likelihoods together. And that's just a simple matter of multiplying terms and summing up what's in the exponential term.[/tex]
 
The indices are a shorthand for indeterminately long products. For example, given your definition of L, we evaluate the pdf at n values of y, {y1, y2, ..., yn} and take the product. So if
[tex]f(y) = \frac{1}{\theta^2} ye^{-\frac{y}{\theta}}[/tex]
we get
[tex]f(y_1)f(y_2)...f(y_n) &= \prod_{i=1}^n f(y_i)[/tex]

[tex]= \prod_{i=1}^n \frac{1}{\theta^2} y_i e^{-\frac{y_i}{\theta}}[/tex]

[tex]= \frac{1}{\theta^{2n}} \prod_{i=1}^n y_i e^{-\frac{y_i}{\theta}}[/tex]

[tex]= \frac{1}{\theta^{2n}} \left(y_1 e^{-\frac{y_1}{\theta}}...y_n e^{-\frac{y_n}{\theta}}\right)[/tex]

[tex]= \frac{1}{\theta^{2n}} \left(y_1...y_n e^{-\frac{y_1+y_2+...+y_n}{\theta}}\right)[/tex]

[tex]= \frac{1}{\theta^{2n}} \left(\prod_{i=1}^n y_i\right) \left(e^{-\frac{1}{\theta}\sum_{i=1}^n y_i}}\right)[/tex]
 
ok, thanx, now that makes a little bit more sense, but i'll think about it some more...still a bit confusing.

what about to get ln L(theta)? the book does some weird stuff an I don't know what it did.

ln L(theta) = [tex]-2n ln \theta + ln \prod yi - 1/\theta \sum yi[/tex]


how did that happen?

and also, I'm not understanding where they put the n's. maybe I'm having trouble w/ the definition. like, on the first problem, why did it become [tex]\theta^{-2n}[/tex]?

and for ths problem,

we have fy (y, theta) = [tex]e^{-(y-\theta)},[/tex] so L(theta) = [tex]e^{\sum y + n\theta}[/tex]
 
Last edited:
semidevil said:
and also, I'm not understanding where they put the n's. maybe I'm having trouble w/ the definition. like, on the first problem, why did it become [tex]\theta^{-2n}[/tex]?
Because [tex]\theta^{-2}[/tex] multiplied by itself n times is [tex]\theta^{-2n}[/tex]. They just took it out of the n-product by associativity.

semidevil said:
we have fy (y, theta) = [tex]e^{-(y-\theta)},[/tex] so L(theta) = [tex]e^{\sum y + n\theta}[/tex]
What happens when you multiply ea with eb ? That's all that's going on here. :)
 

Similar threads

Replies
1
Views
1K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 16 ·
Replies
16
Views
3K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
Replies
21
Views
3K
  • · Replies 12 ·
Replies
12
Views
3K