How do I normalize the likelihood distribution?

Click For Summary

Discussion Overview

The discussion revolves around the normalization of a likelihood distribution in the context of statistical analysis, specifically related to chi-squared calculations. Participants explore methods for achieving a normalized likelihood function, addressing challenges posed by unknown parameters and the integration of probability distributions.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • One participant describes their approach to calculating the likelihood function and expresses a desire to normalize it to a maximum value of 1.
  • Another participant suggests that a factor of two may be missing in the exponent of the chi-squared definition and notes that normalization often requires manual integration of the probability distribution.
  • Concerns are raised about the challenge of having two unknowns (the parameter and the normalization constant) with only one equation available for solving.
  • There is a discussion about the integration of the likelihood function over the parameter space and the implications of Gaussian versus non-Gaussian distributions for normalization.
  • One participant reports attempting to normalize the likelihood function but finds that the maximum value remains greater than 1, prompting questions about potential errors in their approach.
  • Another participant emphasizes that the integral of the probability distribution being equal to one indicates normalization, regardless of the maximum value at specific points.
  • There is a suggestion that authors of referenced papers may have scaled their graphs to one without adhering to strict normalization principles.

Areas of Agreement / Disagreement

Participants express differing views on the normalization process and the implications of their findings. While some agree on the methods discussed, there is no consensus on the correct approach to normalization or the handling of unknown parameters.

Contextual Notes

Participants note limitations related to the dependence on the definitions of chi-squared and the nature of the distributions involved. The discussion highlights the complexity of normalizing likelihood functions when parameters are not fully determined.

aymer
Messages
16
Reaction score
0
Hey everyone,

i am new to using statistics and have come across a problem. I am trying to regenerate the result of a paper in which a theoretical parameter is constrained.
I have calculated the chi squared and plotted the likelihood function (exp[-chisquared])to see what value of the parameter has the highest likelihood.
I am getting almost the same value as the paper but my graph is not scaled like it. I would like the maximum value of likelihood to be 1. What normalization constants am i missing?

Also in the likelihood function do we use the reduced chisquared?
 
Space news on Phys.org
Well, first of all, there may be a factor of two error. The way the Chi square is usually defined, you should be dividing it by two in the exponent to the probability distribution.

That said, the key point here is that for most problems, the normalization is simply not known when the Chi square is generated, so you have to manually integrate the probability distribution to determine the normalization. If your distribution is Gaussian, you can do this analytically, though for real distributions we typically make use of Markov-Chain Monte Carlo to perform the integration numerically.

At any rate, you can read up on the multivariate normal distribution here:
http://en.wikipedia.org/wiki/Multivariate_normal_distribution
 
Thanx,
I thought of integrating the likelihood and equating it to find the normalization constant but the problem is my chi-squared itself is a function of an unknown parameter,the value of which I want to find. So we have two unknowns-the parameter and the normalization constant and only one equation!
 
aymer said:
Thanx,
I thought of integrating the likelihood and equating it to find the normalization constant but the problem is my chi-squared itself is a function of an unknown parameter,the value of which I want to find. So we have two unknowns-the parameter and the normalization constant and only one equation!
Well, no, the unknown parameter is left variable like so:

[tex]\int P(x) dx = 1[/tex]

Where [itex]x[/itex] here is the set of all parameters in the chi square, and the integral is over the entire available parameter space. If your chi square is Gaussian-distributed, this normalization is relatively straightforward. If not, you have to actually perform the integral numerically.

Basically, the unknown input parameters to the chi square are never solved for, we merely find their probability distributions.
 
hii,
i tried doing the normalization as you suggested by doing

[A*integral(exp[-chisquared/2])] over the parameter(-inf,inf) and equated it to 1 to calculate A.
then i plotted A*exp[-chisquared/2] vs the parameter. but still it is not normalized to 1. the maximum value the function takes is nearly 2.
what am i doing wrong?
 
i also tried normalizing each event(i have 17 data points) to 1 and find corresponding normalization constants. Then i plotted (product of constants)*exp[-chisquared/2],but even that is not scaled to 1.
 
aymer said:
hii,
i tried doing the normalization as you suggested by doing

[A*integral(exp[-chisquared/2])] over the parameter(-inf,inf) and equated it to 1 to calculate A.
then i plotted A*exp[-chisquared/2] vs the parameter. but still it is not normalized to 1. the maximum value the function takes is nearly 2.
what am i doing wrong?
What do you mean it isn't normalized to one? If the integral of the probability distribution is equal to one, then it is normalized to one. Depending upon the width of the distribution, it could easily take on values at any given point that are much larger than one. This isn't a problem.

Normalizing the individual events changes the problem significantly, and should not be done.
 
Chalnoth said:
What do you mean it isn't normalized to one? If the integral of the probability distribution is equal to one, then it is normalized to one. Depending upon the width of the distribution, it could easily take on values at any given point that are much larger than one. This isn't a problem.

Normalizing the individual events changes the problem significantly, and should not be done.

Thanx,

So I guess what i have done is correct and the authors of the paper must have just scaled the graph to one somehow.I was worried because I was getting the peak at the same point and my graph intersects the x-axis at the same coordinates as the paper.
 
aymer said:
Thanx,

So I guess what i have done is correct and the authors of the paper must have just scaled the graph to one somehow.I was worried because I was getting the peak at the same point and my graph intersects the x-axis at the same coordinates as the paper.
Right, so, a lot of the time people don't care to normalize probability distributions. So if they simply set the peak equal to one, that would explain their graph.
 

Similar threads

  • · Replies 11 ·
Replies
11
Views
3K
  • · Replies 16 ·
Replies
16
Views
2K
  • · Replies 6 ·
Replies
6
Views
1K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
Replies
1
Views
4K
  • · Replies 11 ·
Replies
11
Views
3K
  • · Replies 14 ·
Replies
14
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K