Curve fitting of summed normal distributions

Click For Summary

Discussion Overview

The discussion revolves around fitting a dataset to a model represented as a sum of normal distributions. Participants explore methods for robustly determining the parameters of these distributions, specifically the means and variances, while clarifying that this is not a convolution of normal distributions.

Discussion Character

  • Exploratory
  • Technical explanation
  • Mathematical reasoning

Main Points Raised

  • One participant presents a model for the probability density function as a sum of normal distributions and seeks robust fitting methods for the parameters.
  • Another participant suggests a software tool that may assist in the fitting process, referencing its development and offering a trial evaluation.
  • A question is raised about the specific methods used by the suggested software, with a hypothesis that it might employ Expectation-Maximization (EM).
  • A later reply provides a mathematical formulation for maximizing the likelihood of the model parameters, indicating a focus on the EM algorithm as a potential solution.

Areas of Agreement / Disagreement

Participants have not reached a consensus on the specific methods for fitting the model, and multiple approaches, including the EM algorithm, are discussed without resolution.

Contextual Notes

The discussion includes assumptions about the nature of the dataset and the parameters involved, but these assumptions are not fully explored or defined. The mathematical formulation provided is subject to further clarification and exploration.

exmachina
Messages
42
Reaction score
0
Hi,

I have a dataset of a random variable whose probability density function can be fitted/modelled as a sum of N probability density functions of normal distributions:

<br /> F_X(x) = p(\mu_1,\sigma_1^2)+p(\mu_2,\sigma_2^2)+\ldots+p({\mu}_x,\sigma_x^2)<br />

I am interested in a fitting method can robustly determine the values of \mu_1,\sigma_1,\mu_2,\sigma_2, etc

Note this is NOT convolution of normal distributions.
 
Last edited:
Physics news on Phys.org
These folks have put a lot of time and thought into your problem

http://www.sigmaplot.com/products/peakfit/peakfit.php

and they have free 30 day trial evaluations.
 
Last edited by a moderator:
Interesting, any idea what method they use? Expectation-Maximization?
 
Edit: I guess in particular, this is the equation I'm trying to maximize, given an input vector:<br /> X = (x_1,x_2,...,x_n)<br />

Maximize:

<br /> \begin{equation}<br /> \prod_{j=1}^n\sum_{i=1}^k \frac{p_i}{\sqrt{2\pi} \sigma_i} \exp(-\frac{(x_j-\mu_i)^2}{2\sigma_i^2})<br /> <br /> Edit: I found a nice paper tackling this exact problem using EM.<br /> \end{equation}<br />

Subject to \sum_{i=1}^{k} p_i = 1

When I say maximize, I mean to find the model parameters \mu_i, \sigma_i, p_i
 
Last edited:

Similar threads

  • · Replies 2 ·
Replies
2
Views
2K
Replies
1
Views
4K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 8 ·
Replies
8
Views
5K
Replies
6
Views
2K
  • · Replies 6 ·
Replies
6
Views
4K
  • · Replies 25 ·
Replies
25
Views
3K