Can the pdf be determined numerically for a given data set?

  • Context: Undergrad 
  • Thread starter Thread starter rohitashwa
  • Start date Start date
  • Tags Tags
    Data Pdf Set
Click For Summary

Discussion Overview

The discussion revolves around the numerical determination of probability density functions (pdf) from given data sets. Participants explore methods for approximating the pdf and the underlying assumptions involved in these approaches.

Discussion Character

  • Exploratory
  • Technical explanation
  • Conceptual clarification

Main Points Raised

  • One participant inquires about how the pdf can be derived from a data set, expressing understanding of the pdf's role in probability.
  • Another participant suggests that the pdf can be approximated by creating a bin structure and normalizing cumulative sums of the data.
  • A follow-up question seeks clarification on why cumulative sums serve as an approximation for the pdf.
  • It is noted that the assumption underlying statistical analysis is that the data was generated from the pdf, allowing the sample distribution to approximate the probability distribution.
  • Another participant mentions alternative methods for obtaining the pdf, such as fitting data to standard distributions or using numerical analysis techniques like interpolation.
  • There is a suggestion that using standard distributions may simplify analysis compared to deriving a pdf through numerical methods.

Areas of Agreement / Disagreement

Participants express various methods for approximating the pdf, but there is no consensus on a single approach. Multiple competing views on how to derive the pdf from data remain present.

Contextual Notes

The discussion does not resolve the assumptions required for the methods proposed, nor does it clarify the limitations of the numerical techniques mentioned.

rohitashwa
Messages
3
Reaction score
0
I have just been learning probability density function (pdf) and there is something I need to ask. I understand the idea that for any value v, the pdf (f(v)) gives the probability that a value picked from a data set is less than v. It seems ok to find mean,variance, skewness etc. when f(v) is known. However, how is the expression for f(v) arrived at.

If you have a data set given, can the pdf be found numerically?

Thank you.
 
Physics news on Phys.org
If you have a data set given, can the pdf be found numerically?
Yes. Actually you can approximate the density function (pdf derivative) by setting a bin structure (x intervals) and sort the data into the intervals. Cumulative sums (normalized by dividing by the total number of items) will be an approximation to the pdf.
 
Thanks. But, could you tell me why the cumulative sums approximate the pdf?
 
rohitashwa said:
Thanks. But, could you tell me why the cumulative sums approximate the pdf?
The underlying assumption is that the given data was generated from the pdf. All statistical analysis is based on this assumption, i.e. given enough sample data, the probability distribution can be approximated by the sample distribution.
 
Thank you. That was very helpful.
 
rohitashwa said:
I have just been learning probability density function (pdf) and there is something I need to ask. I understand the idea that for any value v, the pdf (f(v)) gives the probability that a value picked from a data set is less than v. It seems ok to find mean,variance, skewness etc. when f(v) is known. However, how is the expression for f(v) arrived at.

If you have a data set given, can the pdf be found numerically?

Thank you.

there's a couple of ways to get the pdf. In univariate distributions you could "fit" the results you get to a standard distribution (like say gaussian, lognormal, uniform etc) or you could use numerical analysis to come up with a distribution based on interpolation and other techniques.

If the data happened to fit a "stock" standard distribution then analyzing it would be a lot easier than analyzing a distribution based on numerical analysis since the assumptions of the stock standard distributions are easier understood.
 

Similar threads

  • · Replies 18 ·
Replies
18
Views
5K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 11 ·
Replies
11
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 19 ·
Replies
19
Views
2K
  • · Replies 9 ·
Replies
9
Views
3K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 12 ·
Replies
12
Views
3K
  • · Replies 6 ·
Replies
6
Views
3K