Creating PDFs to Converting Data into Histograms and Fitting Curves

  • Context: Undergrad 
  • Thread starter Thread starter member 428835
  • Start date Start date
  • Tags Tags
    Pdf
Click For Summary
SUMMARY

The discussion centers on the process of creating Probability Density Functions (PDFs) from data, emphasizing the importance of first constructing a Cumulative Distribution Function (CDF). Participants highlight that the CDF, being an integral of the PDF, provides a smoother representation and is less sensitive to histogram partitioning. The least squares method is mentioned as a technique for fitting curves to data, which is essential for estimating parameters of known distributions. Overall, the conversation underscores the significance of these statistical methods in accurately modeling data distributions.

PREREQUISITES
  • Understanding of Probability Density Functions (PDFs)
  • Knowledge of Cumulative Distribution Functions (CDFs)
  • Familiarity with least squares curve fitting techniques
  • Experience with statistical data analysis tools
NEXT STEPS
  • Research methods for constructing Cumulative Distribution Functions (CDFs)
  • Learn about least squares fitting techniques for statistical modeling
  • Explore tools for data visualization and histogram creation
  • Study parameter estimation for known distributions
USEFUL FOR

Data analysts, statisticians, researchers, and anyone involved in statistical modeling and data visualization will benefit from this discussion.

member 428835
hey all (again)!

how do people create pdf's? it seems like we start by collecting info and turning it into a histogram. then do we simply look at an array of curves, or is there something more, such as what is done of finding the least squares method for fitting data to a line?

thanks!
 
Physics news on Phys.org
Most of the time the data is assumed to fit some known distribution with a theoretical PDF and parameters. Then the parameters are estimated. If you are creating a PDF from scratch from the data, I would make a CDF first. The CDF is an integral of the PDF so it is smoother and less dependent on the choice of histogram partitions. Then a curve can be fit to the CDF and the PDF can be estimated.
 
Thanks!
 

Similar threads

  • · Replies 16 ·
Replies
16
Views
3K
  • · Replies 13 ·
Replies
13
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
Replies
28
Views
4K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 26 ·
Replies
26
Views
3K
  • · Replies 22 ·
Replies
22
Views
4K
  • · Replies 5 ·
Replies
5
Views
9K
  • · Replies 9 ·
Replies
9
Views
4K
  • · Replies 12 ·
Replies
12
Views
3K