Parametric versus no parametric distributions

Click For Summary
SUMMARY

This discussion centers on the appropriateness of fitting non-parametric distributions to variables in a simulation of car travel patterns. The user is utilizing software that fits over 50 parametric distributions and is questioning the validity of applying non-parametric distributions without thorough checks. The conversation highlights the absence of mathematical theorems to definitively validate or invalidate distribution choices and emphasizes the importance of understanding the underlying physical model and parameters. Practical approaches such as bootstrap methods and sensitivity analysis are recommended for assessing the reliability of results.

PREREQUISITES
  • Understanding of non-parametric and parametric distributions
  • Familiarity with bootstrap methods in statistical analysis
  • Knowledge of sensitivity analysis techniques
  • Basic concepts of simulation modeling in data analysis
NEXT STEPS
  • Research the application of bootstrap methods for statistical inference
  • Explore sensitivity analysis techniques for model validation
  • Learn about Ogive distributions and their applications in simulations
  • Investigate the implications of using different distribution fitting methods on simulation outcomes
USEFUL FOR

Data scientists, statisticians, and simulation modelers working on predictive analytics and distribution fitting in complex systems.

bradyj7
Messages
117
Reaction score
0
Hi there,

I'm working on a simulation of the travel patterns of cars. There are many variables and conditional probabilities in the model.

My question is, is there anything wrong with fitting all non parametric distributions to variables (both continuous and discrete)? The software I'm using fits lots (50+) different parametric distributions to data and ranks them in order of best fit. Some are very good fits but some are not very good fits. But I can't check every single distribution in a simulation, so would it be reasonable to fit all non parametric distributions?

I believe it is called fitting an Ogive distribution http://www.vosesoftware.com/ModelRi...ntinuous_distributions/Ogive_distribution.htm

Thanks
 
Physics news on Phys.org
bradyj7 said:
My question is, is there anything wrong with fitting all non parametric distributions to variables (both continuous and discrete)?

would it be reasonable to fit all non parametric distributions?
There are no theorems in mathematics that answer those questions. So there is no proof that a given distribution is wrong or that it is reasonable. Applying math to moderately complicated real world problems almost always involves making assumptions. Some people make these assumptions in an organized manner and use mathematics to deduce the proper method from them. Other people simply make assumptions as they go along, They make a long sequence of arbitrary decisions about what methods they will use. I can't take the latter kind of analysis seriously unless the person doing the work can prove the method applied to one set of data worked to predict another set that wasn't involved in the orginal analysis.

There are several approaches you can take to investigate your question in a practical manner.

The first thing you should ask is whether there is any reaonable physical model for what causes a distribution and, if so, what parameters are involved in that model.

You can try a "bootstrap" approach. Pick an important bottom line result of your project - for example, perhaps it is the total yearly useage of electricity by electric cars or the distribution of miles per year driven by drivers of electric cars, etc. Take the data you have and reepatedly pick a smaller subset of it at random and apply your methods. See how sensitive your bottom line result is to this random variation of data that is used. If your bottom line is extremely sensitive to random selection of the input data then I'd suspect that either your methods are impractical or you are dealing with a problem that is too sensitive to inputs to be reliably analyzed. (To make quantitative conclusions from "bootstrap" methods isn't straightforward mathematically. But you don't have to know how to do that to get a "feel" for how sensitive your methods are to the input data.)

You can also see how sensitive your bottom line results are to using various methods to fit distributions to all of the data. For example, would you get a drastically different number for the total yearly useage of electricity by electric cars if you used an orgive for a particular distribution than if you used a lognomal distribution?
 
Thank you Stephen
 

Similar threads

  • · Replies 4 ·
Replies
4
Views
2K
Replies
2
Views
5K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 12 ·
Replies
12
Views
5K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 9 ·
Replies
9
Views
3K
  • · Replies 1 ·
Replies
1
Views
1K
Replies
1
Views
2K