Distribution Fitting for Bus Arrival Times

  • Context: Undergrad 
  • Thread starter Thread starter Mark J.
  • Start date Start date
  • Tags Tags
    Distribution Fitting
Click For Summary

Discussion Overview

The discussion revolves around the fitting of statistical distributions to bus arrival time data collected over a week. Participants explore the appropriateness of various distributions, particularly the Poisson distribution, and seek guidance on methodology and literature for distribution fitting.

Discussion Character

  • Exploratory
  • Technical explanation
  • Conceptual clarification
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • One participant expresses a need to fit a statistical distribution to bus arrival data, suggesting that the Poisson distribution may be theoretically appropriate but seeks to exclude other distributions systematically.
  • Another participant cautions against simply trying various distributions to find the best fit, arguing that this approach can yield random and meaningless results unless there is a strong theoretical basis for the chosen distribution.
  • A different participant emphasizes the importance of understanding the context of the data and suggests that if the independence of events is not guaranteed, a more complex model may be necessary.
  • Markov modeling is proposed as a potential approach for analyzing the data, with a recommendation of Sheldon M. Ross's "Introduction to Probability Models" for further reading.
  • One participant agrees with the previous points and requests additional theoretical guidance on making assumptions related to the bus arrival process.
  • Another participant suggests starting with the independence assumption, recommending the plotting of a histogram to visually assess the fit to a Poisson distribution and to calculate the mean and variance for parameter estimation.
  • They also mention that if events are conditional on prior events, a Markov chain model may be required.

Areas of Agreement / Disagreement

Participants generally agree on the need for a theoretical basis when selecting a distribution for fitting. However, there are differing views on the methodology for distribution fitting, particularly regarding the appropriateness of testing multiple distributions without a strong theoretical justification.

Contextual Notes

Participants note the importance of independence in the data and the potential need for more complex models if this assumption is violated. There are also references to the necessity of exploratory statistics and visual assessments in the fitting process.

Who May Find This Useful

This discussion may be useful for individuals interested in statistical modeling, particularly in the context of analyzing arrival times or similar time-series data in fields such as transportation or operations research.

Mark J.
Messages
81
Reaction score
0
Hi
I am stucked on a problem of distribution fitting.
I have collected data about bus arrivals on field in a week time and now I want to fit a statistical distribution to these data.
I know that theoretically should be Poisson or smthg but to make a correct step by step observation I need to exclude other distributions and to work on the most probable ones until I get only one distribution.
Can you suggest me any good links or literature on this?
Any step by step solved similar problem in literature would be very nice :)
Thank you in advance
 
Physics news on Phys.org
Mark J. said:
Hi
I am stucked on a problem of distribution fitting.
I have collected data about bus arrivals on field in a week time and now I want to fit a statistical distribution to these data.
I know that theoretically should be Poisson or smthg but to make a correct step by step observation I need to exclude other distributions and to work on the most probable ones until I get only one distribution.
Can you suggest me any good links or literature on this?
Any step by step solved similar problem in literature would be very nice :)
Thank you in advance


The way statistics works is you for some reason think the data follows the distribution, then make a test that could show that your hypothesis is unlikely. Trying a bunch of distributions and seeing which one fits best is not a good way to go about it, unless you have a supervisor who thinks its a good idea in which case it is the greatest thing in the world.

Trying a bunch of distributions and seeing which one fits best is likely to give you a random and meaningless result. Only try to fit a distribution if there is a good reason for it to be correct.
 
Mark J. said:
Hi
I am stucked on a problem of distribution fitting.
I have collected data about bus arrivals on field in a week time and now I want to fit a statistical distribution to these data.
I know that theoretically should be Poisson or smthg but to make a correct step by step observation I need to exclude other distributions and to work on the most probable ones until I get only one distribution.
Can you suggest me any good links or literature on this?
Any step by step solved similar problem in literature would be very nice :)
Thank you in advance

Hey MarkJ and welcome to the forums.

PatrickPowers makes some very good points. You need to put the data into context of what you are doing. It's not a matter of just using a goodness of fit to see which distribution is the best fit.

The reason Poisson distributions are used is because the foundational assumptions match those of modelling rates of physical phenomena which includes your bus example. If however independence of events is not gauranteed, you will need to use a more complicated model to take this into account.

In saying the above you need to look at the process and decide how to analyze the data based on this.

In terms of modelling these kinds of problems you should look at Markov modelling. One book that goes into a lot of depth on this topic is "Introduction to Probability Models" by Sheldon M. Ross. If you have models that have more complicated behaviors where independence is violated and where you have more general criteria, then you need to read this book or something like it.

The book itself is pretty comprehensive in terms of content, but if you need some worked out examples, you could probably get a Schaums worked out problems book in the right area or get another book that has a comprehensive solutions manual.
 
I totally agree with you.Thank you for your answer.Can you give me some other orientation in making theoretical assumptions on this bus example process.I had a look on Sheldon Ross and it is a very good book indeed.Anyway if you please can give me some advice on theory arguments I can rely on??
 
Mark J. said:
I totally agree with you.Thank you for your answer.Can you give me some other orientation in making theoretical assumptions on this bus example process.I had a look on Sheldon Ross and it is a very good book indeed.Anyway if you please can give me some advice on theory arguments I can rely on??

If your are dealing with rates where each measurement is independent from the next, it's probably a good idea to a) plot the histogram based on the data to see if it "looks like" a poisson distribution and then b) if it is then use a valid estimator to estimate the rate parameter from the mean and/or variance.

If it is more complicated where events are conditional on prior events, you will have to create some kind of markov chain model for a rate process.

I would start by doing it using the independence assumption. You should get a histogram as well as the first couple of moments and some other exploratory statistics to see whether the poisson is a relatively good model for your data.
 

Similar threads

  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 10 ·
Replies
10
Views
2K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 24 ·
Replies
24
Views
4K
  • · Replies 8 ·
Replies
8
Views
5K
  • · Replies 21 ·
Replies
21
Views
3K
  • · Replies 16 ·
Replies
16
Views
2K
  • · Replies 11 ·
Replies
11
Views
4K
  • · Replies 12 ·
Replies
12
Views
5K