Best procedure to determine Lambda to calculate Poisson probability

Click For Summary
SUMMARY

The discussion centers on determining the optimal procedure for calculating Lambda (λ) to assess the Poisson probability of accidents occurring on a given day. Participants suggest that averaging daily accidents over a longer period yields a more accurate λ, while acknowledging that shorter periods may be necessary if λ is expected to fluctuate. The exponential moving average (EMA) technique is proposed as a method to enhance λ relevance, although concerns are raised about its potential to amplify noise and violate Poisson distribution assumptions. A t-test is recommended for detecting significant changes in λ over time.

PREREQUISITES
  • Understanding of Poisson distribution and its parameters
  • Familiarity with statistical concepts such as confidence intervals and t-tests
  • Knowledge of exponential moving averages (EMA) and their application
  • Basic grasp of accident rate analysis and data averaging techniques
NEXT STEPS
  • Research the application of Poisson distribution in real-world scenarios
  • Learn about calculating confidence intervals using the chi-squared distribution
  • Explore the implications of using exponential moving averages in statistical analysis
  • Study the use of t-tests for assessing changes in statistical parameters over time
USEFUL FOR

Statisticians, data analysts, traffic safety researchers, and anyone involved in modeling accident probabilities using Poisson distribution techniques.

Aetius
Messages
3
Reaction score
0
What is the best procedure to determine Lambda to calculate the Poisson probability? Say I want to calculate P(X ≥1) of an accident occurring next day. For this I would calculate the average of daily accidents and divide it by 10. The question is, should I take the previous 10 days? Or calculate λ averaging i.e. 10 day periods for the last 200 days and divide them by 20? What would be the best?
Thank you much for any constructive comment.
 
Physics news on Phys.org
Aetius said:
What is the best procedure to determine Lambda to calculate the Poisson probability? Say I want to calculate P(X ≥1) of an accident occurring next day. For this I would calculate the average of daily accidents and divide it by 10.
Why? Where did "10" come from?

The question is, should I take the previous 10 days? Or calculate λ averaging i.e. 10 day periods for the last 200 days and divide them by 20? What would be the best?
Thank you much for any constructive comment.
The Poisson distribution gives the probability of "n" occurances in a given time interval. Lambda is the average number of occurances in that time interval. What time interval are you given?
 
Aetius said:
What is the best procedure to determine Lambda to calculate the Poisson probability? Say I want to calculate P(X ≥1) of an accident occurring next day. For this I would calculate the average of daily accidents and divide it by 10. The question is, should I take the previous 10 days? Or calculate λ averaging i.e. 10 day periods for the last 200 days and divide them by 20? What would be the best?
Thank you much for any constructive comment.

Hi Aetius, welcome to MHB! ;)

Assuming a constant accident rate we will get a more accurate $\lambda$ if we take the average over a longer period of time.
However, if $\lambda$ is expected to change over time, such as in traffic accidents, then we should take a shorter period in which we assume that $\lambda$ is more or less constant.
With a shorter period comes a wider confidence interval for $\lambda$ though, so we can't take it too short.

Intuitively I'd expect that the rate of traffic accidents does not change significantly in 200 days.
It would over decades though.
That is, unless systematic changes were made in the area with regard to traffic safety and the like.

A first step in finding an acceptable interval in which $\lambda$ is sufficiently constant could be to average it over a shifting interval in time and see if we get a more or less level curve.
Suppose we have a total of $k$ traffic accidents in a specific period of $n$ days.
Then we can quantify the confidence interval for the estimate $\hat\lambda = \frac kn$ by:
$$\frac 12\chi^2(\alpha/2;\ 2k) \le n\lambda \le \frac 12\chi^2(1-\alpha/2;\ 2k+2)$$
where $\chi^2(p;\ df)$ is the inverse $\chi^2$-distribution for the cumulative probability $p$ with $df$ degrees of freedom.
 
Klaas van Aarsen said:
Hi Aetius, welcome to MHB! ;)

Assuming a constant accident rate we will get a more accurate $\lambda$ if we take the average over a longer period of time.
However, if $\lambda$ is expected to change over time, such as in traffic accidents, then we should take a shorter period in which we assume that $\lambda$ is more or less constant.
With a shorter period comes a wider confidence interval for $\lambda$ though, so we can't take it too short.

Intuitively I'd expect that the rate of traffic accidents does not change significantly in 200 days.
It would over decades though.
That is, unless systematic changes were made in the area with regard to traffic safety and the like.

A first step in finding an acceptable interval in which $\lambda$ is sufficiently constant could be to average it over a shifting interval in time and see if we get a more or less level curve.
Suppose we have a total of $k$ traffic accidents in a specific period of $n$ days.
Then we can quantify the confidence interval for the estimate $\hat\lambda = \frac kn$ by:
$$\frac 12\chi^2(\alpha/2;\ 2k) \le n\lambda \le \frac 12\chi^2(1-\alpha/2;\ 2k+2)$$
where $\chi^2(p;\ df)$ is the inverse $\chi^2$-distribution for the cumulative probability $p$ with $df$ degrees of freedom.

Actually, in general you confirmed what I thought. Something I left out is that, in order to improve the relevance of Lambda I would apply the exponential moving averages technique. I am thinking that the ideal procedure would be to take, say the previous 10 days and apply EMA to it. How does it grab you?
Thank you kindly!
 
Aetius said:
Actually, in general you confirmed what I thought. Something I left out is that, in order to improve the relevance of Lambda I would apply the exponential moving averages technique. I am thinking that the ideal procedure would be to take, say the previous 10 days and apply EMA to it. How does it grab you?
Thank you kindly!

It sounds as if you're specifically interested in changes of $\lambda$.
That's what you get with a relatively short period that is exponentially weighted on the most recent days.
It also amplifies the noise in the last day.
But if $\lambda$ changes, then one of the assumptions of Poisson is violated. That is, it's not a Poisson distribution any more.

If you're interested in observing changes in lambda, perhaps a test statistic (e.g. t-test) is more appropriate to figure out if there is a significant change with respect to previous days.
 
Klaas van Aarsen said:
It sounds as if you're specifically interested in changes of $\lambda$.
That's what you get with a relatively short period that is exponentially weighted on the most recent days.
It also amplifies the noise in the last day.
But if $\lambda$ changes, then one of the assumptions of Poisson is violated. That is, it's not a Poisson distribution any more.

If you're interested in observing changes in lambda, perhaps a test statistic (e.g. t-test) is more appropriate to figure out if there is a significant change with respect to previous days.

Are you sure, Klaas? i say this because the only thing that a moving average (exponential and simple) does is giving more weight to the last values, because Lambda does change -as does in the real world- because the number of accidents occur randomly. Think now that instead of accidents, we are researching highway robberies, and Lambda will change. Wouldn't it be almost as taking another Lambda reading for another 10 days which would likely generate another Lambda value??
 

Similar threads

  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 12 ·
Replies
12
Views
4K
Replies
8
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 15 ·
Replies
15
Views
3K
  • · Replies 1 ·
Replies
1
Views
9K
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 17 ·
Replies
17
Views
3K