Are Both Sensible Interpretations of Poisson Behavior?

In summary, there are two different interpretations of "Poisson" behavior, one involving a binomial distribution with a large number of trials and small probability of success, and the other involving an exponential distribution and a Poisson process. While both result in a Poisson distribution, the questions being asked and the assumptions made are slightly different. The first view takes into account the number of houses and the probability of a house burning down, while the second view looks at the time until a specific house burns down. However, in cases with a finite number of houses and a finite time frame, the second view may not accurately model the situation. Therefore, both applications of the Poisson distribution can be sensible, but they may not always give the
  • #1
nonequilibrium
1,439
2
Are both sensible (equivalent? contradictory?) interpretations of "Poisson" behavior?

I've come across two quite distinct notions (or so it seems to me, anyway) of Poisson behavior and I'm not sure if they're equally sensible or perhaps even equivalent. I'll apply both "views" to the same case to show you what I mean.

The first view is how I first met Poisson in my statistics textbook (and the way I did not like):
Say we have n houses, and we know that the probability of one house burning down in the course of one year is p. We can assume that n is very large and p is quite small. We're interested in knowing the probability of the number of houses that burn down in the course of one year. If we look more closely we see, as the burning down of every house is a Bernoulli experiment, that this follows a binomial distribution with variables n and p. As n is large and p is small, we can approximate this distribution by a Poisson distribution with parameter [itex]\lambda = np[/itex] (with [itex]\lambda[/itex] a "normal" size, way larger than p, way smaller than n).

The second "Poisson" view is called a Poisson process (it has got its own wiki page):
Clear your head of the previous case. We now regard the burning down of one specific house as an intrinsically random event, hence the time until burning down is modeled well by an exponential distribution. Call the exponential distribution parameter [itex]\lambda[/itex]. We want to know the probability distribution for the number of houses that burn down after a time t (afterwards we will take "t = one year"). As the probability of a house burning down in a given time interval dt is [itex]\lambda \mathrm d t[/itex] (you can see this as a consequence of the memorylessness of the exponential distr.), we can see that "the number of houses that burn down after a time t" as a sum of [itex]\frac{t}{\mathrm dt}[/itex] bernouilli experiments, each with probability [itex]\lambda \mathrm d t[/itex]. In the limit [itex]\mathrm d t \to 0[/itex], this is described by a Poisson distribution with parameter [itex]\lim_{\mathrm d t \to 0} \left(\lambda \mathrm dt \right) \left( \frac{t}{\mathrm d t} \right) = \lambda t[/itex].

You see that in both cases we arrive at a Poisson distribution but in quite distinct ways. (For ease, take the units of lambda to be "per year", then the lambda in both cases is equal.) But is the result equivalent? I think not, right? For one thing, in the former case, the end result depended on the number of houses, whereas in the latter case it didn't enter at all.
Even conceptually it is quite different, no? In the former case, it was truly a Binomial distribution which we mathematically approximated by a Poisson distribution (cause n was big and p was small), but in the latter case it was the Binomial distribution which was a temporary approximation, but by taking the limit we got the actual nature of the problem. But perhaps this note is too much philosophy and too little hardcore mathematical objection.

Anyway, I was wondering, are both sensible applications of the Poisson distribution? I think both derivations make sense, but on the other hand the results are different. Which of the two should a company use? Or should we expect them to make similar predictions? Is there any of the two more true than the other? Do they apply in distinct cases? And am I the only one who thinks the second view is somehow more pleasing? (I just encountered the second view in my physics Markov course, and only now do I feel I somehow understand Poisson behavior.)
 
Last edited:
Physics news on Phys.org
  • #2


Although similar, the questions being asked in the two cases are slightly different. In the first example, you are looking at the distribution of the number of houses burnt down, while the second is concentrating on the probability of a given house burning down.
 
  • #3


I don't think so. Both answer the question "how many houses can I expect to burn down in a year?".
 
  • #4


You are being confused by the fact that λ means different things in the two explanations and by a feature of the trial I shall come to later. The following should be clearer but lacks rigour:

In the first view we use the fact that a binomial distribution tends towards a poisson distribution as n->∞ (google binomial limit poisson for more details). With n trials the expected value of the number of houses burning down is np, and so the limit is a Poisson distribution with mean μ = np.

The explanation of the second view is somewhat confusing. It starts with the result that where the time between successes forms an exponential distribution (as it does in most cases because if you wait twice as long you expect twice as many successes) that the number of successes in time t forms a Poisson distribution with mean μ = ft where f is the frequency parameter of the exponential distribution. This equivalence can be derived directly, there is no need to introduce the binomial distribution and take it to the limit.

Combining the two expressions for μ we see that μ = np = ft, or f = np / t which is what we would expect.

But (here is the bit about the feature of the trial), with a finite set of houses and a finite time (in particular a time shorter than the time it takes to rebuild a house), this is not a Poisson process. In a year, either a house burns down or it doesn't: it can't burn down twice so the first model as a Bernoulli experiment with the associated binomial distribution is accurate and the result does depend on n. In the second model we are using the Poisson distribution to approximate the Binomial distribution: the Poisson distribution does not depend on n, and that is why it is not a precise model: the error is a function of n (and p).

If instead the trial was a house being struck by lightning, the first scenario would be the approximation: it is quite feasible (however improbable) for a given house to be struck by lightning twice in a year so this is not a Bernoulli experiment, but it is a Poisson process. But if p is small enough we can use the Binomial distribution as an approximation: the error is a function of p (and n).

So with these (abbreviated) facts:
  • Bernoilli experiments are accurately modeled by the binomial distribution
  • Events with an exponential distribution form a Poisson process
  • The differences between the Poisson distribution and the binomial distribution become small for large N (providing p is small enough)
you can answer your own questions.

I'll add one more thing: if N and p are sufficiently large, both the binomial distribution and the Poisson distribution tend towards the normal distribution (this can be derived directly or seen quickly from the Central Limit Theorem) which is often the easiest to use.
 
Last edited:
  • #5


I would say that both interpretations of "Poisson" behavior are sensible and valid, but they are also distinct and should not be considered equivalent. The first interpretation, where the Poisson distribution is used to approximate a binomial distribution, is more commonly used in statistics and has practical applications in fields such as finance and insurance. The second interpretation, known as a Poisson process, is used in physics and other fields to model events that occur randomly over time.

While both interpretations may result in a Poisson distribution, the underlying assumptions and methods used to arrive at this distribution are different. In the first interpretation, the number of events (e.g. houses burning down) is fixed and the probability of each event is small, while in the second interpretation, the number of events is variable and the probability of each event occurring is constant over time. Therefore, the results may differ and it is important to consider the context and assumptions of each interpretation when deciding which one to use in a specific situation.

In terms of which interpretation is more "true", it depends on the context and the specific phenomenon being studied. The Poisson distribution is a useful tool for modeling random events, but it may not be the most accurate or appropriate model in all cases. It is important to carefully consider the assumptions and limitations of each interpretation before applying it to a real-world problem.

Overall, both interpretations of "Poisson" behavior have their own strengths and applications, and it is up to the scientist to determine which one is most suitable for their specific research question or problem.
 

1. What is the Poisson distribution?

The Poisson distribution is a probability distribution that is used to model the number of times an event occurs in a given time period or space, when the event occurs independently and at a constant average rate. It is commonly used in fields such as statistics, physics, and engineering to analyze rare events.

2. How is the Poisson distribution related to Poisson behavior?

Poisson behavior refers to the characteristics of a system or process that can be described by the Poisson distribution. This means that the system or process has a constant average rate of occurrence, and the events are independent of each other.

3. What are the two sensible interpretations of Poisson behavior?

The two sensible interpretations of Poisson behavior are the frequency interpretation and the waiting time interpretation. The frequency interpretation focuses on the number of events that occur in a given time or space, while the waiting time interpretation focuses on the time between events.

4. How do you determine if data follows a Poisson distribution?

To determine if data follows a Poisson distribution, you can use statistical tests such as the Chi-square test or the Kolmogorov-Smirnov test. These tests compare the observed data to the expected values from a Poisson distribution and can determine if the data fits the distribution.

5. What are the limitations of using the Poisson distribution to model real-world events?

The Poisson distribution assumes that events occur independently and at a constant average rate, which may not always be true in real-world situations. In addition, the distribution is only appropriate for modeling rare events, so it may not be suitable for events that occur frequently. Other factors, such as external factors or human behavior, may also impact the accuracy of using the Poisson distribution to model events.

Similar threads

  • Set Theory, Logic, Probability, Statistics
Replies
12
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
16
Views
1K
  • Calculus and Beyond Homework Help
Replies
8
Views
607
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
768
  • Set Theory, Logic, Probability, Statistics
Replies
0
Views
963
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
1K
Replies
1
Views
848
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
15
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
1K
Back
Top