Normal distribution negative values

Click For Summary

Discussion Overview

The discussion revolves around the application of normal distribution to real-world scenarios, specifically addressing the implications of negative values in contexts such as bus travel times and particle detector responses. Participants explore the theoretical aspects of normal distributions and their limitations when applied to phenomena that cannot yield negative outcomes.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • Some participants express confusion regarding the application of normal distribution to bus travel times, noting that normal distributions theoretically allow for negative values.
  • Others clarify that while the standard normal distribution has a mean of 0, shifting the mean and standard deviation can result in distributions that are primarily positive, but still theoretically allow for negative values.
  • A participant mentions the challenge of modeling particle detector responses with normal distributions, highlighting the issue of generating negative energy values and discussing potential solutions like capping values or using a log-normal distribution.
  • Another participant emphasizes that if a model's parameters are set such that the minimum possible measurement is 0, then a normal distribution may not be appropriate, as it could lead to unrealistic simulation outcomes.
  • One participant points out that even with adjustments to the mean and standard deviation to minimize negative outcomes, there remains a small probability of generating negative values, which could be problematic in practical applications.

Areas of Agreement / Disagreement

Participants generally agree that normal distributions can theoretically yield negative values, which poses challenges for certain real-world applications. However, there is no consensus on the best approach to handle these issues, as various strategies are proposed and debated.

Contextual Notes

Participants note limitations regarding the assumptions of normal distributions, particularly in contexts where negative values are not realistic. The discussion highlights the need for careful consideration of model parameters to avoid nonsensical results.

DrunkenPhD
Messages
10
Reaction score
0
Hi
I read at this link http://www.eecs.berkeley.edu/~aude/papers/TRB2012_stat_traffic.pdf
something like bus travel times can be normally distributed.
Sounds strange to me because normal distribution presumes even negative values
Anything I am missing here?
Regards
 
Last edited by a moderator:
Physics news on Phys.org
Negative values for what?

The standard normal distribution has a mean value μ = 0. Different mean values will shift the distribution to the left or right of the vertical axis, just like the standard deviation σ makes the curve skinny or broad.
 
Thank you SteamKing.
But in case of not standard normal distribution smth like N(μ,σ) where both μ,σ are not equal to 0, I cannot really see smthg like -5 minutes which can be a value of simulation N(μ,σ)
Any ideas about that?
Regards
 
Even in statistics, sometimes you got to employ common sense.

-5 minutes can only be a realistic value if you allow for time travel to the past, which busses might make you feel is happening.

The pdf's used in the study appear to be constructed so that for values of travel time <= 0, the probability is identically 0, which is what would happen in the real world. The mean travel times are also >= 0, which implies that the pdf is shifted to the positive side of the vertical axis.
 
I run into this issue with my research when modeling particle detector responses with a normal distribution since it will on occasion try to produce a negative energy particle (clearly not realistic). There are options but they produce biases. One is to just cap the low end at zero (or nearly zero), which produces a positive bias. Another might be to cap the range from 0 to 2x the mean. This removes the bias but now the RMS is smaller than expected.

An option I've considered but haven't yet implemented would be to use something like a log-normal distribution. It has the feature of not going negative, but approaches a normal distribution when the mean is sufficiently larger than the sqrt(variance).
 
"But in case of not standard normal distribution smth like N(μ,σ) where both μ,σ are not equal to 0, I cannot really see smthg like -5 minutes which can be a value of simulation N(μ,σ)
Any ideas about that?"

Remember that a crude but simple descriptive property of normal distributions is that almost everything is contained between \mu - 3\sigma and \mu + 3 \sigma.

If your model (and that's what it is, a model, a mathematically convenient description of some phenomenon) has the distribution portion normal and mean and standard deviation restricted so that, say, \mu - 6 \sigma and \mu + 6 \sigma take in only "reasonable" values - values that are realistic - you won't have the type of non-sense simulation problems you describe.

On the other hand, if you see a situation in which the minimum possible measurement is 0, and are told the mean is 8 with standard deviation 5, you should know a normal distribution can't apply, as 0 is "too close" to the mean. Using a normal distribution there is a sure way to encounter problems.
 
DrunkenPhD said:
Hi
I read at this link http://www.eecs.berkeley.edu/~aude/papers/TRB2012_stat_traffic.pdf
something like bus travel times can be normally distributed.
Sounds strange to me because normal distribution presumes even negative values
Anything I am missing here?
Regards
In case the other responses didn't make this clear, what you said above is correct. Any normal distribution, regardless of its mean and standard deviation, has infinite tails and therefore nonzero probability of a negative outcome. So some physical phenomenon which cannot be negative (such as bus waiting times) cannot truly be normally distributed. However, it can be approximately true if the mean and standard deviation are chosen so that the probability of a negative outcome is very small. This is what statdad is referring to above.

Be aware, though, that even if you choose ##\mu## and ##\sigma## such that ##\mu - 3\sigma > 0##, you will still have a 0.15% probability that the outcome will be less than ##\mu - 3\sigma##. So if you run enough trials, you WILL get a nonsensical result. If you increase the margin to ##6\sigma##, then the chance of an outcome below ##\mu - 6\sigma## is only about one in a billion. This may or may not be an issue depending on how many times the experiment will be performed.

In practice, if you're writing simulation software where the outcome MUST be non-negative (e.g. if you're going to perform a square root), but the distribution MIGHT generate a negative number, you need to add an operation such as "if x < 0 then set x = 0" to prevent disaster.
 
Last edited by a moderator:

Similar threads

  • · Replies 24 ·
Replies
24
Views
4K
  • · Replies 4 ·
Replies
4
Views
4K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 5 ·
Replies
5
Views
7K
Replies
1
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 12 ·
Replies
12
Views
5K
  • · Replies 6 ·
Replies
6
Views
1K