Uncertainty of histogram bins from MC-simulation

  • Context: Undergrad 
  • Thread starter Thread starter minits
  • Start date Start date
  • Tags Tags
    Histogram Uncertainty
Click For Summary

Discussion Overview

The discussion revolves around calculating uncertainties for histogram bins derived from Monte Carlo (MC) simulations in the context of particle physics. Participants explore the application of the Poisson distribution to estimate uncertainties in differential cross sections, considering the nature of the data and the weighting of events.

Discussion Character

  • Technical explanation
  • Conceptual clarification
  • Debate/contested

Main Points Raised

  • One participant describes calculating differential cross sections from MC simulations and seeks guidance on assigning uncertainties to histogram bins, noting that the differential cross sections do not carry uncertainties from analytical calculations.
  • Another participant questions the assertion that the process is not a counting experiment, suggesting that all histograms involve counting and that the Poisson distribution can provide uncertainty estimates for the number of MC results in each bin.
  • A participant expresses uncertainty about applying the Poisson distribution, particularly regarding the interpretation of event weights and how they affect error calculations for histogram bins.
  • One participant asserts that the Poisson distribution is appropriate for estimating uncertainties, explaining its relevance in the context of sampling from a larger set of outcomes.
  • Another participant notes that if events are weighted, the uncertainty calculation is more complex than simply using the square root of the number of events in a bin.
  • A participant seeks clarification on specific equations related to error propagation and the variance of weighted events, questioning the rationale behind certain mathematical expressions.

Areas of Agreement / Disagreement

Participants generally agree that the Poisson distribution is relevant for estimating uncertainties in this context, but there is disagreement regarding the specifics of applying it to weighted events and how to interpret the results. The discussion remains unresolved on the best approach to calculate uncertainties given the complexities introduced by event weights.

Contextual Notes

Participants express uncertainty about the assumptions underlying the application of the Poisson distribution, particularly in relation to weighted events and the implications for error calculations. There are unresolved mathematical steps regarding the variance of weighted events and how they relate to Poisson fluctuations.

minits
Messages
12
Reaction score
0
TL;DR
How to determine the uncertainty of histogram bins when the entries result from a MC-simulation
Hello everyone,

I calculated the matrix element of a parton level process and determined the total cross section via a MC-simulation. Then I wanted to look at some differential distributions like the differential cross section with respect to the energy of one of the particles in the final state. In order to do this I calculate the differential cross section for 10.000 phase-space points and then do a case analysis in which bin to add the respective result. Now I am stuck with assigning an uncertainty to each bin. How does this work? The differential cross sections do not carry any uncertainty since they are calculated analytically. One approach I often read is the use of the poissonian distibution but I do not think it is the proper way since I feel like I am not really doing a counting experiment. Thanks for any advice!
 
Physics news on Phys.org
Can you show us the histogram? If you want to protect your data, feel free to apply a blind to the axes. I'd just like a sense for how many points fall in each bin before I say something stupid.

I know next to nothing about the physics of what you're studying; however, I'm confused how you could be generating a histogram from MC results and not be doing a counting experiment? Can you explain why you think you're not doing counting? Where I'm going with this is that all histograms are counting. The Poissonian distribution tells you the estimated uncertainty on the number of MC results in each bin (in the appropriate limit of many MC results).
 
Here is the histogram you asked for. I am just not firm with the application of the poisson distribution since everyone says it kind of always applies in counting experiments but I still don't know why that is.
A second thing that I am unsure about is if the error from the MC ist poissonian I should take sqrt(N) as the errors for each bin but I have 10.000 events and in every event another number flows into one of the bins. So if I have 100 events flowing into one bin and the value of these events sum up to two the error won't be 10. So probably I should interpret the specific values as a weight which comlicates things.
 

Attachments

  • hist.PNG
    hist.PNG
    3.5 KB · Views: 253
Last edited:
Ok, seeing that I feel more confident saying Poisson is what you want.

I could give you my spiel about why the Poisson distribution is appropriate, but there are probably 1000's of folks online that have already given good explanations and put more time in than I would have. This video might be a good place to start.

The technical reason for why the poisson distribution is appropriate is that it is the limit of the binomial distribution as the number of trials goes to infinity and the mean number of successes is kept finite. Whenever you skim a few outcomes from a much larger pot of outcomes (raindrops on a roof, electrons over a threshold, photon shot noise, or points in a histogram bin), the Poisson distribution is what you need. If what you're asking is how you assign error bars to your histogram plot, take the square root of the number of points in each bin (that's the signature feature of the PD).
 
You are counting phase space points. If they are selected randomly then Poisson-like uncertainties should be right (your events have weights, so it's not a simple sqrt(N)). If they come from some grid then the uncertainty will be smaller. If computing power allows you can repeat the analysis with different grid patterns and compare the results to estimate the uncertainty.

Just from looking at your plot, bin-by-bin variations are probably very small.
 
  • Like
Likes   Reactions: Twigg
Thanks for your replies! I actually took the info for the uncertainties from the webseite you linked. However I did not understand the following equations:
var(w_i * 1 event) = w_i^2 * var(1 event) = w_i^2
Do you know why the these three expressions are equal? In the first relation why why is it the square of the weight? And after the second equal sign why is the variance of 1 event 1 and not 0? They also talk about
"poissonian fluctuation of the number of events" but where is this used explicitly?
 
I'm on my phone so forgive the lack of LaTeX.

The first equality is error propagation. Var[a*x] = a^2 Var[x]. The second is because for poissonian Var[N events] = N.
 

Similar threads

  • · Replies 9 ·
Replies
9
Views
3K
  • · Replies 21 ·
Replies
21
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 4 ·
Replies
4
Views
1K
  • · Replies 3 ·
Replies
3
Views
8K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 10 ·
Replies
10
Views
6K
  • · Replies 0 ·
Replies
0
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 2 ·
Replies
2
Views
1K