Increasing variance of weights in sequential importance sampling

sisyphuss · May 22, 2011

Hi all,

I know about these facts:
1- The variance of importance weights increases in SIR (also know as the degeneracy problem).
2- It's bad (lol), because in practice, there will be a particle with high normalized weight and many particles with insignificant (normalized) weights.

But I can not really understand the meaning of increasing variance of importance weights in sequential importance sampling (SIR) well.

Can you please explain it for me? And why the high variance is bad? Also, is there any intuitive proof for that?

Thanks.

Stephen Tashi · May 27, 2011

sisyphuss said:

Hi all,

I know about these facts:
1- The variance of importance weights increases in SIR (also know as the degeneracy problem).
2- It's bad (lol), because in practice, there will be a particle with high normalized weight and many particles with insignificant (normalized) weights.

You may know those facts, but they haven't struck a chord with anyone else, based on those fragmentary descriptions. There are probably people on the forum who know enough probability theory to help you if you describe your question precisely.

But I can not really understand the meaning of increasing variance of importance weights in sequential importance sampling (SIR) well.

I certainly don't know what "increasing variance of importance weights" means. And why do you use the abbreviation "SIR" for "sequential importance sampling"? Did you mean "sequential importance re-sampling"?

hdb · Nov 10, 2011

sisyphuss said:

Hi all,

I know about these facts:
1- The variance of importance weights increases in SIR (also know as the degeneracy problem).
2- It's bad (lol), because in practice, there will be a particle with high normalized weight and many particles with insignificant (normalized) weights.

But I can not really understand the meaning of increasing variance of importance weights in sequential importance sampling (SIR) well.

Thanks.

- The abbreviation of sequential importance sampling is SIS, as far as I know, and this approach does not include resampling, not like the sequential important resampling (SIR)

- Basicly fact 1 and fact 2 say somewhat the same thing. Variance is the second moment of the normalized weights (informally it gives the sum squared deviation from the mean). The sample weights are normalized. At the initialization of the algorithm they are distributed evenly (low variance), and as time goes by, and we proceed with the algorithm, there will be some particles that perform good, and gain more and more weights, and as variance is sensitive to outliers it will grow.
The solution of SIR is to resample to eliminate samples with low importance weights and multiply samples with high weights, this will lower the variance of the weights, and prevent us to work with particles of low weights, this way we will discover the interesting places of the posterior density (the ones with high weights).
The resampling step is often followed by a Markov Chain Monte-Carlo (MCMC)
step to introduce sample variety without aﬀecting the posterior density.

If you are interested, read the great book about the topic:

A. Doucet, N. De Freitas, and N. Gordon. Sequential
Monte Carlo methods in practice. Springer Verlag, 2001.

Increasing variance of weights in sequential importance sampling

Thread 'Hypothesis testing: Defining H0, HA hypotheses so that ( H_A)_A' makes sense'

Similar threads

Undergrad A variant of the Monty Hall problem

Undergrad Please Explain (actually explain) The Monty Hall Problem

Undergrad What Are the Axioms of Fuzzy Logic and How Do They Extend Boolean Algebra?

High School How Rare Is Low Smartphone Usage Among Metro Travelers in Japan?

High School Onto set mapping is the surjective set mapping, and into injective?

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers