Need help from someone who knows about importance sampling.

  • Context: Graduate 
  • Thread starter Thread starter af_231
  • Start date Start date
  • Tags Tags
    Sampling
Click For Summary

Discussion Overview

The discussion revolves around the concept of Importance Sampling, particularly its application in combining sampling techniques and dividing probability distributions. Participants explore how to determine the division point (p*) in the distribution and seek clarification on the methodology and terminology associated with sampling methods.

Discussion Character

  • Exploratory
  • Technical explanation
  • Conceptual clarification
  • Debate/contested
  • Homework-related

Main Points Raised

  • One participant describes using Importance Sampling to divide a probability distribution into two parts and seeks guidance on determining the value of p*.
  • Another participant provides a link to external resources and states that the initial comment about p* is incorrect, suggesting that there is no simple answer to the question posed.
  • A participant expresses gratitude for the feedback and acknowledges their inexperience in statistics.
  • A later reply offers a brief explanation of how Importance Sampling works, emphasizing the need to oversample in areas where the function is high and undersample where it is low, while using weights to compensate for bias.
  • The original poster asks additional questions regarding the nature of sampling methods and whether generated random numbers should align with the best fitted distribution from frequency analysis.
  • Another participant points out a potential misunderstanding in the use of the term "random number," indicating a need for clarification on terminology.

Areas of Agreement / Disagreement

Participants express differing views on the interpretation of p* and the terminology used in sampling methods. The discussion remains unresolved regarding the correct understanding of these concepts.

Contextual Notes

There are limitations in the clarity of terminology and assumptions regarding the definitions of random numbers and sampling methods. The discussion reflects varying levels of understanding among participants.

Who May Find This Useful

Individuals interested in statistics, particularly those exploring sampling methods, Importance Sampling, and risk analysis may find this discussion relevant.

af_231
Messages
20
Reaction score
0
Hello!
My question is about Importance Sampling. I am trying to apply a complex sampling method, which combine two sampling techniques. To do this, the importance sampling is used ONLY to divide the probability distribution into two main parts: Part 1: from 0 to p*, and Part 2: from p* to 1. Then, the method continues with the application of stratified sampling.

I would appreciate if someone can help me or explain me Importance Sampling... how to use this technique only to divide the probability? How to know which is the p* value, where the distribution should be divided?

I really appreciate your help...Any information will helpful.

I found in internet the following text:
"Importance Sampling attempts to do more samples at the areas of the function that are more
important. The way it does this is by bringing in a probability distribution function (pdf). All this is, is a function that attempts to say which areas of the function in the interval should get more samples. It does this by having a higher probability in that area."

Does this mean that p* is the CDF value that corresponds to the largest PDF value? ... I don't know, I am confused.
 
Physics news on Phys.org
Thanks!... and I'm sorry about my mistake, I am new in statistics area, but I'm trying to learn.
 
af_231 said:
Thanks!... and I'm sorry about my mistake, I am new in statistics area, but I'm trying to learn.
No need to apologize.

A very short answer to your original question. Assume you have a random variable X and you want to estimate the average of f(X) using Monte Carlo. Then you would oversample X where f(X) is high and undersample where f(X) is low and compensate for the biased sapling by weghts (low for oversample and high for undersample). If done properly, the weighted average has a mean equal to the answer you are looking for, while the standard deviation is reduced in comparison to using unbiased samples.
 
Thanks for your help Mathman!

Can I ask you a favor?... maybe you can help me answering some questions about sampling methods and analysis of risk. I have these doubts and maybe you can help me to answer them.

Question 1) Is this correct?: The basic function of the sampling methods is to generate random numbers with similar characteristics or properties to the original sample. I mean, applying a sampling method, the output is a random sample?

Question 2) My original data is a time series data from which I selected the best fitted distribution through frequency analysis. As part of a risk analysis, I must apply a sampling method... my question is, the random numbers generated by this sampling method should be generated according to the best fitted distribution chosen on the frequency analysis?

Thanks! I really appreciate your help!
 
Last edited:
We need to clarify terminology. You are using the term "random number" in a non-standard fashion. I find it hard to understand what you are trying to do.
 

Similar threads

  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 13 ·
Replies
13
Views
3K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 0 ·
Replies
0
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 7 ·
Replies
7
Views
3K