Need help from someone who knows about importance sampling.

  • Thread starter Thread starter af_231
  • Start date Start date
  • Tags Tags
    Sampling
AI Thread Summary
Importance sampling is being used to divide a probability distribution into two parts for a complex sampling method, followed by stratified sampling. The user seeks clarification on how to determine the p* value for this division and the correct application of importance sampling. It is noted that importance sampling involves oversampling areas of higher function values while compensating for bias with weights. The discussion also touches on the fundamental purpose of sampling methods and the necessity for generated random numbers to align with the best-fitted distribution from frequency analysis. Overall, the thread emphasizes the need for clear terminology and understanding in statistical methods.
af_231
Messages
20
Reaction score
0
Hello!
My question is about Importance Sampling. I am trying to apply a complex sampling method, which combine two sampling techniques. To do this, the importance sampling is used ONLY to divide the probability distribution into two main parts: Part 1: from 0 to p*, and Part 2: from p* to 1. Then, the method continues with the application of stratified sampling.

I would appreciate if someone can help me or explain me Importance Sampling... how to use this technique only to divide the probability? How to know which is the p* value, where the distribution should be divided?

I really appreciate your help...Any information will helpful.

I found in internet the following text:
"Importance Sampling attempts to do more samples at the areas of the function that are more
important. The way it does this is by bringing in a probability distribution function (pdf). All this is, is a function that attempts to say which areas of the function in the interval should get more samples. It does this by having a higher probability in that area."

Does this mean that p* is the CDF value that corresponds to the largest PDF value? ... I don't know, I am confused.
 
Mathematics news on Phys.org
Thanks!... and I'm sorry about my mistake, I am new in statistics area, but I'm trying to learn.
 
af_231 said:
Thanks!... and I'm sorry about my mistake, I am new in statistics area, but I'm trying to learn.
No need to apologize.

A very short answer to your original question. Assume you have a random variable X and you want to estimate the average of f(X) using Monte Carlo. Then you would oversample X where f(X) is high and undersample where f(X) is low and compensate for the biased sapling by weghts (low for oversample and high for undersample). If done properly, the weighted average has a mean equal to the answer you are looking for, while the standard deviation is reduced in comparison to using unbiased samples.
 
Thanks for your help Mathman!

Can I ask you a favor?... maybe you can help me answering some questions about sampling methods and analysis of risk. I have these doubts and maybe you can help me to answer them.

Question 1) Is this correct?: The basic function of the sampling methods is to generate random numbers with similar characteristics or properties to the original sample. I mean, applying a sampling method, the output is a random sample?

Question 2) My original data is a time series data from which I selected the best fitted distribution through frequency analysis. As part of a risk analysis, I must apply a sampling method... my question is, the random numbers generated by this sampling method should be generated according to the best fitted distribution chosen on the frequency analysis?

Thanks! I really appreciate your help!
 
Last edited:
We need to clarify terminology. You are using the term "random number" in a non-standard fashion. I find it hard to understand what you are trying to do.
 
Seemingly by some mathematical coincidence, a hexagon of sides 2,2,7,7, 11, and 11 can be inscribed in a circle of radius 7. The other day I saw a math problem on line, which they said came from a Polish Olympiad, where you compute the length x of the 3rd side which is the same as the radius, so that the sides of length 2,x, and 11 are inscribed on the arc of a semi-circle. The law of cosines applied twice gives the answer for x of exactly 7, but the arithmetic is so complex that the...
Thread 'Video on imaginary numbers and some queries'
Hi, I was watching the following video. I found some points confusing. Could you please help me to understand the gaps? Thanks, in advance! Question 1: Around 4:22, the video says the following. So for those mathematicians, negative numbers didn't exist. You could subtract, that is find the difference between two positive quantities, but you couldn't have a negative answer or negative coefficients. Mathematicians were so averse to negative numbers that there was no single quadratic...
Thread 'Unit Circle Double Angle Derivations'
Here I made a terrible mistake of assuming this to be an equilateral triangle and set 2sinx=1 => x=pi/6. Although this did derive the double angle formulas it also led into a terrible mess trying to find all the combinations of sides. I must have been tired and just assumed 6x=180 and 2sinx=1. By that time, I was so mindset that I nearly scolded a person for even saying 90-x. I wonder if this is a case of biased observation that seeks to dis credit me like Jesus of Nazareth since in reality...
Back
Top