Replication Method for Survey Estimation

Click For Summary
SUMMARY

The discussion focuses on effective replication methods for estimating means and standard errors in survey data lacking design information. The user is working with a public dataset of 15,000 individuals from the Census, which includes survey weights but lacks detailed survey design variables. Key points include the necessity of using pseudo-samples that maintain the original sample's design for methods like Jackknife and the distinction between replication, re-sampling, and bootstrapping. The user expresses challenges in achieving consistent variance estimates compared to official GVF results.

PREREQUISITES
  • Understanding of survey weighting and its implications in statistical analysis.
  • Familiarity with replication methods such as Jackknife and bootstrapping.
  • Knowledge of variance computation techniques, specifically GVF (Generalized Variance Function).
  • Basic principles of survey design, including clustering and stratification.
NEXT STEPS
  • Research the application of Jackknife resampling techniques in survey data analysis.
  • Explore bootstrapping methods for variance estimation in datasets without design information.
  • Study the implications of survey weighting on statistical inference and estimation.
  • Investigate the Generalized Variance Function (GVF) and its application in survey estimation.
USEFUL FOR

Statisticians, data analysts, and researchers working with survey data, particularly those dealing with datasets lacking complete design information and seeking to improve estimation accuracy.

zli034
Messages
106
Reaction score
0
Hi all,

What is a good way to use replication methods for estimation? For a dataset with no design information, only weighting remain. I know the basic principles of replication. Are there any considerations to use replication effectively?

There is a public data of 15,000 people from the Census I'm using to estimate its mean and standard error for some variables. The dataset has its survey weighting, but the survey design variables are not released to the public. The clustering and stratification stage of the survey were based on geographic information, I have no where to find them.

I do have GVF to compute variance of some variable estimations. After using some replication methods, my estimations are very different from the official GVF results.
 
Physics news on Phys.org
You are using terms unfamiliar to me. By "replication", do you mean "re-sampling" or "bootstrapping"?

What kind of information is the "survey weighting" of the data?
 
To do Jackknife, each replication need to be a pseudo-sample has same design as the original sample. We can call replications re-samples, it is also similar to bootstrapping. Because of the survey design, each observation unit in the sample has to be weighted, the data have weights, which is good thing. However the data do not have design variables, which means it is impossible to re-sample like the original sample did.

Need to make re-samples has similar design to the original sample.
 

Similar threads

  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 5 ·
Replies
5
Views
5K
  • · Replies 2 ·
Replies
2
Views
628
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 13 ·
Replies
13
Views
4K
  • Poll Poll
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 101 ·
4
Replies
101
Views
15K
  • · Replies 2 ·
Replies
2
Views
4K
  • · Replies 7 ·
Replies
7
Views
4K