Replication Method for Survey Estimation

AI Thread Summary
Replication methods for estimation in survey data without design information can be challenging, particularly when only survey weights are available. Effective use of replication requires understanding the original survey's clustering and stratification, which are often based on geographic information. The lack of design variables complicates the ability to create pseudo-samples that accurately reflect the original sample's design. While techniques like Jackknife and bootstrapping can be employed, they may yield results that differ significantly from official estimates if not applied correctly. Overall, careful consideration of the survey weighting and the limitations of the available data is essential for accurate estimation.
zli034
Messages
106
Reaction score
0
Hi all,

What is a good way to use replication methods for estimation? For a dataset with no design information, only weighting remain. I know the basic principles of replication. Are there any considerations to use replication effectively?

There is a public data of 15,000 people from the Census I'm using to estimate its mean and standard error for some variables. The dataset has its survey weighting, but the survey design variables are not released to the public. The clustering and stratification stage of the survey were based on geographic information, I have no where to find them.

I do have GVF to compute variance of some variable estimations. After using some replication methods, my estimations are very different from the official GVF results.
 
Physics news on Phys.org
You are using terms unfamiliar to me. By "replication", do you mean "re-sampling" or "bootstrapping"?

What kind of information is the "survey weighting" of the data?
 
To do Jackknife, each replication need to be a pseudo-sample has same design as the original sample. We can call replications re-samples, it is also similar to bootstrapping. Because of the survey design, each observation unit in the sample has to be weighted, the data have weights, which is good thing. However the data do not have design variables, which means it is impossible to re-sample like the original sample did.

Need to make re-samples has similar design to the original sample.
 
I was reading a Bachelor thesis on Peano Arithmetic (PA). PA has the following axioms (not including the induction schema): $$\begin{align} & (A1) ~~~~ \forall x \neg (x + 1 = 0) \nonumber \\ & (A2) ~~~~ \forall xy (x + 1 =y + 1 \to x = y) \nonumber \\ & (A3) ~~~~ \forall x (x + 0 = x) \nonumber \\ & (A4) ~~~~ \forall xy (x + (y +1) = (x + y ) + 1) \nonumber \\ & (A5) ~~~~ \forall x (x \cdot 0 = 0) \nonumber \\ & (A6) ~~~~ \forall xy (x \cdot (y + 1) = (x \cdot y) + x) \nonumber...
I was reading documentation about the soundness and completeness of logic formal systems. Consider the following $$\vdash_S \phi$$ where ##S## is the proof-system making part the formal system and ##\phi## is a wff (well formed formula) of the formal language. Note the blank on left of the turnstile symbol ##\vdash_S##, as far as I can tell it actually represents the empty set. So what does it mean ? I guess it actually means ##\phi## is a theorem of the formal system, i.e. there is a...
Back
Top