Graduate "Population-averaged"regression on panel data using Stata

monsmatglad · May 1, 2019

Hey. I am running regression on panel data. I test different approaches using Stata. When using "population-averaged" no squared R measures are reported. The approach is equal to running a regular linear regression on the panel data, and according to my professor, a squared R is statistically "allowed." When I run a regular linear regression on the data, the coefficients and significance-levels are almost completely identical to "population-averaged", but a squared R and adjusted squared R is reported. is there a reason why Stata does not provide a squared R estimate (within, between, overall) when applying "population-averaged"? Is there a way to make it report such a measure? and if not, can I use the Squared R from a regular linear regression as a "substitute"?

Mons

Dale · May 1, 2019

I am not sure that R^2 makes sense for a population averaged analysis. In general, R^2 measures the proportion of the variance in the data explained by fitting the model to the data. However, in a population averaged analysis you don't really produce a model that explains the data at all, so there isn't anything against which to measure the variance.

For example, suppose you have a control and a treatment group of seeds with several different characteristics of the seeds and your outcome is sprouting or not sprouting and you are doing a logit regression. A normal regression will give you the odds of a given control seed sprouting vs the odds of that same seed sprouting under the treatment. So it is an explanation about that given individual seed data point and can be used to explain the actual outcome of that specific data point. In contrast, the population averaged regression will give you the odds of an average control seed sprouting vs the odds of an average treatment seed sprouting. It does not explain any of the individual data points, and if your experimental assignment is not random then there can be biases due to the population biases.

I think that if you want an R^2 value you should not use a population averaged regression. It just doesn't seem to make sense to me.

Graduate "Population-averaged"regression on panel data using Stata

Thread 'Hypothesis testing: Defining H0, HA hypotheses so that ( H_A)_A' makes sense'

Similar threads

Undergrad A variant of the Monty Hall problem

Undergrad Please Explain (actually explain) The Monty Hall Problem

Undergrad What Are the Axioms of Fuzzy Logic and How Do They Extend Boolean Algebra?

High School How Rare Is Low Smartphone Usage Among Metro Travelers in Japan?

High School Onto set mapping is the surjective set mapping, and into injective?

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers