Independent variable that is a distribution?

Click For Summary
SUMMARY

This discussion centers on the application of logistic regression analysis with binary responses, utilizing income distributions represented as histograms for individual observations within subgroups, or nests. The participant, Seth, proposes fitting a distribution to these histograms instead of using individual income data, aiming to mitigate the ecological fallacy in statistical modeling. This approach involves integrating over known distributions to estimate parameter values, which may be more computationally intensive than traditional random effects models. The discussion highlights the need for further exploration of methods to address the ecological fallacy in statistical analyses.

PREREQUISITES
  • Understanding of logistic regression analysis with binary responses
  • Familiarity with random effects models in statistical modeling
  • Knowledge of ecological fallacy and its implications in data analysis
  • Experience with statistical distributions and their fitting techniques
NEXT STEPS
  • Research methods for fitting statistical distributions to histogram data
  • Explore advanced techniques in logistic regression with random effects
  • Investigate alternative approaches to mitigate ecological fallacy in statistical models
  • Learn about integration techniques for parameter estimation in complex models
USEFUL FOR

Statisticians, data analysts, researchers in social sciences, and anyone involved in logistic regression modeling and ecological fallacy mitigation strategies.

wvguy8258
Messages
48
Reaction score
0
Hi,

I am attempting a logistic regression analysis with a binary response and explanatory variables measured at the individual level and also some measured within subgroups of the observations (nests). For each nest containing individual observations I know the distribution of income in the form of a histogram (but not the individual incomes). I could enter each histogram value as a separate variable (i.e. percent in class 1) but I would rather fit a distribution to each histogram for each nest. Then, similar to a regression with random effects (but in a sense the diametric opposite), a parameter estimate would be assigned to this variable. Analogous to integration over multiple versions of the random effect to find the one which maximizes likelihood, this would integrate over each known distribution for each nest when estimating the parameter estimate attached to it and other independent variables. I am wanting to do this because I believe that income interacts with the relationships between other independent variables and my dependent variable but do not know the actual income, only the distribution over a subset of observations. In doing so, I'm wanting to partially overcome what is called the ecological fallacy. Is this approach a known method? I have not run across it. It seems more computationally intensive than the random effects model as you would have to estimate all parameters in the model and by integration over the pre-specified distributions for each nest. At least the distribution will not also have to be estimated as well. Any food for thought on other ways to overcome the ecological fallacy (correlation)? Thanks. -Seth
 
Physics news on Phys.org

Similar threads

  • · Replies 30 ·
2
Replies
30
Views
5K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 5 ·
Replies
5
Views
3K
  • · Replies 13 ·
Replies
13
Views
5K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 6 ·
Replies
6
Views
5K