Leaving out a confounder in longitudinal regression?

  • Context: Graduate 
  • Thread starter Thread starter FallenApple
  • Start date Start date
  • Tags Tags
    Longitudinal Regression
Click For Summary
SUMMARY

This discussion focuses on the complexities of including confounders in longitudinal regression models, specifically in the context of analyzing the relationship between creatine doses and sports performance over time. The example provided illustrates the challenge of incorporating injury status as a confounder when its occurrence is not fixed throughout the study. The consensus is that while including such a variable may lead to invalid inferences, omitting it could also compromise the validity of the analysis. Ultimately, the responsibility for drawing accurate inferences lies with the subject matter expert, emphasizing the need for careful consideration of confounding variables in statistical modeling.

PREREQUISITES
  • Understanding of linear mixed models in statistical analysis
  • Familiarity with confounding variables and their impact on regression outcomes
  • Knowledge of longitudinal data analysis techniques
  • Experience with statistical software for regression modeling, such as R or Python
NEXT STEPS
  • Research the application of linear mixed models in longitudinal studies
  • Explore methods for identifying and adjusting for confounders in regression analysis
  • Learn about the implications of time-varying covariates in statistical modeling
  • Investigate stepwise multiple regression techniques for managing independent variable correlation
USEFUL FOR

Statisticians, data analysts, researchers in sports science, and anyone involved in longitudinal studies who seeks to understand the implications of confounding variables in regression analysis.

FallenApple
Messages
564
Reaction score
61
Say I want to analyze how a relation changes though time. Like usual, I would throw in the potential confounders into the regression model. But what if some of the confounders are not determined at the beginning of the study but at some point within time?

For example, say I want to analyze the relation between creatine doses and sports performance over time. So a basic linear mixed model would be: ##Performance_{ij} \sim Dose_{ij}+Time_{ij}+Dose_{ij}*Time_{ij}+PotentialConfounders_{i}+RandomEffects_{i}+error_{ij}##

Where i is the ith subject and j is within that subject. Say Dose and Performance are measure longitudinally. and the rest of covariates are just recorded once and is the same for all levels within the ith subject.

Say I suspect that gender might confound the relation between creatine and performance. Well, a persons gender is fixed throughout the study.

But what about say injury? Say I have an injury indicator variable that's says whether the athlete experiences injury during the study. Where I don't know when it happened in the study. This is different from say gender, where even if the data is collected after, we know it couldn't have changed. Anyways, If I consider Injury Status as potential confounder, should I include it? I mean, it might not be present at all time points. It seems that whatever inference obtained by including it would be invalid. But then again, if injury is a confounder, then not adjusting for it would also result in invalid inferences.
 
Last edited:
Physics news on Phys.org
So yes, you can only partially correct for confounding. Better than nothing, isn't it?
Also note that for a confounder to be a confounder, it has to correlate both with the effect and with the parameter of interest. E.g. injury may not correlate with dose (if subjects take the same dose independently of whether injured or not) but certainly correlates with time.
 
FallenApple said:
It seems that whatever inference obtained by including it would be invalid. But then again, if injury is a confounder, then not adjusting for it would also result in invalid inferences.
The inferences to draw from statistical analysis is always entirely the responsibility of the subject matter expert. One should be careful about thinking that statistics will do more than show correlation.

It should be noted that, unless a person has control of the independent variables, most independent variables will be a confounder. When a person puts a variable into a model, it is usually because he suspects some relationship or correlation between it and the dependent variable. It would be unusual for two such independent variables to not also show some correlation among themselves. So that is what procedures like stepwise multiple regression are designed to handle. The exception to this is when an experiment is specifically designed to minimize or even completely eliminate the correlation between the independent variables.
 
Last edited:

Similar threads

  • · Replies 11 ·
Replies
11
Views
8K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 2 ·
Replies
2
Views
1K
Replies
4
Views
2K
  • · Replies 11 ·
Replies
11
Views
5K
Replies
63
Views
11K
  • · Replies 6 ·
Replies
6
Views
7K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 20 ·
Replies
20
Views
3K
  • · Replies 51 ·
2
Replies
51
Views
5K