How Can I Adjust Regression Analysis for Commuting Patterns?

  • Context: Graduate 
  • Thread starter Thread starter mrburns404
  • Start date Start date
  • Tags Tags
    Data Regression
Click For Summary
SUMMARY

This discussion focuses on adjusting regression analysis for commuting patterns using statistical data on average wages and various influencing parameters. The user has 500 observations of wages affected by local taxes, education, and age, while also accounting for the percentage of residents commuting to work in different areas. The user is exploring weighted regressions in Excel to address the challenge of incomplete data regarding commuting patterns, with the goal of improving the model's reliability, currently indicated by a low R-squared value of ~0.3.

PREREQUISITES
  • Understanding of regression analysis concepts, specifically weighted regression techniques.
  • Proficiency in Excel for statistical modeling and data analysis.
  • Knowledge of statistical significance and R-squared metrics in regression models.
  • Familiarity with factors influencing wage determination, such as local taxes and education levels.
NEXT STEPS
  • Research advanced weighted regression techniques and their implementation in Excel.
  • Explore methods for estimating commuting patterns and their impact on wage data.
  • Learn about multilevel modeling to account for data from different geographical areas.
  • Investigate alternative statistical software tools for regression analysis, such as R or Python's statsmodels.
USEFUL FOR

Data analysts, economists, and researchers focusing on labor economics, particularly those interested in modeling wage influences and commuting patterns.

mrburns404
Messages
2
Reaction score
0
So I have this set of statistical data, which is not completely relevant to what I want to model, and I would like to compensate for that somehow since I do not have the more precise data.

I have about 500 observations of average wages in certain areas which are modeled as dependent on several other parameters (taxes in the area, education of people living in the area, age, etc). The problem is, for each one of those areas I know in percent (from about 5% up to 50%) amount of people traveling to other areas to work there (and ofc getting paid by that area's standard), while still living in home area (and ofc contributing to parameters in home area).

Any ideas how to deal with this kind of problem? I was thinking about weighted regressions but I got kinda stuck since they use standard deviations which is different from what I have.PS I am working with regressions in Excel but any help would be appreciated.
 
Physics news on Phys.org
Can you confirm whether this description of your problem is correct?

Ideally your model would be that a person's wages are a function of where they work, or possibly a function of where they work and where they live, but you only have data on where they live. You know what percentage of people work in different areas than where they live, but you don't know who they are or where they are working.

For example you have regions A, B and C. You know the wages of people living in region A, and you also know 10% of people living in A work in B or C, but you don't know who they are or how many work in B and how many in C?
 
Yes, that seems pretty accurate.

Ideally a model without those "travelling" people would be enough if there were statistical data over people who live and work in same area. So I am trying to somehow reduce the data which include everyone AND the "uncertainty coefficient" for each area expressed in % of people traveling to work (more travels = less reliable data) to this ideal model.

I am still getting meaningful results but the regression is very weak, R squared is ~0.3 or so.
 
Last edited:

Similar threads

  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
Replies
17
Views
2K
Replies
15
Views
3K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 6 ·
Replies
6
Views
2K
Replies
1
Views
2K
  • · Replies 6 ·
Replies
6
Views
3K