Multiple regression, why use categories

bradyj7 · Sep 6, 2012

Hello,

I have a question regarding multiple regression.

I am reading a paper in which the author performed a multiple regression to predict the energy consumption of an electric car based on a 27 variables measured during journeys, such as speed and acceleration etc.

The author categorised the variables into 4 groups as shown in this table. If 2 variables were correlated within the group he dropped one variable. At the end he has 16 nominated variables for the regression.

https://dl.dropbox.com/u/54057365/All/regtable.JPG

My questions are:

1. What is the advantage of using the categories?
2. What if two variables in separate groups are correlated?
3. Could he have put all the variables in one group and did a stepwise or best subsets regression?

The reason I am asking these questions is because, multicollinearity does not matter if your regression is for prediction. He is removing correlated variables within the categories but not between the categories.

I would of thought that leaving them all in one category, dropping one of two highly correlated variables and then doing a best subsets regression would be a better approach.

My main question is, what if any is the advantage of using the 4 categories?

Thank you

John

mfb · Sep 6, 2012

1. What is the advantage of using the categories?

I would guess that you look at correlations only where you expect them to come from general concepts of car/tours/drivers and not from specific routes chosen for the calibration.

Categories reduce the complexity of the analysis a bit - maybe it is just a question of computational power. The "best" concept (in an ideal world with test data of arbitrary size and infinite computation power) would be to use all variables, but that might be impractical.

Multiple regression, why use categories

Similar threads

Graduate Hypothesis testing: Defining H0, HA hypotheses so that ( H_A)_A' makes sense

Undergrad My basic understanding of set theory

Undergrad How do E[X] and E[|X|] relate?

Graduate Expected numbers of cards of a last color remaining

Undergrad How does axiom of foundation prevent infinite sequence of elements?

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight