Design Matrix: Restrictions & Definition - Mike

  • Context: Undergrad 
  • Thread starter Thread starter mikeph
  • Start date Start date
  • Tags Tags
    Design Matrix
Click For Summary
SUMMARY

The discussion clarifies that a design matrix is a mathematical entity used in statistical modeling, specifically in ANOVA and regression analysis. It cannot consist entirely of zeros, as demonstrated through a linear regression example involving points (2,5), (3,4), (5,12), and (7,13). The design matrix must include a column of ones and the corresponding x-values. Additionally, issues arise when observation vectors are nearly linearly dependent, which can lead to unpredictable results, necessitating the use of software tools to identify such vectors.

PREREQUISITES
  • Understanding of linear regression and ANOVA
  • Familiarity with design matrices in statistical modeling
  • Knowledge of multivariate and univariate analysis
  • Basic concepts of Principal Component Analysis (PCA)
NEXT STEPS
  • Research the construction of design matrices for linear regression
  • Learn about the implications of multicollinearity in regression analysis
  • Explore software tools for detecting linear dependence in observation vectors
  • Study Principal Component Analysis (PCA) for variable uncorrelation
USEFUL FOR

Statisticians, data analysts, and researchers involved in regression analysis and statistical modeling will benefit from this discussion.

mikeph
Messages
1,229
Reaction score
18
Hi,
Are there any restrictions on what a design matrix can be?

I have no background in this area, I'm just wondering from the wikipedia article, is it a mathematical entity or simply a name? Can it be all zeros, diagonal, or anything I want?

Thanks
Mike
 
Physics news on Phys.org
What type of problem are you considering? Design matrices can have different forms depending on whether you are doing a type of ANOVA or regression, whether the problem is multivariate or univariate.
The only easy answer to give is the one for "can it be all zeros?" No, it can't.

As one simple example: if you want to do a linear regression through the points (2,5), (3,4), (5,12), (7,13),
the "design matrix" that would be used to develop the fit with matrix methods has its first column all 1s and the second column the x-values of these points.
 
Hey MikeyW.

One thing you need to look out for is when you have observation vectors that are nearly linearly dependent. When this occurs you get all kinds of crazy behavior.

Most packages will pick this up and you can use software to find out the vectors that have this property.

Also if you have variables that are largely correlated, then you can use something like Principal Components to un-correlate them and use for a regression.
 

Similar threads

  • · Replies 0 ·
Replies
0
Views
2K
  • · Replies 11 ·
Replies
11
Views
2K
  • · Replies 14 ·
Replies
14
Views
2K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 33 ·
2
Replies
33
Views
3K
  • · Replies 5 ·
Replies
5
Views
4K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 5 ·
Replies
5
Views
5K