Over constrained least squares analysis

Click For Summary

Discussion Overview

The discussion revolves around an over-constrained least squares problem, specifically focusing on how to analyze the contribution of individual vectors (rows of matrix A) to the correct solution. Participants explore methodologies for assessing the importance of these vectors in relation to the solution.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested

Main Points Raised

  • Benzun introduces the problem and suggests using eigen decomposition to analyze the contribution of vectors to the solution, seeking intuition on the methodology.
  • One participant questions whether the vectors in question are the rows of matrix A and proposes that the row producing the least deviation from Y might be the most reliable.
  • Another participant suggests that the most independent vector could be an outlier and emphasizes the need to clarify what it means for one vector to contribute more than another to the solution.
  • Concerns are raised about the lack of a clearly defined function for measuring vector contributions, noting that least squares treats all vectors as equally important.
  • A suggestion is made to consider small variations in one vector and observe the resulting variation in the answer, potentially leading to a Jacobian that could be collapsed into a scalar.

Areas of Agreement / Disagreement

Participants express differing views on how to define and measure the contribution of individual vectors to the solution, indicating that the discussion remains unresolved with multiple competing perspectives.

Contextual Notes

There are limitations in defining the contribution of vectors, as the least squares method treats all vectors equally, and the implications of doubling a row and its corresponding Y element are not fully explored.

benzun_1999
Messages
260
Reaction score
0
Hi,

I have a over constrained least squares problem. I also have the correct solution to the problem. Now I need to determine which of the vectors are contributing how much information that is close to the correct solution. I am sure there should be some methodology for this analysis, but I don't know. Maybe I can do a eigen decomposition to find which all vectors are unique? But would that say anything to if these vectors are helping me find the solution closer to the correct solution? I would be grateful if someone can give me some intuition into this problem.

Thanks,
Benzun
 
Physics news on Phys.org
Expressing the problem as AX=Y, A and Y given, what are these vectors? Are they the rows of A?
If so, it will not generally happen that a subset of these vectors is linearly dependent.
The row which produces the least deviation from the Y value could be considered the most reliable.
Or maybe you want the row which, in some sense, is 'most independent' of the other rows. Something like, that vector which subtends the greatest angle to the subspace generated by the rest?
 
Most independent vector can be an outliers. yes the vectors I am talking about are the rows of A matrix. I also know vector X and Y. I know the correct solution to the problem. I could have found X with a minimal definition of A since I have A over-defined, I want to find out how much contribution each vector had to my final solution. This analysis will help me identify the important vectors.
 
benzun_1999 said:
Most independent vector can be an outliers. yes the vectors I am talking about are the rows of A matrix. I also know vector X and Y. I know the correct solution to the problem. I could have found X with a minimal definition of A since I have A over-defined, I want to find out how much contribution each vector had to my final solution. This analysis will help me identify the important vectors.
Ok, but my point is that this is not a clearly defined function. What does it mean to say that one vector contributes more to the answer than another? As far as the least squares formula is concerned, they are all treated as equally important. What will you use the answer for? That might help clarify the concept.
Note that if you were to double a row and double the corresponding Y element then this would double the influence of the row on the answer. Is that consistent with your intuitive concept here?
Here's a possibility: consider small variations in one vector (counting the Y element as part of the vector) and observe the variation in the answer. You'd get a Jacobian in general, so you'd then need a way to collapse that into a scalar.
 

Similar threads

  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 5 ·
Replies
5
Views
1K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 4 ·
Replies
4
Views
1K
  • · Replies 4 ·
Replies
4
Views
5K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K