How can a vector be multiplied by itself ?

  • Context: Graduate 
  • Thread starter Thread starter ahmadka
  • Start date Start date
  • Tags Tags
    Vector
Click For Summary

Discussion Overview

The discussion revolves around the interpretation of a least squares equation involving vector and matrix multiplication, specifically addressing how a vector can be multiplied by itself in the context of dimensionality reduction. Participants explore the implications of the equation for transforming high-dimensional data to low-dimensional data.

Discussion Character

  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • One participant questions how a column vector can be squared in the equation, suggesting it might refer to a dot product instead.
  • Another participant asserts that the result of the multiplication aTxi is a 1x1 matrix, implying it is a scalar.
  • A different participant clarifies that aT is a matrix with dimensions greater than 1, and discusses the implications of the equation leading to a column vector.
  • Some participants propose that the expression could represent the inner product of a vector with itself, leading to a sum of squares.
  • There is a suggestion that a should be interpreted as a matrix rather than a vector, emphasizing its role in transforming dimensionality.
  • One participant argues that for the equation to hold, a must be a transformation matrix that reduces dimensions from n to p, where n > p.
  • Another participant reinforces the idea that aTx must match the dimensionality of yi, suggesting that both must be scalars, thus supporting the interpretation of a as a row vector.
  • Participants discuss the context of the equation in relation to assessing independent variables affecting a measurable outcome, linking it to the minimization of residuals.

Areas of Agreement / Disagreement

Participants express differing views on the nature of the vector and matrix involved in the equation, with no consensus reached on whether a should be considered a vector or a matrix. The discussion remains unresolved regarding the interpretation of the multiplication and the dimensionality of the components.

Contextual Notes

There are unresolved assumptions regarding the definitions of the vectors and matrices involved, as well as the notation used in the equation. The discussion highlights the complexity of interpreting the mathematical expressions without a clear agreement on dimensionality and representation.

ahmadka
Messages
3
Reaction score
0
Hi all ... Well I'm reading a paper and this simple least squares equation is not making sense to me .. This is the equation (I tried using latex but it ain't working well for me):

http://img19.imageshack.us/img19/3046/equationk.png

A little background on this equation:

basically this equation is being used for finding a mapping which takes some high dimensionality data (x), and maps it to some low dimensional data (y). The mapping transformation matrix (a) is what we have to find. The mapping should be such that we minimize the error between the original and projected data (as you can probably tell) ..

Inside the summation, y and x are being accessed serially in a column by column manner, and so yi and xi are referring to individual columns of matrices y and x, respectively .. Also, as you can probably tell, both y and x contain the same number of columns, m. Only the number of rows they each have is different ..

Now here's my question .. As far as I can tell (and I'm virtually sure I'm correct here), the result of the evaluation being performed inside the brackets would result in a column vector ... And then this column vector is being squared !? That is, this is what the equation is suggesting:

( <column vector here> )2

How is this possible ? Or is the equation maybe referring to a dot product here ?
 
Last edited by a moderator:
Physics news on Phys.org
Doesn't aTxi result in a 1x1-matrix? I.e. a number
 
micromass said:
Doesn't aTxi result in a 1x1-matrix? I.e. a number

Nope ... aT is a matrix with both dimensions being greater than 1 .. xi and yi are column vectors ...

so this is basically what is happening here:

((matrix*column vector) - column vector) = (column vector - column vector) = column vector

(column vector)2 ... !??!
 
They probably mean the inproduct with itself. I.e.

(x_1,...,x_n)^2=x_1^2+...+x_n^2

I don't like the notation however...
 
micromass said:
Doesn't aTxi result in a 1x1-matrix? I.e. a number
Exactly.

The way to read this expression is that a and each xi are row vectors -- i.e., 1×N matrices. This makes aT an N×1 matrix, which in turn makes aTxi a scalar.
 
Last edited:
I agree with DH. Also see here.
 
D H said:
The way to read this expression is that a and each xi are row vectors -- i.e., 1×N matrices. This makes aT an N×1 matrix, which in turn makes aTxi a scalar.

The thing is with regards to the problem at hand, it makes more sense for a to be a matrix, and not a vector. I say this because a is a transformational matrix which takes a high dimensional column vector and turns it into a low dimensional column vector .. So for example, suppose x contains m column vectors which are each n dimensions long (so then x would be an nxm matrix), and we want to reduce the number of dimensions these column vectors use, from n to p let's say, and so n>p .. Let xi refer to anyone of the m column vectors in x, then ...

yi = aTxi

where yi contains the reduced number of dimensions, i.e. p ..

So for such a problem, a cannot be column vector .. rather is would be a pxn transformation matrix which transforms a nx1 column vector into a px1 column vector ...

So I really don't see how for the problem at hand, a can be a column vector ..

Landau said:
I agree with DH. Also see here.

Thats me again .. :)
 
ahmadka said:
The thing is with regards to the problem at hand, it makes more sense for a to be a matrix, and not a vector. ...
Ask yourself: Does that interpretation make any sense? Look at the equation to which you linked in the original post:

<br /> \mathbf a = \underset{\mathbf a}{\operatorname{arg\,max}}<br /> \sum_{i=1}^m \left(\mathbf a^T\mathbf x_i-y_i\right)^2<br />

This means that aTx must have the same dimensionality as yi. Since yi is written in non-bold text, one can assume that each yi is a scalar. This means that each xi must be a n-vector (a row vector), as must a.

Backing up a bit, suppose you are doing an experiment where you are need to assess how a bunch of different independent variables affect the outcome of the experiment, with the outcome expressed as a scalar measurable quantity. To keep things simple, your first guess is that this scalar quantity is some unknown linear function of the independent variables. One approach to finding this linear relation is to find the solution that minimizes the sum of the squares of the residuals. That is exactly what the above equation does.
 

Similar threads

  • · Replies 1 ·
Replies
1
Views
4K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 14 ·
Replies
14
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 14 ·
Replies
14
Views
6K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 2 ·
Replies
2
Views
3K