Confused by this result for the tensor product of two vectors

Click For Summary

Discussion Overview

The discussion revolves around the tensor product of two probability distributions represented as vectors. Participants explore different notations and definitions for the tensor product, particularly in the context of probability distributions and matrix representations. The scope includes theoretical aspects and notation conventions.

Discussion Character

  • Debate/contested
  • Technical explanation

Main Points Raised

  • One participant presents a tensor product definition from a paper, questioning the omission of a matrix representation that includes all combinations of elements from the two vectors.
  • Another participant argues that the authors of the paper have the authority to define their notation and clarifies that the tensor product maps two vectors to a single vector of dimension ##mn##.
  • A different participant expresses a preference for the matrix notation, suggesting it simplifies the tensor product to ordinary matrix multiplication, while acknowledging that writing a matrix as a column is also valid.
  • A later reply reiterates the initial question about the tensor product, providing a standard definition for the Kronecker product and discussing how different notations can lead to confusion.
  • The same reply mentions that constraining vectors to be column vectors affects the representation of the tensor product.
  • A resource is shared for further reading on Kronecker products, which may address the initial query directly.

Areas of Agreement / Disagreement

Participants do not reach a consensus on the preferred notation for the tensor product, and multiple competing views regarding its representation and definition remain present throughout the discussion.

Contextual Notes

There are limitations regarding the definitions and notations used, which may depend on the context of the discussion. The mathematical steps and assumptions underlying the tensor product and Kronecker product are not fully resolved.

Prez Cannady
Messages
21
Reaction score
2
Given two probability distributions ##p \in R^{m}_{+}## and ##q \in R^{n}_{+}## (the "+" subscript simply indicates non-negative elements), this paper (page 4) writes down the tensor product as

$$p \otimes q := \begin{pmatrix}
p(1)q(1) \\
p(1)q(2) \\
\vdots \\
p(1)q(n) \\
\vdots \\
p(m)q(n)
\end{pmatrix}, $$

yet I expected to see

$$p \otimes q := \begin{pmatrix}
p(1)q(1) && p(1)q(2) && \cdots && p(1)q(n-1) && p(1)q(n) \\
p(2)q(1) && \ddots && && && p(2)q(n) \\
\vdots && && \ddots && && \vdots \\
p(m-1)q(1) && && && \ddots && p(m-1)q(n) \\
p(m)q(1) && p(m)q(2) && \cdots && p(m)q(n-1) && p(m)q(n)
\end{pmatrix}, $$

Am I missing something?
 
Physics news on Phys.org
Well, they define, so they call the shots. At least they explain clearly what they mean:
Above, we have introduced the notation ⊗ to denote the tensor product, which in general maps a pair of vectors with dimensions ##m, n## to a single vector with dimension ##mn##
so all the elements you expected to see are present.
 
Last edited:
  • Like
Likes   Reactions: Prez Cannady
I'd also prefer the matrix notation, because the tensor product then becomes an ordinary matrix multiplication: column times row. But who says you can't write a matrix as a column? I think it's more convenient for type setting than it is mathematically, but in the end it depends on what you want to do with it.
 
  • Like
Likes   Reactions: Prez Cannady
Prez Cannady said:
Given two probability distributions ##p \in R^{m}_{+}## and ##q \in R^{n}_{+}## (the "+" subscript simply indicates non-negative elements), this paper (page 4) writes down the tensor product as

$$p \otimes q := \begin{pmatrix}
p(1)q(1) \\
p(1)q(2) \\
\vdots \\
p(1)q(n) \\
\vdots \\
p(m)q(n)
\end{pmatrix}, $$

yet I expected to see

$$p \otimes q := \begin{pmatrix}
p(1)q(1) && p(1)q(2) && \cdots && p(1)q(n-1) && p(1)q(n) \\
p(2)q(1) && \ddots && && && p(2)q(n) \\
\vdots && && \ddots && && \vdots \\
p(m-1)q(1) && && && \ddots && p(m-1)q(n) \\
p(m)q(1) && p(m)q(2) && \cdots && p(m)q(n-1) && p(m)q(n)
\end{pmatrix}, $$

Am I missing something?

For the Kronecker product this is actually a very common definition. In fact if you use the standard definition for the Kronecker product of

##
\mathbf X \otimes \mathbf Y = \begin{bmatrix}
x_{1,1}\mathbf Y & \cdots & x_{1,n}\mathbf Y\\
\vdots & \ddots & \vdots \\
x_{m,1}\mathbf Y & \cdots & x_{m,n}\mathbf Y
\end{bmatrix}##

where ##\mathbf X## is ##\text{m x n}##
- - - -
and you then constrain ##\mathbf X## and ##\mathbf Y## to be column vectors, you really have no choice but to have

##\mathbf {xy}^* = \mathbf x \otimes \mathbf y^* ##

and
##\mathbf x^* \otimes \mathbf y = \mathbf y \mathbf x^* ##

but

##\mathbf x \otimes \mathbf y##

is a column vector.

Notation and definitions can be tweaked slightly to get very different results, which is unfortunately, confusing.

- - - -
There's a nice free 12 page sample chapter and walkthrough of Kronecker products in Laub's "Matrix Analysis for Scientists & Engineers" -- I gave an indirect link here:

https://www.physicsforums.com/threa...vector-subspace-of-r-2-2.929266/#post-5868072

(page 2 of said sample chapter addresses your question directly)
 
  • Like
Likes   Reactions: Prez Cannady

Similar threads

  • · Replies 31 ·
2
Replies
31
Views
3K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 11 ·
Replies
11
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 52 ·
2
Replies
52
Views
4K
  • · Replies 7 ·
Replies
7
Views
3K