Embedding Complex Matrices into Real Spaces

  • Thread starter Thread starter Kreizhn
  • Start date Start date
  • Tags Tags
    Complex Matrices
AI Thread Summary
Embedding complex matrices into real spaces involves mapping M_n(ℂ) to M_{2n}(ℝ) to preserve matrix multiplication and structure. Two methods are discussed: one using a direct mapping of complex entries to 2x2 real matrices, and another using block matrix forms. While both methods result in a higher-dimensional space, they maintain the necessary ring structure of complex numbers. Alternative representations, such as visualizing complex matrices as n x 2n matrices, do not preserve multiplication, which is essential for maintaining matrix properties. Ultimately, the preferred methods ensure that the matrix structure is preserved while embedding complex matrices into real spaces.
Kreizhn
Messages
714
Reaction score
1
Hey all,

I have a quick question that should hopefully be simple to answer.

Consider a the space of n \times n matrices over \mathbb C given by M_n(\mathbb C). In order to properly consider this as a real matrix, we have to embed M_n(\mathbb C) \to M_{2n}(\mathbb R), and I can give some books that cite this. In order to do this, we use one of the following methods:

Method 1
Define the map \rho: \mathbb C \to M_2(\mathbb R) by
\rho(x+iy) = \begin{pmatrix} x & -y \\ y & x \end{pmatrix}
which actually gives an identification of the complex numbers with a subspace of M_2(\mathbb R). Let Z \in M_n(\mathbb C) be given by the components Z=[Z]_{ij}. Then define \rho_n: M_n(\mathbb C) \to M_{2n}(\mathbb R) as \rho_n(Z) = [\rho(Z_{ij})]_{ij}. That is, we have just made each complex entry of Z into a 2x2 matrix.

Method 2

Similar to above, let Z = X+iY \in M_N(\mathbb C) where X,Y \in M_n(\mathbb R) and define \rho': M_n(\mathbb C) \to M_{2n}(\mathbb R) by
\rho'(Z) = \begin{pmatrix} X & -Y \\ Y & X \end{pmatrix}
where this is done in block-matrix form.

Now both of these methods are related, and the reason for the relation comes from the two different definitions of symplectic structures. However, this is not what I'm interested in. We notice that M_{2n}(\mathbb R) is a 4n^2 dimensional space and in particular we can identify it with \mathbb R^{4n^2}. What if instead of using one of these embeddings, we instead wrote Z = X + i Y and visualized it as an element of M_{n,2n}(\mathbb R) with
Z = \begin{pmatrix} X \\ Y \end{pmatrix}
which has dimension 2n^2 and so we identify it with \mathbb R^{2n^2}. Why do we prefer using one of the previous two methods if they give a higher dimensional space? Is it for some reason like "multiplication is properly preserved" or something along those lines?

Edit: Something in my head says that from a topological point of view, there's probably not a big difference. However, it would be something like "We want to preserve the ring structure of \mathbb C."
 
Mathematics news on Phys.org
As you suspect, it is to preserve multiplication. After all, you can't even multiply two n×2n matrices together, so the mapping you suggest certainly won't preserve the product of the two matrices. So to get a ring homomorphism, we have to take \mathbf{M}_{n}(\mathbb{C}) \rightarrow \mathbf{M}_{2n}(\mathbb{R})
 
Would it not be possible to "vectorize" the n \times 2n matrices and define some sort of really weird multiplication on \mathbb R^{2n^2} that would preserve the ring structure?
 
Sure you could, just take the injective linear map from \mathbf{M}_{n}(\mathbb{C}) \rightarrow \mathbb{R}^{2n^2} and transfer the multiplication from \mathbf{M}_{n}(\mathbb{C}) to \mathbb{R}^{2n^2}. However, having to define such nonstandard multiplication defeats the purpose of finding real matrix representations (which is to embed them in the real matrix ring with its usual multiplication).
 
Kreizhn said:
Hey all,
Why do we prefer using one of the previous two methods if they give a higher dimensional space? Is it for some reason like "multiplication is properly preserved" or something along those lines?

From a computational point of view, that is back to front. The advantage of transforming from real variable to complex (when it is possible) is to reduce the matrix dimensions, and also reduce the amount of computation by doing everything only once, rather than twice.

If you are going to define something that doesn't actually behave like a matrix, then it's not a matrix. (If something doesn't look like a duck or quack like a duck, then it's not a duck).

The "double sized" real formulation arises naturally in some physical problems, for example in the dynamics of an object with "cyclic symmetry" like a fan with a number of identical equally spaced blades. You can represent the motion of the complete fan as a Fourier series of the degrees of freedom of one blade. If the fan is in the XY plane with its axis along z = 0, by symmetry the equatons of motion of the complete fan are the same in X and Y directions if you replace X by Y and Y by -X.

Putting those two facts together is equivalent to the matrix transformation between n complex variables and 2n real ones.
 
So if I get what you're both saying, it's essentially that there is no theoretical issue with embedding M_n(\mathbb C) into \mathbb R^{2n^2}. However, for the purpose of keeping matrices as matrices, the only way to maintain the "matrix" structure is the transformation into M_{2n}(\mathbb R).

Edit: Keep in mind that the quotation marks around "matrix" structure is meant to be entirely colloquial. I figure someone might get pedantic and say something like "the matrix structure and the ring structure are the same" so I just wanted to avoid that.
 
Insights auto threads is broken atm, so I'm manually creating these for new Insight articles. In Dirac’s Principles of Quantum Mechanics published in 1930 he introduced a “convenient notation” he referred to as a “delta function” which he treated as a continuum analog to the discrete Kronecker delta. The Kronecker delta is simply the indexed components of the identity operator in matrix algebra Source: https://www.physicsforums.com/insights/what-exactly-is-diracs-delta-function/ by...
Suppose ,instead of the usual x,y coordinate system with an I basis vector along the x -axis and a corresponding j basis vector along the y-axis we instead have a different pair of basis vectors ,call them e and f along their respective axes. I have seen that this is an important subject in maths My question is what physical applications does such a model apply to? I am asking here because I have devoted quite a lot of time in the past to understanding convectors and the dual...
Back
Top