Prove the identity matrix is unique

askmathquestions · Oct 25, 2022

I would appreciate help walking through this. I put solid effort into it, but there's these road blocks and questions that I can't seem to get past. This is homework I've assigned myself because these are nagging questions that are bothering me that I can't figure out. I'm studying purely on my own, no professors, but according to freely accessible MIT open courseware material on linear algebra.

I'm trying to prove and figure out how and why that the identity matrix is unique, but I can't quite figure out how AC = BC implies A = B, I don't know how you can suddenly remove the "C" from the equation. Here's where I'm at, and I don't know if there's a LaTeX editor I can use:

Let $ I_1 $ and $ I_2$ be two $n$ x $n$ matrices acting on an $n$ x $p$ matrix $A$, such that $ I_1*A = A $ and $ I_2*A = A $. Suppose $A$ is not identically the $0$ matrix.

How do we show $ I_1 = I_2 $ ?

We have by equality that
$ I_2 * I_1 * A = I_2 * A = A $
and so $ I_1 * A = I_2 * A $

But how do I make the leap to saying $ I_1 = I_2$? Every other attempt I have is just some combinatoric mess of matrices, there's something fundamental I'm not getting and I don't know what.

If we made some additional assumptions of the framework, we could require $A$ is invertible, but then we'd lose the identity's uniqueness on non-invertible matrices.

This reminds me of another question that's bothering me: are column vectors, like x = [[x_1],[x_2],[x_3]] "invertible" matrices? Because conceivably, we could define a row vector y = (1/3) [[1/x_1, 1/x_2, 1/x_3 ]] so that when we multiply $ y*x $ we obtain 1, but I'm confused because historically we don't refer to vectors as "matrices", we refer to them as "vectors", so it's confusing to assume a vector is a matrix, and furthermore this "1" that is the byproduct of multiplying y and x is a just scalar quantity, not a matrix, so I don't know whether to say "y" is the "left-inverse" of x. I'm confused by how all the dimensions of each component keep changing.

PeroK · Oct 25, 2022

What happens if there are two matrices ##I_1## and ##I_2##, both having the properties of the identity matrix?

Can you show that ##I_1 = I_2##?

Is that sufficient to show that the identity matrix is unique?

askmathquestions · Oct 25, 2022

That's my general outline. If I suppose $ I_1*A = I_2 * A $ then I feel like I should somehow be able to derive $I_1 = I_2, $ but I don't know how to get rid of this $A$ without making all these other assumptions about invertibility and how AA^{1} equals the identity, it's circular logic then because how then do I know "which" identity A*A^{-1} is equal to? I don't know how to show that such a statement is true "independent" of your choice of A.

PeroK · Oct 25, 2022

Perhaps you need a clever choice of ##A##?

askmathquestions · Oct 25, 2022

PeroK said:

Perhaps you need a clever choice of ##A##?

But A is supposed to be any matrix, of the correct dimensions. Picking and limiting yourself to only a specific A seems like it would defeat the purpose of the proof.

fresh_42 · Oct 25, 2022

askmathquestions said:

Homework Statement:: Prove the identity matrix is unique.
Relevant Equations:: I_1 * A = A, I_2 * A = A

I would appreciate help walking through this. I put solid effort into it, but there's these road blocks and questions that I can't seem to get past. This is homework I've assigned myself because these are nagging questions that are bothering me that I can't figure out. I'm studying purely on my own, no professors, but according to freely accessible MIT open courseware material on linear algebra.

I'm trying to prove and figure out how and why that the identity matrix is unique, but I can't quite figure out how AC = BC implies A = B, I don't know how you can suddenly remove the "C" from the equation. Here's where I'm at, and I don't know if there's a LaTeX editor I can use:

Let $ I_1 $ and $ I_2$ be two $n$ x $n$ matrices acting on an $n$ x $p$ matrix $A$, such that $ I_1*A = A $ and $ I_2*A = A $.

How do we show $ I_1 = I_2 $ ?

We have by equality that
$ I_2 * I_1 * A = I_2 * A = A $
and so $ I_1 * A = I_2 * A $

But how do I make the leap to saying $ I_1 = I_2$? Every other attempt I have is just some combinatoric mess of matrices, there's something fundamental I'm not getting and I don't know what.

If we made some additional assumptions of the framework, we could require $A$ is invertible, but then we'd lose the identity's uniqueness on non-invertible matrices.

This reminds me of another question that's bothering me: are column vectors, like x = [[x_1],[x_2],[x_3]] "invertible" matrices? Because conceivably, we could define a row vector y = (1/3) [[1/x_1, 1/x_2, 1/x_3 ]] so that when we multiply $ y*x $ we obtain 1, but I'm confused because historically we don't refer to vectors as "matrices", we refer to them as "vectors", so it's confusing to assume a vector is a matrix, and furthermore this "1" that is the byproduct of multiplying y and x is a just scalar quantity, not a matrix, so I don't know whether to say "y" is the "left-inverse" of x. I'm confused by how all the dimensions of each component keep changing.

It is crucial to know where ##A## is from. E.g. if it is invertible, we could simply multiply with ##A^{-1}## from the right. If ##A=0## then there is more than one identity matrix. Sure, these are extreme examples, but they show that ##A\in ?## is crucial. And there are domains where left identity and right identity are different.

askmathquestions · Oct 25, 2022

I can see how if A = 0 then we don't have uniqueness. I just don't know about assuming invertibility, you'd have the assume an inverse exists, but this seems like circular logic because of the definition of an inverse is a matrix multiplication which returns the identity, the thing I'm trying to prove, so without uniqueness, how do you know "which" identity A*A^{-1} returns?

PeroK · Oct 25, 2022

askmathquestions said:

But A is supposed to be any matrix, of the correct dimensions. Picking and limiting yourself to only a specific A seems like it would defeat the purpose of the proof.

There's no answer to that except to say that your thinking is illogical.

PeroK · Oct 25, 2022

fresh_42 said:

It is crucial to know where ##A## is from. E.g. if it is invertible, we could simply multiply with ##A^{-1}## from the right. If ##A=0## then there is more than one identity matrix. Sure, these are extreme examples, but they show that ##A\in ?## is crucial. And there are domains where left identity and right identity are different.

I don't follow what you are trying to say here.

askmathquestions · Oct 25, 2022

PeroK said:

There's no answer to that except to say that your thinking is illogical.

Just because AX = B doesn't suddenly mean, if we pick another, random, different matrix from X, say Y, that AY is still equal to B in the general case.

PeroK said:

I don't follow what you are trying to say here.

They are pointing out a point of contention in the assumptions of the problem. I didn't grab this problem out of a book, that's why it's ill-posed, so I need to figure out what minimal additional assumptions I need to make to prove uniqueness, which I obviously didn't know when writing the problem. I amended my problem to include that $A$ not be identically equal to the 0 matrix.

fresh_42 · Oct 25, 2022

askmathquestions said:

I can see how if A = 0 then we don't have uniqueness. I just don't know about assuming invertibility, you'd have the assume an inverse exists, but this seems like circular logic because of the definition of an inverse is a matrix multiplication which returns the identity, the thing I'm trying to prove, so without uniqueness, how do you know "which" identity A*A^{-1} returns?

No. If we have a multiplicative group, i.e. an associative [##A(BC)=(AB)C##] structure in which all elements have inverses and there is an identity element, then we are allowed to multiply from one side with an inverse element, and
\begin{align*}
I_1\cdot A =I_2\cdot A &\Longrightarrow (I_1\cdot A)\cdot A^{-1} =(I_2\cdot A)\cdot A^{-1}\\
&\Longrightarrow I_1\cdot (A\cdot A^{-1}) =I_2\cdot (A\cdot A^{-1})\\
&\Longrightarrow I_1\cdot I_1= I_1=I_2=I_2\cdot I_2
\end{align*}
This would be necessary if we only have a minimal number of axioms for a group. Then we would have demanded only the existence of an identity element, not its uniqueness or - even more important - that left and right identity are the same.

If we have no group, then it is still important where ##A## is from. I assume we have ##A\in \mathbb{M}_{(n,n)}(\mathbb{R}),## that is all possible real square matrices of a given finite dimension. In this case, you can look for appropriate matrices ##A## and test what you get. Hint: use matrices with a lot of zero entries. However, this would have been a guess on my part. There are domains where it is not automatically the case that there is only one identity. That's why I asked.

WWGD · Oct 25, 2022

If I works out as AI =A, then it should do so for all matrices. If it's not unique for some matrices it's just not unique overall, which is the endgoal.
So, what if A is n by n and invertible ? Then A I_1= AI_2 .
Can you prove uniqueness now?

PeroK · Oct 25, 2022

askmathquestions said:

Just because AX = B doesn't suddenly mean, if we pick another, random, different matrix from X, say Y, that AY is still equal to B in the general case.They are pointing out a point of contention in the assumptions of the problem. I didn't grab this problem out of a book, that's why it's ill-posed, so I need to figure out what minimal additional assumptions I need to make to prove uniqueness, which I obviously didn't know when writing the problem. I amended my problem to include that $A$ not be identically equal to the 0 matrix.

No additional assumptions are required. The identity element must be unique.

Considering the case where we have only the zero matrix and hence no identity is unnecessarily muddying the waters.

PeroK · Oct 25, 2022

WWGD said:

If I works out as AI =A, then it should do so for all matrices. If it's not unique for some matrices it's just not unique overall, which is the endgoal.
So, what if A is n by n and invertible ? Then A I_1= AI_2 .
Can you prove uniqueness now?

Well, in that case is ##AA^{-1} = I_1## or ##I_2##?

WWGD · Oct 25, 2022

PeroK said:

Well, in that case is ##AA^{-1} = I_1## or ##I_2##?

My idea is to see what happens if A(I_1 -I_2)=0 when we know A is invertible. Here 0 is the 0 matrix. Then show I_1-I_2 must be identically zero. We can do it without fancy results on the rank of products.

askmathquestions · Oct 25, 2022

fresh_42 said:

No. If we have a multiplicative group, i.e. an associative [##A(BC)=(AB)C##] structure in which all elements have inverses and there is an identity element, then we are allowed to multiply from one side with an inverse element, and
\begin{align*}
I_1\cdot A =I_2\cdot A &\Longrightarrow (I_1\cdot A)\cdot A^{-1} =(I_2\cdot A)\cdot A^{-1}\\
&\Longrightarrow I_1\cdot (A\cdot A^{-1}) =I_2\cdot (A\cdot A^{-1})\\
&\Longrightarrow I_1\cdot I_1= I_1=I_2=I_2\cdot I_2
\end{align*}
This would be necessary if we only have a minimal number of axioms for a group. Then we would have demanded only the existence of an identity element, not its uniqueness or - even more important - that left and right identity are the same.

If we have no group, then it is still important where ##A## is from. I assume we have ##A\in \mathbb{M}_{(n,n)}(\mathbb{R}),## that is all possible real square matrices of a given finite dimension. In this case, you can look for appropriate matrices ##A## and test what you get. Hint: use matrices with a lot of zero entries. However, this would have been a guess on my part. There are domains where it is not automatically the case that there is only one identity. That's why I asked.

Thanks for your reply, I'm interested in A being a vector OR a matrix, so I'd like the proof to be broad enough to cover both basis. Are vectors invertible matrices if they are not the 0 vector?

WWGD · Oct 25, 2022

askmathquestions said:

Thanks for your reply, I'm interested in A being a vector OR a matrix, so I'd like the proof to be broad enough to cover both basis. Are vectors invertible matrices if they are not the 0 vector?

A vector is an 1xp or px1 matrix. As such it's not invertible on both sides, as only square, e.g. nxn matrices can be.

PeroK · Oct 25, 2022

askmathquestions said:

Are vectors invertible matrices if they are not the 0 vector?

No.

fresh_42 · Oct 25, 2022

askmathquestions said:

Thanks for your reply, I'm interested in A being a vector OR a matrix, so I'd like the proof to be broad enough to cover both basis. Are vectors invertible matrices if they are not the 0 vector?

No. If you multiply two vectors (by the usual method), then you get either a number (row times column) or a matrix (column times row) that isn't invertible. If you multiply differently, say column times column componentwise, then you can't get an inverse as soon as one component is zero, without the need to be completely zero.

WWGD · Oct 25, 2022

WWGD said:

My idea is to see what happens if A(I_1 -I_2)=0 when we know A is invertible. Here 0 is the 0 matrix. Then show I_1-I_2 must be identically zero. We can do it without fancy results on the rank of products.

Since A(I_1-I_2)=0, it follows each of the column vectors are in the nullspace of A, and, since A is invertible, it has trivial kernel, so each column vector in ( I_1-I_2 ) must be the 0 vector,

askmathquestions · Oct 25, 2022

WWGD said:

A vector is an 1xp or px1 matrix. As such it's not invertible on both sides, as only square, e.g. nxn matrices can be.

Well that's a problem, because I'm trying to prove uniqueness for the case A is a vector.

What about my example? x = [[x_1],[x_2],[x_3]], a column vector, and a row vector y = (1/3) [[1/x_1, 1/x_2, 1/x_3 ]].

If you multiply y*x you get 1, the "scalar" identity matrix as it were.

Maybe we need some additional specificity, like A is an n x p (which includes vectors, n x 1) matrix with entries drawn from complex numbers, but such that all entries are not 0. A vector would then be of the space $ \mathbb{C}^{n x 1} $.

P.S. does this website process LaTeX?

fresh_42 · Oct 25, 2022

PeroK said:

I don't follow what you are trying to say here.

I pointed to the lack of clarity in the question. The answer depends on the ring, group, or set. You simply assumed things and claimed to know better. I asked instead.

Have a look at:

askmathquestions said:

Well that's a problem, because I'm trying to prove uniqueness for the case A is a vector.

fresh_42 · Oct 25, 2022

askmathquestions said:

Well that's a problem, because I'm trying to prove uniqueness for the case A is a vector.

In that case: how is the multiplication defined? Are ##I_1,I_2## matrices?

askmathquestions · Oct 25, 2022

fresh_42 said:

In that case: how is the multiplication defined? Are ##I_1,I_2## matrices?

Maybe we're talking past each other. I_1 and I_2 are n x n matrices, acting on an n x p matrix A. We have the usual definition of matrix multiplication.

fresh_42 · Oct 25, 2022

Then the path you should follow is to solve ##(x)_{ij} \cdot A = A## for several ...

PeroK said:

Perhaps you need a clever choice of ##A##?

... and consider ...

fresh_42 said:

Hint: use matrices with a lot of zero entries.

Edit: From ##I_1A=I_2A=A ## we get ##(I_1-I_2)\cdot A=0.## Thus ##X=I_1-I_2## might be easier to solve. A matter of taste, finally.

askmathquestions · Oct 25, 2022

fresh_42 said:

Then the path you should follow is to solve ##(x)_{ij} \cdot A = A## for several ...

... and consider ...Edit: From ##I_1A=I_2A=A ## we get ##(I_1-I_2)\cdot A=0.## Thus ##X=I_1-I_2## might be easier to solve. A matter of taste, finally.

This depends, if we assume ##A## is non-zero, how do we know we can imply that ##(I_1 - I_2) = 0?##

FactChecker · Oct 25, 2022

In the Relevant Equations, can you also say that ##AI_1=A## and ##AI_2=A## for all matrices ##A##?
If so, what can you say about ##I_1I_2##?

fresh_42 · Oct 25, 2022

askmathquestions said:

This depends, if we assume ##A## is non-zero, how do we know we can imply that ##(I_1 - I_2) = 0?##

You need to solve ##X\cdot A=0.## These are ##n^2## variables and ##n\cdot p## linear equations. It is immediately clear that

there is possibly more than one solution, i.e. ##X## isn't unique if ##n<p.##
there is possibly no solution, i.e. ##I_1## and ##I_2## do not exist if ##n>p.##
there is only a unique solution guaranteed if ##n=p.##

The ##n\cdot p## many linear equations have variables ##x_{ij}## and parameters ##a_{ij}.##
I still do not know where ##A## is allowed to be from, but identity means, that it has to hold for all entries from whatever this be-from is. I assume we can plugin any real number. In that case, plugin ##a_{11}=1,## and ##a_{ij}=0## elsewhere, then proceed with ##a_{12}=1## and ##a_{ij}=0## elsewhere etc. This gives you tons of equations that all have to be true.

askmathquestions · Oct 25, 2022

FactChecker said:

In the Relevant Equations, can you also say that ##AI_1=A## and ##AI_2=A## for all matrices ##A##?
If so, what can you say about ##I_1I_2##?

Possibly that the matrices commute, I thought about that but I'm not sure how it helps.

fresh_42 · Oct 25, 2022

askmathquestions said:

Possibly that the matrices commute, I thought about that but I'm not sure how it helps.

It would help if you would multiply matrices!
\begin{align*}
I_1\cdot A= \begin{bmatrix}
x_{11}&x_{12}&\ldots&x_{1n}\\ \vdots &\vdots &\ldots&\vdots \\x_{n1}&x_{n2}&\ldots&x_{nn}
\end{bmatrix}\cdot
\begin{bmatrix}
a_{11}&a_{12}&\ldots&a_{1p}\\ \vdots &\vdots &\ldots&\vdots \\a_{n1}&a_{n2}&\ldots&a_{np}
\end{bmatrix} =
\begin{bmatrix}
a_{11}&a_{12}&\ldots&a_{1p}\\ \vdots &\vdots &\ldots&\vdots \\a_{n1}&a_{n2}&\ldots&a_{np}
\end{bmatrix}=A
\end{align*}

I like to consider the matrices
$$E_{ij}:= \begin{bmatrix}
0&0&\ldots&0\\
\vdots &\vdots &\ldots&\vdots \\
0&0&\ldots 1_{ij}\ldots &0\\
\vdots &\vdots &\ldots&\vdots \\
0&0&\ldots&0
\end{bmatrix}
$$
with a ##1## at position ##(i,j)## and ##0## elsewhere. Then
$$
E_{pq}\cdot E_{rs} =\begin{cases} 0&\text{ if }q\neq r\\ E_{ps} &\text{ if }q= r\end{cases}
$$
Now, set ##I_1-I_2=X## as an arbitrary matrix ##X=(x_{ij})=\sum_{i=1}^n\sum_{j=1}^nx_{ij}E_{ij}## and for ##A## all ##E_{pq}## within your given sizes.

Prove the identity matrix is unique

Homework Help Overview

Discussion Character

Approaches and Questions Raised

Discussion Status

Contextual Notes

Similar threads

"Critical" Triangle Problem

The optimal way of dividing the bet three ways

Hedging on a weather prediction

Insights Revisiting the Velocity-Time Function

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect