Coefficients of the characteristic equation

In summary, the characteristic polynomial of a matrix T is given by C_{n} \lambda^{n} + C_{n-1} \lambda^{n-1} + ... + C_{1} \lambda + C_{0} = 0, where C_{n} = (-1)^{n}, C_{n-1} = (-1)^{n-1} Tr(T), and C_{0} = det(T). The proof for C_{n-1} and C_{0} involves the fact that the determinant of a matrix is the product of its eigenvalues and the trace of a matrix is the sum of its eigenvalues. However, this proof assumes that the matrix is diagonalizable over
  • #1
spaghetti3451
1,344
33
let's say you are given the following matrix T:

T11 T12 . . . T1n
T21 T22 . . . T2n
. . .
. . .
. . .
Tn1 Tn2 . . . Tnn

You want to find the characteristic equation for the matrix, which is
[itex]C_{n} \lambda^{n} + C_{n-1} \lambda^{n-1} + ... + C_{1} \lambda + C_{0} = 0.[/itex]

Now, [itex]C_{n} = (-1)^{n}[/itex], [itex]C_{n-1} = (-1)^{n-1} Tr(T)[/itex], and [itex]C_{0} = det(T)[/itex].

I have been having a pretty hard proving these three formulae. The first one looks intuitively obvious and I think I have a fairly good understanding of why the second one has the structure that it has. But the complexity of the matrix notation and the fact that there are so many elements in the matrix is halting my attempts to precisely determine each of the formulae from the matrix. I would appreciate any help. Thanks.
 
Last edited:
Physics news on Phys.org
  • #2
failexam said:
let's say you are given the following matrix T:

T11 T12 . . . T1n
T21 T22 . . . T2n
. . .
. . .
. . .
Tn1 Tn2 . . . Tnn

You want to find the characteristic equation for the matrix, which is
[itex]C_{n} \lambda^{n} + C_{n-1} \lambda^{n-1} + ... + C_{1} \lambda + C_{0} = 0.[/itex]

Now, [itex]C_{n} = (-1)^{n}[/itex], [itex]C_{n-1} = (-1)^{n-1} Tr(T)[/itex], and [itex]C_{0} = det(T)[/itex].

I have been having a pretty hard proving these three formulae. The first one looks intuitively obvious and I think I have a fairly good understanding of why the second one has the structure that it has. But the complexity of the matrix notation and the fact that there are so many elements in the matrix is halting my attempts to precisely determine each of the formulae from the matrix. I would appreciate any help. Thanks.


The characteristic polynomial, not equation (unless equalled to zero or some other number) is the determinant of [itex]T-xI[/itex] .

If we also know that the determinant is the sum of products of n numbers, each one of which is formed with exactly one element from each row and each column of the matrix, we get at once that one of such products in the above determinant is [itex](-x)(-x)\cdot...\cdot (-x)=(-1)^nx^n[/itex] , and there is no other product as above with n x's, so this last is the greatest degree term in the pol., and you get [itex]C_n[/itex] .

Now you try to deduce the other two.

DonAntonio
 
  • #3
Co = Det (T) is easy. The last term Co is always the product of the roots. The roots of the characteristic equation for the matrix are the eigenvalues. As the product of the eigenvalues is equal to the Determinant, we are done. Proof of the last statement:

Det (T) = Det (P'EP) = Det(P'PE) = Det (I*E) = Det (E) = e1*e2*...

where E is a diagonal matrix with the eigenvalues on the diagonal of matrix T and P'P = I.

Similarly, the second term is always the minus sum of the roots. Note, rescaling the term becomes C(n-1)/C(n) = -1 as was required.
 
Last edited:
  • #4
ajkoer said:
Co = Det (T) is easy. The last term Co is always the product of the roots. The roots of the characteristic equation for the matrix are the eigenvalues. As the product of the eigenvalues is equal to the Determinant, we are done. Proof of the last statement:

Det (T) = Det (P'EP) = Det(P'PE) = Det (I*E) = Det (E) = e1*e2*...

where E is a diagonal matrix with the eigenvalues on the diagonal of matrix T and P'P = I.

Similarly, the second term is always the minus sum of the roots. Note, rescaling the term becomes C(n-1)/C(n) = -1 as was required.
One minor difficulty with this approach is that the OP didn't specify the field over which they're working, so one might have to enlarge it so that it contains the eigenvalues of T, and then one has to prove that det and Tr once taken over this enlarged field will give the same values as over the original field.

A more serious problem, however, is that your proof seems to assume that T is diagonalizable. This issue really needs to be addressed. (Suggestion: If we're working over a field that contains the eigenvalues of T then, although T may not be diagonalizable over this field, it's still possible to find a matrix P such that ##PTP^{-1}=E## is upper triangular. This would salvage your argument.)
 
  • #5


I understand your frustration with trying to prove these formulae. However, there are some steps you can take to make the process easier.

First, let's break down the characteristic equation and understand what each term represents. The characteristic equation is a polynomial equation that allows us to find the eigenvalues of a matrix, which are the values of lambda that satisfy the equation. The degree of the polynomial is equal to the size of the matrix (n), and the coefficients represent certain properties of the matrix.

Now, let's look at each of the coefficients:

1. C_{n}: This coefficient is the leading coefficient of the polynomial and is always equal to (-1)^n. This is because the characteristic equation is derived from the determinant of the matrix, which is always multiplied by (-1)^n based on the size of the matrix.

2. C_{n-1}: This coefficient is equal to (-1)^{n-1} times the trace of the matrix (Tr(T)). The trace of a matrix is the sum of its diagonal elements, and it represents the sum of the eigenvalues. This is why the coefficient is multiplied by (-1)^{n-1}, as it takes into account the sign of the eigenvalues.

3. C_{0}: This coefficient represents the determinant of the matrix (det(T)). The determinant of a matrix is a scalar value that represents the scaling factor of the matrix. It is also equal to the product of all the eigenvalues of the matrix.

Now, in order to prove these formulae, you can use mathematical induction. Start with a 2x2 matrix and prove the formulae for n=2, and then continue to larger matrices. You can also use properties of determinants and traces to prove these formulae.

I hope this helps and good luck with your proof! If you need further assistance, don't hesitate to seek out a math tutor or consult with a colleague who may have more expertise in this area.
 

What are coefficients of the characteristic equation?

The coefficients of the characteristic equation are the numerical values that appear in front of the variables in an equation that describes the behavior of a system or process. These coefficients are used to solve for the roots or solutions of the equation, which can provide important information about the system.

Why are coefficients of the characteristic equation important?

Coefficients of the characteristic equation are important because they allow us to understand the behavior of a system or process. By solving for the roots of the equation, we can determine important characteristics such as stability, convergence, and oscillation. These coefficients also play a crucial role in mathematical models and simulations.

How do you find the coefficients of the characteristic equation?

The coefficients of the characteristic equation can be found by simplifying the given equation and identifying the numerical values in front of the variables. In some cases, the coefficients may also be given directly in the problem or equation. It is important to ensure that the equation is in the correct form before identifying the coefficients.

What do the coefficients of the characteristic equation represent?

The coefficients of the characteristic equation represent the influence of each variable on the behavior of the system. For example, a larger coefficient for a variable may indicate a stronger impact on the behavior of the system compared to a smaller coefficient. These coefficients can also help us understand the relationships between different variables in the system.

Can the coefficients of the characteristic equation be negative?

Yes, coefficients of the characteristic equation can be negative. This can indicate a negative relationship between a variable and the behavior of the system. It is important to pay attention to the signs of the coefficients when solving for the roots of the equation, as they can affect the overall behavior of the system.

Similar threads

  • Linear and Abstract Algebra
Replies
5
Views
1K
  • Linear and Abstract Algebra
Replies
1
Views
779
  • Programming and Computer Science
Replies
1
Views
2K
  • Advanced Physics Homework Help
Replies
1
Views
707
  • Linear and Abstract Algebra
Replies
1
Views
3K
  • Linear and Abstract Algebra
Replies
15
Views
2K
Replies
3
Views
2K
  • Linear and Abstract Algebra
Replies
20
Views
1K
Replies
2
Views
1K
  • Linear and Abstract Algebra
Replies
4
Views
958
Back
Top