Proof of trace of density matrix in pure/mixed states

Click For Summary

Discussion Overview

The discussion revolves around proving the inequality tr(ρ²) ≤ 1 for density matrices, applicable to both pure and mixed states in quantum mechanics. Participants explore various mathematical approaches and properties of density matrices, including their diagonalization and the implications of the Cauchy-Schwarz inequality.

Discussion Character

  • Technical explanation
  • Mathematical reasoning
  • Debate/contested

Main Points Raised

  • One participant seeks help to prove that tr(ρ²) ≤ 1, starting with the expression for ρ in terms of its eigenstates.
  • Another participant notes that density matrices can be diagonalized and that their trace equals one, suggesting basic algebra to show that the sum of the squares of the probabilities is less than or equal to one.
  • Some participants express confusion about the implications of diagonalization and the relationship between the eigenvalues and the trace of ρ².
  • Concerns are raised about the assumption that the states |ψ_i⟩ are orthogonal, with a suggestion to start from the definition of the trace using an arbitrary orthonormal basis.
  • There is a discussion about using the Cauchy-Schwarz inequality to complete the proof, with varying levels of clarity among participants regarding its application.
  • A participant expresses confusion about the meaning of probabilities p_i in the context of pure and mixed states, seeking clarification on their interpretation.
  • Another participant clarifies that p_i represents the probability of the ith system being in state |ψ_i⟩ and emphasizes the importance of the sum of probabilities equaling one.
  • Participants engage in a back-and-forth regarding the mathematical steps involved in the proof, with some expressing uncertainty about their calculations and others providing corrections or alternative approaches.

Areas of Agreement / Disagreement

Participants generally agree on the goal of proving tr(ρ²) ≤ 1, but there are multiple competing views on the methods to achieve this, particularly regarding the assumptions about orthogonality and the application of mathematical inequalities. The discussion remains unresolved with respect to the most straightforward proof method.

Contextual Notes

Some participants highlight limitations in their understanding of the mathematical steps involved, particularly in relation to the properties of density matrices and the implications of the Cauchy-Schwarz inequality. There is also a noted dependence on the definitions of the states and probabilities involved.

barnflakes
Messages
156
Reaction score
4
Can someone help me prove that [itex]tr(\rho^2) \leq 1[/itex] ?

Using that [tex]\rho = \sum_i p_i | \psi_i \rangle \langle \psi_i |[/tex]

[tex]\rho^2 = \sum_i p_i^2 | \psi_i \rangle \langle \psi_i |[/tex]

[tex]tr(\rho^2) = \sum_{i, j} p_i^2 \langle j | \psi_i \rangle \langle \psi_i | j \rangle[/tex]

Where do I go from here? Thanks guys.
 
Physics news on Phys.org
Density matrices may be diagonalised, and their trace is one:

[tex]Tr(\rho) = \sum_i p_i = 1[/tex]

Then you need a bit of basic algebra:

[tex]\sum_i p_i^2 \le \left(\sum_i p_i\right)^2 = 1[/tex]
 
Can you be a bit more explicit? So they can be diagonalised, ie. [tex]\rho = \sum_i \lambda_i p_i| i \rangle \langle i |[/tex]

[tex]\rho^2 = \sum_i \lambda_i^2 p_i^2 | i \rangle \langle i |[/tex]
[tex]tr(\rho^2) = \sum_j \lambda_j^2 p_j^2 \leq (\sum_j \lambda_j p_j)^2[/tex]

?? does that final term = 1?
 
barnflakes said:
[tex]\rho^2 = \sum_i p_i^2 | \psi_i \rangle \langle \psi_i |[/tex]
This isn't always true, since the [itex]|\psi_i\rangle[/itex] don't have to be orthogonal.

Start with [tex]\operatorname{Tr}\rho^2=\sum_n\langle n|\rho^2|n\rangle[/tex], where the [itex]|n\rangle[/itex] are members of an arbitrary orthonormal basis. Use the correct expression for [tex]\rho^2[/tex]. Then rearrange some stuff and recognize the identity operator in what you've got. Then you're almost done, but you'll need the Cauchy-Schwarz inequality to finish it.
 
In the diagonal basis, the eigenvalues of [tex]\rho^2[/tex] are just [tex]p_i^2[/tex], where [tex]p_i[/tex] are the eigenvalues of [tex]\rho[/tex].
 
Fredrik said:
This isn't always true, since the [itex]|\psi_i\rangle[/itex] don't have to be orthogonal.

Start with [tex]\operatorname{Tr}\rho^2=\sum_n\langle n|\rho^2|n\rangle[/tex], where the [itex]|n\rangle[/itex] are members of an arbitrary orthonormal basis. Use the correct expression for [tex]\rho^2[/tex]. Then rearrange some stuff and recognize the identity operator in what you've got. Then you're almost done, but you'll need the Cauchy-Schwarz inequality to finish it.

OK so I get to [tex]\sum_{i,j} p_i p_j \langle \psi_j | \psi_i \rangle \langle \psi_i | \psi_j \rangle[/tex]

and now I need to use cauchy schwartz you say?

[tex]\sum_{i,j} p_i p_j \langle \psi_j | \psi_i \rangle \langle \psi_i | \psi_j \rangle = \sum_{i,j} p_i p_j \frac{\langle \psi_j | \psi_i \rangle \langle \psi_i | \psi_j \rangle \langle \psi_i | \psi_i \rangle}{\langle \psi_i | \psi_i \rangle} \leq \sum_{i,j,n} p_i p_j \langle \psi_j | n \rangle \langle n | \psi_j \rangle \langle \psi_i | \psi_i \rangle = \sum_{i,j} p_i p_j \langle \psi_j | \psi_j \rangle \langle \psi_i | \psi_i \rangle[/tex]

Is that correct? Where do I go from here?
 
You're making it more complicated than it needs to be. I'm not even sure what you're doing, but you're getting the right result. Now you just need to use that the states are normalized. :smile:

This is the easy way to get the result you've got already:

[tex]\langle \psi_j | \psi_i \rangle \langle \psi_i | \psi_j \rangle=|\langle\psi_i|\psi_j\rangle|^2\leq \big\||\psi_i\rangle\big\|^2\big\||\psi_j\rangle\big\|^2=1[/tex]

You will of course also have to use what you know about the pi.
I posted a statement and proof of the Cauchy-Schwarz inequality in the Science Advisor forum some time ago. You probably don't need it, but since it's a related topic, and since I have only posted it in a restricted forum before, I'm reposting it here.Theorem:

If x and y are vectors in an inner product space X over [tex]\mathbb C[/itex], then<br /> <br /> <div style="margin-left: 20px">[tex]|\langle x,y\rangle| \leq \|x\|\|y\|[/tex]​</div><br /> where the norm is the standard norm on an inner product space.<br /> <br /> <b>Proof:</b><br /> <br /> Let t be an arbitrary complex number.<br /> <br /> <div style="margin-left: 20px">[tex]0 \leq \langle x+ty,x+ty\rangle=\|x\|^2+t\langle x,y\rangle+t^*\langle y,x\rangle+|t|^2\|y\|^2[/tex]​</div><br /> <div style="margin-left: 20px">[tex]=\|x\|^2+2\operatorname{Re}(t\langle x,y\rangle)+|t|^2\|y\|^2[/tex]​</div><br /> The inequality is obviously satisfied when the real part of t<x,y> is non-negative, so we can only learn something interesting when it's negative. Let's choose Arg t so that it is.<br /> <br /> <div style="margin-left: 20px">[tex]=\|x\|^2-2|t||\langle x,y\rangle|+|t|^2\|y\|^2[/tex]​</div><br /> Now let's choose |t| so that it minimizes the sum of the last two terms. (This should give us the most interesting result).<br /> <br /> <div style="margin-left: 20px">[tex]s=|t|,\ A=\|y\|^2,\ B=2|\langle x,y\rangle|[/tex]​</div><br /> <div style="margin-left: 20px">[tex]f(s)=As^2-Bs[/tex]​</div><br /> <div style="margin-left: 20px">[tex]f'(s)=2As-B=0\ \Rightarrow\ s=\frac{B}{2A} = \frac{|\langle x,y\rangle|}{\|y\|^2}[/tex]​</div><br /> <div style="margin-left: 20px">[tex]f''(s)=2A>0[/tex]​</div><br /> Continuing with this value of |t|...<br /> <br /> <div style="margin-left: 20px">[tex]=\|x\|^2-2\frac{|\langle x,y\rangle|}{\|y\|^2}|\langle x,y\rangle|+\frac{|\langle x,y\rangle|^2}{\|y\|^4}\|y\|^2[/tex]​</div><br /> <div style="margin-left: 20px">[tex]=\|x\|^2-\frac{|\langle x,y\rangle|^2}{\|y\|^2}[/tex]​</div>[/tex]
 
Thank you for the response Fredrik, you'll have to excuse me, I'm really rather new to quantum mechanics/information, so when you say "use what I know with regards to p_i and p_j" I have to confess my ignorance as to what I know. As far as I'm aware, it's the probability that the N_th quantum system of an N dimensional system is in the state [tex]\psi_i[/tex] So if the system is in a pure state we know exactly the state of the system. I find this confusing. Does it mean we know the state of the system overall, or the state of each individual qubit/quantum system?
 
It's the probability that the ith system has been prepared in state [itex]|\psi_i\rangle[/itex]. And you know that the sum of the probabilities is 1. That's what I meant you should use.

The density operator [itex]\rho=\sum_i p_i|\psi_i\rangle\langle\psi_i|[/itex] is a mathematical tool that we can use to calculate the expected average result when we perform a measurement of some observable A on every member of a large ensemble of identical systems, with a fraction [itex]p_i[/itex] of the systems in state [itex]|\psi_i\rangle[/itex]. (If the number of systems is small, we're going to have to repeat the procedure many times to get an accurate average. That's why I said that we're calculating the "expected" average). It doesn't matter if the members of the ensemble are different systems that all exist at the same time at different locations, or if they are states of a single system at a single location at different times, or if they are different possible states of a single system at a single time.

A pure state has [itex]p_i=\delta_{ij}[/itex] for some j. What that means is that every member of the ensemble has been prepared in state [itex]|\psi_i\rangle[/itex]. When you know that, it doesn't matter if the ensemble consists of a single system or 10^50 systems. What the information [itex]p_i=\delta_{ij}[/itex] is telling you is just what the "expected average" result of a measurement will be. ("Expected average" isn't a real term as far as I know. I just thought it seemed appropriate).
 
  • #10
Thank you Fredrik, I understand it much more now, one last thing about the proof. You say:

[tex]\operatorname{Tr}\rho^2=\sum_n\langle n|\rho^2|n\rangle[/tex]

So I have [tex]\sum_{i,j,n} p_i p_j \langle n|\langle \psi_j | \psi_i \rangle \langle \psi_i | \psi_j \rangle |n \rangle[/tex]

But since the expression [tex]\rho^2[/tex] is just a number then adding those orthonormal basis is making no difference? In other words, I can rearrange the above as follows:

[tex]\sum_{i,j,n} p_i p_j \langle \psi_i | \psi_j \rangle |n \rangle \langle n|\langle \psi_j | \psi_i \rangle =\sum_{i,j} p_i p_j \langle \psi_i | \psi_j \rangle \langle \psi_j | \psi_i \rangle =[/tex] and then use the caughy schwarz as above?
 
  • #11
barnflakes said:
You say:

[tex]\operatorname{Tr}\rho^2=\sum_n\langle n|\rho^2|n\rangle[/tex]
Yes, that's the definition of the trace. It's actually independent of the basis we use. (Proving that would be a good warm-up exercise).

barnflakes said:
So I have [tex]\sum_{i,j,n} p_i p_j \langle n|\langle \psi_j | \psi_i \rangle \langle \psi_i | \psi_j \rangle |n \rangle[/tex]
That's not what we get from [itex]\rho = \sum_i p_i | \psi_i \rangle \langle \psi_i |[/itex] and the definition of the trace.

barnflakes said:
But since the expression [tex]\rho^2[/tex] is just a number
It isn't (but I see that it has magically turned into one in what you wrote above :smile:).
 
  • #12
Haha oops, I see what I've done, sorry I forgot to check over my working since last time:

[tex]\sum_{i,j,n} p_i p_j \langle \psi_j | \psi_i \rangle \langle \psi_i | \psi_j \rangle[/tex] this is the expression I obtain after taking the trace and using the identity representation, I see now. Thank you Fredrik :)
 

Similar threads

  • · Replies 3 ·
Replies
3
Views
1K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 6 ·
Replies
6
Views
4K
Replies
2
Views
2K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K