How observation leads to wavefunction collapse?

  • #251
Demystifier said:
The point is that the "Wigner-Weinberg" is not an appropriate name, because Weinberg does NOT propose to use the non-covariant Wigner position operator. In fact, Weinberg does not introduce any position operator, which makes his approach inconsistent with nonrelativistic QM. The question is: Is Weinberg aware of that?

You are right, Weinberg does not introduce position operator and position-space wavefunctions in the first 13 chapters of his book. However, he could easily do that without contradicting anything else said in his book.

The situation changes in chapter 14, where he discusses bound states and the Lamb shift. In eqs. (14.1.4) and (14.1.5) he introduces electron's position-space wavefunctions, which are equivalent to your earlier definition

\psi(x) = \langle 0 | \phi(x) |1 \rangle

where \psi(x) is the wavefunction and \phi(x) is the quantum field. I think, Weinberg is not fully consistent in his book. His "wavefunctions" are vulnerable to the counterargument that I suggested in post #241.

It would be great to ask Weinberg himself what he thinks about that. But I suspect he is not reading Physics Forums.

Eugene.
 
Physics news on Phys.org
  • #252
gptejms said:
It wasen't my assumption--it was Demystifier's.As far as evidence is concerned he said 'Because this provides that the equation of motion for psi (the Klein Gordon equation) will have the same form in all Lorentz frames.'I think what he said is true for the field(here too, I would say the condition should be slightly modified to \psi^\prime (x^\prime)=\psi (x) where x^\prime = \Lambda x) not the wavefunction.

This is true in the case of multicomponent (spinor, vector, etc.) quantum fields \phi_i(x). In this case, the transformation law is

\phi_i(x) \to \sum_j D(\Lambda^{-1})_{ij} \phi_j (\Lambda x)

where D_{ij} is a finite-dimensional (non-unitary) representation of the Lorentz group associated with the field \phi_i(x). In the case of scalar field D = 1.


gptejms said:
May be you can tell us what it should be for the wavefunction.

I don't have a closed formula, but the boost transformation of the wavefunction \psi(\mathbf{r}) \to U(\Lambda) \psi(\mathbf{r})can be obtained in the following three steps:

1. Change to the momentum representation

\psi(\mathbf{p}) = (2 \pi \hbar)^{-3/2}\int d^3r \psi(\mathbf{r}) \exp(-\frac{i}{\hbar} \mathbf{pr})

2. Apply boost transformation to the wavefunction momentum arguments

U(\Lambda) \psi(\mathbf{p}) = \psi(\Lambda \mathbf{p})

Transformations of momenta under boosts are given by simple formulas. For example, if \Lambda is a boost along the z-axis with rapidity \theta, then

\Lambda (p_x, p_y, p_z) = (p_x, p_y, p_z \cosh \theta + \frac{E_p}{c} \sinh \theta)

where E_p = \sqrt{m^2c^4 + p^2c^2}

3. Perform inverse transformation back to the position space

U(\Lambda) \psi(\mathbf{r}) = (2 \pi \hbar)^{-3/2}\int d^3p U(\Lambda) \psi(\mathbf{p}) \exp(\frac{i}{\hbar} \mathbf{pr})
 
  • #253
meopemuk said:
3. Perform inverse transformation back to the position space

U(\Lambda) \psi(\mathbf{r}) = (2 \pi \hbar)^{-3/2}\int d^3p U(\Lambda) \psi(\mathbf{p}) \exp(\frac{i}{\hbar} \mathbf{pr})

If \psi(\mathbf{p}) =1, then you get back U(\Lambda) acting on the wavefunction(which is a delta function in this case).Now what do you do?
 
  • #254
How do you transform the boundaries of the integrals during the boost? This is a critical part as the set of integration is the effective "event space" of the observer. The question is how does sets of simultaneous sampling transform, this seems to be the basis for the entire probabilistic reasoning, these sets seems generally not to be invariants. The observers doesn't possesses the same set of information. So it seems we are comparing apples to pears.

To consider infinity as sample space is a bit amgious in my thinking.

/Fredrik
 
  • #255
gptejms said:
If \psi(\mathbf{p}) =1, then you get back U(\Lambda) acting on the wavefunction(which is a delta function in this case).Now what do you do?

Oops! I am sorry. I missed an important part in the boost transformation of the momentum-space wave function. Please replace step 2. in my previous posty with

2. Apply boost transformation to the momentum space wavefunction

U(\Lambda) \psi(\mathbf{p}) = \sqrt{\frac{E_{\Lambda p}}{E_p}}\psi(\Lambda \mathbf{p}) (1)

The (previously missed) square root factor is important in order to guarantee the conservation of probabilities under boosts. For example, the wavefunction normalization is now preserved, because

\int d^3p |U(\Lambda) \psi(\mathbf{p}) |^2= \int d^3p\frac{E_{\Lambda p}}{E_p}|\psi(\Lambda \mathbf{p})|^2 (2)

where one can make the substitution \mathbf{q} = \Lambda \mathbf{p} and use the well-known property

\frac{d^3p}{E_p} = \frac{ d^3(\Lambda p)}{E_{\Lambda p}}


to show that the normalization integral (2) is equal to

\int d^3q|\psi( \mathbf{q})|^2 = 1

With this (now corrected) boost transformation law (1) the position-space delta function is not transformed to another position-space delta function. There is a wavefunction spreading caused by boosts.
 
  • #256
meopemuk said:
With this (now corrected) boost transformation law (1) the position-space delta function is not transformed to another position-space delta function. There is a wavefunction spreading caused by boosts.

Good!
But as I asked earlier also,what happens to the time coordinate?
 
  • #257
gptejms said:
Good!
But as I asked earlier also,what happens to the time coordinate?

The time coordinate is not present at all. These formulas relate wavefunction \psi(\mathbf{r}) seen by the observer at rest at time t=0 (measured by his clock) to the wavefuinction U(\Lambda)\psi(\mathbf{r}) seen by the moving observer at time t=0 (measured by her clock).

I guess you would like to condemn this transformation on the basis of unequal treatment of space and time coordinates. Before you do that, consider the following:

1. You haven't proved yet that space/time symmetry and covariant transformation laws follow necessarily from the principle of relativity. (I know, that's what most textbooks say. But I haven't seen a rigorous proof.)

2. It actually makes sense that the transformed wavefunction U(\Lambda)\psi(\mathbf{r}) depends on values of the initial wavefunction \psi(\mathbf{r}) at time t=0 only. Indeed, the latter wavefunction provides a complete description of the particle's state for the observer at rest at t=0. By knowing the state for one observer, we should be able to find exactly how the state looks like for any other (e.g., moving) observer. Therefore, it seems quite logical, that the transformed wavefunction U(\Lambda)\psi(\mathbf{r}) should have a unique expression in terms of the original wavefunction \psi(\mathbf{r}) at time t=0.

Eugene.
 
  • #258
meopemuk said:
It would be great to ask Weinberg himself what he thinks about that. But I suspect he is not reading Physics Forums.
Completely agree. :approve:
 
  • #259
meopemuk said:
2. It actually makes sense that the transformed wavefunction U(\Lambda)\psi(\mathbf{r}) depends on values of the initial wavefunction \psi(\mathbf{r}) at time t=0 only. Indeed, the latter wavefunction provides a complete description of the particle's state for the observer at rest at t=0. By knowing the state for one observer, we should be able to find exactly how the state looks like for any other (e.g., moving) observer. Therefore, it seems quite logical, that the transformed wavefunction U(\Lambda)\psi(\mathbf{r}) should have a unique expression in terms of the original wavefunction \psi(\mathbf{r}) at time t=0.

Eugene.

The problem here is that U(\Lambda)\psi(\mathbf{r}) that you have determined is not all at one t^{\prime},the time in the transformed frame.Is this even meaningful then?
 
  • #260
gptejms said:
The problem here is that U(\Lambda)\psi(\mathbf{r}) that you have determined is not all at one t^{\prime},the time in the transformed frame.Is this even meaningful then?

I am not sure I understand what you are saying. U(\Lambda)\psi(\mathbf{r}) is the wavefunction seen by the moving observer at one particular time instant t=0.

From your question it seems to me that you are still trying to interpret wavefunction transformations in terms of Minkowski space-time diagrams. I think that this is not a useful approach.

Eugene.
 
  • #261
meopemuk said:
I am not sure I understand what you are saying. U(\Lambda)\psi(\mathbf{r}) is the wavefunction seen by the moving observer at one particular time instant t=0.

From your question it seems to me that you are still trying to interpret wavefunction transformations in terms of Minkowski space-time diagrams. I think that this is not a useful approach.

Eugene.
Are you saying that two observers have the same time t?
 
  • #262
meopemuk said:
I am not sure I understand what you are saying. U(\Lambda)\psi(\mathbf{r}) is the wavefunction seen by the moving observer at one particular time instant t=0.

From your question it seems to me that you are still trying to interpret wavefunction transformations in terms of Minkowski space-time diagrams. I think that this is not a useful approach.

Eugene.

If t=0 ,then from Lorentz transformation, t' is not zero everywhere--so your transformed wavefunction is not defined at one t'(but multiple times).
 
  • #263
Demystifier said:
Are you saying that two observers have the same time t?

I don't understand the meaning of your question. One observer measures wavefunction at time t=0 by his clock. The other (moving) observer also measures (a different) wavefunction at time t=0 by her clock. There is a formula that connects these two wavefunctions. Is there anything wrong with that?

Eugene.
 
  • #264
> One observer measures wavefunction at time t=0 by his clock

Now it get's interesting. How does this measurement look like - I picture it to be a procedure, experimental procedure, possibly including calculations etc? Or perhaps the wavefunction isn't measurable?

I think it's safe to assume that only the observers own past can influence this "measurement", or?

But the two observers doesn't share the same past anyway, that's how it gets interesting to see how you can get the obvious connection of two different histories.

/Fredrik
 
  • #265
gptejms said:
If t=0 ,then from Lorentz transformation, t' is not zero everywhere--so your transformed wavefunction is not defined at one t'(but multiple times).

Wavefunction \psi(\mathbf{r}) refers to time t=0 from the point of view of observer O. Wavefunction U(\Lambda) \psi(\mathbf{r}) refers to time t'=0 from the point of view of the moving observer O'. Both wavefunctions are defined at a single time point for the respective observers.

What is the meaning of your statement "t' is not zero everywhere"?

I think you are trying to apply Lorentz transformations (formulated for times and positions of events viewed from different frames of reference) to arguments (\mathbf{r}, t) of wavefunctions. What makes you believe that Lorentz transformations are applicable in this case?

Eugene.
 
Last edited:
  • #266
Fra said:
> One observer measures wavefunction at time t=0 by his clock

Now it get's interesting. How does this measurement look like - I picture it to be a procedure, experimental procedure, possibly including calculations etc? Or perhaps the wavefunction isn't measurable?

I think it's safe to assume that only the observers own past can influence this "measurement", or?

But the two observers doesn't share the same past anyway, that's how it gets interesting to see how you can get the obvious connection of two different histories.

/Fredrik

Could you please be more precise? It seems that you find a contradiction in my statements. But I am not sure what is the contradiction that you see?

Eugene.
 
  • #267
meopemuk said:
Wavefunction \psi(\mathbf{r}) refers to time t=0 from the point of view of observer O. Wavefunction U(\Lambda) \psi(\mathbf{r}) refers to time t'=0 from the point of view of the moving observer O'. Both wavefunctions are defined at a single time point for the respective observers.

What is the meaning of your statement "t' is not zero everywhere"?

If t=0,then t'=0 only at x=0.Your wavefunction \psi(\mathbf{r^{\prime}}) is not defined at one particular t'.
 
  • #268
gptejms said:
If t=0,then t'=0 only at x=0.Your wavefunction \psi(\mathbf{r^{\prime}}) is not defined at one particular t'.

I don't understand what you mean by "then t'=0 only at x=0". In my understanding, the time shown by a clock does not depend on x. Could you please explain?

What I am saying is this: We have two observers O and O'. Both of them have clocks. Both of them perform measurements of the wavefunction of the (same) particle when their clocks show zero time. Can they do that? Yes they can. Both of them will obtain certain results for the position space wavefunction. Their results will be different and they will be related by the transformation I've outlined.

Eugene.
 
  • #269
meopemuk said:
Could you please be more precise? It seems that you find a contradiction in my statements. But I am not sure what is the contradiction that you see?

Eugene.

I guess it's difficult, it is not a direct contradiction since I'm not sure I understand how you define the wavefunction \psi(r,t).

Do you consider |\psi(r,t)|^2 to be the probability density to find whatever we are looking for at position r, at time t? If so, do you tehcnically consider the observer to be delocalized? Ie. does the observer have an array or detectors throughout the universe? Is t, the time where the observer gets informed, or the time in the observers frame, when the event happens simultaneously at another location, hitting one of the infinite number of detectors in the universe? or are the observer instantaneously informed?

The question is, how does the normalisation look like? on what data does the normalization take place? If you picture surfaces in minkowskispace your suggestions seems strange, so what is your thinking? Or are you thinking that the ensemble construction is observer invariant?

meopemuk said:
I don't understand what you mean by "then t'=0 only at x=0". In my understanding, the time shown by a clock does not depend on x. Could you please explain?

But your wavefunctions contains to datapoints, relating to position. And how does information about remote points propagate consistently to the observer and arrive as to allow simultaneous "sampling"? If you mention probability to see something at a certain time. One asks, at what time? Does the various events originate from a simulatenous events in the fram, or from different times? Either way may be fine, as long as it's consistent. In the first frame for example, did |\psi(r)|^2 refer to t=const in minkowski space? If not, how is if defined? \psi is just a symbol, I'd like to see a logic how it's induced from something that is at leat in principle measureable. I'm not saying it can't be done, I'm just not sure I understand your view. If you have some interesting ideas I'd like to understnad them.

/Fredrik
 
  • #270
Fra said:
Do you consider |\psi(r,t)|^2 to be the probability density to find whatever we are looking for at position r, at time t?

Yes, this is the standard definition of the wavefunction at time t.


Fra said:
If so, do you tehcnically consider the observer to be delocalized? Ie. does the observer have an array or detectors throughout the universe? Is t, the time where the observer gets informed, or the time in the observers frame, when the event happens simultaneously at another location, hitting one of the infinite number of detectors in the universe? or are the observer instantaneously informed?


I understand your concerns. Technically, it is difficult to collect information about extended wavefunctions at one time instant. However, in principle, it is not impossible.

Consider an array of detectors covering entire universe, and one processing unit in the middle. Suppose that each detector is connected to the processing unit by a cable of the same length L. For far away detectors the cables will be stretched. For detectors close to the processing unit we will need to fold their cables (in rings, or whatever). Assuming that signals propagate through all cables at the same speed c, it would take exactly the same time L/c to reach the central unit for signals from all detectors in the array. This arrangement will guarantee that our measurements will be done at the same time instant (or, if you like on the same "slice" of the Minkowski space-time) throughout entire universe.

This is for the observer at rest. The moving observer will have a similar array of detectors co-moving with her. Both observer at rest and moving observer can use their detector arrays to measure instantaneous (in their opinion) wavefunctions and then compare their notes. This comparison should agree with the 3-step wavefunction transformation that I outlined in previous posts. Does it make sense?


Eugene.
 
  • #271
meopemuk said:
I don't understand what you mean by "then t'=0 only at x=0". In my understanding, the time shown by a clock does not depend on x. Could you please explain?

Use t'=\frac{t-xv^2/c^2}{\sqrt{1-v^2/c^2}}.Your (transformed)wavefunction is defined at one particular t(not one t' as it should be).

In your post #252,you did Fourier transform over x--you need to take the time coordinate t also (and Fourier transf. over time as well) to complete the calculation.
 
Last edited:
  • #272
meopemuk said:
I understand your concerns. Technically, it is difficult to collect information about extended wavefunctions at one time instant. However, in principle, it is not impossible.

Consider an array of detectors covering entire universe, and one processing unit in the middle. Suppose that each detector is connected to the processing unit by a cable of the same length L. For far away detectors the cables will be stretched. For detectors close to the processing unit we will need to fold their cables (in rings, or whatever). Assuming that signals propagate through all cables at the same speed c, it would take exactly the same time L/c to reach the central unit for signals from all detectors in the array. This arrangement will guarantee that our measurements will be done at the same time instant (or, if you like on the same "slice" of the Minkowski space-time) throughout entire universe.

This is for the observer at rest. The moving observer will have a similar array of detectors co-moving with her. Both observer at rest and moving observer can use their detector arrays to measure instantaneous (in their opinion) wavefunctions and then compare their notes. This comparison should agree with the 3-step wavefunction transformation that I outlined in previous posts. Does it make sense?

I suspect this boils down to our views of probability concepts and what the point of the wavefunction is, or should be.

Your prescription of filling the universe with detectors and then adjusting cable length to maintain simultanous sampling means that the "simultaneous sampling" will be delayed an incredible amount of time if the "universe" is large. Which would mean that the simultanous sampling at time t1, is delayed and not given to use until t2.

So, you can not in principle, in your thinking, "measure" \psi(x,t_1)until t2, if we agree on that, I think the construction is useless.

So the wavefunction the observer writes down at t2, that is supposedly \psi(x,t2),is really based on \rho(x,t1). Clearly the information from nearby detector may have impacted the observer between t1 and t2. The whole point is that the observer needs his wavefunction for guidance. If this is not determined until the end of time, I see the whole objective beeing disrespected.

Anyway, I assumed that what the detectors would give you anyway, is particle position, so what we really get is \rho(x,t), where t is way back in history once we receive it.

So suppose we define \psi(x,t) = (\rho(x,t))^{1/2}, then we determine \psi up to the complex phase only.

I think the probabilistic constructions should be made on truly simultaneously available information. The relevant question I ask, is what is my _currently_ available best information, and how does that induce my "probabilistic" estimates? But then, my view of probability in this context is the bayesian one. Supposedly the wavefunction represents this information, right? If not, the wavefunction is of little use. The information that is in my hand tomorrow, does not help me today.

You seem to suggest(?) that your best bets at t2 are based on old simultanous information originating from t1? I don't understand the point in such construction?

I am still working on my thinking, but if I introduce a wavefunction it represents the simultanous information I have, at hand. I will from then one define related probability spaces. The relations will mutually induce priors, and there will also be a relation to mass. You can not accumulated arbitrary amounts of information without getting huge mass. This doesn't contain the information I get tomorrow. Neither can I ignore information at hand, just to synchronize with collection of data sent from remote locations. This is a bit difficult though, and in progress. But in my thinking the proper transformation is not as simple as a rotation or translation and so on in some space. It would ultimatley be an equilibration process, where equilibrium is where the two observers are in agreement.

I think the most fundamental difference to my thinking here is the treatise of probability theory, and what it is used for? IMO, the purpose of probability in reality is for any subject to formulate it's best educated guess, accuonting to all available information. This is entangled with the definition of wavefunction. The simple postulates of QM are far from satisfactory. They don't match the complexity I see in reality. In my thinking this analysis demands accounting for several things, the notion of space time, "entropy" concepts and relative probabilities and the concept of information vs mass vs inertia.

I find the standard formulations simplistic and unsatisfactory from the point of view of a scientific method. The motivation is that "looks it works". That may be true, but the real question is if we are just lucky, and still await a proper explanation?

/Fredrik
 
Last edited:
  • #273
gptejms said:
Use t'=\frac{t-xv/c^2}{\sqrt{1-v^2/c^2}}
Your (transformed)wavefunction is defined at one particular t(not one t' as it should be).


I think you should be careful when extending Lorentz transformations to transformations of wavefunction arguments. You may have extended them beyond their range of validity.

Recall the physical meaning of Lorentz transformations. They relate times and positions of actual events measured in different reference frames. If from the point of view of observer O there was a physical event (for example, a collision of two particles) at point x at time t, then from the point of view of the moving observer O' the same event occurs at space-time point (x',t'), where

t'=\frac{t-xv/c^2}{\sqrt{1-v^2/c^2}} (1)

x'=\frac{x-vt}{\sqrt{1-v^2/c^2}} (2)

These transformations apply to definite classical events measured with 100% probability.

The physical interpretation of wavefunction \psi(x,t) tells that if I place a particle detector at point x, then the probability (density) of registering a particle at this point at time t is |\psi(x,t)|^2. Your prescription is equivalent to the following statement. If the moving observer O' places a particle detector at point x', then the probability (density) of finding a particle there at time t' should be equal to |\psi(x,t)|^2 This statement doesn't necessarily follow from the original (classical) meaning of Lorentz transformations. At least, you haven't provided yet a convincing proof that one implies the other. If you want to sound convincing, I think you need to prove your statement from fundamerntal axioms of quantum mechanics and relativity.

On the other hand, I am ready to provide a complete proof of my wavefunction transformation law based on such axioms. The full proof would be quite lengthy, so I will outline just few important steps:

1. Particle states are described by unit vectors in a Hilbert space H.

2. Transformations between different inertial frames of reference (or observers) form the Poincare group (here I am considering not just boosts, but the full set of inertial transformations, which include also space translations, time translations, and rotations; all of them are treated on equal footing).

3. The action of inertial transformations on state vectors are such that probabilities are preserved.

4. It follows from 3. that inertial transformations of observers are represented by unitary operators in the Hilbert space H, and that there exists an unitary representation U_g of the Poincare group in H. This means that if |\Psi \rangle is a state vector of the particle from the point of view of observer O, then observer O' will describe the same state by the vector U_g|\Psi \rangle, where g is the boost transformation that connects O to O'. So, we found our desired transformation in the language of abstract state vectors. All we need to do is to translate this transformation in the language of position-space wavefunctions.

5. In order to do that we need to define the position operator \mathbf{R} and its eigenvectors |\mathbf{r} \rangle in the Hilbert space H. Then the wavefunction from the point of view of observer O will be \langle \mathbf{r}| \Psi \rangle, and the wavefunction from the point of view of O' will be \langle \mathbf{r}| U_g |\Psi \rangle

6. The position operator \mathbf{R} satisfying all relativistic requirements was constructed by Newton and Wigner in

T. D. Newton, E. P. Wigner, "Localized states for elementary systems" Rev. Mod. Phys. 21 (1949), 400.

If you put all these ideas together you'll obtain exactly the wavefunction transformation law given in post #252 (with correction in post #255).

You may wonder whether this transformation law agrees with classical Lorentz transformations? It appears that yes, they do agree with each other. It is possible to show (I will not do it here, but I am ready to reproduce calculations, if you are interested) that if we

1) take the classical limit of the above theory and consider trajectories of two non-interacting particles that intersect at the space-time point (x,t) from the point of view of observer O and
2) transform these trajectories according to commutators between the Newton-Wigner operator and generators of boost transformations U_g,

then

3) we will find that from the point of view of O', the intersection of trajectories occurs at point (x',t') which is related to (x,t) by usual Lorentz formulas (1) and (2).

Eugene.
 
  • #274
Hi Fredrik,

Yes, in my detector design the signal generated by the particle today and here will reach the central processing unit only after very long time L/c. So, I'll need to wait forever to analyze these results. But we are not talking about practical matters here, we are talking about matters of principle. So, in principle, it is possible to measure the wavefunction in entire space at a fixed time instant.

You are right that this will give us only the square of the position-space wavefunction, which doesn't determine the state uniquely. To overcome this problem, one can also build detectors measuring probability distributions of particle momentum, angular momentum, and other observables not commuting with position. So that the full set of data would allow us to reconstruct the particle state in an unique fashion.

I don't want to speculate how exactly these detectors should be built. Personally, I believe that nothing useful can be gained from considering exact designs of such detectors. I think that for theoretical purposes it is sufficient to postulate that each observer can somehow precisely determine the wavefunction of any state in any given basis at each fixed time instant t. This postulate is, of course, an idealization. But I believe that this is a reasonable price for getting a simple and transparent theory.

Eugene.
 
  • #275
meopemuk said:
I don't understand the meaning of your question. One observer measures wavefunction at time t=0 by his clock. The other (moving) observer also measures (a different) wavefunction at time t=0 by her clock. There is a formula that connects these two wavefunctions. Is there anything wrong with that?
Not necessarily. But can you write down a general mathematical relation (not only for the case when both are zero) between times of the two observers? If I understand you correctly, you suggest that it might NOT be the Lorentz transformation.
 
  • #276
meopemuk said:
I don't want to speculate how exactly these detectors should be built. Personally, I believe that nothing useful can be gained from considering exact designs of such detectors.

I agree the exact detector design may not be the biggest issues at this point, but still details like how information from detectors are transformed into possession of the observer is i think important. If we are careless here I think we are compromising our own endavours.

meopemuk said:
I think that for theoretical purposes it is sufficient to postulate that each observer can somehow precisely determine the wavefunction of any state in any given basis at each fixed time instant t. This postulate is, of course, an idealization. But I believe that this is a reasonable price for getting a simple and transparent theory.

I think we disagree about the validity of some of the idealizations that we admittedly all do at some level. But the exact concept of information, what it means, and what it's for, is I think so paramount that I disagree with your foundations here. I want to find more solid ground.

Though you are completely right that what I "want", will not be as simple, and I'm still fighting with "basics" that you already idealized away. We will see if I succeed in resolving it to a way that satisfies me. Meanwhile m view of the standard models is as a kind of effective theories, but whose rigorous formulation is yet to be seen. And with rigour I mean not just mathematical rigour of formalism, I mean rigour of reasoning.

/Fredrik
 
  • #277
Demystifier said:
Not necessarily. But can you write down a general mathematical relation (not only for the case when both are zero) between times of the two observers? If I understand you correctly, you suggest that it might NOT be the Lorentz transformation.

Lorentz transformations are understood as transformations between times and positions of events in different reference frames. When you have a single quantum-mechanical particle with its wavefunction, there is no specific event for which you could exactly pinpoint its space-time location. The wavefunction is diffuse, and measurements are not certain. So, Lorentz transformations need not to apply in this case.

The situation is different in the classical limit. Then particle's behavior can be described by a trajectory x(t) in the reference frame O and by a trajectory x'(t') in the reference frame O'. These trajectories can be regarded as series of events (x,t) and (x',t'). Within my theory x and x' can be represented as expectation values of the Newton-Wigner position operator in different reference frames. By applying boost transformations to the Newton-Wigner operator it can be shown that events (x',t') are related to (x,t) by usual Lorentz formulas. So, this theory does produce Lorentz transformations, but only in the classical (h -> 0) limit.

Eugene.

P.S. It is important to note that Lorentz formulas can be obtained in this fashion only if the particle is non-interacting, i.e., it has a straight-line trajectory. It appears that for interacting particles (with events defined as, for example, intersections of their trajectories) this approach leads to different transformations (x,t) -> (x',t'), which deviate from Lorentz formulas if the interaction is strong. So, in fact, predictions of my theory are different from those of ordinary special relativity in some cases. This is another "can of worms", and I am not sure if we are ready to open it at this moment.
 
  • #278
meopemuk said:
These transformations apply to definite classical events measured with 100% probability.

If the probability is not 100%,I don't see why it should follow a different law/transformation.

At least, you haven't provided yet a convincing proof...

I asked you to complete your calculation in post #252 with the time coordinate included.Anyway---if the wavefunction in one frame is \exp{\iota p_\mu x^\mu}(where \hbar=1),then you can see that the transformed wavefunction is going to be not much different.

It is possible to show (I will not do it here, but I am ready to reproduce calculations, if you are interested)

Go ahead!
 
Last edited:
  • #279
gptejms said:
If the probability is not 100%,I don't see why it should follow a different law/transformation.

I disagree with some of Eugene's thinking, but I agree on the fact that probabilities can't just be transformed like a lorentz scalar. I think Eugene's reason for thinking so is different than mine, but by reason is that probability is not an objective thing, it's not an object that everyone agress upong. It's intrinsically relative to the observer. What is probably for me, need not be probably to you for the simple reason that we devise our ensembles differently.

It does not make sense to think that there is someone, like a God, that knows everything and thus therefore can give objective status to probabilities.

This debate is not specific to quantum mechanics, it's a bit philosophical and has to do as much with your view of probability theory.

The frequentists and the bayesian thinking can probably be partly united with some abstractions, if you consider that the frequency of information is relative anyway. Two observers by definition doesn't see the same things. They live in the same universe, but it's a universe we all are fighting to understand, and no one has an omnipotent view of things. Thus anything claiming to find a simple connection between the views, by means of a mathematical transformation must incorporate assumptions about information they do not have. I suggest it's formulate in a bayesian sense, as degrees of information in different things, relative to what we know. This is essence, IMO, one of the few honest implementations of the scientific method, which I think of as finding the BEST guess, given the current evidence.

I think most people would agree that, just because we don't know, this doesn't mean we might as well guess randomly. The fact that we still know _something_ can induce a bayesian probability, as to guide us forward. This can be formalised. The proof is in the succcess and survival implying a selection, I think there are no formal proofs.

/Fredrik
 
  • #280
  • #281
olgranpappy said:
a single electron does not create an interference pattern

Sorry to bumb in this late but i think your statement is a bit inaccurate. It is indeed true we need to redo several times the double slit experiment to actually observe the interference pattern. But, 1 electron is interfering with itself. More specifically, the interference happens between all the possible paths when going from source to detector. So, not observing the interference pattern after 1 single measurement does NOT imply that the electron is NOT interfering !

Just my 2 $'s


marlon
 
  • #282
gptejms said:
If the probability is not 100%,I don't see why it should follow a different law/transformation.

I agree with Fra on this point:

Fra said:
... I agree on the fact that probabilities can't just be transformed like a lorentz scalar. ... probability is not an objective thing, it's not an object that everyone agress upong. It's intrinsically relative to the observer. What is probably for me, need not be probably to you for the simple reason that we devise our ensembles differently.

I think that your transformation law \psi(x) \to \psi(\Lambda x) is not obvious at all. If you think it is true, you should be able to prove it, rather that give a vague reference to analogies.

gptejms said:
I asked you to complete your calculation in post #252 with the time coordinate included.

This is done most easily in the momentum representation. If \psi(\mathbf{p}) is the wave function at time t=0 from the point of view of observer at rest O, then

e^{\frac{it}{\hbar} E_p}\psi(\mathbf{p})

is the wavefunction at time t from the point of view of O


\sqrt{\frac{E_{\Lambda p}}{E_p}}\psi(\Lambda \mathbf{p})

is the wavefunction from the point of view of moving observer O' at time t=0 (by her clock)

e^{\frac{it'}{\hbar} E_{\Lambda p}} \sqrt{\frac{E_{\Lambda p}}{E_p}}\psi(\Lambda \mathbf{p})

is the wavefunction seen by O' at time t' (by her clock). You can get position-space representations of these wavefunctions by making usual Fourier transformations. I don't think these integrals can be written in a nice analytical form.

The above formulas follow from the general fact that boost transformations are realized by unitary operators \exp(-\frac{ic}{\hbar} K \theta), where K is the generator of boosts; and time translations are realized by unitary operators \exp(\frac{i}{\hbar} Ht), where H is the Hamiltonian.


gptejms said:
Anyway---if the wavefunction in one frame is \exp{\iota p_\mu x^\mu}(where \hbar=1),then you can see that the transformed wavefunction is going to be not much different.

This is true, but the wavefunction you wrote is just a specific case of a plane wave, i.e., a state with definite momentum, where our formulas coincide. I was talking about transformations of general wavefunctions.


meopemuk said:
It is possible to show (I will not do it here, but I am ready to reproduce calculations, if you are interested) that if we

1) take the classical limit of the above theory and consider trajectories of two non-interacting particles that intersect at the space-time point (x,t) from the point of view of observer O and
2) transform these trajectories according to commutators between the Newton-Wigner operator and generators of boost transformations U_g,

then

3) we will find that from the point of view of O', the intersection of trajectories occurs at point (x',t') which is related to (x,t) by usual Lorentz formulas (1) and (2).
gptejms said:
Go ahead!

OK. To make things simple, I'll consider a one-dimensional problem in which the movement of the particles and the reference frame occur only along the x-axis. I will denote P, H, and K the generators of the Poincare group corresponding to space translations, time translations, and boosts, respectively. In the case of spinless particle, the Newton-Wigner position operator is R = -c^2 \frac{K}{H} (In fact, the correct quantum expression is R = -\frac{c^2}{2} (KH^{-1} + H^{-1}K), however, in the end we will take the classical limit anyway, so it is justified to ignore the non-commutativity of operators here.)

From the point of view of observer at rest O, the time dependence of R is

R(0,t) = \exp(\frac{i}{\hbar}Ht) R \exp(-\frac{i}{\hbar}Ht) = R + Vt

where V = Pc^2/H is velocity (Calculations performed in this post can be easily done with the knowledge of commutators between P, H, K, R, and V. I'll skip the detailed calculations, but, if you like, I can justify all steps)

From the point of view of moving observer O', the position at t'=0 (where time t' is measured by the clock of observer O') is given by

R(\theta,0) = \exp(-\frac{ic}{\hbar}K \theta) R \exp(\frac{ic}{\hbar}K \theta) = \beta \frac{R}{\cosh \theta}

(Here I will use a convenient notation \cosh \theta = (1-v^2/c^2)^{-1/2} and \sinh \theta = v/c (1-v^2/c^2)^{-1/2} and
\beta = (1 - Vv/c^2)^{-1}, where v is the velocity of the reference frame O')

The time dependence of position from the point of view of O' is

R(\theta,t') = \exp(\frac{i}{\hbar}H't') R (\theta,0)\exp(-\frac{i}{\hbar}H't') = \beta (\frac{R}{\cosh \theta} + (V-v)t'} ) (1)

where

H' = \exp(-\frac{ic}{\hbar}K \theta) H \exp(\frac{ic}{\hbar}K \theta)

is the Hamiltonian in the reference frame O'.


Eq. (1) is the transformation of position obtained by quantum-mechanical Poincare-group analysis. On the other hand, standard Lorentz transformations of special relativity would predict

R(\theta,t') = R (0,t)\cosh \theta - ct \sinh \theta (2)

t' = t \cosh \theta - \frac{R (0,t)}{c} \sinh \theta (3)

Formulas (1) and (2) do not look identical. However, if we take their difference, replace t' by eq. (3) and use R (0,t) = R + Vt, we will get exactly zero.

This means that space-time coordinates of free classical particles transform exactly by Lorentz formulas within relativistic quantum mechanics. This also means that if there is a localized event defined as intersection of trajectories of two non-interacting particles, then space-time coordinates of this event will transform by Lorentz formulas as well.

One important remark. Here we obtained Lorentz transformation formulas, which are usually interpreted as an evidence of equivalence of space and time coordinates. However, there is no such symmetry in our approach: Position is described by a Hermitian operator, but time is just a classical parameter that labels reference frames.

Another important point is that by this method we will not be able to prove Lorentz formulas (2) and (3) for the point of collision if the two particles interact with each other.

Eugene.
 
Last edited:
  • #283
meopemuk said:
This is done most easily in the momentum representation. If \psi(\mathbf{p}) is the wave function at time t=0 from the point of view of observer at rest O, then

e^{\frac{it}{\hbar} E_p}\psi(\mathbf{p})... (1)

is the wavefunction at time t from the point of view of O


\sqrt{\frac{E_{\Lambda p}}{E_p}}\psi(\Lambda \mathbf{p})

is the wavefunction from the point of view of moving observer O' at time t=0 (by her clock)

e^{\frac{it'}{\hbar} E_{\Lambda p}} \sqrt{\frac{E_{\Lambda p}}{E_p}}\psi(\Lambda \mathbf{p})...(2)

is the wavefunction seen by O' at time t' (by her clock). You can get position-space representations of these wavefunctions by making usual Fourier transformations. I don't think these integrals can be written in a nice analytical form.

You have now transformed t to t',E to E'(in addition to p to p' which you were doing earlier)--now you add to this x to x' and you'll end up saying exactly what I am saying!



This is true, but the wavefunction you wrote is just a specific case of a plane wave, i.e., a state with definite momentum, where our formulas coincide. I was talking about transformations of general wavefunctions.

A general wavefunction is nothing but a sum of such plane waves, which is what you (would be) ultimately doing above,or in your post #252.


Formulas (1) and (2) do not look identical. However, if we take their difference, replace t' by eq. (3) and use R (0,t) = R + Vt, we will get exactly zero.

Another important point is that by this method we will not be able to prove Lorentz formulas (2) and (3) for the point of collision if the two particles interact with each other.

Eugene.

Interesting operator,but with limitations.
 
  • #284
gptejms said:
You have now transformed t to t',E to E'(in addition to p to p' which you were doing earlier)--now you add to this x to x' and you'll end up saying exactly what I am saying!


A general wavefunction is nothing but a sum of such plane waves, which is what you (would be) ultimately doing above,or in your post #252.

In order to find the position-space wavefunction for the moving observer O' at time t' one should first find eigenvectors |\mathbf{r}, t' \rangle of the corresponding position operator

R(\theta,t') = \exp(\frac{i}{\hbar}H't') R (\theta,0)\exp(-\frac{i}{\hbar}H't') = \beta (\frac{R}{\cosh \theta} + (V-v)t'} )

R(\theta,t') |\mathbf{r}, t' \rangle = \mathbf{r}|\mathbf{r}, t' \rangle

and then take the inner product of the state vector |\Psi \rangle with these eigenvectors

\psi (\mathbf{r}, t') = \langle \mathbf{r}, t' |\Psi \rangle ... (1)



I haven't done this calculation, but I am pretty sure that the result will be different from your proposal \psi (\Lambda x) , where x \equiv (\mathbf{r}, t). One reason is this: if the state |\Psi \rangle is localized from the point of view of observer O (this means that \psi ( \mathbf{r}, 0) is zero everywhere except point \mathbf{r}= 0), then transformed wavefunction (1) is not localized (neither at t'=0 nor at any other time), but function \psi (\Lambda x) is localized at t'=0.


Eugene.
 
Last edited:
  • #285
meopemuk said:
In order to find the position-space wavefunction for the moving observer O' at time t' one should first find eigenvectors |\mathbf{r}, t' \langle of the corresponding position operator.
Eugene,

When will you finally read those first two pages of Jun John Sakurai's chapter 3?It shows that the probability density transforms as the 0th component of the
four vector probability current density:

\frac{i}{2m}\left( \phi^*\frac{\partial \phi}{\partial x_\mu} - \frac{\partial \phi^*}{\partial x_\mu}\phi \right)

In this way you can account for the fact that the probability density goes
from one Lorentz contracted state to another:

\frac{E}{m}\psi(x) \to \frac{E'}{m}\psi(\Lambda x)

Where E, E' and m can be local densities in the most general case.Regards, Hans
 
Last edited:
  • #286
Sakurai also proofs very simply the unitarity of the probability density of
the Klein Gordon equation.

Given the probability current density:s_\mu\ =\ \frac{i}{2m}\left( \phi^*\frac{\partial \phi}{\partial x_\mu} - \frac{\partial \phi^*}{\partial x_\mu}\phi \right)

We have the requirement for unitarity that the four divergence of this
vanishes. (The continuity relation)

\frac{\partial s_\mu}{\partial x_\mu}\ =\ \frac{i}{2m}\left( \frac{\partial \phi^*}{\partial x_\mu}\frac{\partial \phi}{\partial x_\mu} - (\Box\phi^*)\phi - \phi^*\Box\phi -\frac{\partial \phi^*}{\partial x_\mu}\frac{\partial \phi}{\partial x_\mu} \right)\ =\ 0

If phi obeys the Klein Gordon equation then we can replace this by:

\frac{\partial s_\mu}{\partial x_\mu}\ =\ \frac{i}{2m}\left( \frac{\partial \phi^*}{\partial x_\mu}\frac{\partial \phi}{\partial x_\mu} - m^2\phi^*\phi - \phi^*m^2\phi -\frac{\partial \phi^*}{\partial x_\mu}\frac{\partial \phi}{\partial x_\mu} \right)\ =\ 0

Which trivially vanishes.Regards, Hans
 
Last edited:
  • #287
Hans de Vries said:
Eugene,

When will you finally read those first two pages of Jun John Sakurai's chapter 3?It shows that the probability density transforms as the 0th component of the
four vector probability current density:

\frac{i}{2m}\left( \phi^*\frac{\partial \phi}{\partial x_\mu} - \frac{\partial \phi^*}{\partial x_\mu}\phi \right)

In this way you can account for the fact that the probability density goes
from one Lorentz contracted state to another:

\frac{E}{m}\psi(x) \to \frac{E'}{m}\psi(\Lambda x)

Where E, E' and m can be local densities in the most general case.Regards, Hans

But this isn't probability density, is it? I haven't checked it with with some solution of the KG equation myself, but if I assume that Demystifier knows what he is talking about in his "QM Myths"-paper, there exists positive frequency solutions, for which this KG current density is negative (somewhere).

Demystifier, judging by your paper, and by responses to it that I saw here, I might guess that this
I was quite suprised when I noticed last night, that the Klein-Gordon equation actually does conserve probability, if only positive frequency solutions are considered. Are you aware of this? :cool:
(which I posted in the "Mathematical signature of electronmagnetic field and electron's deBroglie functions" thread) is not very well known. The calculation isn't even very difficult. For a given initial wavefunction \phi(0,\boldsymbol{x}), the positive frequency time evolution given by KG equation is

<br /> \phi(t,\boldsymbol{x}) = \int\frac{d^3p\;d^3x&#039;}{(2\pi)^3} \phi(0,\boldsymbol{x}&#039;) e^{-i(E_{\boldsymbol{p}}t - \boldsymbol{p}\cdot (\boldsymbol{x}-\boldsymbol{x}&#039;))}<br />

(Actually I'm not sure about the sign convention, but if that is not the standard positive frequency solution, it probably isn't ruining anything important here.) Assume

<br /> \int d^3x\; |\phi(0,\boldsymbol{x})|^2 = 1<br />

and using the usual delta function representations you can verify that

<br /> \int d^3x\; |\phi(t,\boldsymbol{x})|^2 = 1<br />

is true for any fixed t&gt;0. With the same calculation you can also show, that probability is conserved for negative frequency solutions. However, the linear combinations of positive and negative frequency solutions do not conserve the probability, and this is why Klein-Gordon equation does not conserve probability for arbitrary solutions, and is also why it is impossible to prove the conservation of probability out of Klein-Gordon equation alone.

meopemuk, I think you were ignoring the paradox I mentioned earlier. The solutions of your relativistic Shrodinger's equation

<br /> i\partial_t \psi(t,\boldsymbol{x}) = \sqrt{-\nabla^2 + m^2} \psi(t,\boldsymbol{x})<br />

are also solutions of the Klein-Gordon equation. Since fixing the sign of the frequency of the solution of Klein-Gordon equation determines its solution uniquely, defining time evolution of a wave function with relativistic Shrodinger's equation is equivalent to defining it with the Klein-Gordon equation and demanding only positive frequency solutions to be accepted. The paradox stemmed from assumptions that Shrodinger's equation was conserving the probability and Klein-Gordon equation was not, simultaneously! But this one I solved quite quickly.
 
Last edited:
  • #288
jostpuur said:
1. But this isn't probability density, is it? I haven't checked it with with some solution of the KG equation myself, but if I assume that Demystifier knows what he is talking about in his "QM Myths"-paper, there exists positive frequency solutions, for which this KG current density is negative (somewhere).

2. Demystifier, judging by your paper, and by responses to it that I saw here, I might guess that this

(which a posted in the "Mathematical signature of electronmagnetic field and electron's deBroglie functions" thread) is not very well known. The calculation isn't even very difficult. For a given initial wavefunction \phi(0,\boldsymbol{x}), the positive frequency time evolution given by KG equation is

<br /> \phi(t,\boldsymbol{x}) = \int\frac{d^3p\;d^3x&#039;}{(2\pi)^3} \phi(0,\boldsymbol{x}&#039;) e^{-i(E_{\boldsymbol{p}}t - \boldsymbol{p}\cdot (\boldsymbol{x}-\boldsymbol{x}&#039;))}<br />

(Actually I'm not sure about the sign convention, but if that is not the standard positive frequency solution, it probably isn't ruining anything important here.) Assume

<br /> \int d^3x\; |\phi(0,\boldsymbol{x})|^2 = 1<br />

and using the usual delta function representations you can verify that

<br /> \int d^3x\; |\phi(t,\boldsymbol{x})|^2 = 1<br />

is true for any fixed t&gt;0.
1. Yes, that is exactly what the problem is.

2. I do not think that you obtain a delta function. This is because energy E_p also depends on the 3-momentum p, so integration over d^3p is not so trivial. Check it once again.
 
  • #289
jostpuur said:
But this isn't probability density, is it? I haven't checked it with with some solution of the KG equation myself, but if I assume that Demystifier knows what he is talking about in his "QM Myths"-paper, there exists positive frequency solutions, for which this KG current density is negative (somewhere).
You always end up with this expression. (Just read Sakurai chapter 3.1 and 3.5)

j_\mu\ =\ \frac{i}{2m}\left( \phi^*\frac{\partial \phi}{\partial x_\mu} - \frac{\partial \phi^*}{\partial x_\mu}\phi \right)

for the probability current density. One does so with the Klein Gordon equation,
with the Schrödinger equation after the prescribed substitution \phi=\Psi \exp(-imc^2t/\hbar)
and with the Dirac equation after the the spin current has been separated out
via the Gordon decomposition.Yes, The 0th component which represents the probability density is
negative in the case of anti particles. Here we arrive at the 1934
Pauli-Weisskopf interpretation of j as the charge current density.

Both the probabilistic interpretation and the Pauli-Weisskopf interpretation
are valid and have been proved in countless experiments. The 0th component
of j is conserved either as the probability or the electric charge.

The non conservation issues can occur if the initial wavefunction at t=0 is
not combined with the correct \partial_t \psi at t=0 which is rather logical because
one then programs a violation of the continuity relation right into the
boundary conditions.


Regards, Hans
 
Last edited:
  • #290
Demystifier said:
2. I do not think that you obtain a delta function. This is because energy E_p also depends on the 3-momentum p, so integration over d^3p is not so trivial. Check it once again.

So we want to calculate

<br /> \int d^3x\; \phi^*(t,\boldsymbol{x})\phi(t,\boldsymbol{x})<br />

Substituting the general solutions to this gives

<br /> =\int\frac{d^3x\;d^3p\;d^3y\;d^3p&#039;\;d^3y&#039;}{(2\pi)^6} \phi^*(0,\boldsymbol{y}) e^{i(E_{\boldsymbol{p}}t - \boldsymbol{p}\cdot(\boldsymbol{x}-\boldsymbol{y}))} \phi(0,\boldsymbol{y}&#039;) e^{-i(E_{\boldsymbol{p}&#039;}t - \boldsymbol{p}&#039;\cdot(\boldsymbol{x}-\boldsymbol{y}&#039;))}<br />

This can be rearranged to be

<br /> =\int\frac{d^3x\;d^3p\;d^3y\;d^3p&#039;\;d^3y&#039;}{(2\pi)^6} \phi^*(0,\boldsymbol{y})\phi(0,\boldsymbol{y}&#039;) e^{i(E_{\boldsymbol{p}} - E_{\boldsymbol{p}&#039;})t} e^{i(\boldsymbol{p}\cdot\boldsymbol{y} - \boldsymbol{p}&#039;\cdot\boldsymbol{y&#039;})} e^{i(\boldsymbol{p}&#039;-\boldsymbol{p})\cdot\boldsymbol{x}}<br />

Integration of variable x can be carried out, and it gives \delta^{3}(\boldsymbol{p}&#039;-\boldsymbol{p}). Then you can integrate over the variable p', and this will remove energy terms.
 
  • #291
Hans de Vries said:
Yes, The 0th component which represents the probability density is
negative in the case of anti particles. Here we arrive at the 1934
Pauli-Weisskopf interpretation of j as the charge current density.

The fact that it can be negative also for the positive frequency solutions is the true problem.
 
  • #292
jostpuur said:
The fact that it can be negative also for the positive frequency solutions is the true problem.

This is simply not true. It's never negative for a positive frequency solution.
It can only be negative there where there is a negative energy density.
This does never occur for free particle solutions.

It also stays positive in the case of an interacting electron in a very deep
potential well where -eV is larger as the rest mass energy of the electron.
You have to take the interaction into account as is done in Sakurai chapter
3.5 in the paragraph handling the Gordon decomposition. (see 3.204)

For the Klein Gordon equation you get:

j_\mu\ =\ \frac{i\hbar}{2m}\left( \phi^*\frac{\partial \phi}{\partial x_\mu} - \frac{\partial \phi^*}{\partial x_\mu}\phi \right)\ -\ \frac{e}{mc}A_\mu\phi^*\phi

where j_0 is the probability density.

Sakurai shows that unitarity also holds in the case of interacting fields.


Regards, Hans
 
Last edited:
  • #293
Hans de Vries said:
This is simply not true. It's never negative for a positive frequency solution.

All right. I tried a superposition of two plane waves in one dimension, and the Klein-Gordon density was still positive everywhere. Perhaps Demystifier explains this.
 
  • #294
Hans de Vries said:
Eugene,

When will you finally read those first two pages of Jun John Sakurai's chapter 3?

Hi Hans,

Following your suggestion I went to the library and found Sakurai's book. I wasn't impressed. Like many other authors Sakurai takes a heuristic approach to relativistic quantum mechanics, like "deriving" KG equation by analogy from E^2 = m^2c^4 + p^2c^2. This kind of guessing was forgivable in 1927, when people didn't know exactly what quantum mechanics is about. But we are in 2007 now. I think we can demand a more rigorous derivation of our theories from fundamental axioms of relativity and quantum mechanics.


Hans de Vries said:
It shows that the probability density transforms as the 0th component of the
four vector probability current density:

\frac{i}{2m}\left( \phi^*\frac{\partial \phi}{\partial x_\mu} - \frac{\partial \phi^*}{\partial x_\mu}\phi \right)

There are certain laws of quantum mechanics that we are not allowed to violate. For example, if you want to find a wavefunction corresponding to a state vector |\Psi \rangle you need to perform the following steps:

1. Specify a full set of mutually commuting operators (F_1, F_2, F_3, \ldots) in the Hilbert space

2. Find common eigenvectors |f_1, f_2, f_3, \ldots \rangle of this set

3. Then the wave function is

\psi(f_1, f_2, f_3, \ldots) = \langle f_1, f_2, f_3, \ldots | \Psi \rangle

4. and the probability (density) is

|\psi(f_1, f_2, f_3, \ldots)|^2

The same rules are valid for position-space wavefunctions. Your "probability current densities" are not built by these rules. So, you'll have to do some extra work to justify that they can be interpreted as probability densities. The fact that they satisfy the continuity equation is not a proof.

Eugene.
 
  • #295
Hans de Vries said:
Sakurai also proofs very simply the unitarity of the probability density of
the Klein Gordon equation.

Given the probability current density:


s_\mu\ =\ \frac{i}{2m}\left( \phi^*\frac{\partial \phi}{\partial x_\mu} - \frac{\partial \phi^*}{\partial x_\mu}\phi \right)

We have the requirement for unitarity that the four divergence of this
vanishes. (The continuity relation)

No, this is not the definition of unitarity in quantum mechanics. According to Wigner theorem, the preservation of probabilities requires that state vectors are transformed by unitary operators. If you want to be consistent with quantum mechanics, you should prove that your "four divergence" definition agrees with Wigner theorem. Otherwise, you are not doing quantum mechanics. This is not a crime, of course. Anybody is allowed to look for alternatives to QM. But, at least, you should acknowledge that that's what you are doing.

Eugene.
 
  • #296
meopemuk said:
Hi Hans,

Following your suggestion I went to the library and found Sakurai's book. I wasn't impressed.

Sakurai work is a classic and represents the mainstream ideas in relativistic
quantum mechanics.

meopemuk said:
There are certain laws of quantum mechanics that we are not allowed to violate. For example,
4. and the probability (density) is

|\psi(f_1, f_2, f_3, \ldots)|^2

The relativistic expression does just this! Just use the prescribed substitution
\phi=\Psi \exp(-imc^2t/\hbar) in the relativistic expression: (explicitly expressing h and c)

j_\mu\ =\ \frac{i\hbar}{2mc^2}\left( \phi^*\frac{\partial \phi}{\partial x_\mu} - \frac{\partial \phi^*}{\partial x_\mu}\phi \right)

And you see that it becomes the classic expression for the Schrödinger
equation. The probability density HAS to be the 0th component of a four-
vector probability current density TO BE ABLE to hold under Lorentz
transformation.


Regards, Hans
 
  • #297
meopemuk said:
No, this is not the definition of unitarity in quantum mechanics. According to Wigner theorem, the preservation of probabilities requires that state vectors are transformed by unitary operators. If you want to be consistent with quantum mechanics, you should prove that your "four divergence" definition agrees with Wigner theorem. Otherwise, you are not doing quantum mechanics. This is not a crime, of course. Anybody is allowed to look for alternatives to QM. But, at least, you should acknowledge that that's what you are doing.

Eugene.

This is a well established mainstream expression in relativistic quantum
mechanics!

I'm sure Wigner did agree with this and not with your ideas.Regards, Hans
 
  • #298
Hans de Vries said:
The probability density HAS to be the 0th component of a four-
vector probability current density TO BE ABLE to hold under Lorentz
transformation.

Do you know for sure if the total Klein-Gordon charge remains constant in Lorentz transformations?

On the other hand, do you know about the same question concerning quantity
<br /> \int d^3x |\phi|^2<br />
Does the normalization change in Lorentz transformations?

I cannot see easily why the probability density should be a first component of the four vector, instead of being scalar. Can it be proven with some calculation reasonably?

Hopefully Hans isn't already answering (I'm editing this here:). I can see that probability current should be a four vector so that the continuity equation gets satisfied manifestly in all frames. However, I just showed that also the normalization of |\phi|^2 remains constant in positive frequency time evolution, so I cannot see what's the problem with such scalar probability density.

In my previous questions I started thinking about the total charge and normalization in Lorentz transformations. If the discussion wasn't about then in the first place, then my comments were perhaps a bit confusing.
 
Last edited:
  • #299
Hans de Vries said:
Just use the prescribed substitution
\phi=\Psi \exp(-imc^2t/\hbar) in the relativistic expression: (explicitly expressing h and c)

j_\mu\ =\ \frac{i\hbar}{2mc^2}\left( \phi^*\frac{\partial \phi}{\partial x_\mu} - \frac{\partial \phi^*}{\partial x_\mu}\phi \right)

And you see that it becomes the classic expression for the Schrödinger
equation.


The time dependence \phi=\Psi \exp(-imc^2t/\hbar) is characteristic for eigenfunctions of the energy operator, for which the probability density is time-independent. This time dependence cannot be valid in the general case.

Hans de Vries said:
The probability density HAS to be the 0th component of a four-
vector probability current density TO BE ABLE to hold under Lorentz
transformation.

Can you define the exact meaning of "to hold under Lorentz transformation"?
Can you prove your statement as a theorem? What would be your axioms in this case?

Eugene.
 
Last edited:
  • #300
Hans de Vries said:
This is a well established mainstream expression in relativistic quantum
mechanics! :

Hi Hans,

I agree that what you wrote is "mainstream". I disagree that it is "well-established". Textbooks usually justify KG equation, "probability current densities", etc. by handwavings and vague analogies. Those who complain will be told that one-particle relativistic quantum mechanics does not make sense anyway, and should be replaced by the full-blown QFT approach. Or that \mathbf{r} should not be interpreted as position, because position is not measurable in the relativistic world anyway. There are many ways to weasel out of hard questions. That's why I am insisting on an axiomatic approach.

Eugene.
 
Back
Top