Orthonormality of a complete set of eigenvectors

burakumin · Jan 22, 2014

hello

How to you rigorously express the orthonormality of a complete set of eigenvectors [itex](|q\rangle)_q[/itex] of the position operator given that these are necessarily generalized eigenvectors (elements of the distribution space of a rigged hilbert space)?
The usual unformal condition [itex]\langle q|q'\rangle=\delta(q-q')[/itex] does not make sense as inner product is not defined for a pair of these vectors.

thank you

ChrisVer · Jan 22, 2014

you have some eigenvectors, and you want an orthonormality relation between them.
To do that, you have to define an inner product.
Once you define the inner product, you want it to be equal to 1.
Once you impose that, you'll get the delta of Kroenicker or Dirac's delta...
Is there something unclear?

George Jones · Jan 22, 2014

burakumin said:

hello

How to you rigorously express the orthonormality of a complete set of eigenvectors [itex](|q\rangle)_q[/itex] of the position operator given that these are necessarily generalized eigenvectors (elements of the distribution space of a rigged hilbert space)?
The usual unformal condition [itex]\langle q|q'\rangle=\delta(q-q')[/itex] does not make sense as inner product is not defined for a pair of these vectors.

If you want to do this rigourously, you should look at rigged Hilbert spaces (also called Gelfand triples).

dextercioby · Jan 22, 2014

Burakumin, the topological dual of a Hilbert space is a Hilbert space. That's where the whole rigged Hilbert space story (or if you prefer enchilada) starts.

strangerep · Jan 22, 2014

burakumin said:

How to you rigorously express the orthonormality of a complete set of eigenvectors [itex](|q\rangle)_q[/itex] of the position operator given that these are necessarily generalized eigenvectors (elements of the distribution space of a rigged hilbert space)?
The usual unformal condition [itex]\langle q|q'\rangle=\delta(q-q')[/itex] does not make sense as inner product is not defined for a pair of these vectors.

The inner product is "extended" to be what you wrote (so-called "delta normalization"), in which case it becomes "distribution-valued" on those elements -- although such an extension does not necessarily encompass arbitrary elements of the distribution space. The primary purpose of this is so that arbitrary vectors in the small space (Schwartz space) can be expanded in terms of the ##|q\rangle##'s.

In that sense, the "delta normalization" is already rigorous provided one recognizes the nature (and limitations) of this extended meaning of "inner product".

burakumin · Jan 23, 2014

George Jones said:

If you want to do this rigourously, you should look at rigged Hilbert spaces (also called Gelfand triples).

As you might have noticed, my question already mentions rigged hilbert spaces. So yes I have already read docs about this topic (for example Quantum Mechanics beyond Hilbert space). But I haven't find anything about orthonormality in this context. Do you have references?

strangerep said:

The inner product is "extended" to be what you wrote (so-called "delta normalization"), in which case it becomes "distribution-valued" on those elements -- although such an extension does not necessarily encompass arbitrary elements of the distribution space. The primary purpose of this is so that arbitrary vectors in the small space (Schwartz space) can be expanded in terms of the ##|q\rangle##'s.

In that sense, the "delta normalization" is already rigorous provided one recognizes the nature (and limitations) of this extended meaning of "inner product".

Thanks strangerep. I can perfectly accept that a notion is extended so that it encompasses new situations. But this requires a general definition. Even if it might not be possible for all elements in the distribution space, it must at least be defined for a subset. And we agree that formula [itex]\langle q|q'\rangle=\delta(q-q')[/itex] is certainly not a definition, right? The only reference Google gives me for "distribution-valued inner product" is a discussion on physicsforums. "delta normalization" gives more results but the few I've read only seems to describe the formal aspect, sweeping the mathematical aspect under the carpet. Do you already know better (=mahematically rigorous) references?

Thank you

vanhees71 · Jan 23, 2014

A good book, dealing with the formal aspects of quantum theory is

A. Galindo and P. Pascual. Quantum Mechanics. Springer Verlag, Heidelberg, 1990. 2 Vols.

burakumin · Jan 23, 2014

vanhees71 said:

A good book, dealing with the formal aspects of quantum theory is

A. Galindo and P. Pascual. Quantum Mechanics. Springer Verlag, Heidelberg, 1990. 2 Vols.

Thanks vanhees71, but does it specifically deal with the orthonormality issue ?

I've found this document : http://www.researchgate.net/publication/210189116_Dirac-orthogonality_in_the_space_of_tempered_distributions/file/50463529fdaae0189d.pdf. At least its existence proves that the topic requires investigations.

strangerep · Jan 23, 2014

burakumin said:

I can perfectly accept that a notion is extended so that it encompasses new situations. But this requires a general definition. Even if it might not be possible for all elements in the distribution space, it must at least be defined for a subset. And we agree that formula [itex]\langle q|q'\rangle=\delta(q-q')[/itex] is certainly not a definition, right?

Not exactly. I think you should switch to reading about the Schwartz theory of distributions, since this gives a more rigorous treatment of the underlying ideas.

E.g., if the bra and ket are the functions ##e^{-iqx}## and ##e^{iq'x}## and ##\langle q|q'\rangle## is understood to mean:
$$
\int e^{-iqx} e^{iq'x} dx
$$
then this is a well-known tempered distribution known as the Dirac delta distribution.

The Dirac bra-ket notation tends to obscure the fact that one is really working in a framework of dual pairs here, and not necessarily a true inner product. (For the finite-dimensional case, this distinction is often lost, of course.)

Rigged Hilbert space can be regarded as a generalization of Schwartz distribution theory, so one should acquire proficiency in the latter first. I like Appendix A of Nussenzveig's book "Causality & Dispersion Relations", but this is more of an extended summary than an extensive rigorous treatment.

Edit 1: Take those papers by David Carfi with a grain of salt. I don't think his other papers achieve what he claims.

Edit 2: This paper by Rafael de la Madrid gives another introduction to RHS. Searching Google (Scholar?) should also lead you to his PhD thesis which contains a lot more detail. But... my advice is to become familiar with distribution theory first.

George Jones · Jan 23, 2014

burakumin said:

As you might have noticed, my question already mentions rigged hilbert spaces. So yes I have already read docs about this topic (for example Quantum Mechanics beyond Hilbert space). But I haven't find anything about orthonormality in this context. Do you have references?

I don't have these books with me right now, and I can't remember if they do what you want, but, in the past, I wrote

George Jones said:

For a rigourous overview of rigged Hilbert spaces (Gelfand triples) and Dirac notation, I recommend highly sections 11.2, 11.3, and 12.2 from Quantum Field Theory I: Basics in Mathematics and Physics (A Bridge Between Mathematicians and Physicists) and subsection 7.6.4 from Quantum Field Theory II: Quantum Electrodynamics (A Bridge Between Mathematicians and Physicists) by Eberhard Zeidler.

burakumin · Jan 28, 2014

George Jones said:

I don't have these books with me right now, and I can't remember if they do what you want, but, in the past, I wrote

Ok, I'll try to find these references.

strangerep said:

Not exactly. I think you should switch to reading about the Schwartz theory of distributions, since this gives a more rigorous treatment of the underlying ideas.

Thanks but I'm not sure there's a problem on this aspect. I studied distribution theory far before I learned any basic concept of quantum mechanics (the former was part of my engineering student curriculum). And as far as I can remember there is no notion of orthogonality within the distribution space itself.

strangerep said:

E.g., if the bra and ket are the functions ##e^{-iqx}## and ##e^{iq'x}## and ##\langle q|q'\rangle## is understood to mean:
$$
\int e^{-iqx} e^{iq'x} dx
$$

I don't think this expression is valid in distribution theory. The fact that an expression looks meaningful does not prove it actually is. If we consider these two objects as functions, this integral does not converge. As distributions, this integral is meaningless because distribution theory does not define any scalar-valued product of two distributions. And I do not think you can define spaces where this precise pair can be interpreted as a distribution and a test function as neither is "nice enough" (rapidly decreasing) to be a test function.

Now I was wondering if you were thinking of the product of distributions, the result of which is a distribution again. This product can sometimes be defined and it is a well know fact that it is impossible in general. But as Mr Carfi's article points it, we cannot be looking for the product of two distributions in this context. We must look for the product of two families (which is in fact already the case for the loose formula ⟨q|q′⟩=δ(q−q′) ). It is arguably a rather different concept from the standard inner product or even from the "duality" product of a distribution and a test function.

strangerep said:

Edit 1: Take those papers by David Carfi with a grain of salt. I don't think his other papers achieve what he claims

Thank you for the warning but I don't intend to read anything else from Mr Carfi. So my only question is "do you consider this article wrong/invalid?" I may be wrong but his approach looks rigorous to me (but we agree formal appearance proves nothing).

strangerep · Jan 28, 2014

burakumin said:

I studied distribution theory far before I learned any basic concept of quantum mechanics (the former was part of my engineering student curriculum).

OK, good. But did you also study the theory of Fourier transforms in the more general context of distributions? From your other remarks, it seems not.

And as far as I can remember there is no notion of orthogonality within the distribution space itself.

Yes -- that's sort-of what I've been saying. Although one can define a "Dirac delta normalization" for particular distributions, it fails to extend sensibly to the entire space of distributions.

strangerep said:

E.g., if the bra and ket are the functions ##e^{-iqx}## and ##e^{iq'x}## and ##\langle q|q'\rangle## is understood to mean: $$\int e^{-iqx} e^{iq'x} dx$$

I don't think this expression is valid in distribution theory.

Then you need to study distribution theory at a more advanced level. The integral I gave above is ##2\pi \delta(q-q')##.

The fact that an expression looks meaningful does not prove it actually is. If we consider these two objects as functions, this integral does not converge.

It does have meaning as a distribution. (Sorry, but this is kinda basic.)

As distributions, this integral is meaningless because distribution theory does not define any scalar-valued product of two distributions. And I do not think you can define spaces where this precise pair can be interpreted as a distribution and a test function as neither is "nice enough" (rapidly decreasing) to be a test function.

(Sigh.) It has the properties of the Dirac delta distribution. Again, this is elementary.

Now I was wondering if you were thinking of the product of distributions, the result of which is a distribution again. This product can sometimes be defined and it is a well know fact that it is impossible in general.

I know this, of course. But any regular function (of x, say) which grows at infinity no faster than a polynomial, (i.e., a so-called function of "slow growth"), qualifies as a distribution (since when integrated against a Schwartz function, the result is finite). Such functions can of course be multiplied, and the product also grows no faster than a polynomial, hence also qualifies as a distribution. However, if one takes their respective Fourier transforms (which are still distributions), these cannot be simply multiplied (but rather must be convolved).

Do you have access to a University library? If so, try Appendix A of H.M. Nussenzveig's textbook "Causality and Dispersion Relations". It presents a lot of useful material on distributions quite efficiently.

So my only question is "do you consider [Carfi's] article wrong/invalid?" I may be wrong but his approach looks rigorous to me (but we agree formal appearance proves nothing).

It's been a while since I studied them, and I'm disinclined to put any more time into that. The reason I used the phrase "grain of salt" earlier is that his treatment of generalized spectral theorems don't actually show that every operator on the space of distributions can be decomposed in the way he describes.

burakumin · Jan 30, 2014

strangerep said:

OK, good. But did you also study the theory of Fourier transforms in the more general context of distributions? From your other remarks, it seems not.

Then you need to study distribution theory at a more advanced level. The integral I gave above is ##2\pi \delta(q-q')##.

It does have meaning as a distribution. (Sorry, but this is kinda basic.)

(Sigh.) It has the properties of the Dirac delta distribution. Again, this is elementary.

I know this, of course. But any regular function (of x, say) which grows at infinity no faster than a polynomial, (i.e., a so-called function of "slow growth"), qualifies as a distribution (since when integrated against a Schwartz function, the result is finite). Such functions can of course be multiplied, and the product also grows no faster than a polynomial, hence also qualifies as a distribution. However, if one takes their respective Fourier transforms (which are still distributions), these cannot be simply multiplied (but rather must be convolved).

Hi strangerep,

I now understand more precisely what you are talking about. And all I can say is that IMHO you are mislead. I'm going to try to explain why.

If I’m correct, you are interpreting the expression ##\int \exp(2 \pi i q x) \cdot \exp(-2 \pi i q' x) \cdot dx## as ##\mathcal{F}1(q'-q)## with ##\mathcal{F}## the Fourier transform and 1 the constant function equal to 1. This interpretation is, in a way, possible but you seem to forget that this is an abuse of notation. How do you define the Fourier transform of a tempered distribution ##T##? By showing that there is one and only one distribution (##\mathcal{F}T##) such that:

##\mathcal{F}T(\phi)= T(\mathcal{F}\phi)##

for all rapidly decreasing functions ##\phi##. There exists no integral definition for it. Now we both agree that it is correct that if ##T## is regular (associated with the function I will call ##\hat{T}## even if generally we intentionally do not make the distinction) and if the function ##\hat{T}## has a Fourier transform, ##\mathcal{F}T## is also a regular distribution, and is associated with the function ##t \mapsto \int \exp(-2 \pi i t x) \cdot \phi(x) \cdot dx##. This is not the case for function 1 !

We tend to use abuses of notations for their evocative properties but we all know that they are always tricky somewhere. One may write :

##\delta(t) = \int \exp(-2 \pi i t x) \cdot 1 \cdot dx##

but one should keep in mind that in distribution context, ##\int \exp(-2 \pi i t x) ... dx## is purely symbolic. There is no « actual » exponential function involved here. Just as we all know that ##\frac{\partial ...}{\partial ...}## is a single symbol and certainly not a fraction.

Now if we leave Fourier transforms and choose some ##t##, what can we say about the function ##E_t : x \mapsto \exp(2 \pi i t x)##? It is not even a member of ##\mathcal{L}_2(\mathbb{R})## so it cannot be a test function. Can we interpret it as a tempered distribution ? We know that we can, as the distribution :

##E_t(\phi) = \int \exp(2 \pi i t x) \cdot \phi(x) \cdot dx##

But this expression only makes sense if ##\phi## is rapidly decreasing function. So according to distribution theory, an expression like :

##E_q(x \mapsto \exp(-2 \pi i q’ x)) = \int \exp(2 \pi i t q) \cdot \exp(-2 \pi i q’ x) \cdot dx##

just does not make sense and it has nothing to do with any Fourier transform. Similarly if one considers ##|q\rangle## to be an abstract generalized vector, distribution theory says ##\langle q|q’\rangle## is not a meaningful expression and Fourier transform is of no help here.

I said and I repeat that if we want a definition of orthonormality we cannot rely on a particular family of generalized eigenvectors (unlike you do when considering only Dirac deltas or only complex exponentials). We must think of a undefinite family, potentially containing weird elements (for example Dirac combs or whatever you want) and define concepts that can apply to all families (or at least enough families). This is precisely what Mr Carvi is trying to do and (I suppose) why he has felt necessary to write an article about it. This article would be totally pointless if the problem could be solved as easily as you pretend, right? It might be, but I seriously doubt it and the previous fact should incite to caution.

strangerep · Jan 30, 2014

burakumin said:

If I’m correct, you are interpreting the expression ##\int \exp(2 \pi i q x) \cdot \exp(-2 \pi i q' x) \cdot dx## as ##\mathcal{F}1(q'-q)## with ##\mathcal{F}## the Fourier transform and 1 the constant function equal to 1.

That's not what I said. Hence no need to reply to most of the rest of your post.

I said and I repeat that if we want a definition of orthonormality we cannot rely on a particular family of generalized eigenvectors (unlike you do when considering only Dirac deltas or only complex exponentials).

Again, you create here a straw man. I don't respond to straw men.

burakumin · Feb 3, 2014

strangerep said:

That's not what I said. Hence no need to reply to most of the rest of your post.

I'm very sorry if I have misinterpreted your comments. That was not intentional. But in that case I sincerely do not understand your point.

strangerep said:

Do you have access to a University library? If so, try Appendix A of H.M. Nussenzveig's textbook "Causality and Dispersion Relations". It presents a lot of useful material on distributions quite efficiently.

I do not have access to a university library but I could finally find this book and read the appendix, just in case I would have ignored some parts of the theory. It is true that I didn’t know of ##\mathcal{E}’##, the space of distribution with compact support and that Fourier transform can be expressed slightly more easily than in ##\mathcal{S}’##. But except that, it contains the standard framework of distribution theory and I cannot see how these usual concepts can be trivially used to express orthonormality, except again by identifying distinct object by means of abuses of notations.

strangerep said:

Again, you create here a straw man. I don't respond to straw men.

I may be too stupid to understand, who knows? But at least contrary to what you seem to pretend, I’m not dishonest.

strangerep · Feb 3, 2014

burakumin said:

strangerep said:

[...] straw man [...]

But at least contrary to what you seem to pretend, I’m not dishonest.

You're doing the straw man thing again: you formed an incorrect extrapolated interpretation of something I said, and then you react to that interpretation as if it was fact. But in reality I never thought that you are dishonest.

Take another look at the "structure" section of the Wikipedia link I gave, i.e., straw man.

One way to avoid building straw men is, instead of reacting defensively to your interpretation of something I say, you could ask further questions to clarify what I said. This gives the conversation a better chance of proceeding constructively.

In any case, this thread seems to be going nowhere, so if you wish to continue constructively, you could perhaps re-state clearly whatever questions still remain.

burakumin · Feb 4, 2014

strangerep said:

In any case, this thread seems to be going nowhere, so if you wish to continue constructively, you could perhaps re-state clearly whatever questions still remain.

I hope you will admit that it is difficult for me to "re-state clearly whatever questions still remain" as our point of disagreement is that you are pretending something is possible (orthonormality of a family of generalized eigenvectors can be stated within the distribution theory in an elementary way) and I’m pretending it is not. I cannot ask precise questions on your position as according to you I don't understand it.

So we can try to restart from the very beginning and try to identify where we disagree. I’m a bit worried that we might not be on the same wavelength and that this conversation remains a long mutual misunderstanding, but at least I want to try. First I would like to expose general fact about the context so we can check we are speaking about the same thing. Tell me if you don't consider this Worth it and we will close the discussion.

We have a hilbert space. As done before, let's consider it is ##\mathcal{L}_2(\mathbb{R})## for simplification. We have a pair of test function space and distribution space associated with it. I think ##(\mathcal{S},\mathcal{S}')=(\mathcal{S}( \mathbb{R} ),\mathcal{S}'( \mathbb{R} ))##, i.e. rapidly decreasing functions and tempered distributions, is the usual choice in this context but feel free to tell me if another pair is more appropriate. I propose, following Mr. Nussenzveig, to use notation ##(T,\phi)## for the duality product of the distribution ##T## with the test function ##\phi##.

To my knowledge we do not have any inner product on ##\mathcal{S}'## (= function ##\mathcal{S}' \times \mathcal{S}' \mapsto \mathbb{C}## antilinear on first argument and linear on second) but we have some operations:

a partial product of distributions ##T\cdot U \in \mathcal{S}'## (partial because it might make no sense for arbitrary pairs of tempered distributions)
a partial convolution of distributions ##T * U \in \mathcal{S}'## (partial for the same reason)
a Fourier transform that is an isomorphism of ##\mathcal{S}'## and such that ##\mathcal{F}(T * U) = \mathcal{F}T \cdot \mathcal{F}U## and ##\mathcal{F}(T \cdot U) = \mathcal{F}T * \mathcal{F}U## whenever these products make sense
a tensor product of distributions ##T\otimes U \in \mathcal{S}'(\mathbb{R}^2)##

Let’s suppose now that we are given a real-indexed family ##(T_q)_{q\in\mathbb{R}}## of elements of ##\mathcal{S}’##. To remain general I prefer not to suppose this family is a set of eigenvectors of some operator. What I am looking for is a way to extend the usual definition of orthonormality so that one can decide if ##(T_q)_{q\in\mathbb{R}}## is or is not orthonormal. A priori this does not necessarily requires that one must extend the notion of hilbert inner product but it is a possibility.

I consider this issue to be non-trivial if it must be solved in a mathematically rigorous way. I also consider Mr. Carfi’s article to be a plausible manner of solving it. You claim that it is useless because there exists a trivial manner to solve it. Am I wrong somewhere on all previous statements?

Now what I understand about your opinion (and that is certainly wrong but I do my best to understand) is that according to you:

there exists a sort of partial product between families such that ##(T_q)_{q\in\mathbb{R}}\cdot (U_{q’})_{q’\in\mathbb{R}}## is a distribution again (partial again because it might not exist for arbitrary pairs of families)
this operation is just a trivial consequence of previously listed operations
the value of ##(T_q)_{q\in\mathbb{R}}\cdot (T_{q’})_{q’\in\mathbb{R}}## can be used to confirm or disconfirm the orthonormality of ##(T_q)_{q\in\mathbb{R}}##.
it is what is implied by informal physicist equation ##\langle q|q’\rangle = \delta(q-q’)##

Is there something misinterpretated here and where?

strangerep · Feb 4, 2014

burakumin said:

We have a hilbert space. As done before, let's consider it is ##\mathcal{L}_2(\mathbb{R})## for simplification. We have a pair of test function space and distribution space associated with it. I think ##(\mathcal{S},\mathcal{S}')=(\mathcal{S}( \mathbb{R} ),\mathcal{S}'( \mathbb{R} ))##, i.e. rapidly decreasing functions and tempered distributions, is the usual choice in this context but feel free to tell me if another pair is more appropriate. I propose, following Mr. Nussenzveig, to use notation ##(T,\phi)## for the duality product of the distribution ##T## with the test function ##\phi##.

To my knowledge we do not have any inner product on ##\mathcal{S}'## (= function ##\mathcal{S}' \times \mathcal{S}' \mapsto \mathbb{C}## antilinear on first argument and linear on second) but we have some operations:

a partial product of distributions ##T\cdot U \in \mathcal{S}'## (partial because it might make no sense for arbitrary pairs of tempered distributions)

a partial convolution of distributions ##T * U \in \mathcal{S}'## (partial for the same reason)

a Fourier transform that is an isomorphism of ##\mathcal{S}'## and such that ##\mathcal{F}(T * U) = \mathcal{F}T \cdot \mathcal{F}U## and ##\mathcal{F}(T \cdot U) = \mathcal{F}T * \mathcal{F}U## whenever these products make sense

Yes (to all of the above).

[*] a tensor product of distributions ##T\otimes U \in \mathcal{S}'(\mathbb{R}^2)##

No. A tensor product would be in ##\mathcal{S}'(\mathbb{R}^2) \otimes \mathcal{S}'(\mathbb{R}^2)##, by definition.

Let’s suppose now that we are given a real-indexed family ##(T_q)_{q\in\mathbb{R}}## of elements of ##\mathcal{S}’##. To remain general I prefer not to suppose this family is a set of eigenvectors of some operator.

Well, this makes things more difficult, but let us continue. (BTW, since we're talking about the specific example of Schwartz space and tempered distributions, the usual operators of position and momentum are applicable here. But ok, let's proceed without mentioning those operators specifically.)

I'm also surprised that you now wish to exclude this family being a set of eigenvectors of some operator, since your original post in this thread specifically asked about orthnormality of eigenvectors of the position operator.

What I am looking for is a way to extend the usual definition of orthonormality so that one can decide if ##(T_q)_{q\in\mathbb{R}}## is or is not orthonormal. A priori this does not necessarily requires that one must extend the notion of hilbert inner product but it is a possibility.

Noted.

I consider this issue to be non-trivial if it must be solved in a mathematically rigorous way.

I also consider it non-trivial.

I also consider Mr. Carfi’s article to be a plausible manner of solving it. You claim that it is useless because there exists a trivial manner to solve it.

No, I do not say that. In the Carfi article that you linked earlier, he only gets to "Dirac orthonormality" near the end, in section 5. That section contains a definition and a couple of examples, but no theorems or other results. Thus he does not "solve" anything in that section. Nevertheless, I have no problem with Carfi's definition 5.1, since that's nothing new -- it's just a more rigorous statement of the usual physicist's "Dirac-delta orthogonality" concept, (and which the framework of Rigged Hilbert Space generalizes to other sets of operators besides position and momentum).

Now what I understand about your opinion (and that is certainly wrong but I do my best to understand) is that according to you:

[*] there exists a sort of partial product between families such that ##(T_q)_{q\in\mathbb{R}}\cdot (U_{q’})_{q’\in\mathbb{R}}## is a distribution again (partial again because it might not exist for arbitrary pairs of families)

More precisely, I say that there might exist a partial product on such families. Carfi's examples show such a case. (If one re-admits operators into the discussion, and the family constitutes a set of generalized eigenvectors of suitably-extended self-adjoint operators, then one could say more. This is the content of the Gelfan-Maurin nuclear spectral theorem.)

[*] this operation is just a trivial consequence of previously listed operations

No, I do not claim that.

the value of ##(T_q)_{q\in\mathbb{R}}\cdot (T_{q’})_{q’\in\mathbb{R}}## can be used to confirm or disconfirm the orthonormality of ##(T_q)_{q\in\mathbb{R}}##.

I do not claim that in the context of this discussion, since you have excluded discussion of operators.

it is what is implied by informal physicist equation ##\langle q|q’\rangle = \delta(q-q’)##

I do not claim that in the context of this discussion, since you have excluded discussion of operators.

If you want to know what I do think about the concept of Dirac-delta orthogonality, I refer you to this paper on the RHS in QM by Rafael de la Madrid, specifically eqns(2.28a-d), and the context leading up to them. If you do a search for his other works on Google Scholar, you can also find his PhD Thesis in which these things are discussed more extensively.

burakumin · Feb 6, 2014

strangerep said:

No. A tensor product would be in ##\mathcal{S}'(\mathbb{R}^2) \otimes \mathcal{S}'(\mathbb{R}^2)##, by definition.

Would I be misinterpreting you again if I suggested that there is a typo and that you certainly mean ##\mathcal{S}'( \mathbb{R})\otimes \mathcal{S}'( \mathbb{R})## ? That said, to my knowledge ##\mathcal{S}'( \mathbb{R})\otimes \mathcal{S}'( \mathbb{R})## canonically injects in ##\mathcal{S}'( \mathbb{R}^2)## using injection ##i## such that ##(i(T\otimes U), \chi) = (T, x \mapsto (U, y\mapsto \chi(x,y)))## with ##\chi\in\mathcal{S}( \mathbb{R}^2)##. But I guess tensor product of distributions might not be the most important tool for this discussion anyway.

strangerep said:

Well, this makes things more difficult, but let us continue. (BTW, since we're talking about the specific example of Schwartz space and tempered distributions, the usual operators of position and momentum are applicable here. But ok, let's proceed without mentioning those operators specifically.)

I'm also surprised that you now wish to exclude this family being a set of eigenvectors of some operator, since your original post in this thread specifically asked about orthnormality of eigenvectors of the position operator.

Actually this was intended because I precisely don’t want that we focus on a specific operator. My initial question referred to the position operator but that was just an example and I never intended to focus on it. I was looking for a general definition and to me [itex]\langle q|q'\rangle=\delta(q-q')[/itex] is just an example. So your answer

strangerep said:

Not exactly. [...]

was (and is still) pretty confusing to me.

If needed it is possible to consider an abstract operator ##X## (densily defined and (essentially) self-adjoint) and the set ##B_X## of all possible complete families of eigenvectors. I suppose that the way to go may be to define the distribution-valued product ##(T_q)_{q\in\mathbb{R}}\cdot (T_{q’})_{q’\in\mathbb{R}}## at least for some families ##(T_q)_{q\in\mathbb{R}} \in B_X## and among them to select the orthonormal ones based on the result of this product. What is still not clear to me is how to define the product in the general case (at least as often as it can makes sense) and what value is expected for orthonormal families.

If you want to know what I do think about the concept of Dirac-delta orthogonality, I refer you to this paper on the RHS in QM by Rafael de la Madrid, specifically eqns(2.28a-d), and the context leading up to them. If you do a search for his other works on Google Scholar, you can also find his PhD Thesis in which these things are discussed more extensively.

I’m not sure to understand how these equations could be an answer. From what I understand after a glance at Mr de la Madrid’s article, it looks more like « we would like objects that formally behave like this » rather than defining them. That said, I’ve downloaded and started to read his phd. His mathematical focus and care for rigor have been very enjoyable so far so I plan to read it entirely. I guess it’s better that I come back with precise questions after I’ve finished.

strangerep · Feb 6, 2014

burakumin said:

Would I be misinterpreting you again if I suggested that there is a typo and that you certainly mean ##\mathcal{S}'( \mathbb{R})\otimes \mathcal{S}'( \mathbb{R})## ?

Oops. Yes, it was a typo.

I’ve downloaded and started to read his phd. His mathematical focus and care for rigor have been very enjoyable so far so I plan to read it entirely. I guess it’s better that I come back with precise questions after I’ve finished.

Yes. It would be better to discuss the content of specific references, rather than my attempted answers that are necessarily brief and incomplete.

BTW, if you want more completeness and rigor, try Gel'fand & Vilenkin, vol 4.

burakumin · Feb 17, 2014

I've finally read parts of Mr de la Madrid's phd (at least I hope what was needed for my problem). To be honest, I'm a bit skeptic about the manipulation of some of his equations. If we take formula 3.5.15, it asserts that

##|\phi\rangle = \int |x\rangle \langle x|\phi\rangle d\mu(x)##

Then we can say that

##(|x\rangle, |\phi\rangle) = \left( |x\rangle, \int |y\rangle \langle y|\phi\rangle d\mu(y) \right) ##

But I do not follow Mr de la Madrid when he deduces 3.5.35

##(|x\rangle, |\phi\rangle) = \int (|x\rangle,|y\rangle) \langle y|\phi\rangle d\mu(y)##

To me it's equivalent to the situation of considering a linear form ##F## on ##\mathbb{R}^2\times \{0\}##, two vectors ##a=(1,0,-1)## and ##b=(-2,2, 1)## of ##\mathbb{R}^3## and pretending that because ##F(a+b)## is a defined quantity so must be ##F(a)+F(b)## and that both are then equal.

I don't mean that it is impossible to develop a framework where this makes sense. I mean that Mr. de la Madrid's approach consists here in proceeding as if the step were allowed a priori and in trying to reinterpret his new formula a posteriori so that symbols can fit something apparently meaningful: "Equation (3.5.38) therefore says that the mathematical quantity ##\langle x|y \rangle## has the property that [...]". Furthemore, the nature of this post-interpreted quantity (3.5.39: ##\langle x|y \rangle = \delta(x-y)##) is not very clear. What are ##|x \rangle## and ##|y \rangle## ? You cannot really interprete them as the eigenvector for some given real numbers ##x## and ##y## because that would suggest that something like ##\langle 5|5 \rangle = \delta(0)## is meaningful. So at least either one of these symbols represent a family. ##|x \rangle## ? ##|y \rangle## ? Both ? Note that if both are families ##\delta(x-y)## has to be interpreted as a distribution of ##\mathcal{S}'( \mathbb{R} ) \otimes \mathcal{S}'( \mathbb{R} )##. In the end I just feel that Mr de la Madrid does not care so much about this possible ambiguity.

That said, I have actually realized that equation 3.5.94 of the Gelfand Maurin theorem

##(\psi, \phi) = \int \langle \phi | \lambda \rangle \langle \lambda | \psi \rangle d\mu(\lambda)##

might be sufficient to express orthonormality (provided ##\mu## has known form). Indeed, if ##(|v_i \rangle)_{0<i\leq n}## is a base of a n-dimentional Hilbert space, an equivalent condition for orthonormality is that for all ## | \phi \rangle## and ## | \psi \rangle## we have ## \langle \psi | \phi \rangle = \sum_i \langle \psi | v_i \rangle \langle v_i | \phi \rangle ##. If I'm not mislead by this fact, I'm not sure there is so much value in trying to give a meaning to 3.5.39. I don't know if we would agree on this but at least I feel that 3.5.94 offers a valid orthonormality criteria.

Now, if you allow me I have an additional question. In fact I didn't think so much about what follows in the beginning so I erroneously considered that my only problem was with defining orthonormality when it was just a part of it. Following Ballentine that you recommended me in another thread, the position representation of a state ##| \phi \rangle## is defined as the square integrable function ##\phi(x) = \langle x | \phi \rangle ## with ##(\langle x |)_{x \in \mathbb{R}}## an (orthonormal) complete family of generalized eigenvectors of ##Q## (in one dimension). But if ##(\langle x |)_{x \in \mathbb{R}}## is a valid family, I cannot see why ##(\widehat{\langle x|})_{x \in \mathbb{R}} = \left(e^{i\theta(x)}\cdot\langle x |\right)_{x \in \mathbb{R}} ## would not satisfy the Gelfand Maurin theorem for any real function ##\theta##. This would make phase and phase variation (and then Fourier transform) completely irrelevant. What am I missing?

strangerep · Feb 17, 2014

burakumin said:

I've finally read parts of Mr de la Madrid's phd (at least I hope what was needed for my problem). To be honest, I'm a bit skeptic about [...]

Perhaps you should try to allocate time to study the whole thesis carefully? It's easy to be skeptical about something one has not yet studied thoroughly.

[...] I do not follow Mr de la Madrid when he deduces 3.5.35 [...]

At the bottom of p95, Rafa says quite clearly:

Rafael de la Madrid said:

To explain the Gelfand-Maurin Theorem in detail requires much more mathematics. These mathematics are provided in Section 3.5.2. In this section, we just give an intuitive statement, which can be accepted in analogy to (3.5.10).

I.e., Rafa is just trying to provide an intuitive motivation (for physicists) of the basic ideas and notations for Rigged Hilbert Space, in a way that relates it to the more familiar (for physicists) Dirac bra-ket notation.

Thus, it is quite unfair to criticize him for not providing full rigor in that section 3.5.1, when he does offer a higher level of rigor in section 3.5.2.

burakumin said:

That said, I have actually realized that equation 3.5.94 of the Gelfand Maurin theorem
[...]
might be sufficient to express orthonormality (provided ##\mu## has known form).
[...]
at least I feel that 3.5.94 offers a valid orthonormality criteria.

Well, that's at least 1 step forward.

[...] I cannot see why ##(\widehat{\langle x|})_{x \in \mathbb{R}} = \left(e^{i\theta(x)}\cdot\langle x |\right)_{x \in \mathbb{R}} ## would not satisfy the Gelfand Maurin theorem for any real function ##\theta##. This would make phase and phase variation (and then Fourier transform) completely irrelevant. What am I missing?

Perhaps you are missing the fact that one is trying to find a space on which (arbitrary powers of) both ##Q## and ##P## are well-defined, and can be extended by duality to the (anti-)dual space. Perhaps you should study (all of) section 3.6 of Rafa's Phd Thesis carefully (if you have not already done so), to see how the Gelfand-Maurin theorem is applied in a simple case of the harmonic oscillator.

Orthonormality of a complete set of eigenvectors

1. What is the definition of orthonormality in the context of a complete set of eigenvectors?

2. How is orthonormality related to eigenvalues and eigenvectors?

3. Can a set of non-orthonormal eigenvectors still form a basis for a vector space?

4. How can you determine if a set of eigenvectors is orthonormal?

5. Why is orthonormality important in linear algebra and other fields of science?

Similar threads

Hot Threads

Recent Insights