Unbounded operators in non-relativistic QM of one spin-0 particle

Fredrik · Apr 7, 2009

strangerep said:

For QFT in infinite-dimensions, even the rigged Hilbert space is not big enough.
The singular behaviours arising from the interaction terms in the Hamiltonian
become far more pathological than mere delta distributions.

Interesting...and strange. The strange part is that we can actually make predictions in spite of all of this.

strangerep said:

Secondly, there are general theorems about how not all operators are sensible
observables. There's a little more on this at the end of a previous thread:
https://www.physicsforums.com/showthread.php?t=262821&page=2

I don't see anything about it in that thread.

but I recognize the claim from http://en.wikipedia.org/wiki/Density_matrix#C.2A-algebraic_formulation_of_states", but I don't really understand it. I think I understood one thing though. A self-adjoint operator is only an observable if it preserves each superselection sector. For example, it can't take a bosonic state to a fermionic state.

strangerep · Apr 7, 2009

Fredrik said:

I don't see anything about it in that thread.

OK. There's some papers by Gotay, e.g., paper math-ph/9809011,
that talk about this in excruciating detail. It's peripheral to the
main subject of the current thread, so you probably won't want
to wade through 50-pages of nontrivial math. Maybe just look
at p18: the paragraph starting with

In particular, one can see
at the outset that it is impossible for a prequantization to satisfy
the “product → anti-commutator” rule.

and the manipulations that follow. And maybe look through the
early sections to get the context.

EDIT: A better reference might be math-ph/9809015.

But like I said, this goes off on an tangent from the current thread.

strangerep · Apr 7, 2009

Fredrik said:

So what I had in mind is the topological dual of a vector space with the topology induced by the inner product. What's confusing me is the definition of continuity. It seems natural to me to define the standard inner product, and use the concept of continuity induced by it. But instead, we choose to define convergence of a sequence in D, and use that to define continuity. These definitions may be equivalent, but it's not obvious that they are.

I'm starting to lose track of what you're referring to, hence unable to offer a pinpoint
answer. Maybe you should re-summarize?

It gets even more weird in the case of tempered distributions. We have defined an infinite
number of norms, which should give us infinitely many kinds of continuity.

Note that each of these norms defines a subspace. I.e., the sequence of norms defines
a sequence of spaces, each densely nested in the previous (larger) one. So "continuity"
applies in the context of the norm topology on each subspace. The tempered distribution
case applies to the (dual of) the "inductive limit" of this sequence of spaces (afaiu).

Fredrik · Apr 7, 2009

strangerep said:

I'm starting to lose track of what you're referring to, hence unable to offer a pinpoint
answer. Maybe you should re-summarize?

OK, let's focus on "distributions" first, not "tempered distributions". We define the test function space (I'll call it \mathcal D) as the set of C^\infty functions from \mathbb R^n into \mathbb C that have compact support. I would like to define a "distribution" as a member of the dual space \mathcal D^*, defined as the set of continuous linear functions T:\mathcal D\rightarrow\mathbb C. But to do that, we need to define what "continuous" means. There are at least two ways to do that.

Option 1: Define the usual inner product. The inner product gives us a norm, and the norm gives us a metric. Now we can use the definition of continuity that applies to all metric spaces.

T:\mathcal D\rightarrow\mathbb C is continuous at g\in\mathcal D if for each \epsilon>0 there's a \delta>0 such that \|f-g\|<\delta\implies|T(f)-T(g)|<\epsilon. T is continuous on a set U\subset\mathcal D if it's continuous at each point in U. Alternatively, and equivalently, T is continuous on U\subset\mathcal D if T^{-1}(E) is open for every open E\subset\mathbb C.

Weird, it takes a lot less to cause a database error now than a couple of weeks ago. I'll continue in the next post.

Fredrik · Apr 7, 2009

Option 2: This is the option that every text on the subject seems to prefer, and I don't see why. We define what it means for a sequence in \mathcal D to be convergent by saying that \phi_n\rightarrow\phi if there's a compact set K in \mathbb R^n that contains the supports of all the \phi_n, and D^\alpha \phi_n converges uniformly on K to D^\alpha\phi, for all \alpha. Recall that D^\alpha is defined by

|\alpha|=\alpha_1+\cdots+\alpha_n

D^\alpha f(x)=\frac{\partial^{|\alpha|}}{\partial x_1^{\alpha_1}\cdots\partial x_n^{\alpha_n}}f(x)

Now we define "continuous" by saying that T:\mathcal D\rightarrow\mathbb C is continuous at \phi\in\mathcal D if

\lim_{n\rightarrow\infty}T(\phi_n)=T(\phi)

for every sequence \{\phi_n\} that converges to \phi. Maybe these two options are actually equivalent, but I would expect that they're not, mostly because I think option 1 is more natural and intuitive. I don't see why these authors would all go for option 2 if they are equivalent.

Fredrik · Apr 7, 2009

strangerep said:

Note that each of these norms defines a subspace. I.e., the sequence of norms defines
a sequence of spaces, each densely nested in the previous (larger) one. So "continuity"
applies in the context of the norm topology on each subspace. The tempered distribution
case applies to the (dual of) the "inductive limit" of this sequence of spaces (afaiu).

Ah, that's actually very helpful. I just re-read a few statements from the part of Streater & Wightman where they define tempered distributions, keeping in mind what you just said, and suddenly what they're saying makes a lot more sense.

Unfortunately I have to go to bed now, but I'll continue tomorrow.

Fredrik · Apr 8, 2009

The answer to my concerns in #34-35 (or at least a partial answer) is probably that there's nothing weird or unexpected about the fact that the concept of continuity depends on what topology you define on the set you're working with. The space of distributions isn't the dual space of the test functions. It's a dual space of the test functions. Or to be more accurate, the topological dual space of a set isn't defined. Only the topological dual space of a topological vector space is defined, so we have to choose a topology first, and then we get a dual space.

It still makes me wonder why their version of continuity is more desirable. It probably has something to do with the idea that derivatives of distributions should always exist, but I haven't thought that idea through yet.

Edit: I'm satisfied that I understand the definition of distributions and tempered distributions now, so we don't have to discuss them unless someone else wants to. I'm not saying that I understand every detail perfectly, but my understanding is good enough.

By they way, a thought occurred to me when I was replying to another thread. Isn't it weird to start the formulation of the simplest meaningful quantum theory (the non-relativistic QM of one spin-0 particle) by postulating that the states are rays in L^2(\mathbb R^3) when the next postulate says that the time evolution is given by the Schrödinger equation? The Schrödinger equation only makes sense if the partial derivatives exist, and that implies continuity, but L^2(\mathbb R) contains ridiculous functions like the one Hurkyl thought of in another thread to disprove a claim I made there:

Hurkyl said:

Consider the function:

 \psi(x) = \begin{cases} 0 & x < 1 \\ 1 & x \in [n, n + n^{-2}) \\ 0 & x \in [n + n^{-2}, n+1) \end{cases} 

where n ranges over all positive integers. \psi(x) does not converge to zero at +\infty. However,

 \int_{-\infty}^{+\infty} |\psi(x)|^2 \, dx = \sum_{n = 1}^{+\infty} \int_{n}^{n + n^{-2}} 1 \, dx = \sum_{n = 1}^{+\infty} n^{-2} = \pi^2 / 6

strangerep · Apr 9, 2009

Fredrik said:

Maybe these two options are actually equivalent, but I would expect that they're not, mostly because I think option 1 is more natural and intuitive. I don't see why these authors would all go for option 2 if they are equivalent.

First, you can probably separate out the differentiability stuff from the topological
continuity stuff.

I think (in the absence of precise definitions of those two cases) that both notions
of convergence are equivalent. The first is just saying that you can construct a
Cauchy sequence and the second is saying that a sequence converges.
(Topologically, a sequence "converges to a point z" if, given an arbitrary open
set O containing z, all points of the sequence are eventually "in" O -- after some
integer n in the sequence.

The thing about restricting to infinitely differentiable functions is so that you
can do integral calculus, i.e., solve a DE given an initial condition, in such
a way that as to be compatible with the (rigged) Hilbert space structure.

the concept of continuity depends on what topology you define on
the set you're working with

Yes. This is a crucial point to understand in order to work with other
topologies (weak, weak*, and others). Generally speaking, convergence
is easier in weaker (coarser) topologies, but more theorems can be
proven with stronger (finer) topologies. Bit of a tradeoff depending on
exactly what one is trying to achieve.

Fredrik · Apr 9, 2009

I think I'm going to have to study some more functional analysis before I post a bunch of new questions. I'll read some more of Conway first, and maybe I'll read Maurin's lecture notes. Strangerep and Jostpuur, thanks for the answers so far. George, feel free to bump the thread when you're less busy. I'm interested in what you have to say about all of this.

dextercioby · Apr 12, 2009

Brilliant discussion on a very interesting topic, I must say. I was wondering, how would I build the RHS for the hydrogen atom ? The hamiltonian for the H atom has a mixed spectrum and I've never seen an application of the RHS formalism for an operator with mixed spectrum.

jostpuur · Apr 12, 2009

Fredrik said:

OK, let's focus on "distributions" first, not "tempered distributions". We define the test function space (I'll call it \mathcal D) as the set of C^\infty functions from \mathbb R^n into \mathbb C that have compact support. I would like to define a "distribution" as a member of the dual space \mathcal D^*, defined as the set of continuous linear functions T:\mathcal D\rightarrow\mathbb C. But to do that, we need to define what "continuous" means. There are at least two ways to do that.

Option 1: Define the usual inner product. The inner product gives us a norm, and the norm gives us a metric. Now we can use the definition of continuity that applies to all metric spaces.

Equipping the \mathcal{D} with a norm

 \|\psi\|^2 = \int\limits_{\mathbb{R}^n} dx\; |\psi(x)|^2 

and defining \mathcal{D}^* as a topological dual

 \mathcal{D}^* = \big\{ \phi^*\in \mathbb{C}^{\mathcal{D}} \;|\; \phi^*\;\textrm{is linear},\; \sup_{\|\psi\|\leq 1} |\phi^*(\psi)| < \infty\big\} 

is probably not what anyone wants, because for example this collection of linear forms doesn't contain delta-functions \delta_x:\mathcal{D}\to\mathbb{C}, \delta_x(\psi)=\psi(x).

Was that what you meant by your option 1?

Fredrik said:

Option 2: This is the option that every text on the subject seems to prefer, and I don't see why. We define what it means for a sequence in \mathcal D to be convergent by saying that \phi_n\rightarrow\phi if there's a compact set K in \mathbb R^n that contains the supports of all the \phi_n, and D^\alpha \phi_n converges uniformly on K to D^\alpha\phi, for all \alpha.

Now we define "continuous" by saying that T:\mathcal D\rightarrow\mathbb C is continuous at \phi\in\mathcal D if

\lim_{n\rightarrow\infty}T(\phi_n)=T(\phi)

for every sequence \{\phi_n\} that converges to \phi. Maybe these two options are actually equivalent, but I would expect that they're not, mostly because I think option 1 is more natural and intuitive. I don't see why these authors would all go for option 2 if they are equivalent.

I'm not 100% sure of this, but I've heard that it is possible prove the existence of such topology in \mathcal{D}, that the convergence \psi_n\to\psi in that topology is equivalent with this definition that you described here. When \mathcal{D} is equipped with such topology, then the standard collection of distributions \mathcal{D}^* can be defined according to the definition of topological dual.

jostpuur · Apr 12, 2009

meopemuk said:

In my understanding, the reason why standard Hilbert space formalism is not suitable for QM is rather simple. Let's say I want to define an eigenfunction of the momentum operator. In the position space such an eigenfunction has the form (I work in 1D for simplicity)

\psi(x) = N \exp(ipx)

where N is a normalization factor. This wavefunction must be normalized to unity, which gives

1 =\int \limits_V |\psi(x)|^2 dx = N^2V

where V is the "volume of space", which is, of course, infinite. This means that the normalization factor is virtually zero

N = 1/\sqrt{V}

While I have understood this problem, it has not yet become clear to me how this problem is supposed to be solved in a more sophisticated formalism. The only solution that I have understood so far is this:

jostpuur said:

If we ask a question that what is the probability for a momentum to be in an interval [p_0-\Delta, p_0+\Delta], we get the answer from the expression

 \frac{1}{2\pi\hbar} \int\limits_{p_0-\Delta}^{p_0+\Delta} |\hat{\psi}(p)|^2 dp. 

I'm not convinced that it is useful to insist on being able to deal with probabilities of precise eigenstates. Experimentalists cannot measure such probabilities either.

Avodyne said:

I don't see why it's necessary to go beyond Hilbert space. Rather than defining a position operator, we could define projection operators with eigenvalue 1 if the particle is in some particular volume V, and 0 otherwise; heuristically, these would be

P_{x\in V} \equiv \int_V d^3\!x\,|x\rangle\langle x|

Similarly for a volume in momentum space. Then, instead of defining a hamiltonian whose action could take a state out of the Hilbert space, we could define a unitary time evolution operator.

I'm not convinced that the distributions are very useful. The distributions are usually defined by using rather strange topologies or norms. The Swartz norm seems to be totally different from the original idea of a Hilbert space norm. However, the Hilbert space norm is highly relevant for the probability interpretation. How are we supposed to deal with quantum mechanical probabilities if the Hilbert space norm has been replaced with some of these distribution norms?

Fredrik · Apr 12, 2009

jostpuur said:

is probably not what anyone wants, because for example this collection of linear forms doesn't contain delta-functions \delta_x:\mathcal{D}\to\mathbb{C}, \delta_x(\psi)=\psi(x).

Was that what you meant by your option 1?

Ahh, excellent point. Yes, that's what I meant. I get it now. (I didn't until I read your post). Delta functions wouldn't be bounded (and therefore not continous) linear functionals if we go for option 1. For delta to be bounded, there must exist M>0 such that

|\delta f|\leq M\|f\|

for all test functions f, but the inequality is equivalent to

\frac{|f(0)|}{\|f\|}\leq M

and we don't have to look hard to find an f that violates this. Even some constant functions will do. For example, define f_r(x)=1 when |x|<r, and f_r(x)=0 outside that region. Now shrink that region (i.e. choose a smaller r) until the norm of f_r gets small enough to violate the inequality.

Fredrik · Apr 12, 2009

jostpuur said:

I'm not 100% sure of this, but I've heard that it is possible prove the existence of such topology in \mathcal{D}, that the convergence \psi_n\to\psi in that topology is equivalent with this definition that you described here. When \mathcal{D} is equipped with such topology, then the standard collection of distributions \mathcal{D}^* can be defined according to the definition of topological dual.

I don't think the construction of such a topology is very difficult. It would be harder to prove that the result is equivalent to what we already have, but even that looks doable. (I'm not sure I care enough to give it a try though

).

We can e.g. define a subset of \mathcal D to be open if it can be expressed as T^{-1}(U) where U is an open subset of the complex numbers, and T is continuous in the sense defined above.

Alternatively, I think we can define a subset E of \mathcal D to be open if for every sequence in \mathcal D that converges to a a point in E (in the sense defined above), E contains all but a finite number of members of the sequence.

I'm guessing that these definitions are adequate (in the sense that they both define a topology on \mathcal D), and equivalent (in the sense that those topologies are the same), but I haven't made any attempt to prove it or disprove it.

strangerep · Apr 12, 2009

Fredrik said:

We can e.g. define a subset of \mathcal D to be open if it can be expressed as T^{-1}(U) where U is an open subset of the complex numbers, and T is continuous in the sense defined above.

Yes, that defines the "T-weak" topology on \mathcal D. (Weak topologies are
defined implicitly/indirectly by a demand that a certain set of functions be continuous
under it.)

Alternatively, I think we can define a subset E of \mathcal D to be open if for every sequence in \mathcal D that converges to a a point in E (in the sense defined above), E contains all but a finite number of members of the sequence.

I don't follow this. If you haven't yet specifed a topology on \mathcal D
then there's no meaning in the statement that a sequence of elements of
\mathcal D "converges". But maybe I misunderstood and you
meant something else?

strangerep · Apr 13, 2009

bigubau said:

I was wondering, how would I build the RHS for the hydrogen atom ? The hamiltonian for the H atom has a mixed spectrum and I've never seen an application of the RHS formalism for an operator with mixed spectrum.

The discrete part of the spectrum doesn't pose the same difficulties as encountered
with the continuous part. One has a resolution of unity as usual, consisting of an
integral over the continuous part, plus a sum over the discrete part. Rigged Hilbert
space and the nuclear spectral theorem give meaning to the continuous part.
(Not sure whether that's what you were asking, though.)

BTW, for anyone who's still a bit perplexed about the role of all this
rigged Hilbert space stuff, I noticed this pedagogical introductory paper:

Rafael de la Madrid,
"The role of the rigged Hilbert space in Quantum Mechanics",
Available as: quant-ph/0502053

It's written in a very physicist-friendly way, using the example of a rectangular
potential barrier to make everything concrete. It also explains the connections
between the bras and kets of rigged Hilbert space theory and distributions
quite clearly (while minimzing the heavy pure math that probably turns some
people off).

Fredrik · Apr 13, 2009

strangerep said:

I don't follow this. If you haven't yet specifed a topology on \mathcal D
then there's no meaning in the statement that a sequence of elements of
\mathcal D "converges". But maybe I misunderstood and you
meant something else?

You understood me right. We define convergence of sequences first, and use that to define the topology. Convergence is defined as in post #35. I got that definition and the idea that it defines a topology on the test function space from the Wikipedia article on distributions. This is a direct link to the relevant section. Note in particular the sentence "It can be given a topology by defining the limit of a sequence of elements of D(U)." Unfortunately, Wikipedia doesn't say how the definition of convergence defines a topology. That's why I came up with those two guesses about how it can be done.

Fredrik · Apr 13, 2009

strangerep said:

BTW, for anyone who's still a bit perplexed about the role of all this
rigged Hilbert space stuff, I noticed this pedagogical introductory paper:

Rafael de la Madrid,
"The role of the rigged Hilbert space in Quantum Mechanics",
Available as: quant-ph/0502053

Thanks. I have read the first 9 pages now, and I'm very pleased with it so far. I'm definitely going to read the rest later.

Fredrik · Apr 13, 2009

I found a review of Maurin's books, and the reviewer is complaining a lot about how careless Maurin's presentations are. Link. I'm guessing it would be better to try to find the original articles than to read his books.

DarMM · Apr 14, 2009

Fredrik said:

strangerep said:

For QFT in infinite-dimensions, even the rigged Hilbert space is not big enough.
The singular behaviours arising from the interaction terms in the Hamiltonian
become far more pathological than mere delta distributions.

Interesting...and strange. The strange part is that we can actually make predictions in spite of all of this.

I thought I'd add a bit to this since it's very interesting.

As you know the Hilbert Space for 1 particle spin-0 QM is L^2(\mathbb R^3, dx). Where dx is the Lesbesgue measure.

\mathbb R^3[/tex] coming from the fact that a particle can occupy any point in three dimensional space. So the set of all points \mathbb R^3[/tex] is the classical configuration space. For Quantum Field Theory the classical configurations are the set of all possible field configurations. This set turns out to be \mathcal{D}^*(\mathbb R^3), the space of distributions. So the Hilbert space of QFT is: L^2(\mathcal{D}^*(\mathbb R^3), d\nu), the space of square integrable functions over the space of distributions with respect to some measure d\nu. A free QFT and an interacting QFT differ by their choice of d\nu. Quantum Fields \phi(x) are then objects that when integrated against a function f(x) give an object \int{\phi(x)f(x)}dx = \phi(f), which is an unbounded operator on L^2(\mathcal{D}^*(\mathbb R^3), d\nu). Who said rigorous QFT was hard?

strangerep · Apr 14, 2009

Fredrik said:

I got that definition and the idea that it defines a topology on the test function space from the Wikipedia article on distributions. This is a direct link to the relevant section. Note in particular the sentence "It can be given a topology by defining the limit of a sequence of elements of D(U)." Unfortunately, Wikipedia doesn't say how the definition of convergence defines a topology.

OK, thanks. I think I get it now.
The key item is the use of uniform convergence. E.g., given two functions f,g it's
sensible to ask how "close" they are to each other, as follows: If we can say that
|f(x) - g(x)| < \epsilon for all x, then \epsilon is an expression
of the closeness of f and g (independently of x).

So one could define open sets in a manner reminiscent of open balls in a metric
topology: pick a function f and a "radius" r and then say that one particular
open set centered on f, of radius r, is all the other functions g s.t. |f(x) - g(x)| < r
for all x. Then take unions, etc, to get a full topology.

Uniform convergence just extends this idea to sequences, the crucial bit
being how it's independent of x.

The fact that it's expressed in terms of arbitrary derivatives is just because we
want to deal with infinitely-differentiable functions only, and is a bit a red-herring
when one is focusing just on topological matters.

strangerep · Apr 14, 2009

Fredrik said:

I found a review of Maurin's books, and the reviewer is complaining a lot about how careless Maurin's presentations are. Link.

Hmmm, that reviewer uses rather savage language, but I have to say I agree with
his remarks about lack of care. I noticed the same thing as soon as I started to
try and follow Maurin's proof of the nuclear spectral theorem. I had to keep flipping
backwards tediously to double-check notations, often finding things that I felt
sure were typos. Worse, even though the book came with an errata list at the
back, I had trouble matching it with what was in the text(!) (sigh).

Look's like I'll have to try harder to obtain a copy of Gelfand & Vilenkin and
see whether their presentations is any better.

TBH, I'm quite surprised and disappointed that I couldn't find a clear pedagogical
proof in a more modern text, given it's importance in modern quantum theory.

I'm guessing it would be better to try to find the original articles than
to read his books.

Dunno. Original papers tend to be written for people with a greater level
of knowledge. Books are supposed to be for lesser mortals.

strangerep · Apr 14, 2009

DarMM said:

I thought I'd add a bit [...]

Hmmm, thanks for joining this thread. You just made me think about
a couple of things from a direction I hadn't explored adequately...

For Quantum Field Theory the classical configurations are the set of all possible field configurations. This set turns out to be \mathcal{D}^*(\mathbb R^3), the space of distributions. So the Hilbert space of QFT is:
L^2(\mathcal{D}^*(\mathbb R^3), d\nu), the space of square integrable functions over the space of distributions with respect to some measure d\nu.

I guess this gets into the problematic area of "multiplication of distributions"?
I hope you can say a bit more about that.

A free QFT and an interacting QFT differ by their choice of d\nu.
Quantum Fields \phi(x) are then objects that when integrated against a function f(x) give an object \int{\phi(x)f(x)}dx = \phi(f), which is an unbounded operator on L^2(\mathcal{D}^*(\mathbb R^3), d\nu).

Let me see... in the simpler case of rigged Hilbert space, translating into
Dirac bra-ket notation, we have
 \phi(f) ~=~ \int{\phi(x)f(x)}dx ~=~ \langle\phi| \int{|x\rangle\langle x|}dx ~ |f\rangle ~=~ \langle \phi | f\rangle 
with
 \langle x|x'\rangle ~=~ \delta(x - x') ~,~~~\mbox{etc.} 

So in the more general case we're dealing with functions over distributions (which
involving products of distributions in general), and then trying to figure out
how to construct (higher-order?) functionals over these functions?

DarMM · Apr 15, 2009

strangerep said:

I guess this gets into the problematic area of "multiplication of distributions"?
I hope you can say a bit more about that.

Yes, in fact it's one of the big issues in rigorous QFT. Your question below will naturally lead to it.

strangerep said:

So in the more general case we're dealing with functions over distributions (which
involving products of distributions in general), and then trying to figure out
how to construct (higher-order?) functionals over these functions?

Let me give a simple example of a function over distributions.
As you know a distribution T(x) is a continuous linear functional over the space of test functions \mathcal{D}. However one can reverse this and consider a test function f as a function on distributions. A given test function f maps any distribution T to T(f), resulting in a map over the whole space of distributions.
f:\mathcal{D}^*(\mathbb R^3) \rightarrow \mathbb R.

So one writes this function over distributions as f(T). One can then immediately arrive at more general functions for instance:
f(T)g(T)
f(T)g(T)h(T)
or e^{f(T)}

Since one can choose a any test function, this results in an enormous space of functions over Distributions.

In the case of a free quantum field, the quantum field operator acts as a "multiplication operator". For example:
\phi(g) = \int{\phi(x)g(x)}dx
Then,
\phi(g)f(T) = g(T)f(T)

The conjuagate momentum field \pi(x) acts like the momentum operator from regular QM by being a derivative. In rough language:
\pi(x)f(T) = -i\frac{\delta}{\delta T}f(T)

The Hamiltonian is composed of \pi(x) and \phi(x) and so it acts in a natural way.

Like all quantum mechanical theories though, we don't allow just any functions, but only square integrable ones. So one requires a notion of integration for this functions, formally something like:
\int{f(T)}dT. This can be done, although the details are a bit involved. For the purpose of the remainder of this post let's assume we have done this for the free theory. We have the correct measure dT_{free}, the Hilbert space of square integrable functions with respect to it and the Hamiltonian, field and conjugate momentum operators acting in the way I described above. This space is basically Fock space.

However when one comes to \phi(x)^4[/tex] theory for example, the Hamiltonian gains an additional term: \int{\phi(x)^4}dx It is very difficult to know exactly how one should interpret this. (\int{\phi(x)^4}dx)f(T) = ? There are two problems here. \phi(x) is a operator valued distribution. When integrated it gives operators on L^2(\mathcal{D}^*(\mathbb R^3), dT_{free}). Due to its distributional nature no sense can be made of \phi(x)^4. One could attempt to come up with some way of powering distributions, however this powering system must respect the fact that (\int{\phi(x)^4}dx) is meant to be a linear operator and respect certain commutation relations. Let's approach this problem first in two dimensions then in three dimensions. Two dimensions It turned out (again avoiding details) that there was one unique notion of powering which respects the structure of (\int{\phi(x)^4}dx) as a quantum operator. This notion was actually already known to physicists as Wick ordering. So we replace: (\int{\phi(x)^4}dx) with (\int{:\phi(x)^4:}dx). One can then prove that the full Hamiltonian is self-adjoint and semi-bounded (stable, no negative energy), the time evolution operator is unitary, e.t.c. Everything one wants from a QFT basically. This was done by James Glimm and Arthur Jaffe in the period 1968-1973. Three dimensions Here things get significantly harder. Where as one can put the proof of the two dimensional case into a semester long graduate course, the proof of the three dimensional case is a monster. However let me paint a picture. Basically we try what we did before and replace (\int{\phi(x)^4}dx) with (\int{:\phi(x)^4:}dx). However one finds that this does not result in the full Hamiltonian being a self-adjoint operator. If you are very clever you can prove that this problem can not be overcome, unless you leave L^2(\mathcal{D}^*(\mathbb R^3), dT_{free}), that is leave Fock space and move to another space of square integrable functions over distributions. Let me call it: L^2(\mathcal{D}^*(\mathbb R^3), dT_{Glimm}). However finding this space is incredibly diffcult, so one uses the condition that on this correct space the physical mass of a particle should be finite. This condition show you how to get to this new space. Basically, cutoff the fields so that you do the analysis in a more clear cut way, then use the condition of finite mass. This condition tells you that you should add the term: \delta m^2(\int{:\phi(x)^2:}dx) to the Hamiltonian. The \delta m^2 is a constant that will go infinite when the cutoff is removed, whose explicit form is given by the finite mass condition. As the cutoff is removed, this extra term does the job of "pushing" us into the correct space L^2(\mathcal{D}^*(\mathbb R^3), dT_{Glimm}), which is a lot easier than finding it abstractly. Physicists will know this procedure as mass renormalization. We then take the remove the cutoff. James Glimm proved that this moves us into the correct space and that the full Hamiltonian is self-adjoint there. Glimm and Jaffe later proved that the Hamiltonian is also semibounded (in what is considered the most difficult paper in mathematical physics, lots of heavy analysis). Shortly after came all the the other properties one wants, Lorentz invariance, e.t.c. Thus proving the rigorous existence of \phi^4 in three dimensions. As you can imagine the four dimensional case is extremely difficult. For five dimensions and greater, Jürg Fröhlich has shown that there is no space where the (\int{\phi(x)^4}dx) term makes sense, unless you set the interaction to zero, that is make it free. People have also shown the existence of Yukawa theories in two and three dimensions, as well as gauge and Higgs theories in two and three dimensions. Tadeusz Balaban has gotten pretty far although with Yang-Mills theory in two and three dimensions. He has even gotten some results for the four dimensional case. One final thing I wish to emphasize is that renormalization is not the difficulty in rigorous quantum field theory. Renormalization is a necessary technique, as it has been known for a long time the most interacting theories simply cannot live in the same space as the free theory (Haag's theorem). The actual difficulty is with standard analytic stuff, proving convergence and obtaining estimates which tell you something about a limit besides the fact that it exists. I hope this helps.

Fredrik · Apr 15, 2009

DarMM said:

For Quantum Field Theory the classical configurations are the set of all possible field configurations. This set turns out to be \mathcal{D}^*(\mathbb R^3), the space of distributions. So the Hilbert space of QFT is:
L^2(\mathcal{D}^*(\mathbb R^3), d\nu), the space of square integrable functions over the space of distributions with respect to some measure d\nu. A free QFT and an interacting QFT differ by their choice of d\nu.

I'm really confused by this. I thought the Hilbert space of (say) Klein-Gordon theory was supposed to be the Fock space constructed from the Hilbert space H₁ of one-particle states, which is defined as the subset of L^2(\mathbb R^3) that consists of square-integrable positive-frequency solutions of the Klein-Gordon equation. Is this wrong, or just another way of saying the same thing? Note btw that a square-integrable solution of the field equation must be continuous, because its partial derivatives must exist, so this H₁ isn't L^2(\mathbb R^3). It's just a "small" subset of it.

I don't understand why the space of distributions is the classical configuration space. Can you elaborate on that?

Also, if it's possible to explicitly write down the measure of the free theory and explain why it's the right one, I'd appreciate it.

DarMM said:

Who said rigorous QFT was hard?

Some moron probably

Seriously, I think it's hard just to find out where you can learn this stuff. I took three classes on quantum mechanics and two on quantum field theory at my university, and they didn't say anything about these things. (They mentioned the word "distribution" and said that you can use that concept to make sense of the delta function, but that's where they stopped).

DarMM said:

\mathcal{D}. However one can reverse this and consider a test function f as a function on distributions. A given test function f maps any distribution T to T(f), resulting in a map over the whole space of distributions.
f:\mathcal{D}^*(\mathbb R^3) \rightarrow \mathbb R.

So one writes this function over distributions as f(T). One can then immediately arrive at more general functions for instance:
f(T)g(T)
f(T)g(T)h(T)
or e^{f(T)}

Since one can choose a any test function, this results in an enormous space of functions over Distributions.

I don't understand what you're saying about these "more general" functions. Isn't f(T)g(T) defined as (fg)(T) and exp(f(T)) as \exp\circ f(T)? In that case, how are they more general? The product of two test functions is another test function. Same thing with compositions.

I'll stop here because it seems dumb to ask about advanced stuff when I'm still struggling with the "easy" stuff.

DarMM · Apr 16, 2009

Fredrik said:

I'm really confused by this. I thought the Hilbert space of (say) Klein-Gordon theory was supposed to be the Fock space constructed from the Hilbert space H₁ of one-particle states, which is defined as the subset of L^2(\mathbb R^3) that consists of square-integrable positive-frequency solutions of the Klein-Gordon equation. Is this wrong, or just another way of saying the same thing?

It's just another way of saying the same thing. L^2(\mathcal{D}^*(\mathbb R^3), d\nu) can be decomposed quite easily into direct sums of tensor products of square-integrable positive-frequency solutions of the Klein-Gordon equation. However in rigorous QFT this is sometimes avoided because for interacting theories the Hilbert space L^2(\mathcal{D}^*(\mathbb R^3), d\nu) is not a Fock space over positive-frequency solutions.

Fredrik said:

I don't understand why the space of distributions is the classical configuration space. Can you elaborate on that?

Basically you know from QM that the Hilbert space is the square integrable functions over the classical configuration space. It is of course difficult to know what this space should be, when does a given function count as a sensible classical configuration. However it turns out that if you start with any set of functions, in order to define a measure on this space you'll have to make the space larger to the point where it encompesses distributions. If you choose a larger space of functions it turns out that a lot of the space has measure zero, except the subspace of distributions. So either way no matter how big or small a space you pick it has to be the space of distributions.

Fredrik said:

I don't understand what you're saying about these "more general" functions. Isn't f(T)g(T) defined as (fg)(T) and exp(f(T)) as \exp\circ f(T) In that case, how are they more general? The product of two test functions is another test function. Same thing with compositions.

I was unclear before, f(T)g(T) \neq (fg)(T). It's actually simpler, you just evaluate f(T) and g(T) and then multiply them.
By more general functions I just meant I can define more functions than only the linear ones.

Fredrik said:

Also, if it's possible to explicitly write down the measure of the free theory and explain why it's the right one, I'd appreciate it.

Why it's the right one is easy. Basically instead of defining the free theory on continuous spacetime, put it on a lattice. On the lattice you can find the correct measure easily, since you're now back in a system of finite degrees of freedom. It's basically just L^2(\mathbb R^n) with n a very large number equal to the number of lattice points. If you take the limit as the lattice returns to a continuum you'll get the free theory measure I'm talking about.
You can also solve the Klien-Gordon equation to obtain the free field and work out the generating functional for the theory. If you have the generating functional then by a theorem called Minlos theorem, you know there exists a measure on the space of distributions which reproduces this generating functional and this the correct measure.

I'll define the measure in my next post, however you actually already know the measure and how to integrate with it.

Fredrik · Apr 17, 2009

Thank you DarMM. This is good stuff, and very interesting. Maybe you can also tell me what I would have to read to learn this stuff?

Fredrik · Apr 17, 2009

Strangerep, I skimmed the rest of the RHS article that you recommended. The interesting stuff was on the pages I had already read, so I didn't study the details of his specific example. One thing in particular that I had failed to understand before I read his paper is that the RHS definition of a ket is as an anti-linear functional on the nuclear space. (Is that the correct term for \Omega?)

I liked his motivation for the definition of \Omega. Short version: If we define a position operator Q:D(Q)\rightarrow H by Qf(x)=xf(x) for all x, then its domain can't be all of H, because Qf is only square integrable if

\int|xf(x)|^2\ dx < \infty

So the idea is to find a subspace of H on which the entire algebra of operators generated by Q,P and H (including operators of the form Q^iP^jH^k where i,j,k are arbitrary positive integers) is well-defined.

One thing that still seems weird to me is that people keep claiming that you need the RHS approach specifically to motivate bra-ket notation. I mean, all the problems that the RHS stuff is supposed to solve are there even if you don't use bra-ket notation. The first QM book I studied didn't use bra-ket notation, and it still pretended that Q and P had eigenvectors. It just ignored the fact that plane waves aren't square integrable and that delta functions aren't even functions.

If you ignore those problems, the Riesz representation theorem is all you need to justify the notation. We can define a ket to be a member of H, and a bra to be a member of H*. The bra written as \langle f| is the linear functional |g\rangle\mapsto(|f\rangle,|g\rangle).

strangerep · Apr 17, 2009

Fredrik said:

[...] the RHS definition of a ket is as an anti-linear functional on
the nuclear space. (Is that the correct term for \Omega?)

Yes, what I wrote earlier as \Omega is a nuclear space. In the paper
quant-ph/0502053, the author denotes it as \Phi instead.

So the idea is to find a subspace of H on which the entire algebra of operators generated by Q,P and H (including operators of the form Q^iP^jH^k where i,j,k are arbitrary positive integers) is well-defined.

Precisely. We want a representation of the enveloping algebra of observables,
and/or the full Lie group (not just the basic Lie algebra).

One thing that still seems weird to me is that people keep claiming that you need the RHS approach specifically to motivate bra-ket notation. I mean, all the problems that the RHS stuff
is supposed to solve are there even if you don't use bra-ket notation. [...]

Yes. It would be clearer to say "...motivate the use of Dirac's improper eigenvectors of
position and momentum...". I.e., to be able to write (eg)
 1 ~=~ \int\!\! dx ~ |x\rangle\langle x| 
and have it actually mean something.

If you ignore those problems, the Riesz representation theorem is all you need to justify the notation. We can define a ket to be a member of H, and a bra to be a member of H*. [...]

Yes, up to a point, but note that H is self-dual, i.e., H is isomorphic to H*.
So we don't really see the problems until unbounded operators are in play.

P.S., I finally got hold of the books by Gelfand and Vilenkin: "Generalized Functions
vol 4 -- Applications of Harmonic Analysis." Also vol 5. After some cursory reading
it looks better than Maurin's book. (Possibly because the G-V book was originally
Russian and then translated by a technically competent person. I.e., more people
have been over it with a fine tooth comb.)

P.P.S., Looks like you started are really good thread. Several days later, I'm still
trying to relate the stuff DarMM said with what I already know about QFT. :-)

Fredrik · Apr 23, 2009

Yes, this has been a good discussion. I have learned a lot from it.

I'm still confused though, and I'd like to go back to the simplest possible QM theory for a while. (See the thread title). How can we state its axioms in a way that makes sense? They are usually stated in a way that ignores all the complications. Something like this:1. The possible states of a physical system are represented by the unit rays of a separable Hilbert space over the complex numbers, on which there exists a self-adjoint operator H called the Hamiltonian.

2. The time evolution of an isolated system which is initially in state F is given by f(t)=exp(-iHt)f, where f is any vector in the ray F.

3. Measurable quantities are represented by self-adjoint operators. Self-adjoint operators are therefore called "observables". If a system is in state F when a measurement of an observable B is performed, the state of the system will change to state |b> (an eigenvector of B with eigenvalue b) with probability |<f|b>|², where f is any vector in F. The result of a measurement of B that leaves the system in state |b> is b. This is actually a less sloppy formulation than the ones that were shown to me when I first studied QM, but there are many flaws here. These are some of my thoughts:

* If we're dealing with the QM of one spin-0 particle, we can be more specific in axiom 1 and say that the Hilbert space is L²(R³) rather than just some Hilbert space. On the other hand, since all separable infinite-dimensional Hilbert spaces are isomorphic to each other, it doesn't really matter. Maybe we should at least add the requirement that the Hilbert space is infinite-dimensional.

* Instead of postulating the existence of the Hamiltonian directly, we should be postulating that we're dealing with an irreducible representation of the covering group of the Galilei group. Actually, now that I think about it, once we start talking about representations, the axioms of non-relativistic QM of one spin-0 particle aren't significantly more complicated than the the axioms of special relativistic QM of one particle with arbitrary spin, so we might as well go for a slightly more general set of axioms (but stick with one-particle theories for now).

* Axiom 2 can be dropped from the list, since it's implicit in the definition of the Hamiltonian as the generator of translations in time. That doesn't mean that we delete axiom 2. We just reinterpret it as the definition of an "isolated system".

* Axiom 3 is still giving me a headache. It ignores several important issues, including: a) An observable can have several eigenvectors with the same eigenvalues, b) Position and momentum aren't even observables if we define them as self-adjoint operators on the Hilbert space, c) If we generalize the definition of observables to densely defined operators, some observables don't have eigenvectors, d) A generalized observable may have a continuous spectrum, a discrete spectrum, or a combination of both.

* The first of those issues seems easy enough to deal with. Axiom 3 above is equivalent to saying that a density operator ρ changes to ∑_b P_b ρ P_b when we measure B. (Sorry, LaTeX doesn't work). Here P_b=|b><b|, but if we instead define P_b to be the projection operator onto the eigenspace of B corresponding to eigenvalue b, then (I think) the the same rule holds even when the spectrum of B is degenerate.

* The C*-algebra approach seems unnecessarily radical for the one-particle theories. You (Strangerep) suggested that we need this approach in those cases where we can't take all self-adjoint operators to be observables, and Mackey's book seems to be saying that this happens if and only if the theory includes a superselection rule (and one-particle theories don't). Hm, I guess that means that we should be using this approach when we get to SR QM of N non-interacting particles of arbitrary types.

* There probably is no real need for the RHS approach either, but I think I might prefer to use it anyway, mostly because it clearly states which subspace represent the physical states. But I'm not sure yet if using a RHS will make the axioms less awkward, or more awkward.

I'm going to post an improved version of the axioms, but I'll do that later (probably tomorrow). I still have to think some more about some of the details. If someone wants to suggest a better way to express axiom 3 in the meantime, feel free to do so.

It's the only one that I'm still having problems with.

Unbounded operators in non-relativistic QM of one spin-0 particle

Similar threads

Undergrad Phase and group velocity for the wave function

Undergrad Schrödinger equation and classical wave equation

Undergrad Antilinear Operators

Undergrad Angular Momentum Vector and Its Magnitude

Undergrad Elementary question about comparing notations of inner product

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers