perturbation theory



physics, mathematical physics, philosophy of physics

Surveys, textbooks and lecture notes

theory (physics), model (physics)

experiment, measurement, computable physics




Perturbation theory is a general method of finding (or even defining) the solution of equations of mathematical physics by expanding them with respect to a small parameter in the vicinity of known, defined or well-understood solution (for which the small parameter is 00). It is used in the study of PDEs involving operators depending on small parameter, in classical and celestical mechanics, in quantum mechanics, and in the statistical and quantum field theory.

One of the varieties of perturbation theory provides a method to make sense of and handle the path integral involved in the quantization of classical field theory to quantum field theory.

It is based on the observation that the quantization of free classical field theories, whose action functional contains only the kinetic term, is well understood; therefore, the quantization of a functional consisting of a kinetic term and polynomial interaction terms may be expanded like a Taylor series in the interaction terms, thus yielding what looks like a series of correlators in a free field theory. If the coupling constant – the parameter in front of the interaction terms – is small enough, one says one is in the weakly coupled regime of the theory and expects this perturbation series to approximate the desired answer. Usually, even for that to work the action functional first has to be subjected to renormalization.

More details

Suppose we’re working with a quantum system that’s nearly a quantum harmonic oscillator, but not quite; that is, the quadratic potential V 0=12kx 212V_0 = \frac{1}{2}k x^2 - \frac{1}{2} is only a good local approximation to the real potential V 0+λVV_0 + \lambda V. Then we can write the Hamiltonian as H=H 0+λV,H = H_0 + \lambda V, where VV is a function of the position xx and the momentum pp (or equivalently, of z=p+ix2z = \frac{p+i x}{\sqrt{2}} and ddz\frac{d}{dz}) and λ\lambda is small.

Now we solve Schrödinger’s equation perturbatively. We know that

(1)ψ(t)=e itHψ(0),\psi(t) = e^{-itH} \psi(0),

and we assume that

(2)e itHψ(t)e itH 0ψ(t)e^{-itH}\psi(t) \approx e^{-itH_0} \psi(t)

so that it makes sense to solve it perturbatively. Define

(3)ψ 1(t)=e itH 0e itHψ(t)\psi_1(t) = e^{itH_0} e^{-itH}\psi(t)


(4)V 1(t)=e itH 0λVe itH 0.V_1(t) = e^{itH_0} \lambda V e^{-itH_0}.

After a little work, we find that

(5)ddtψ 1(t)=iV 1(t)ψ 1(t),\frac{d}{dt}\psi_1(t) = -i V_1(t) \psi_1(t),

and integrating, we get

(6)ψ 1(t)=i 0 tV 1(t 0)ψ 1(t 0)dt 0+ψ(0).\psi_1(t) = -i\int_0^t V_1(t_0) \psi_1(t_0) dt_0 + \psi(0).

We feed this equation back into itself recursively to get

(7)ψ 1(t) = i 0 tV 1(t 0)[i 0 t 0V 1(t 1)ψ 1(t 1)dt 1+ψ(0)]dt 0+ψ(0) = [ψ(0)]+[ 0 ti 1V 1(t 0)ψ(0)dt 0]+[ 0 t 0 t 0i 2V 1(t 0)V 1(t 1)ψ 1(t 1)dt 1dt 0] = n=0 tt 0t n10i nV 1(t 0)V 1(t n1)ψ(0)dt n1dt 0 = n=0 (λi) n tt 0t n10e i(tt 0)H 0Ve i(t 0t 1)H 0VVe i(t n10)H 0ψ(0)dt n1dt 0.\array{ \psi_1(t) & = & -i \int_0^t V_1(t_0) \left[-i\int_0^{t_0} V_1(t_1) \psi_1(t_1) dt_1 + \psi(0) \right] dt_0 + \psi(0) \\ & = & \left[\psi(0)\right] + \left[\int_0^t i^{-1} V_1(t_0)\psi(0) dt_0\right] + \left[\int_0^t\int_0^{t_0} i^{-2} V_1(t_0)V_1(t_1) \psi_1(t_1) dt_1 dt_0\right] \\ & = & \sum_{n=0}^{\infty} \int_{t \ge t_0 \ge \ldots \ge t_{n-1} \ge 0} i^{-n} V_1(t_0)\cdots V_1(t_{n-1}) \psi(0) dt_{n-1}\cdots dt_0 \\ & = & \sum_{n=0}^{\infty} (-\lambda i)^n \int_{t \ge t_0 \ge \ldots \ge t_{n-1} \ge 0} e^{-i(t-t_0)H_0} V e^{-i(t_0-t_1)H_0} V \cdots V e^{-i(t_{n-1}-0)H_0} \psi(0) dt_{n-1}\cdots dt_0. }

So here we have a sum of a bunch of terms; the nnth term involves nn interactions with the potential interspersed with evolving freely between the interactions, and we integrate over all possible times at which those interactions could occur.

Here’s an example Feynman diagram for this simple system, representing the fourth term in the sum above:

Three interactions with the perturbation.

The lines represent evolving under the free Hamiltonian H 0H_0, while the dots are interactions with the potential VV.

As an example, let’s consider V=(z+ddz)V = (z + \frac{d}{dz}) and choose λ=12\lambda = \frac{1}{\sqrt{2}} so that λV=p.\lambda V = p. When VV acts on a state ψ=z n,\psi = z^n, we get Vψ=z n+1+nz n1.V \psi = z^{n+1} + nz^{n-1}. So at each interaction, the system either gains a photon or changes phase and loses a photon.



Despite what one might naively expect, the perturbation series of natural quantum field theories have a vanishing radius of convergence, they are asymptotic series.

Roughly this can be understood as follows: since the pertrubation is in the coupling constant about vanishing coupling, a non-zero radius of convergence would imply that the theory is finite also for negative coupling (where “things fly apart”), which will not happen in realistic theories.

More in detail, theories with non-perturbative effects such as instantons field configurations (such as Yang-Mills theory, hence QCD, QED), branes (such as string theory), etc., are expected to have a path integral which as a function of the coupling constant gg schematically looks like

Z(g)= na ng n+e A/g na n (1)g n+𝒪(e 2A/g), Z(g) = \sum_n a_n g^n + e^{-A/g} \sum_n a_n^{(1)} g^n + \mathcal{O}(e^{-2A/g}) \,,

where the first sum is the perturbation series itself and where the terms with a prefactor of the form exp(A/g)\exp(-A/g) are the contributions of the instantons (AA is the contribution of the instanton action functional). Since all the derivatives of the function ge 1/gg \mapsto e^{-1/g} vanish at coupling constant g=0g = 0, the Taylor series of this part of the path integral does not appear in perturbation series, even though it is present. Therefore this is called a non-perturbative effect.

See the references below for details.

(See also at string theory FAQ – Isn’t it fatal that the string perturbation series does not converge?.)



A solid mathematical formulation of perturbation theory has been given in

  • K. Hepp.: Théorie de la Renormalisation Lect. Notes in Phys. Springer (1969)

  • O. Steinmann, Perturbation expansion in axiomatic field theory Lect. Notes in Phys. 11, Springer (1971)

On Divergence/Convergence and Non-perturbative effects

A general introduction on divergence of perturbation theory, asymptotic series and non-perturbative effects is for instance on the first pages of

Further discussion is for instance in


Perturbation theory in the context of AQFT (or at least in the spirit) is discussed in the following articles.

The observation that in perturbation theory the Stückelberg-Bogoliubov-Epstein-Glaser local S-matrices yield a local net of observables was first made in

  • V. Il’in, D. Slavnov, Observable algebras in the S-matrix approach Theor. Math. Phys. 36 , 32 (1978) 578-585

which was however mostly ignored and forgotten. It is taken up again in

(a quick survey is in section 8, details are in section 2).

Further developments along these lines are then

(relation to deformation quantization)

(relation to renormalization)

(relation to gauge theory and QED)

Reviews includes

In BV-BRST formalism

Perturbative quantization in BV-BRST formalism is nicely systematically discussed in section 5 of

in the broad context of factorization algebras (see there for further references). In particular the relation to Feynman diagrams is discussed in

Revised on November 12, 2015 17:47:07 by Urs Schreiber (