The Variational Principle

I’m currently working on writing an animation package for visualizing quantum circuits. I had hoped it would have been finished sooner so I could complete Part 2 of visualizing entanglement. Until then, I’d like to talk about something I learned in a lecture that may be one of the most useful results in quantum mechanics, while also being embarrassingly simple. It’s called the Variational Principle and is the basis for how variational quantum algorithms work, specifically the variational quantum eigensolver (VQE).

I’ll boldly state the result here and build up to it as we go on. Pick any normalized wave function $ψ$ whatsoever. Then

$E_{gs} \leq ⟨ ψ | \hat{H} | ψ ⟩ \equiv ⟨ \hat{H} ⟩$

which says the expectation value of the Hamiltonian $\hat{H}$ with any arbitrary wave function $ψ$ will always overestimate the ground state energy of $\hat{H}$ . Of course, if $ψ$ just happens to be the ground state wave function, then the above equation is an equivalence. If $ψ$ happens to be any of the excited states, then of course $E_{gs} \leq ⟨ \hat{H} ⟩$ . The point is that this overestimation holds for any $ψ$ whatsoever.

The proof is actually quite simple.

Proof: Since the (unknown) eigenstates of $\hat{H}$ are orthonormal and linearly independent (i.e. $⟨ ψ_{i} | ψ_{j} ⟩ = δ_{i j}$ ) for any two eigenstates, they form a complete set. Therefore, we can express any arbitrary $ψ$ as a linear combination of them:

$ψ = \sum_{n} c_{n} ψ_{n}, with \hat{H} ψ_{n} = E_{n} ψ_{n},$

Since $ψ$ is normalized,

$1 = ⟨ ψ | ψ ⟩ = ⟨ \sum_{m} c_{m} ψ_{m} | \sum_{n} c_{n} ψ_{n} ⟩$ $= \sum_{m} \sum_{n} c_{m}^{*} c_{n} ⟨ ψ_{m} | ψ_{n} ⟩ = \sum_{n} | c_{n} |^{2} .$

(because any two eigenstates are orthonormal: $⟨ ψ_{m} | ψ_{n} ⟩ = δ_{m n}$ ).

Calculating $⟨ \hat{H} ⟩$ ,

$⟨ \hat{H} ⟩ = ⟨ \sum_{m} c_{m} ψ_{m} | \hat{H} | \sum_{n} c_{n} ψ_{n} ⟩ = ⟨ c_{m} ψ_{m} | \hat{H} \sum_{n} c_{n} ψ_{n} ⟩$ $= \sum_{m} \sum_{n} c_{m}^{*} E_{n} c_{n} ⟨ ψ_{m} | ψ_{n} ⟩ = \sum_{n} E_{n} | c_{n} |^{2} .$

But the ground state energy $E_{gs}$ is, by definition, the lowest eigenvalue, $E_{gs} \leq E_{n} \forall n$ . Hence,

$⟨ \hat{H} ⟩ = \sum_{n} E_{n} | c_{n} |^{2} \geq E_{gs} \sum_{n} | c_{n} |^{2} = E_{gs}$

$⟨ \hat{H} ⟩ \geq E_{gs} .$

Helium

I think the best example to showcase the power of the Variational Principle is calculating the ground state energy of Helium.

Despite Helium being the second simplest elemental Hamiltonian, consisting of only two electrons in orbit around a nucleus containing two protons (with some neutrons), its Schrödinger equation is actually unsolvable.

$The Helium Atom .$

Like any many-body system, its differential equation is impossible to solve exactly. When astrodynamicists calculate trajectories of satellites traveling to, say, Jupiter, they would have to account for the gravitational attraction of the Sun, Earth, and Jupiter at a minimum to be exact. However, there exists no solvable equation to account for the pull of a three-body system. They instead discretize the system, calculating the Earth-satellite system when the satellite is near Earth, and the Jupiter-satellite system when the satellite is near Jupiter. (Helium electrons are always close together, unfortunately, so you can’t really do this.) Elsewhere, they employ numerical techniques and possibly some perturbation theory.

The Hamiltonian for Helium (ignoring fine structure) is given by

$\hat{H} = - \frac{ℏ^{2}}{2 m} (\nabla_{1}^{2} + \nabla_{2}^{2}) - \frac{e^{2}}{4 π ϵ_{0}} (\frac{2}{r_{1}} + \frac{2}{r_{2}} - \frac{1}{| \vec{r_{1}} - \vec{r_{2}} |}) .$

The ground state of Helium has been measured experimentally to be $E_{gs} \approx - 78.975 eV (experimental) .$

This is the quantity we’ll do our best to reproduce mathematically.

The issue with trying to solve this Hamiltonian comes from the electron-electron repulsion potential,

$V_{e e} = \frac{e^{2}}{4 π ϵ_{0}} \frac{1}{| \vec{r_{1}} - \vec{r_{2}} |} .$

We could employ time-independent perturbation theory using $\hat{H^{'}} = V_{e e}$ . However, because the $V_{e e}$ is not a small perturbation, this approximation would be quite far off.

If we just ignore $V_{e e}$ , the Helium Hamiltonian simplifies into two hydrogenic Hamiltonians

$\hat{H} = - (\frac{ℏ^{2}}{2 m} \nabla_{1}^{2} - \frac{e^{2}}{4 π ϵ_{0}} \frac{2}{r_{1}}) - (\frac{ℏ^{2}}{2 m} \nabla_{2}^{2} - \frac{e^{2}}{4 π ϵ_{0}} \frac{2}{r_{2}})$ .

with a nuclear charge of $2 e$ instead of $e$ . The exact solution is just the product of hydrogenic wave functions:

$ψ_{0} (\vec{r_{1}}, \vec{r_{2}}) \equiv ψ_{100} (\vec{r_{1}}) ψ_{100} (\vec{r_{2}}) = \frac{8}{π a^{3}} e^{- 2 (r_{1} + r_{2}) / a},$

and the energy is $8 E_{1} \approx - 109$ eV, which is very far from $- 78.975$ eV. Let’s instead use the Variational Principle with the hydrogenic $ψ_{0}$ as the ansatz wave function.

As we’ll see soon, the closer the ansatz is to the actual ground state wave function, the better an approximation the Variational Principle gives. So this ansatz make sense since it’s an eigenfunction for most of the Hamiltonian:

$\hat{H} ψ_{0} = (8 E_{1} + V_{e e}) ψ_{0} .$

Hydrogenic Ansatz

The Variational Principle tells us $E_{gs} \leq ⟨ \hat{H} ⟩$ . Hence, with the hydrogenic ansatz, we are tasked with solving

$⟨ \hat{H} ⟩ = 8 E_{1} + ⟨ V_{e e} ⟩ .$

where $⟨ V_{e e} ⟩ = ⟨ ψ_{0} | \frac{e^{2}}{4 π ϵ_{0}} \frac{1}{| \vec{r_{1}} - \vec{r_{2}} |} | ψ_{0} ⟩ .$ $= (\frac{e^{2}}{4 π ϵ_{0}}) {(\frac{8}{π a^{3}})}^{2} \int \frac{e^{- 4 (r_{1} + r_{2}) / a}}{| \vec{r_{1}} - \vec{r_{2}} |} d^{3} \vec{r_{1}} d^{3} \vec{r_{2}} .$

Solving the $\vec{r_{2}}$ integral first, we align $\vec{r_{1}}$ to be along the polar axis in the $\vec{r_{2}}$ frame.

By the law of cosines, $| \vec{r_{1}} - \vec{r_{2}} | = \sqrt{r_{1}^{2} + r_{2}^{2} - 2 r_{1} r_{2} \cos θ_{2}}$ . Hence,

$I_{2} \equiv \int \frac{e^{- 4 r_{2} / a}}{| \vec{r_{1}} - \vec{r_{2}} |} d^{3} {\vec{r}}_{2} = \int \frac{e^{- 4 r_{2} / a}}{\sqrt{r_{1}^{2} + r_{2}^{2} - 2 r_{1} r_{2} \cos θ_{2}}} r_{2}^{2} d θ_{2} d φ_{2}$

The integrand has no $φ_{2}$ dependence, so the its contribution is just $2 π$ . The $θ_{2}$ integral can be calculated,

$\int_{0}^{π} \frac{\sin θ_{2}}{\sqrt{r_{1}^{2} + r_{2}^{2} - 2 r_{1} r_{2} \cos θ_{2}}} d θ_{2} = \frac{\sqrt{r_{1}^{2} + r_{2}^{2} - 2 r_{1} r_{2} \cos θ_{2}}}{r_{1} r_{2}} |_{0}^{π}$ $= \frac{1}{r_{1} r_{2}} [(r_{1} + r_{2}) - | r_{1} - r_{2} |] = {\begin{cases} 2 / r_{1}, r_{2} < r_{1}, \\ 2 / r_{2}, r_{2} > r_{1} \end{cases} .$

Therefore,

$I_{2} = 4 π (\frac{1}{r_{1}} \int_{0}^{r_{1}} e^{- 4 r_{2} / a} r_{2}^{2} d r_{2} + \int_{r_{1}}^{\infty} e^{- 4 r_{2} / a} r_{2} d r_{2})$ $= \frac{π a^{3}}{8 r_{1}} [1 - (1 + \frac{2 r_{1}}{a}) e^{- 4 r_{1} / a}] .$

The $r_{1}$ part of the integral is still left. Finally solving for $V_{e e}$ ,

$⟨ V_{e e} ⟩ = (\frac{e^{2}}{4 π ϵ_{0}}) (\frac{8}{π a^{3}}) \int [1 - (1 + \frac{2 r_{1}}{a}) e^{- 4 r_{1} / a}] e^{- 4 r_{1} / a} r_{1} \sin θ_{1} d r_{1} d θ_{1} d φ_{1}$

$= (\frac{e^{2}}{4 π ϵ_{0}}) (\frac{8}{π a^{3}}) \cdot 4 π \int_{0}^{\infty} [r e^{- 4 r / a} - (r + \frac{2 r^{2}}{a}) e^{- 8 r / a}] d r$

$= \frac{5}{4 a} (\frac{e^{2}}{4 π ϵ_{0}}) = - \frac{5}{2} E_{1} = 34 eV .$

And hence,

$⟨ \hat{H} ⟩ = - 109 eV + 34 eV = - 75 eV .$

Given the experimental value is $\approx - 79$ eV, we’re already only 5% off from solving an unsolvable equation. Let’s keep going!

To beat this approximation, we have to find a better ansatz wave function. The closer the ansatz is to the intrinsic Helium ground state, the better.

Helium is a system wherein electrons ( $- e$ ) not only get pulled by the nuclear charge $(Z = + 2 e)$ , but also feel a repulsion between each other. This repulsion acts against the inward pull from the nucleus. In this manner, we can think of each electron partially “shielding” the nucleus from the other by, making the net effective nuclear charge $Z$ slightly less than 2. This suggests a ansatz of the form

$ψ_{1} (\vec{r_{1}}, \vec{r_{2}}) \equiv \frac{Z^{3}}{π a^{3}} e^{- Z (r_{1} + r_{2}) / a},$

where we treat $Z$ as a variational parameter, rather than equating it to 2. This equation is an eigenstate of a hydrogenic Hamiltonian, only with $Z$ this time instead of 2. Thus, $\hat{H}$ is evidently

$\hat{H} = - \frac{ℏ^{2}}{2 m} (\nabla_{1}^{2} + \nabla_{2}^{2}) - \frac{e^{2}}{4 π ϵ_{0}} (\frac{Z}{r_{1}} + \frac{Z}{r_{2}})$

$+ \frac{e^{2}}{4 π ϵ_{0}} (\frac{Z - 2}{r_{1}} + \frac{Z - 2}{r_{2}} + \frac{1}{| \vec{r_{1}} - \vec{r_{2}} |}) .$

We’ll solve for $⟨ \hat{H} ⟩$ similarly which will give us an expression involving an arbitary Z.

$⟨ \hat{H} ⟩ = 2 Z^{2} E_{1} + 2 (Z - 2) (\frac{e^{2}}{4 π ϵ_{0}}) ⟨ \frac{1}{r} ⟩ + ⟨ V_{e e} ⟩ .$

For hydrogenic wave functions with nuclear charge $Z$ , the expectation value $⟨ \frac{1}{r} ⟩ = \frac{Z}{a}$ ( $a$ being the Bohr radius). The expectation value $⟨ V_{e e} ⟩$ is the exact same as before, just with $Z$ instead of 2. Because the Bohr radius also scaled inversely with $Z$ , we also adjust $a \mapsto \frac{2}{Z} a$ .

$⟨ V_{e e} ⟩ = \frac{5 Z}{8 a} (\frac{e^{2}}{4 π ϵ_{0}}) = - \frac{5 Z}{4} E_{1} .$

Hence,

$⟨ \hat{H} ⟩ = [2 Z^{2} - 4 Z (Z - 2) - \frac{5}{4} Z] E_{1} = [- 2 Z^{2} + \frac{27}{4} Z] E_{1} .$

We’re getting close now. Because $E_{gs} \leq ⟨ \hat{H} ⟩$ , that is, the value above exceeds $E_{gs}$ for any value of $Z$ , the closest value to $E_{gs}$ occurs when $⟨ \hat{H} ⟩$ is minimized.

$\frac{d}{d Z} ⟨ \hat{H} ⟩ = [- 4 Z + \frac{27}{4}] E_{1} = 0.$

It directly follows that

$Z = \frac{27}{16} = 1.6875 .$

So the electrons in Helium shield the nucleus by roughly 16%. Plugging this value of $Z$ back into $⟨ \hat{H} ⟩$ ,

$⟨ \hat{H} ⟩ = \frac{1}{2} {(\frac{3}{2})}^{6} E_{1} = - 77.5 eV .$

This is within 2% of Helium’s actual ground state.

While this was a bit of math, it does showcase the power of the Variational Principle and how easy it is to use, albeit the complicated integrals. This procedure is the basis for quantum algorithms like VQE, which quantum chemists use to approximate the ground state energies for any system. VQE generates its ansatzs using parameterized quantum circuits. In this way, the Variational Principle turns into a machine learning problem. By continuously adjusting the parameters of the quantum circuit using gradient descent on the system’s Hilbert space, more accurate ansatzs are formed, which yield better ground state energy approximations.