The Variational Principle

I’m currently working on writing an animation package for visualizing quantum circuits. I had hoped it would have been finished sooner so I could complete Part 2 of visualizing entanglement. Until then, I’d like to talk about something I learned in a lecture that may be one of the most useful results in quantum mechanics, while also being embarrassingly simple. It’s called the Variational Principle and is the basis for how variational quantum algorithms work, specifically the variational quantum eigensolver (VQE).

I’ll boldly state the result here and build up to it as we go on. Pick any normalized wave function ψ whatsoever. Then

Egsψ|H^|ψH^

which says the expectation value of the Hamiltonian H^ with any arbitrary wave function ψ will always overestimate the ground state energy of H^. Of course, if ψ just happens to be the ground state wave function, then the above equation is an equivalence. If ψ happens to be any of the excited states, then of course EgsH^. The point is that this overestimation holds for any ψ whatsoever.

The proof is actually quite simple.

Proof: Since the (unknown) eigenstates of H^ are orthonormal and linearly independent (i.e. ψi|ψj=δij) for any two eigenstates, they form a complete set. Therefore, we can express any arbitrary ψ as a linear combination of them:

ψ=ncnψn,with H^ψn=Enψn,

Since ψ is normalized,

1=ψ|ψ=mcmψm|ncnψn =mncmcnψm|ψn=n|cn|2.

(because any two eigenstates are orthonormal: ψm|ψn=δmn).

Calculating H^,

H^=mcmψm|H^|ncnψn=cmψm|H^ncnψn =mncmEncnψm|ψn=nEn|cn|2.

But the ground state energy Egs is, by definition, the lowest eigenvalue, EgsEnn. Hence,

H^=nEn|cn|2Egsn|cn|2=Egs

H^Egs.

Helium

I think the best example to showcase the power of the Variational Principle is calculating the ground state energy of Helium.

Despite Helium being the second simplest elemental Hamiltonian, consisting of only two electrons in orbit around a nucleus containing two protons (with some neutrons), its Schrödinger equation is actually unsolvable.

The Helium Atom.

Like any many-body system, its differential equation is impossible to solve exactly. When astrodynamicists calculate trajectories of satellites traveling to, say, Jupiter, they would have to account for the gravitational attraction of the Sun, Earth, and Jupiter at a minimum to be exact. However, there exists no solvable equation to account for the pull of a three-body system. They instead discretize the system, calculating the Earth-satellite system when the satellite is near Earth, and the Jupiter-satellite system when the satellite is near Jupiter. (Helium electrons are always close together, unfortunately, so you can’t really do this.) Elsewhere, they employ numerical techniques and possibly some perturbation theory.

The Hamiltonian for Helium (ignoring fine structure) is given by

H^=22m(12+22)e24πϵ0(2r1+2r21|r1r2|).

The ground state of Helium has been measured experimentally to be Egs78.975 eV (experimental).

This is the quantity we’ll do our best to reproduce mathematically.

The issue with trying to solve this Hamiltonian comes from the electron-electron repulsion potential,

Vee=e24πϵ01|r1r2|.

We could employ time-independent perturbation theory using H^=Vee. However, because the Vee is not a small perturbation, this approximation would be quite far off.

If we just ignore Vee, the Helium Hamiltonian simplifies into two hydrogenic Hamiltonians

H^=(22m12e24πϵ02r1)(22m22e24πϵ02r2).

with a nuclear charge of 2e instead of e. The exact solution is just the product of hydrogenic wave functions:

ψ0(r1,r2)ψ100(r1)ψ100(r2)=8πa3e2(r1+r2)/a,

and the energy is 8E1109 eV, which is very far from 78.975 eV. Let’s instead use the Variational Principle with the hydrogenic ψ0 as the ansatz wave function.

As we’ll see soon, the closer the ansatz is to the actual ground state wave function, the better an approximation the Variational Principle gives. So this ansatz make sense since it’s an eigenfunction for most of the Hamiltonian:

H^ψ0=(8E1+Vee)ψ0.

Hydrogenic Ansatz

The Variational Principle tells us EgsH^. Hence, with the hydrogenic ansatz, we are tasked with solving

H^=8E1+Vee.

where Vee=ψ0|e24πϵ01|r1r2||ψ0. =(e24πϵ0)(8πa3)2e4(r1+r2)/a|r1r2|d3r1d3r2.

Solving the r2 integral first, we align r1 to be along the polar axis in the r2 frame.

By the law of cosines, |r1r2|=r12+r222r1r2cosθ2. Hence,

I2e4r2/a|r1r2|d3r2=e4r2/ar12+r222r1r2cosθ2r22dθ2dφ2

The integrand has no φ2 dependence, so the its contribution is just 2π. The θ2 integral can be calculated,

0πsinθ2r12+r222r1r2cosθ2dθ2=r12+r222r1r2cosθ2r1r2|0π =1r1r2[(r1+r2)|r1r2|]={2/r1,r2<r1,2/r2,r2>r1.

Therefore,

I2=4π(1r10r1e4r2/ar22dr2+r1e4r2/ar2dr2) =πa38r1[1(1+2r1a)e4r1/a].

The r1 part of the integral is still left. Finally solving for Vee,

Vee=(e24πϵ0)(8πa3)[1(1+2r1a)e4r1/a]e4r1/ar1sinθ1dr1dθ1dφ1

=(e24πϵ0)(8πa3)4π0[re4r/a(r+2r2a)e8r/a]dr

=54a(e24πϵ0)=52E1=34 eV.

And hence,

H^=109 eV+34 eV=75 eV.

Given the experimental value is 79 eV, we’re already only 5% off from solving an unsolvable equation. Let’s keep going!

To beat this approximation, we have to find a better ansatz wave function. The closer the ansatz is to the intrinsic Helium ground state, the better.

Helium is a system wherein electrons (e) not only get pulled by the nuclear charge (Z=+2e), but also feel a repulsion between each other. This repulsion acts against the inward pull from the nucleus. In this manner, we can think of each electron partially “shielding” the nucleus from the other by, making the net effective nuclear charge Z slightly less than 2. This suggests a ansatz of the form

ψ1(r1,r2)Z3πa3eZ(r1+r2)/a,

where we treat Z as a variational parameter, rather than equating it to 2. This equation is an eigenstate of a hydrogenic Hamiltonian, only with Z this time instead of 2. Thus, H^ is evidently

H^=22m(12+22)e24πϵ0(Zr1+Zr2)

+e24πϵ0(Z2r1+Z2r2+1|r1r2|).

We’ll solve for H^ similarly which will give us an expression involving an arbitary Z.

H^=2Z2E1+2(Z2)(e24πϵ0)1r+Vee.

For hydrogenic wave functions with nuclear charge Z , the expectation value 1r=Za (a being the Bohr radius). The expectation value Vee is the exact same as before, just with Z instead of 2. Because the Bohr radius also scaled inversely with Z, we also adjust a2Za.

Vee=5Z8a(e24πϵ0)=5Z4E1.

Hence,

H^=[2Z24Z(Z2)54Z]E1=[2Z2+274Z]E1.

We’re getting close now. Because EgsH^, that is, the value above exceeds Egs for any value of Z, the closest value to Egs occurs when H^ is minimized.

ddZH^=[4Z+274]E1=0.

It directly follows that

Z=2716=1.6875.

So the electrons in Helium shield the nucleus by roughly 16%. Plugging this value of Z back into H^,

H^=12(32)6E1=77.5 eV.

This is within 2% of Helium’s actual ground state.

While this was a bit of math, it does showcase the power of the Variational Principle and how easy it is to use, albeit the complicated integrals. This procedure is the basis for quantum algorithms like VQE, which quantum chemists use to approximate the ground state energies for any system. VQE generates its ansatzs using parameterized quantum circuits. In this way, the Variational Principle turns into a machine learning problem. By continuously adjusting the parameters of the quantum circuit using gradient descent on the system’s Hilbert space, more accurate ansatzs are formed, which yield better ground state energy approximations.