Chapter 4: Spherical Harmonics
NOTE: Math will not display properly in Safari - please use another browser
The material covered in this chapter is partially presented in Boas Chapter 12, Section 10.
4.1 Associated Legendre Function
Chapter 3 was all about the Legendre polynomials Pℓ(x). Here we build on these and introduce the associated Legendre functions Pmℓ(x) in the first part of the chapter, and the spherical harmonics Ymℓ(θ,ϕ) in the second part. Spherical harmonics are used extremely widely in physics. You will see them soon enough in quantum mechanics, they are front and centre in advanced electromagnetism, and they will be among your best friends if you ever become a cosmologist. The presentation here will be fairly terse and dry: apologies! Applications will come in Chapter 10.
The associated Legendre functions are obtained by differentiating the Legendre polynomial m times and multiplying by (1−x2)m/2,
By convention the label m on the functions is indicated as a superscript. This label is not to be confused with a power: Pmℓ is not Pℓ raised to the power m. The definition of Eq.(4.1) implies that m≥0, and it is understood that P0ℓ≡Pℓ. Because Pℓ is a polynomial of degree ℓ, taking m derivatives reduces the largest power of x to ℓ−m, and there is nothing left to be differentiated once m has reached its limiting value m=ℓ. This implies that the associated Legendre functions are not defined for m>ℓ, and Eq.(4.1) therefore comes with the condition 0≤m≤ℓ.
While taking m derivatives of a Legendre polynomial still returns a polynomial, the factor of (1−x2)m/2 inserted in the definition of Pmℓ implies that in general, these functions are not polynomials. They are still polynomials when m is an even number, but they are not polynomials when m is odd. The factor also implies that in general, the associated Legendre functions become imaginary when x is smaller than −1 or larger than +1. To avoid this complication we shall restrict x to the interval (−1,1), and take the functions to be real.
It is a straightforward task to apply the definition of Eq.(4.1) and generate the first few associated Legendre functions. We find
P11=(1−x2)1/2,P12=3x(1−x2)1/2,P22=3(1−x2),P13=32(5x2−1)(1−x2)1/2,P23=15x(1−x2),P33=15(1−x2)3/2,P14=52(7x3−3x)(1−x2)1/2,P24=152(7x2−1)(1−x2),P34=105x(1−x2)3/2,P44=105(1−x2)2.
Exercise 4.1: Verify that the listing of Eq.(4.2) follows from the definition of Eq.(4.1).
It is useful to extend the definition of the associated Legendre functions to negative values of m. This is achieved by the relation
P−mℓ(x):=(−1)m(ℓ−m)!(ℓ+m)!Pmℓ(x),
in which m is taken to be positive. The definition indicates that except for an overall numerical factor, P−mℓ is the same function of x as Pmℓ. With this extended definition, the parameter m is now allowed to lie in the interval −ℓ≤m≤ℓ.
4.2 Associated Legendre Functions
We shall show that the associated Legendre functions satisfy the second-order differential equation
(1−x2)d2Pmℓdx2−2xdPmℓdx+[ℓ(ℓ+1)−m21−x2]Pmℓ=0,
known as the associated Legendre equation. You may note that as it should, the equation reduces to Legendre's equation (3.31) when m=0. You may also note that the equation depends on m2, and therefore does not care about the sign of m.
Exercise 4.2: Verify that the functions of Eq.(4.2) are solutions to the associated Legendre equation.
To establish Eq.(4.4) it is useful to introduce the auxiliary functions
so that Pmℓ=(1−x2)m/2Umℓ. In these equations and the manipulations below, it is understood that m≥0. We know that the functions U0ℓ≡Pℓ are solutions to Legendre's equation,
(1−x2)d2U0ℓdx2−2xdU0ℓdx+ℓ(ℓ+1)U0ℓ=0,
and differentiating this m times reveals that Umℓ satisfies
(1−x2)d2Umℓdx2−2(m+1)xdUmℓdx+[ℓ(ℓ+1)−m(m+1)]Umℓ=0.
You are encouraged to verify this statement, but be warned: it is not at all pleasant to differentiate Legendre's equation m times, because of the factor 1−x2 in front of the second derivative, and the factor 2x in front of the first derivative. Fortunately there is an easier (and more interesting) way to prove the validity of Eq.(4.7). We shall proceed by induction.
Let us assume that Eq.(4.7) is known to be valid for some value of m, denoted m∗. We will show that it must then also be valid for the next value of m, m∗+1. So if the equation is valid for m=m∗ it is also valid for m=m∗+1, and resetting m∗ to m∗+1, we also have validity for m=m∗+2, and so on, all the way to m=ℓ. The conclusion thus far is that if the equation is valid for m=m∗, it must also be valid for all values of m larger than m∗. But we know that Eq.(4.7) is valid for m=0, because it then reduces to Legendre's equation. We can therefore set m∗=0, and we have established the validity of Eq.(4.7) for all values of m.
The inductive argument rests on the essential fact that if Eq.(4.7) is valid for m=m∗, it must also be valid for m=m∗+1. This we can prove by differentiating the equation with respect to x and noting that by virtue of its definition,
Um∗+1ℓ=dUm∗ℓdx.
So set m=m∗ in Eq.(4.7) and differentiate to get
(1−x2)d3Um∗ℓdx3−2(m∗+2)xd2Um∗ℓdx2+[ℓ(ℓ+1)−(m∗+1)(m∗+2)]dUm∗ℓdx=0
after simplification. This can re-expressed as
(1−x2)d2Um∗+1ℓdx2−2(m∗+2)xdUm∗+1ℓdx+[ℓ(ℓ+1)−(m∗+1)(m∗+2)]Um∗+1ℓ=0,
which is just Eq.(4.7) again with m=m∗+1. This derivation, combined with the inductive argument detailed in the preceding paragraph, provides a complete proof of the validity of Eq.(4.7) for any value of m.
Exercise 4.3: Reproduce the steps that lead to Eq.(4.10). Take the time to think through the inductive proof of Eq.(4.7). Proof by induction is a wonderful mathematical method, but it can be elusive and difficult to understand at first.
With Eq.(4.7) established it is now straightforward to prove the validity of Eq.(4.4). It is simply a matter of turning Eq.(4.1) into
Umℓ(x)=(1−x2)−m/2Pmℓ(x)
by virtue of Eq.(4.5), and making the substitution in Eq.(4.7). A single differentiation yields
dUmℓdx=(1−x2)−m/2dPmℓdx+mx(1−x2)−m/2−1Pmℓ,
and two differentiations give
d2Umℓdx2=(1−x2)−m/2d2Pmℓdx2+2mx(1−x2)−m/2−1dPmℓdxtext+m(1−x2)−m/2−2[1+(m+1)x2]Pmℓ
after simplification. Inserting this within Eq.(4.7), multiplying through by (1−x2)m/2, and simplifying, we finally arrive at Eq.(4.4).
Exercise 4.4: Verify that Eq.(4.7) leads to the associated Legendre equation (4.4).
4.3 Recursion Relations
The associated Legendre functions satisfy a number of recursion relations, which can be found in the standard textbooks. A useful one is
Pmℓ=2(m−1)x(1−x2)−1/2Pm−1ℓ−(ℓ+m−1)(ℓ−m+2)Pm−2ℓ.
It can be used to compute the functions efficiently, once the functions with low values of m have been obtained from the definition.
Exercise 4.5: Knowing that P4=18(35x4−30x2+3) and P14=52(7x3−3x)(1−x2)1/2, compute P24, P34, and P44 by exploiting the recursion relation. Make sure your results agree with the listing of Eq.(4.2).
The recursion relation follows from Eq.(4.7), which we write in the equivalent form
(1−x2)Um+2ℓ−2(m+1)xUm+1ℓ+(ℓ+m+1)(ℓ−m)Umℓ=0.
We put m→m−2 in this equation, multiply through by (1−x2)m/2−1, recall the definition of Eq.(4.5), and arrive directly at Eq.(4.14).
Exercise 4.6: Reproduce the steps that take us from Eq.(4.7) to Eq.(4.14).
4.4 Orthogonality
Like the Legendre polynomials, the associated Legendre functions form a set of orthogonal functions on the interval (−1,1),
when ℓ≠ℓ′. Notice that m is the same for both functions; the statement would not be true if the integrand were replaced by PmℓPm′ℓ′ with m≠m′. The proof of Eq.(4.16) follows exactly the same steps as those involved in establishing the orthogonality of the Legendre polynomials. We shall not go over this again, but simply note that the proof begins with Eq.(4.4) multiplied by Pmℓ′, which is subtracted from a second copy of the equation in which ℓ is interchanged with ℓ′. With m the same in both copies of the equation, the terms involving m2/(1−x2) cancel out after subtraction, and the proof proceeds just as in Sec.3.7.
Exercise 4.7: Fill in the steps and provide a complete derivation of Eq.(4.16).
We define the norm of the associated Legendre functions by setting ℓ′ equal to ℓ inside the integral. We shall show that this is given by
Nmℓ:=∫1−1[Pmℓ(x)]2dx=22ℓ+1(ℓ+m)!(ℓ−m)!.
The norm evidently reduces to Eq.(3.47) when m=0. Equations (4.16) and (4.17) can be combined into the single statement
∫1−1Pmℓ(x)Pmℓ′(x)dx=22ℓ+1(ℓ+m)!(ℓ−m)!δℓℓ′,
which involves the Kronecker delta δℓℓ′.
To establish Eq.(4.17) we return\footnote{The following derivation cannot be found in the standard textbooks. It was extracted from Sec.~II.1.8 of Equations of Mathematical Physics by A.N.\ Tikhonov and A.A.\ Samarskii (Dover, 1990).} once more to Eq.(4.7), which we write in the alternative form
ddx[(1−x2)m+1Um+1ℓ]+(ℓ+m+1)(ℓ−m)(1−x2)mUmℓ=0.
In this we put m→m−1 and obtain
ddx[(1−x2)mUmℓ]=−(ℓ+m)(ℓ−m+1)(1−x2)m−1Um−1ℓ.
Now, Eq.(4.17) involves [Pmℓ]2=(1−x2)m[Umℓ]2, which we write as
[Pmℓ]2=(1−x2)mUmℓdUm−1ℓdx=ddx[(1−x2)mUmℓUm−1ℓ]−Um−1ℓddx[(1−x2)mUmℓ]=ddx[(1−x2)mUmℓUm−1ℓ]+(ℓ+m)(ℓ−m+1)(1−x2)m−1[Um−1ℓ]2=ddx[(1−x2)1/2PmℓPm−1ℓ]+(ℓ+m)(ℓ−m+1)[Pm−1ℓ]2;
in the last step we invoked the definition of Eq.(4.5). Integrating this from x=−1 to x=1, we see that the total derivative produces vanishing boundary terms, and we are left with
This is a recursion relation for the norm of the associated Legendre functions.
Exercise 4.8: Make sure that you can reproduce all the steps that lead to Eq.(4.22).
It is now a simple matter to solve Eq.(4.22) for Nmℓ. With m=1 the equation becomes N1ℓ=(ℓ+1)ℓN0ℓ. With m=2 we get N2ℓ=(ℓ+2)(ℓ−1)N1ℓ=(ℓ+2)(ℓ+1)ℓ(ℓ−1)N0ℓ, and with m=3 we obtain N3ℓ=(ℓ+3)(ℓ−2)N2ℓ=(ℓ+3)(ℓ+2)(ℓ+1)ℓ(ℓ−1)(ℓ−2)N0ℓ. A clear pattern emerges, and after m iterations of the recursion relation we get
Nmℓ=(ℓ+m)(ℓ+m−1)(ℓ+m−2)⋯(ℓ−m+1)N0ℓ.
The factor in front of N0ℓ can be written neatly as (ℓ+m)!/(ℓ−m)!, because
(ℓ+m)!(ℓ−m)!=(ℓ+m)(ℓ+m−1)⋯(ℓ−m+1)(ℓ−m)(ℓ−m−1)⋯(1)(ℓ−m)(ℓ−m−1)⋯(1)=(ℓ+m)(ℓ+m−1)⋯(ℓ−m+1).
And because we know that N0ℓ=2/(2ℓ+1) --- this is the norm of the Legendre polynomials --- we have arrived at Eq.(4.17).
Exercise 4.9: Make sure that you can reproduce all the steps that took us from the recursion relation to Eq.(4.17).
4.5 A Change of Variables
In many applications, the associated Legendre functions are expressed in terms of an angle θ instead of the original variable x. The transformation is
x=cosθ,
and the functions are now denoted Pmℓ(cosθ). The interval −1≤x≤1 is covered by letting θ range from θ=0 to θ=π; notice that x decreases as θ increases. The ubiquitous factor of (1−x2)1/2 in the associated Legendre functions becomes the more compelling sinθ when we make the change of variables; notice that sinθ is everywhere positive in the interval 0≤θ≤π.
It is easy to verify that the functions listed in Eq.(4.2) become
P0=1,P01=cosθ,P11=sinθ,P2=12(3cos2θ−1),P12=3sinθcosθ,P22=3sin2θ,P3=12(5cos3θ−3cosθ),P13=32sinθ(5cos2θ−1),P23=15sin2θcosθ,P33=15sin3θ,P4=18(35cos4θ−30cos2θ+3),P14=52sinθ(7cos3θ−3cosθ),P24=152sin2θ(7cos2θ−1),P34=105sin3θcosθ,P44=105sin4θ
when expressed in terms of θ. For completeness we have included the Legendre polynomials Pℓ≡P0ℓ in the list, also expressed in terms of the angular variable.
Exercise 4.10: Verify that the Legendre polynomials of Eq.(3.8) and the associated functions of Eq.(4.2) take the forms listed in Eq.(4.26) when the transformation x=cosθ is implemented.
The differential equation of Eq.(4.4) becomes
d2Pmℓdθ2+cosθsinθdPmℓdθ+[ℓ(ℓ+1)−m2sin2θ]Pmℓ=0
when expressed in terms of the new variable. To arrive at this is simply a matter of applying the chain rule to the derivatives. We have
dPmℓdx=dPmℓdθdθdx=−1sinθdPmℓdθ,d2Pmℓdx2=−1sinθddθ(−1sinθdPmℓdθ)=1sin2θd2Pmℓdθ2−cosθsin3θdPmℓdθ,
and inserting these expressions within Eq.(4.4) returns Eq.(4.27) after simplification. The equation can also be put in the alternative form
1sinθddθ(sinθdPmℓdθ)+[ℓ(ℓ+1)−m2sin2θ]Pmℓ=0.
Exercise 4.11: Verify Eqs.(4.27) and (4.30).
The orthonormality relation of Eq.(4.18) becomes
∫π0Pmℓ(cosθ)Pmℓ′(cosθ)sinθdθ=22ℓ+1(ℓ+m)!(ℓ−m)!δℓℓ′
in the new variable. To get this we simply implement the change of variables. With x=cosθ we have that dx=−sinθdθ, and we see that the lower bound x=−1 is mapped to θ=π, while the upper bound x=1 is mapped to θ=0. The integral is initially expressed as ∫0π⋯, but rewritten as −∫π0⋯ by reversing the direction of integration. The minus sign in front of the integral is finally combined with the minus sign in dx=−sinθdθ, and we arrive at Eq.(4.31).
Exercise 4.12: Go through the steps leading to Eq.(4.31).
4.6 Spherical Harmonics
The spherical harmonics Ymℓ(θ,ϕ) are functions of two angles, θ and ϕ. They are defined by taking the associated Legendre functions Pmℓ(cosθ), which depend on θ only, and multiplying them by
eimϕ=cos(mϕ)+isin(mϕ),
a complex function of the second angle. To make things official we insert some strange numerical factors, and define the spherical harmonics as
Ymℓ(θ,ϕ):=(−1)m√2ℓ+14π(ℓ−m)!(ℓ+m)!Pmℓ(cosθ)eimϕ.
It is understood that θ lies in the interval (0,π), that ϕ is confined to (0,2π), and that m is an integer between −ℓ and ℓ. The reason for the square-root factor will be made clear below. The factor (−1)m is conventional, and you should be warned that the convention can depend on the author. In applications, θ and ϕ are interpreted as the familiar angles in the spherical coordinates (r,θ,ϕ). These, we recall from Sec.1.8 are defined by the relations x=rsinθcosϕ, y=rsinθsinϕ, and z=rcosθ. The angles specify the position of a point on the surface of a sphere r=constant, and any function of θ and ϕ can therefore be thought of as a function on the surface of a sphere. This shall be our interpretation of the spherical harmonics: they constitute a set of functions on the surface of a sphere.
Equation (4.33) reduces to
Y0ℓ=√2ℓ+14πPℓ(cosθ)
when m=0, and we see that Y0ℓ is simply a rescaled version of the Legendre polynomials; these functions depend on θ only. The spherical harmonics with negative values of m can be obtained directly from those with positive values by exploiting the identity
with the asterisk indicating complex conjugation. This property follows directly from Eq.(4.33) after we make use of Eq.(4.3) and recall that e−imϕ=(eimϕ)∗.
Exercise 4.13: Prove the validity of Eq.(4.35).
The first few spherical harmonics are given by
Y00=1√4π,Y01=√34πcosθ,Y11=−√38πsinθeiϕ,Y02=√516π(3cos2θ−1),Y12=−√158πsinθcosθeiϕ,Y22=√1532πsin2θe2iϕ,Y03=√716π(5cos3θ−3cosθ),Y13=−√2164πsinθ(5cos2θ−1)eiϕ,Y23=√10532πsin2θcosθe2iϕ,Y33=−√3564πsin3θe3iϕ.
4.7 Orthonormality
We established in Sec.4.4 that the functions Pmℓ and Pmℓ′ are orthogonal to each other, in the sense provided by Eq.(4.31),
∫π0Pmℓ(cosθ)Pmℓ′(cosθ)sinθdθ=22ℓ+1(ℓ+m)!(ℓ−m)!δℓℓ′.
We have already noted that both functions must share the same value of m for this statement to be true. On the other hand, the functions eimϕ and eim′ϕ are also orthogonal to each other, in the sense provided by
Recall the notion of orthogonality of complex functions introduced back in Sec.3.6.
Exercise 4.14: Verify Eq.(4.38). Now may be a good time to refresh your understanding of the complex exponential. We will make heavy use of it in Chapters 7, 9, and 10.
Because the spherical harmonics result from the merger of the associated Legendre functions with the complex exponentials, we might consider merging the two statements of orthogonality. We express this as
∫2π0∫π0[Pmℓ(cosθ)eimϕ]∗[Pm′ℓ′eim′ϕ]sinθdθdϕ=4π2ℓ+1(ℓ+m)!(ℓ−m)!δℓℓ′δmm′,
and conclude that the double integral vanishes unless ℓ′=ℓ and m′=m. Both conditions must be met; equality between m′ and m is required by the ϕ-integral, and with this first condition satisfied, equality between ℓ′ and ℓ is required by the θ-integral. Notice that the θ-integral never has to be performed with m′≠m, because the ϕ-integral always vanishes under such circumstances.
The merged statement of orthogonality can be re-expressed in terms of the spherical harmonics by making use of the definition of Eq.(4.33). Because the double integral vanishes unless ℓ′=ℓ and m′=m, the numerical factor on the right-hand side of Eq.(4.39) can be factorized as
(−1)m√4π2ℓ+1(ℓ+m)!(ℓ−m)!⋅(−1)m′√4π2ℓ′+1(ℓ′+m′)!(ℓ′−m′)!
and transferred to the left-hand side. You may worry about the overall factor of (−1)m+m′, which was not present in Eq.(4.39). This factor is innocuous, because it becomes (−1)2m when m′=m, and since 2m is necessarily an even number, (−1)2m=1. The factor of (−1)m and the first square root are then naturally combined with Pmℓ(cosθ)eimϕ inside the integral, to give rise to the spherical harmonic Ymℓ(θ,ϕ). Absorbing (−1)m′ and the second square root in a similar way to get Ym′ℓ′(θ,ϕ), we arrive at
∫2π0∫π0[Ymℓ(θ,ϕ)]∗[Ym′ℓ′(θ,ϕ)]sinθdθdϕ=δℓℓ′δmm′,
the statement of orthogonality for the spherical harmonics. Because these functions depend on two variables, θ and ϕ, this is a two-dimensional notion of orthogonality, defined in terms of an integral over the surface of a sphere. We emphasize that the integral returns zero unless ℓ′=ℓ and m=m′. When both conditions are met, the integral becomes
∫2π0∫π0|Ymℓ(θ,ϕ)|2sinθdθdϕ=1,
and we say that the norm of the spherical harmonics is equal to one. Alternatively, we might say that the spherical harmonics are normalized functions of θ and ϕ. This is why we inserted the strange numerical factors in the definition of Eq.(4.33): we wanted to ensure that the spherical harmonics are normalized.
Exercise 4.15: Verify explicitly by performing the double integral that the functions Y22(θ,ϕ) and Y13(θ,ϕ) listed in Eq.(4.36) are normalized.
4.8 Differential Equations
The spherical harmonics are also proportional to eimϕ, and for fixed θ they must satisfy
∂2Ymℓ∂ϕ2=−m2Ymℓ.
This is the associated Legendre equation (4.30), which we re-expressed in terms of partial derivatives to make it clear that ϕ is held fixed while differentiating.
The spherical harmonics are also proportional to eimϕ, and for fixed θ they must satisfy
∂2Ymℓ∂ϕ2=−m2Ymℓ.
Again this was expressed in terms of partial derivatives, to emphasize that θ is now held fixed while differentiating.
The second equation reveals that the m2 term in the first equation can be equivalently written in terms of partial derivatives with respect to ϕ. This gives us
1sinθ∂∂θ(sinθ∂Ymℓ∂θ)+1sin2θ∂2Ymℓ∂ϕ2+ℓ(ℓ+1)Ymℓ=0.
This is a second-order, partial differential equation for the spherical harmonics.
The relevance of this equation comes from the fact that it is closely associated with the Laplacian operator in spherical coordinates. We saw back in Sec.1.8 --- refer to Eq.(1.86) --- that ∇2f is expressed as
∇2f=1r2[∂∂r(r2∂f∂r)+1sinθ∂∂θ(sinθ∂f∂θ)+1sin2θ∂2f∂ϕ2]
in spherical coordinates. You will recognize the last two terms if your compare this with Eq.(4.44). This similarity is not an accident, and in fact it provides the main motivation for defining and studying the spherical harmonics. We'll pick this thread up again in Chapter 10.
4.9 Practice Problems
-
Show by evaluating the integral explicitly that P12 and P13 are orthogonal on the interval (−1,1).
-
Calculate ∫1−1(P22)2dx explicitly and show that your result agrees with Eq.(4.18).
-
Using the definition of Eq.(4.1), show that Pmℓ(−x) is proportional to Pmℓ(x), with a numerical factor that is either +1 or −1. How does this numerical factor depend on ℓ and m? Verify that your result agrees with the listing of Eq.(4.2).
-
Complete the listing of Eq.(4.36) by including the spherical harmonics with negative values of m.
-
Express rY01, r2Y12, and r3Y33 in Cartesian coordinates.
4.10 Challenge Problems
- a) Derive the recursion relation
(2ℓ+1)(1−x2)1/2Pmℓ=Pm+1ℓ+1−Pm+1ℓ−1
satisfied by the associated Legendre functions.
b) Use this recursion relation to find an explicit expression for Pℓℓ(x). Verify your result with the special cases ℓ=1,2,3,4.
-
a)Prove that acting on the spherical harmonic Ymℓ with the differential operator
L+:=eiϕ(∂∂θ+icosθsinθ∂∂ϕ)
returns Ym+1ℓ up to a numerical factor. That is, prove that L+Ymℓ=αmℓYm+1ℓ and find an expression for the constant αmℓ. Simplify your expression as much as possible.
b) Recycle your result in part (a) and prove that acting on the spherical harmonic Ymℓ with the differential operator
L−:=e−iϕ(−∂∂θ+icosθsinθ∂∂ϕ)
returns Ym−1ℓ up to a numerical factor.