Blog

Morse Theory — An Introduction

Critical points, the Morse Lemma, and handle decompositions of surfaces.


§ 1.1 — Critical Points of Functions

Definition

A point x0x_0 satisfying f(x0)=0f'(x_0) = 0 is called a critical point of the function ff. Critical points fall into two categories according to the value of f(x0)f''(x_0):

  • x0x_0 is a non-degenerate critical point if f(x0)0f''(x_0) \neq 0.
  • x0x_0 is a degenerate critical point if f(x0)=0f''(x_0) = 0.

Observation: Non-degenerate critical points are “stable,” while degenerate critical points are “unstable.”


§ 1.2 — The Hessian Matrix

We say that a point p0=(x0,y0)p_0 = (x_0, y_0) in the xyxy-plane is a critical point of a function z=f(x,y)z = f(x,y) if:

fx(p0)=0,fy(p0)=0\frac{\partial f}{\partial x}(p_0) = 0, \qquad \frac{\partial f}{\partial y}(p_0) = 0

We assume f(x,y)f(x,y) is of class CC^\infty. We further assume 2fx2(p0)0\frac{\partial^2 f}{\partial x^2}(p_0) \neq 0 and 2fy2(p0)0\frac{\partial^2 f}{\partial y^2}(p_0) \neq 0, though after some coordinate changes this won’t necessarily hold.

Definition — Hessian

Suppose p0=(x0,y0)p_0 = (x_0, y_0) is a critical point of z=f(x,y)z = f(x,y). The Hessian of ff at p0p_0, denoted Hf(p0)H_f(p_0), is the matrix of second derivatives evaluated at p0p_0:

Hf(p0)=(2fx2(p0)2fxy(p0)2fyx(p0)2fy2(p0))H_f(p_0) = \begin{pmatrix} \dfrac{\partial^2 f}{\partial x^2}(p_0) & \dfrac{\partial^2 f}{\partial x \, \partial y}(p_0) \\[10pt] \dfrac{\partial^2 f}{\partial y \, \partial x}(p_0) & \dfrac{\partial^2 f}{\partial y^2}(p_0) \end{pmatrix}

A critical point p0p_0 is non-degenerate if the determinant of the Hessian is nonzero:

detHf(p0)=2fx2(p0)2fy2(p0)(2fxy(p0))20\det H_f(p_0) = \frac{\partial^2 f}{\partial x^2}(p_0) \cdot \frac{\partial^2 f}{\partial y^2}(p_0) - \left(\frac{\partial^2 f}{\partial x \, \partial y}(p_0)\right)^2 \neq 0

Note that Hf(p0)H_f(p_0) is symmetric, since 2fxy=2fyx\dfrac{\partial^2 f}{\partial x \, \partial y} = \dfrac{\partial^2 f}{\partial y \, \partial x}.

Hessian Under Coordinate Change

Lemma

Let p0p_0 be a critical point of z=f(x,y)z = f(x,y). Denote by Hf(p0)H_f(p_0) the Hessian in coordinates (x,y)(x,y) and by Hf(p0)\overline{H}_f(p_0) the Hessian in coordinates (X,Y)(X,Y). Then:

Hf(p0)=t ⁣J(p0)Hf(p0)J(p0)\overline{H}_f(p_0) = {}^t\!J(p_0) \cdot H_f(p_0) \cdot J(p_0)

where J(p0)J(p_0) is the Jacobian matrix:

J(p0)=(xX(p0)xY(p0)yX(p0)yY(p0))J(p_0) = \begin{pmatrix} \dfrac{\partial x}{\partial X}(p_0) & \dfrac{\partial x}{\partial Y}(p_0) \\[8pt] \dfrac{\partial y}{\partial X}(p_0) & \dfrac{\partial y}{\partial Y}(p_0) \end{pmatrix}

Corollary

The property that p0p_0 is a non-degenerate critical point does not depend on the choice of coordinates. The same is true for degenerate critical points.


§ 1.3 — The Morse Lemma

Morse Lemma

Let p0p_0 be a non-degenerate critical point of a function ff of two variables. Then we can choose appropriate local coordinates (X,Y)(X, Y) such that ff takes one of the following three standard forms:

\text{(i)} \quad & f = X^2 + Y^2 + C \\[4pt] \text{(ii)} \quad & f = X^2 - Y^2 + C \\[4pt] \text{(iii)} \quad & f = -X^2 - Y^2 + C \end{aligned}$$ where $C$ is a constant and $p_0$ is the origin.

The theorem states that a function looks extremely simple near a non-degenerate critical point — up to a coordinate change, it is purely quadratic.

Corollary

A non-degenerate critical point of a function of two variables is isolated.

Non-degenerate p₀ ISOLATED Degenerate NOT ISOLATED

Figure 1 — Non-degenerate critical points are isolated; degenerate ones need not be.


§ 1.4 — Index of a Non-Degenerate Critical Point

Definition

Let p0p_0 be a non-degenerate critical point of ff. Using the coordinate system given by the Morse Lemma, we define the index of p0p_0 as:

index(p0)={0if f=x2+y2+C(local minimum)1if f=x2y2+C(saddle point)2if f=x2y2+C(local maximum)\operatorname{index}(p_0) = \begin{cases} 0 & \text{if } f = x^2 + y^2 + C \quad \text{(local minimum)} \\ 1 & \text{if } f = x^2 - y^2 + C \quad \text{(saddle point)} \\ 2 & \text{if } f = -x^2 - y^2 + C \quad \text{(local maximum)} \end{cases}

In other words, the index is the number of minus signs in the standard form.

The index of a non-degenerate critical point p0p_0 is determined by the behaviour of ff near p0p_0.

Index 0 — Minimum x²+y²+C Index 1 — Saddle x²−y²+C Index 2 — Maximum −x²−y²+C

Figure 2 — The three types of non-degenerate critical points in two variables, classified by index.


§ 2.1 — Morse Functions on Surfaces

We now move from functions on R2\mathbb{R}^2 to functions on surfaces. Recall that a closed surface is a compact surface without boundary. The genus of a closed surface is the number of “holes” in it.

genus 0 genus 1 genus 2

Figure 3 — Closed surfaces of genus 0, 1, and 2.

Let MM be a surface. A function f ⁣:MRf\colon M \to \mathbb{R} assigns a real number to each point pMp \in M. We say ff is of class CC^\infty if it is smooth with respect to any smooth local coordinates at each point of MM.

Definition — Morse Function

Suppose that every critical point of f ⁣:MRf\colon M \to \mathbb{R} is non-degenerate. Then we say that ff is a Morse function.

Example

Consider the sphere S2R3S^2 \subset \mathbb{R}^3 defined by x2+y2+z2=1x^2 + y^2 + z^2 = 1. Let f ⁣:S2Rf\colon S^2 \to \mathbb{R} be the height function f(x,y,z)=zf(x,y,z) = z. Then ff has exactly two critical points — the north pole p0p_0 (a maximum) and the south pole q0q_0 (a minimum) — and both are non-degenerate. Hence ff is a Morse function.

z p₀ q₀ f max min

Figure 4 — The height function on $S^2$ as a Morse function with two critical points.

Lemma

A Morse function f ⁣:MRf\colon M \to \mathbb{R} defined on a closed surface MM has only a finite number of critical points.


§ 2.2 — Diffeomorphism and the Reeb Theorem

Definition — Homeomorphism

Suppose there is a one-to-one and onto map h ⁣:XYh\colon X \to Y between two topological spaces XX and YY with inverse h1 ⁣:YXh^{-1}\colon Y \to X. If both maps are continuous, we say XX and YY are homeomorphic and hh is a homeomorphism — intuitively, they have the same shape.

Definition — Diffeomorphism

A homeomorphism h ⁣:MNh\colon M \to N between surfaces is a diffeomorphism if both hh and h1h^{-1} are of class CC^\infty. Two surfaces are called diffeomorphic if there is a diffeomorphism between them.

Theorem — Maximum-Value Theorem

Let f ⁣:XRf\colon X \to \mathbb{R} be a continuous function on a compact space XX. Then ff takes its maximum value at some point p0p_0 and its minimum value at some point q0q_0.

Theorem (Reeb)

Let MM be a closed surface. Suppose there exists a Morse function f ⁣:MRf\colon M \to \mathbb{R} with exactly two non-degenerate critical points. Then MM is diffeomorphic to the sphere S2S^2.

p₀ q₀ M

Figure 5 — A closed surface admitting a Morse function with exactly two critical points is diffeomorphic to $S^2$.

Proof Sketch

By the maximum-value theorem, f ⁣:MRf\colon M \to \mathbb{R} takes its maximum at p0p_0 and its minimum at q0q_0. Since ff is a Morse function, both are non-degenerate critical points. By the Morse Lemma, ff takes standard form near each:

f={x2y2+Anear p0X2+Y2+anear q0f = \begin{cases} -x^2 - y^2 + A & \text{near } p_0 \\ X^2 + Y^2 + a & \text{near } q_0 \end{cases}

Here AA and aa are the maximum and minimum values. For small ε>0\varepsilon > 0, the set D(p0)={Aεf(p)A}D(p_0) = \{A - \varepsilon \leq f(p) \leq A\} satisfies x2+y2εx^2 + y^2 \leq \varepsilon, making it diffeomorphic to a 2-disk. Similarly for D(q0)D(q_0).

p₀ f near p₀ q₀ f near q₀

Figure 6 — Disk neighbourhoods around the maximum and minimum.

Removing the interiors of D(p0)D(p_0) and D(q0)D(q_0) from MM yields a surface M0M_0 with boundary M0=C(p0)C(q0)\partial M_0 = C(p_0) \cup C(q_0).

Lemma

Let f ⁣:M0Rf\colon M_0 \to \mathbb{R} be a CC^\infty function taking constant values on the boundary circles C(p0)C(p_0) and C(q0)C(q_0). If ff has no critical points on M0M_0, then M0M_0 is diffeomorphic to C(q0)×[0,1]C(q_0) \times [0,1].

Since C(q0)S1C(q_0) \cong S^1, we conclude M0S1×[0,1]M_0 \cong S^1 \times [0,1] — an annulus. Gluing D(p0)D(p_0) and D(q0)D(q_0) back along the boundary circles reconstructs MM and shows it is diffeomorphic to S2S^2.

C(p₀) C(q₀) M₀

Figure 7 — $M_0 \cong S^1 \times [0,1]$, the annulus between the two boundary circles.


§ 1.5 — Handle Decomposition

Start with a Morse function f ⁣:MRf\colon M \to \mathbb{R} on a closed, connected surface. Define the sublevel set:

Mt={pMf(p)t}M_t = \{p \in M \mid f(p) \leq t\}

Let LtL_t be the level curve {f=t}\{f = t\}. Then MtM_t is everything below LtL_t, and Lt=MtL_t = \partial M_t.

L_t M_t A a t f

Figure 8 — The sublevel set $M_t$ and level curve $L_t$ of a Morse function on a surface.

Since ff has maximum AA and minimum aa: Mt=M_t = \varnothing for t<at < a, and Mt=MM_t = M for tAt \geq A.

The fundamental idea of Morse Theory is to trace the change of shape of MtM_t as tt increases from below aa to above AA.

Definition — Critical Values

A real number c0c_0 is a critical value if f(p0)=c0f(p_0) = c_0 for some critical point p0p_0.

Lemma

Let b<cb < c be real numbers such that ff has no critical values in [b,c][b,c]. Then MbM_b and McM_c are diffeomorphic.

The topology of MtM_t only changes when tt passes through a critical value. The change depends on the index of the critical point.

Index 0 — Attaching a 0-handle

When the index of p0p_0 is zero, f=x2+y2+c0f = x^2 + y^2 + c_0 near p0p_0. The sublevel set gains a new connected component — a disk D2D^2:

Mc0+εMc0ε    D2M_{c_0 + \varepsilon} \cong M_{c_0 - \varepsilon} \;\sqcup\; D^2

M_{c₀−ε} = ∅ p₀ M_{c₀+ε} ≅ D² 0-HANDLE

Figure 9 — Passing the minimum (index 0): the sublevel set jumps from empty to a single disk.

Index 1 — Attaching a 1-handle

When the index of p0p_0 is one, f=x2y2+c0f = x^2 - y^2 + c_0 near p0p_0. The sublevel set gains a bridge D1×D1D^1 \times D^1:

Mc0+εMc0ε    (D1×D1)M_{c_0 + \varepsilon} \cong M_{c_0 - \varepsilon} \;\cup\; (D^1 \times D^1)

before gap after p₀ 1-HANDLE (BRIDGE)

Figure 10 — Passing a critical point of index 1 connects two components via a bridge (1-handle = $D^1 \times D^1$).

Index 2 — Attaching a 2-handle

When the index of p0p_0 is two, f=x2y2+c0f = -x^2 - y^2 + c_0 near p0p_0. The sublevel set is capped off by a disk D2D^2:

Mc0+εMc0ε    D2M_{c_0 + \varepsilon} \cong M_{c_0 - \varepsilon} \;\cup\; D^2

Theorem — Handle Decomposition

If a closed surface MM admits a Morse function f ⁣:MRf\colon M \to \mathbb{R}, then MM can be described as a union of finitely many 0-handles, 1-handles, and 2-handles.

This is the fundamental result of Morse theory on surfaces: any closed surface equipped with a Morse function can be built up, piece by piece, by attaching handles of index 0, 1, and 2 as we sweep through the critical values from the minimum to the maximum.


Notes on Morse Theory — transcribed from handwritten lecture notes.