I know the following facts. (Don't assume I know much more than the following facts.)

The Atiyah-Singer index theorem generalizes both the Riemann-Roch theorem and the Gauss-Bonnet theorem.

The Atiyah-Singer index theorem can be proven using heat kernels.

This implies that both Riemann-Roch and Gauss-Bonnet can be proven using heat kernels. Now, I don't think I have the background necessary to understand the details of the proofs, but I would really appreciate it if someone briefly outlined for me an extremely high-level summary of how the above two proofs might go. Mostly what I'm looking for is physical intuition: when does one know that heat kernel methods are relevant to a mathematical problem? Is the mathematical problem recast as a physical problem to do so, and how?

(Also, does one get Riemann-Roch for Riemann surfaces only or can we also prove the version for more general algebraic curves?)

Edit: Sorry, the original question was a little unclear. While I appreciate the answers so far concerning how one gets from heat kernels to the index theorem to the two theorems I mentioned, I'm wondering what one can say about going from heat kernels directly to the two theorems I mentioned. As Deane mentions in this comments, my hope is that this reduces the amount of formalism necessary to the point where the physical ideas are clear to someone without a lot of background.

Just so you know, the version you get out of Atiyah-Singer is actually for algebraic manifolds: it's Hirzebruch-Riemann-Roch. But I'm also pretty sure that the topology is necessary, so you're stuck with working over the complex numbers (though that might not be quite correct)
–
Charles SiegelFeb 9 '10 at 1:42

8

I really like this question. The usual presentation of the Atiyah-Singer index theorem, as well as the heat kernel proofs, use so much formalism (as shown in the answers below). Surely, most of this formalism simplifies if you are just trying to prove the Gauss-Bonnet theorem on a 2-dimensional surface with a Riemannian metric.
–
Deane YangFeb 9 '10 at 5:04

1

I also really like the answers given. +1 to everybody.
–
Ben Webster♦Feb 9 '10 at 6:45

3

But all of the answers so far are for the index theorem in general. Could someone provide a less jargon-laden explanation for how to use the heat kernel on a 2-d Riemannian manifold to prove Gauss-Bonnet?
–
Deane YangFeb 9 '10 at 13:38

8 Answers
8

Here is how the heat kernel proof of Atiyah-Singer goes at a high level. Let $(\partial_t - \Delta)u = 0$ and define the heat kernel (HK) or Green function via $\exp(-t\Delta):u(0,\cdot) \rightarrow u(t,\cdot)$. The HK derives from the solution of the heat equation on the circle:

(It's worth noting that what you labelled as the Atiyah-Singer Theorem is really the McKean-Singer formula. The evaluation of the supertrace as the integral of an explicit characteristic class is the Atiyah-Singer theorem.)
–
Paul SiegelMar 23 '13 at 14:03

1

I was a student of McKean, perhaps that explains my take.
–
Steve HuntsmanMar 23 '13 at 16:00

Since the summary below is already a bit long, I thought I'd add a few lines at the beginning as a guide. The proofs all proceed as follows:

Identify the quantity of interest (like the Euler characteristic) as the index of an operator going from an 'even' bundle to an 'odd' bundle.

Use Hodge theory to write the index in terms of the dimensions of harmonic sections, i.e., kernels of Laplacians.

Use the heat evolution operator for the Laplacians and 'supersymmetry' to rewrite this as a 'supertrace.'

Write the heat evolution operator in terms of the heat kernel to express the supertrace as the integral of a local density.

Use the eigenfunction expansion of the heat kernel to identify the constant (in time) part of the local density.

Most of this is general nonsense, and the difficult step is 5. By and large, the advances made after the seventies all had to do with finding interpretations of this last step that employed intuition arising from physics.

I suffered over this proof quite a bit in my pre-arithmetic youth and wrote up
a number of summaries. A condensed and extremely superficial version is given here, mostly for my own review.
If by chance someone finds it at all useful, of course I will be delighted. I apologize that I don't say anything about physical
intuition (because I have none), and for repeating parts of the previous nice answers.
It's been years since I've thought about these matters, so I will forgo
all attempts at even a semblance of analytic rigor. In fact, the main pedagogical reason for posting is that a basic outline of the proof is possible to understand with almost no analysis.

The usual setting has a compact Riemannian manifold $M$, two hermitian bundles $E^+$ and $E^-$, and a linear
operator
$$P:H^+\rightarrow H^-,$$
where $H^{\pm}:=L^2(E^{\pm})$.
With suitable assumptions (ellipticity), $ker(P)$ and $coker(P)$
have finite dimension, and the number of interest is the index:
$$Ind(P)=dim(ker(P))-dim(coker(P)).$$
This can also be expressed as
$$dim(ker(P))-dim(ker(P^{\*})),$$ where
$$P^{\*}:H^-\rightarrow H^+$$
is the Hilbert space adjoint. A straightforward generalization of the Hodge theorem allows us also to write this in terms of Laplacians
$\Delta^+=P^* P$ and $\Delta^-=PP^*$
as
$$dim(ker(\Delta^+))-dim(ker(\Delta^-)).$$
Things get a bit more tricky when we try to identify the index with the expression ('supertrace,' so-called)
$$Tr(e^{-t\Delta^+})-Tr(e^{-t\Delta^-}).$$
The operator
$$e^{-t\Delta^{\pm}}:H^{\pm}\rightarrow H^{\pm}$$
sends a section $f$ to the solution of the heat equation
$$\frac{\partial}{\partial t} F(t,x)+\Delta^{\pm}F(t,x)=0$$
($x$ denoting a point of $M$) at time $t$ with intial condition $F(0,x)=f(x).$
One important part of this is that there are discrete Hilbert direct sum decompositions
$$H^+=\oplus_{\lambda} H^+(\lambda)$$
and $$H^-=\oplus_{\mu} H^-(\mu)$$
in terms of finite-dimensional eigenspaces for the Laplacians with non-negative eigenvalues. And then, the identities
$$\Delta^-P=PP^{\*}P=P\Delta^+$$
and
$$\Delta^+P^{\*}=P^{\*}PP^{\*}=P^{\*}\Delta^-$$
show that the (supersymmetry) operators $P$ and $P^{\*}$ can be used to define isomorphisms between all non-zero eigenspaces of the two Laplacians with
a correspondence for eigenvalues as well.
Thus, once you believe that the exponential operators are trace class,
it's easy to see that the only contributions to the trace are from the kernels of the plus and minus Laplacians. This is the 'easy cancellation' that occurs in this proof.
But on the zero eigenspaces, the heat evolution operators are clearly the identity, allowing us to identify the supertrace with the index.
To summarize up to here, we have
$$Ind(P)=Tr(e^{-t\Delta^+})-Tr(e^{-t\Delta^-}).$$
This identity also makes it obvious that the supertrace is in fact independent of $t>0$.

The proofs under discussion all have to do with identifying this supertrace in terms of local expressions that
relate naturally to characteristic classes. The beginning of this process involves first writing the operator
$e^{-t\Delta^+}$ in terms of an integral kernel
$$K^+_t(x,y)=\sum_i e^{-t\lambda_i } \phi^+_i(x)\otimes \phi^+_i(y)$$
where the $\phi^+_i$ make up an orthonormal basis of eigenvectors for the Laplacian.
That is,
$$[e^{-t\Delta^+}f](x)=\int_M K^+_t(x,y)f(y)dvol(y)=\sum_i e^{-t\lambda_i } \int_M \phi^+_i(x) \langle \phi^+_i(y),f(y)\rangle dvol(y).$$
Formally, this identity is obvious, and the real work consists of the global analysis necessary to justify the formal computation.
Obviously, there is a parallel discussion for $\Delta^-$. Now, by an infinite-dimensional version of the formula
that expresses the trace of a matrix as a sum of diagonals, we get that
$$Tr(e^{-t\Delta^+})=\int_M Tr(K^+_t(x,x))dvol(x)=\int_M \sum_ie^{-t\lambda_i}||\phi^+_i(x)||^2 dvol(x),$$
an integral of local (point-wise) traces, and similarly for $Tr(e^{-t\Delta^-})$. One needs therefore, techniques to evaluate the density

More analysis gives an asymptotic expansion for the plus and minus densities of the form
$$ a^{ \pm }\_{-d/2}(x) t^{-d/2}+a^{ \pm }\_{d/2+1}(x) t^{-d/2+1}+\cdots $$
where $d$ is the dimension of $M$.

Up to here the discussion was completely general, but then the proof begins to involve special cases, or
at least, broad division into classes of cases. But note that even for the special cases mentioned in the original question,
one would essentially carry out the procedure outlined above for a specific operator $P$.

The breakthrough in this line of thinking
came from Patodi's incredibly complicated computations for the operator $d+d^*$
going from even to odd differential forms,
where one saw that the
$$a^{+}\_i(x)$$
and
$$a^{-}\_i(x)$$
canceled each other out locally, that is, for each point $x$, for all the
terms with negative $i$. I think it was fashionable to refer to this cancellation as 'miraculous,' which it is, compared to the easy cancellation above.
At this point, Patodi could take a limit
$$\lim_{t\rightarrow 0}[\sum_ie^{-t\lambda_i}||\phi^+_i(x)||^2 dvol(x)-\sum_ie^{-t\mu_i}||\phi^-_i(x)||^2 dvol(x)],$$
that he identified with the Euler form. This important calculation set a pattern that recurred in all other versions of
the heat kernel approach to index theorems. One proves the existence of an analogous
limit as $t\rightarrow 0$ and identifies it. The identification
as a precise differential form representative for a characteristic class is referred to sometimes as a local index theorem, a statement
more refined than the topological formula for the global index. There is even a beautiful version of a local families index theorem
that relates eventually to deep work in arithmetic intersection theory and Vojta's proof of the Mordell conjecture.

As I understand it, Gilkey's contribution was an invariant theory
argument that tremendously simplified the calculation and allowed a differential form representative for
the $\hat{A}$ genus to emerge naturally
in the case of the Dirac operator. And then, I believe there is a $K$-theory argument that deduces the index theorem for a general elliptic operator
from the one for the twisted Dirac operator.

Experts can correct me if I'm wrong, but
from a purely mathematical point of view, essentially all the work on the heat kernel proof was done at this point.
Subsequent interpretations of the proof (more precisely, the supertrace), in terms of supersymmetry, path integrals, loop spaces, etc., were tremendously
influential to many areas of mathematics and physics, but the mathematical core of the index theorem itself appears to have remained largely unchanged for almost forty years. In particular, the terminology I've used myself above, the super- things, didn't occur at all in the original papers of Patodi, Atiyah-Bott-Patodi, or Gilkey.

Added:

Here is just a little bit of geometric-physical intuition regarding the heat kernel in the Gauss-Bonnet case, which I'm sure is completely banal for most people. The density
$$\sum_ie^{-t\lambda_i}||\phi^+_i(x)||^2 dvol(x)-\sum_ie^{-t\mu_i}||\phi^-_i(x)||^2 dvol(x)$$
expresses the heat kernel in terms of orthonormal bases for the even and odd forms. When $t\rightarrow \infty$ all terms involving the positive eigenvalues decay to zero, leaving only contributions from orthonormal harmonic forms. This is one way to to see that the integral of this density, which is independent of $t$, must be the Euler characteristic. On the other hand, as $t\rightarrow 0$, the operator
$$K^\+_t(x,y)dvol(y)=[\sum_i e^{-t\lambda_i } \phi^+_i(x)\otimes \phi^+_i(y)]dvol(y)$$
literally approaches the identity operator on all even forms (except for the fact that it diverges). That is, the heat kernel interpolates between the identity and the projection to the harmonic forms, in some genuine sense expressing the diffusion of heat from a point distribution to a harmonic steady-state. A similar discussion holds for the odd forms as well. I can't justify this next point even vaguely at the moment, but one should therefore think of $$[K^+\_t(x,y)-K^-\_t(x,y)]dvol(y)$$ as regularizing the current on $M\times M$ given by the diagonal $M\subset M\times M$. Thus, the integral of $$[TrK^{+}\_t(x,x)-TrK^-\_t(x,x)]dvol(x)$$ ends up computing a deformed self-intersection number of the diagonal in $M\times M$. From this perspective, it shouldn't be too surprising that the Euler class, representing exactly this self-intersection, shows up.

Added:

I forgot to mention that the Riemann-Roch case is where $$P=\bar{\partial}+\bar{\partial}^*$$
going from the even to the odd part of the Dolbeault resolution associated to a holomorphic vector bundle. The limit of the local density is a differential form representing the top degree portion of the Chern character of the bundle multiplied by the Todd class of the tangent bundle. Perhaps it's worth stressing that these special cases all go through the general argument outlined above.

This sounds extremely interesting: are there any references you'd recommend for this sort of approach?
–
Akhil MathewJun 2 '10 at 20:02

My outdated knowledge still relies heavily on the paper of Atiyah, Bott, and Patodi: On the heat equation and the index theorem, Invent. Math. 19 (1973), 279–330. Many nice books seem to have come out in the meanwhile.
–
Minhyong KimJun 2 '10 at 23:49

In my opinion, the most "physical" recasting of the index theorem is the Witten index for supersymmetric quantum theories. It is superficially similar to the heat kernel, but much more general. The Witten index of a supersymmetric quantum mechanical system is the regularised supertrace
$$\mathrm{Tr} (-1)^F \exp(-\beta H)$$
where $(-1)^F$ is the grading operator which is $-1$ on fermionic states and $+1$ on bosonic states and $H$ is the hamiltonian. The trace is taken over the Hilbert space of states.

One can show that this does not depend on $\beta$ and hence can be evaluated both at small $\beta$ ("large temperature expansion") or large $\beta$ ("small temperature expansion"). In one expansion one sees that it computes the trace of $(-1)^F$ on zero modes of the hamiltonian, since for a supersymmetric system the dimensions of the bosonic and fermionic positive-energy eigenstates are equal. In the other expansion one writes the Witten index in terms of a functional integral, which (when formally manipulated) becomes a geometric integral.
The formal manipulations can be justified as in Getzler's proof of the local Atiyah-Singer index theorem.

The relation with the heat kernel comes from taking a particular supersymmetric model in which the hamiltonian is the laplacian. The relation with the Gauss-Bonnet theorem comes from considering a supersymmetric sigma model in which the hamiltonian is the Hodge laplacian acting on ($L^2$) differential forms. The Witten index then is computing the index of the elliptic operator $d + \delta$ from the odd to the even rank differential forms, which is the Euler characteristic of the manifold.

There are supersymmetric models for which the Witten index computes the index of a generalised Dirac operator as in the original work of Atiyah and Singer.

Witten introduced "his" index in order to study supersymmetry breaking. A nonzero value of the index shows that there exists a vacuum state (=a state of minimal energy) which is invariant under supersymmetry and hence supersymmetry is not (spontaneously) broken.

There is yet another relation between the heat kernel and the index theorem and it comes from a certain regularisation of the functional integral measure as in Fujikawa's celebrated derivation of the chiral anomaly.

The index is the number of anomalous "ghost" states in a chiral field theory. Anomalies occur when the classical/quantum symmetry correspondence breaks down under renormalization (but global anomalies are "good" compared to local ones, which in string theory constrain the dimension and fermion content). Atiyah-Singer thus helps to address questions about, e.g. why there are three generations of chiral fermions, why protons don't decay, why the electron mass is small, etc.
–
Steve HuntsmanFeb 9 '10 at 1:20

Global anomalies render the theory inconsistent, so I'm not sure in what sense of the word 'good' you mean that they are "good". Also the dimension in string theory is determined by a local anomaly, whereas if by "fermion content" you mean the GSO projection, this is the modular invariance of the partition function on the torus, which can be rephrased indeed as the absence of a global anomaly.
–
José Figueroa-O'FarrillFeb 9 '10 at 10:43

The statement about supersymmetry breaking in the last is actually wrong. If the Witten index is not zero, which means there is a supersymmetric ground state with zero energy, it does NOT occur that supersymmetry is spontaneously broken.
–
Satoshi NawataMay 15 '12 at 5:14

For a completely elementary description of the relation between the Euler characteristic of a domain in $\mathbb{R}^2$ and the heat kernel on the domain, you could start with Mark Kac's "Can you hear the shape of a drum". At the very end of the paper he observes that the constant term in the expansion of the trace of the heat kernel contains topological information, namely the Euler characteristic of the domain. He doesn't prove this, but indicates how you could get this from what he explained before in the paper. He left his formula as a conjecture (in 1966).

Kac, Mark (1966), "Can one hear the shape of a drum?", American Mathematical Monthly 73 (4, part 2): 1–23

Paul Loya's lectures are also interesting, by the way!
edit:
Gilkey writes inhttp://mmf.ruc.dk/~Booss/recoll.pdf (an interesting article, you should read it!)
"During the course, he said that there was this wonderful
invariant that Bob Seeley had constructed analytically and, ‘here is
Bott’s proof of the index theorem, and somebody should actually show
that this gives a heat equation proof of the index theorem.’ That
struck me as a fun problem, so I went home that night and gave the
heat equation proof to the Gauss-Bonnet-Theorem."

Patodi had published four or five papers on the subject before Atiyah-Bott-Patodi. Here is the opening line of the review (MR0290318) of one of the papers: The main purpose of the paper is to give an analytic proof of the Riemann-Roch-Hirzebruch theorem, in the case of a Kähler manifold, through use of the fundamental solution of the heat operator. Thus, the paper is a natural outcome of the author's previously developed method.
–
Chandan Singh DalawatJun 1 '10 at 14:02

For surfaces everything is quite simple: The starting point is of course the McKean Singer formula for the index. Next step is to compute an assymptotic expansion of the heat kernel, but this can be done for surfaces by hand (for the terms which are important for the index, i.e. up to order 1): it is more or less the curvature of the corresponing bundle-> integrating this up gives you an heat equation proof of RR or Gauss Bonnet. (Details can be found in Roe s Elliptic operators,... book)

This is a wonderful question, but I think people are having a hard time answering it because even in the simplest settings there is a lot going on and it is not clear what not a lot of background means. The simplest case would have to be proving GBC for a surface (nothing I say will be any easier for a surface, unfortunately!). First you have to know that heat kernel here means heat kernel for the Hodge Laplace operator, which is the Laplace operator on forms $(d+d^*)^2.$ Second it involves the supertrace of the heat kernel, which means the trace, except you add a minus sign for the parts taking odd forms to odd forms. A previous post mentions McKean and Singer, whose argument I will not repeat because it cannot be improved upon, but it argues that the eigenforms of this Laplacian with nonnegative eigenvalue cancel out, so only the $0$ eigenfunctions contribute, so the quantity is $t$-independent and in fact the Euler number. But if it is $t$-independent, we may as well look at this quantity as $t \to 0.$ Since the heat kernel is approaching a delta function on the diagonal, the small $t$ behavior is a local question, and can be addressed by comparing it to the flat space heat kernel. This reasonably straightforward calculation gets you the curvature, or in higher dimensions the Pfaffian of the of the curvature (the supertrace itself contained an integral which I have not mentioned explicitly). If you are looking for a principle or technique to bring with you to other problems, I would say it is this: Show some quantity you can compute from the time evolution kernel is somehow topological and therefore time independent, then equate the small time calculation of it to the large time calculation and see a connection between two things that look unrelated.

I am not able to say much regarding how to prove RR by Atiyah-Singer theorem, I know Gauss-Bonnet follows naturally from Atiyah-Singer theorem. Here is a heat kernel proof of Atiyah-Singer in case you are still looking for relevant material.

In the view of the downvote I should justify myself that this article gives a proof of RR via Atiyah-Singer index theorem, which is why I cannot say much in the answer I wrote.
–
KerryMay 16 '12 at 2:24

If that's what there is in the notes, then I also don't understand why the downvotes.
–
AminJun 15 '12 at 13:48