The sum of kth powers

Everyone loves the “baby Gauss story” in which Gauss amazes his teacher by quickly summing the first 100 positive integers in a flash of brilliance—he adds the first to the 100th, the second to the 99th, and so on to get the sum of fifty 101s to obtain 5050. (Brian Hayes has a great article in which he puts this myth under the microscope). Using this same trick it is easy to show that .

Unfortunately, Gauss’ trick doesn’t work for sums of higher powers, .

Here is a neat way to compute such sums. (This may be widely known—I haven’t looked for it, but I don’t remember seeing it either.) [Update: OK, I feel silly now—I just looked in Stewart’s Calculus to see how he proves these formulas… and this is the proof he gives. Not a rare gem, I guess, but I still think it is a slick derivation.]

First we’ll derive the formula for . We begin with the equality

, which we rearrange as

.

Next, sum both sides from 1 to .

The first sum on the right is telescoping and the second is simply , so

.

Dividing by 2 we obtain,

.

Now we can repeat this trick to find the sum .

As before, we begin with an equality.

, or rearranged,.

Then we sum.

.

Again, the first sum is telescoping, we just found the second sum, and the third sum is . So,

.

Dividing by 3 we obtain

.

At this point the pattern is clear. We can find by expanding and using the formulas for (), which we have already computed.

Further update:
I decided to peruse the calculus books on my shelf and see which of them gave this derivation of the sum of the kth powers. Here’s what I found.
Stewart: yes (and he gives a proof by induction)
Smith/Minton: no (induction)
Larson/Hostetler/Edwards: no (induction)
Edwards/Penney: yes (as a homework problem)
Varberg/Purcell/Rigdon: yes
Hughes-Hallett: the formulas don’t appear in the book as far as I can tell
Rogawski: no (shockingly, he gives these formulas in a definition!)
(Yet another update: Rogawski’s text gives the derivation as a sequence of exercises earlier in the book, it also presents the connections between this problem and the Bernoulli numbers, and according to the author, the “definition” problem will be corrected in the next printing.)