How many ancestors?

How many ancestors do you have? If you go back some number of generations, how many genealogical ancestors are there in that generation?

Assuming you’re the product of normal human reproduction (clones can hit the “back” button right now), you have two parents. They had two parents, so you have four grandparents — four ancestors if you go two generations back. Going back three generations, you have eight ancestors; four generations, sixteen ancestors; and so on. In mathematical terms, if you go back n generations, you have 2n ancestors.

Not!

This line of argument becomes preposterous if you carry it back far enough. Go back ten generations and you have 210 = 1024 ancestors. Call it 1000 just to make things easier. If you go back another ten generations, each of those 1000 has 1000 ancestors, so twenty generations back from you there are 1,000,000 ancestors. Thirty generations back there’s another factor of 1000, so you have 1,000,000,000 ancestors — one (American) billion.

Which can’t be right. Thirty generations back is roughly 1000 years — and there weren’t a billion people on Earth then. (See “How Many People Have Ever Lived on Earth?”) Granted, not all the people in the thirtieth generation back from you lived at the same time, but that doesn’t help much.

And it gets worse and worse. Ten more generations back and you have a trillion ancestors. This is about ten times more than the estimated number of people who have ever lived. Ten more generations, a quadrillion ancestors. And so on.

Yet everyone has two parents. So the only way you can avoid this ridiculous conclusion is to realize that some of your ancestors must have had ancestors in common.

In a simple case, if your father’s grandparents on his father’s side are the same people as your mother’s grandparents on her father’s side, then you have only six ancestors three generations back, not eight: Your father’s maternal grandparents, your mother’s maternal grandparents, and two more people who are the paternal grandparents of both your parents. To put it another way, there are two different paths back from you to two of your great grandparents.

But if so, then your parents are first cousins. And generally, if you have ancestors who had ancestors in common — as you now know you must — then that means those ancestors were related. Somewhere in your ancestry, you have couples who were some sort of relatives of one another.

Well, we knew that, of course. We all must have common descent from some individual ancestor, whether one of the earliest modern Homo sapiens of 50,000 years ago, or an earlier species of Homo a million or more years ago, or some other primate even earlier. And that means all couples in your ancestry (and everyone else’s) were relatives of one another.

But the above argument proves you must have multiple paths back to the same ancestors in much more recent times. Your parents probably have common ancestors within the past 1000 years — and so, probably, do you and your spouse, or you and your best friend. I myself know of several cases of multiple paths back from me to ancestors in the 18th century — some a little more scary than others! Of course if one person’s ancestors all came from (say) Easter Island and another’s all came from Scandanavia, their common ancestor is likely to be further back. Maybe even then, not as far back as you’d think. Douglas LT Rohde has written a paper (PDF format) reporting on a series of computer simulations which suggest all people in the world today have a most recent common ancestor (MRCA) who lived about 2000 to 5000 years ago. (Go a few thousand years further back and everyone on earth at that time who has any descendants alive today is an ancestor of everyone alive today!)

Anyway, if you have any northern European (especially English or Danish) ancestry… then you and I probably have a common ancestor who lived after, say, 700 AD. Jack Lee argues that he knows who that ancestor is: Charlemagne. (Or, less flippantly, that everyone living at the time of Charlemagne who has any descendants alive today is an ancestor of everyone of European descent alive today.) His argument is more informal than Rohde’s and I would take it with a grain of salt, but on the other hand it’s hard to argue that the probability he estimates for not being a descendant of Charlemagne (10-15,000 (!)) can be wrong by 14,999 orders of magnitude.

The following table generalizes Lee’s argument. To sum up, the simplified model is that each generation is exactly 30 years, and that in each generation every male in Europe is equally likely to fill a slot in one’s ancestry. First column is generation number and second is number of male ancestor slots; third column is the year — I’m taking 2000 as the year corresponding to the first generation back (so the figures are roughly appropriate to my son, who was born in 1999 — though ignore figures for the last century or so; his ancestors were in America then). Fourth column is the estimated population of Europe for that year (not to be taken too seriously; round numbers are from Wikipedia, though may be assigned to a year 10 years too early or late, and other numbers are exponential extrapolations — note the discontinuity in the 14th century; that’s the bubonic plague); fifth column is estimated male population (just 0.5 times the total). The sixth column is the answer to the question: “if you pick an ancestral slot in that generation, and a random male person in Europe at that time, what is the probability he is the person who fills that slot?” The sixth column is the answer to the question “if you pick a random male person in Europe at the time of that generation, what is the probability he fills any slot in that generation?” In this model at generation 20 (around 1430 AD) there’s a 1% chance any male in Europe is one’s ancestor; by generation 24 (1310 AD) it’s over 15%, and by generation 29 (1160 AD) it’s a near certainty.

Gen

# ancestors

Year

Population (M+F)

Population (M)

Prob (any M anc)

Prob (all M anc)

1

1

2000

729,000,000

364,500,000

0.0000000027

0.00%

2

2

1970

676,388,304

338,194,152

0.0000000030

0.00%

3

4

1940

547,000,000

273,500,000

0.0000000037

0.00%

4

8

1910

408,000,000

204,000,000

0.0000000049

0.00%

5

16

1880

378,554,771

189,277,386

0.0000000053

0.00%

6

32

1850

276,000,000

138,000,000

0.0000000072

0.00%

7

64

1820

256,081,169

128,040,584

0.0000000078

0.00%

8

128

1790

203,000,000

101,500,000

0.0000000099

0.00%

9

256

1760

163,000,000

81,500,000

0.0000000123

0.00%

10

512

1730

151,236,342

75,618,171

0.0000000132

0.00%

11

1,024

1700

125,000,000

62,500,000

0.0000000160

0.00%

12

2,048

1670

115,978,790

57,989,395

0.0000000172

0.00%

13

4,096

1640

107,608,638

53,804,319

0.0000000186

0.01%

14

8,192

1610

111,000,000

55,500,000

0.0000000180

0.01%

15

16,384

1580

102,989,166

51,494,583

0.0000000194

0.03%

16

32,768

1550

95,556,471

47,778,235

0.0000000209

0.07%

17

65,536

1520

88,660,191

44,330,095

0.0000000226

0.15%

18

131,072

1490

84,000,000

42,000,000

0.0000000238

0.31%

19

262,144

1460

77,937,747

38,968,874

0.0000000257

0.67%

20

524,288

1430

72,313,005

36,156,502

0.0000000277

1.44%

21

1,048,576

1400

67,800,000

33,900,000

0.0000000295

3.05%

22

2,097,152

1370

62,906,896

31,453,448

0.0000000318

6.45%

23

4,194,304

1340

100,880,514

50,440,257

0.0000000198

7.98%

24

8,388,608

1310

93,600,000

46,800,000

0.0000000214

16.41%

25

16,777,216

1280

86,844,918

43,422,459

0.0000000230

32.05%

26

33,554,432

1250

80,577,348

40,288,674

0.0000000248

56.52%

27

67,108,864

1220

74,762,107

37,381,053

0.0000000268

83.39%

28

134,217,728

1190

69,366,550

34,683,275

0.0000000288

97.91%

29

268,435,456

1160

64,360,388

32,180,194

0.0000000311

99.98%

30

536,870,912

1130

59,715,520

29,857,760

0.0000000335

100.00%

31

1,073,741,824

1100

55,405,870

27,702,935

0.0000000361

100.00%

32

2,147,483,648

1070

51,407,246

25,703,623

0.0000000389

100.00%

33

4,294,967,296

1040

47,697,202

23,848,601

0.0000000419

100.00%

34

8,589,934,592

1010

44,254,910

22,127,455

0.0000000452

100.00%

35

17,179,869,184

980

41,061,047

20,530,524

0.0000000487

100.00%

36

34,359,738,368

950

38,097,685

19,048,842

0.0000000525

100.00%

37

68,719,476,736

920

35,348,187

17,674,094

0.0000000566

100.00%

38

1.37439E+11

890

32,797,120

16,398,560

0.0000000610

100.00%

39

2.74878E+11

860

30,430,162

15,215,081

0.0000000657

100.00%

40

5.49756E+11

830

28,234,027

14,117,014

0.0000000708

100.00%

(Recent genetic analysis [paper, FAQ] supports the claim that all Europeans who lived more than about 1000 years ago either have no living descendants or are genealogical ancestors of all living Europeans.)

Here’s another thought, based on reading Richard Dawkins’s The Ancestor’s Tale and having more to do with biology than genealogy. An obvious truth once you think about it (at least, obvious to anyone who understands evolution well enough to know it is proven fact), but perhaps surprising when first pointed out anyway. Start with the fact that not only are all humans related to one another, but we’re all related to all other organisms — our MRCA with clams may have lived hundreds of millions of years ago, but there was one.

So now think about this: Somewhere in the past several million years there lived an individual who was the MRCA of all living humans and chimpanzees. He (or she, I’ll use “he”) was my ancestor, your ancestor, and Washoe’s ancestor. In principle, a child (which I’ll call “she”) of this MRCA could have been ancestor to some but not all humans and some but not all chimps; but the above arguments suggest she was ancestor to all humans or none — and likewise ancestor to all chimps or none. In any case, if we shift our attention to the most recent individual who was an ancestor of one or more living humans and one or more living chimps — who may have been that same MRCA or may be a later descendant — then by definition, he must have had two or more children of whom at least one is ancestor to one or more living humans (very probably all living humans), but no chimps, and at least one is ancestor to one or more (probably all) living chimps, but no humans. Think about that. Once upon a time there were, without doubt, two siblings, probably growing up together, all the living descendants of one of whom are humans, and all the living descendants of the other of whom are chimpanzees!

This of course doesn’t mean one sibling was a human and one was a chimp; they both were members of the same ancestral species, very much like one another, and had their life histories varied a little, the human ancestor could have ended up being the chimp ancestor and vice versa. An intelligent observer would have had no way of knowing they were at the divergence point of future species. But they were.

Similarly, go back far enough and there were two siblings, all the living descendants of one of whom are mammals, and all the living descendants of the other of whom are birds or reptiles. Or siblings whose living descendants are vertibrates for one, invertibrates for the other. Or siblings whose living descendants are on the one hand, animals, and on the other hand, plants…

And just think, maybe a few hundred million years from now the descendants of you and your siblings will be just as different!

Genetic ancestors

The above discussion has to do with genealogical ancestors. Genetic ancestors are another story. You may have genealogical descent from millions of ancestors… but the number of ancestors you get your DNA from tops out at about 46,000. Here’s a discussion by Bob Jenkins (and he gives links to a couple other pages about it).

How many descendants?

Working forward is similar to working backward. The difference is that we all have two biological parents, but we have varying numbers of children. The world’s population generally increases with time, but fairly slowly, which means the average number of children (that survive to adulthood) per person (who survives to adulthood) is larger, but not much larger, than 2. Of course it varies from time to time and place to place, but overall it must be a little over 2.

In the following, let’s assume the average is exactly 2, for convenience. The number of descendants n generations removed is (on average) 2n, which is equal to the (exact) number of ancestors n generations removed — if there is no intermarriage between relatives, and of course there is.

But that’s an average. Some people have no children — or have children none of whom have children, or grandchildren none of whom have children, or whatever — and so have no descendants after some number of generations. Others have many.

mtDNA lines

Here’s a question prompted by thinking about mitochondrial DNA (mtDNA), which is inherited only along the female line: yours comes from your mother, which comes from her mother, which comes from her mother, and so on. Yours will be inherited by all your children if you’re female, or by no one if you’re male, and will (if you’re female) be passed to the children of your daughters, but not the children of your sons, and to the children of the daughters of your daughters, but not the children of the daughters of your sons nor the children of the sons of your daughters, and so on. What is the probability of your mtDNA being passed on to the (n+1)th generation after you? That is, what is the probability that there will be a female line extending n more generations after you? On the face of it, it might seem pretty unlikely after only a dozen or so generations: any female line will end if a woman has no daughters, and that’s not very improbable. But female lines of twenty or more generations are actually not very unlikely.

Of course female lines do get more and more unlikely as the number of generations increases. After all, whatever the probability of all female lines ending by a certain generation is, the probability of ending by the next generation has to be higher. But how rapidly does the likelihood of a female line shrink?

That depends on two things: the distribution of the number of children you and your descendants have, and the probability that a given child is female. (Again, by “children” I mean children who survive to adulthood.) That probability isn’t exactly 1/2 but is close, and we can take it as exactly 1/2 without much error, at least for most times and places.

(I think intermarriage probability enters into it too, but for now assume no intermarriage.)

The distribution of the number of children is another matter. It’s hard to know how to model that realistically. So let’s not. Let’s instead consider two models: (A) Exactly two children per person, that is, probability of 2 children P(2) is 1.0, and (B) Probability of 0 (1, 2, 3, 4) children is 0.1 (0.2, 0.4, 0.2, 0.1). In case B the average number of children is 2.

Then the probability of having no daughters is sum (P(n) * (1/2)n). In case A this is just 1 * (1/2)2 = 1/4. In case B, .1 * 1 + .2 * 1/2 + .4 * 1/4 + .2 * 1/8 + .1 * 1/16 = 0.33125, somewhat larger, even though the average number of children in both cases is the same as in case A. Of course the probability of having no sons is the same as the probability of having no daughters. The probability of having at least one daughter is 1 – 1/4 = 3/4 in case A, 1 – 0.33125 = 0.66875 in case B.

Now, if we know the probability Pnf(g) of having no female line descendants after g generations, then what is Pnf(g+1) of having no female line descendants after g+1 generations? It is the sum over m of (probability of having m daughters * probability of all m daughters having no female line descendants after n generations), that is, sum (Pd(m) * (Pnf(g))m). In case A,

Pnf(g+1) = 1/4 + 1/2 * Pnf(g) + 1/4 * Pnf(g)2

and in case B it’s something like that but more complicated. We know Pnf(1); it’s just Pd(0). So we can inductively get the probability for larger numbers of generations. It’s easiest in a spreadsheet.

Here is Pnf(g) for g from 1 to 20 in the two cases:

g

A

B

1

0.25

0.33

2

0.39

0.49

3

0.48

0.58

4

0.55

0.65

5

0.60

0.69

6

0.64

0.73

7

0.67

0.76

8

0.70

0.78

9

0.72

0.80

10

0.74

0.81

11

0.76

0.83

12

0.77

0.84

13

0.79

0.85

14

0.80

0.86

15

0.81

0.87

16

0.82

0.87

17

0.83

0.88

18

0.83

0.88

19

0.84

0.89

20

0.85

0.89

So even after twenty generations there’s about a 10% to 15% chance
the mtDNA is being passed along.

RSS Newsreader Links

Meta

I'm Rich Holmes. I do physics, knit, morris dance, play melodeon and clarinet and ukulele, but not simultaneously, have a son, like science and math and Macintoshes and Linux and Android, build and fly model rockets, dabble in electronic music, make bad jokes, and generally consume oxygen.