Abstract

Stark contrasts in clade species diversity are reported across the tree of life and are especially conspicuous when observed in closely related lineages. The explanation for such disparity has often been attributed to the evolution of key innovations that facilitate colonization of new ecological niches. The factors underlying diversification in bees remain poorly explored. Bees are thought to have originated from apoid wasps during the Mid-Cretaceous, a period that coincides with the appearance of angiosperm eudicot pollen grains in the fossil record. The reliance of bees on angiosperm pollen and their fundamental role as angiosperm pollinators have contributed to the idea that both groups may have undergone simultaneous radiations. We demonstrate that one key innovation—the inclusion of foreign material in nest construction—underlies both a massive range expansion and a significant increase in the rate of diversification within the second largest bee family, Megachilidae. Basal clades within the family are restricted to deserts and exhibit plesiomorphic features rarely observed among modern bees, but prevalent among apoid wasps. Our results suggest that early bees inherited a suite of behavioural traits that acted as powerful evolutionary constraints. While the transition to pollen as a larval food source opened an enormous ecological niche for the early bees, the exploitation of this niche and the subsequent diversification of bees only became possible after bees had evolved adaptations to overcome these constraints.

1. Introduction

Bees provide a mixture of pollen and nectar as food for their developing larvae. To protect these provisions from microbial infection or liquefaction that may result from exposure to moisture, most bees coat the inside of their brood cells with a hydrophobic lining secreted by Dufour's gland [1,2]. By contrast, megachilid bees use an eclectic array of foreign material to line their cells. The French naturalist Jean-Henri Fabre commented extensively on the nesting habits of megachilids and posed the following question: ‘… the Osmiae make their partitions with mud or with a paste of chewed leaves; the Mason-bees build with cement; … the Megachiles made disks cut from leaves into urns; the Anthidia felt cotton into purses; the Resin-bees cement together little bits of gravel with gum; … Why all these different trades …?’ (p. 333 in [3]).

It has been demonstrated that the foreign material used by megachilid bees is hydrophobic and shows antimicrobial activity [4,5], thus serving a similar function to the secreted cell lining in other bee groups. Not all megachilids, however, use foreign material in nest construction. Bees of the tribe Lithurgini do not line their nest cells at all; instead, they excavate burrows in wood or stems [6,7]. The absence of nest-lining in this group was originally attributed to a behavioural loss associated with above-ground nesting [8], but the phylogenetic position of Lithurgini at the base of Megachilinae [9] suggests that it represents an ancestral trait [10]. Bees of the subfamily Fideliinae build unlined nests that they excavate in sandy soil [11–14]. Two distinct tribes of fideliine bees are recognized: Fideliini and Pararhophitini, which are both entirely restricted to deserts; the absence of cell lining in these bees may be related to the arid conditions of their habitats, which may make nest-lining unnecessary [15]. It remains unclear, however, whether cell-lining behaviour, using either secretions or foreign material, has been secondarily lost in these lineages or whether the absence of cell lining represents an ancestral state. To answer these questions, we present a robust molecular phylogeny of Megachilidae and trace the evolution of nesting biology within the family. We demonstrate that the use of foreign material in nest construction was a key innovation that triggered both range expansion and diversification in megachilid bees, and also propose that the ancestral biology of this family, which is still reflected in several extant megachilid lineages, mirrors the ancestral behaviour of bees in general. Similarities in the biology of the early megachilid lineages pertaining to nesting and foraging behaviour are numerous, conspicuous, and challenge our understanding of the evolution and diversification of bees.

2. Material and methods

(a) Taxon sample

We selected 98 ingroup taxa representing all seven tribes of the family Megachilidae. Our ingroup includes 12 Fideliini, two Pararhophitini, eight Lithurgini, three Dioxyini, 23 Anthidiini, 17 Osmiini and 33 Megachilini. We chose 31 outgroup taxa to represent the diversity of the rest of the bees, comprising one Colletidae, one Halictidae, one Andrenidae, five Melittidae and 23 Apidae. Electronic supplementary material, table S1 lists the DNA voucher numbers and collection localities for each of the specimens used in this study. We sampled more densely in the families Melittidae and Apidae to accommodate the placement of fossil calibration points. Voucher specimens are deposited in the Cornell University Insect Collection.

(b) Datasets and alignment

We sequenced fragments from four protein-coding genes (CAD, 882 bp; NAK, 1489 bp; EF1-alpha, 1111 bp; LW rhodopsin, 673 bp) and one ribosomal gene (28S, 1306 bp), following the DNA extraction and sequencing protocols outlined by Danforth et al. [16]. All taxa and GenBank accession numbers are listed in electronic supplementary material, table S2. PCR primers and conditions are listed in electronic supplementary material, table S3. The four protein-coding genes were aligned using MAFFT [17] and then adjusted by eye in MacClade [18]; all introns were removed. The ribosomal gene (28S) was aligned via secondary structure according to the method described by Kjer [19]; all unalignable regions were excluded. The secondary structure alignment was based on the 28S map of Apis mellifera [20]. Details regarding data partitioning and model-testing are included in the electronic supplementary material.

(c) Phylogenetic analyses

Phylogenetic analyses were performed using both Bayesian and maximum-likelihood methods. Bayesian analyses were performed using MrBayes v. 3.1.2 [21,22]. A GTR + I + Γ model was used for all partitions except for the stem partition of 28S, which was analysed using the doublet model. All parameters were unlinked between partitions. Preliminary analyses resulted in poor mixing of chains, so the default temperature setting of 0.2 was adjusted to 0.03, which improved mixing and increased the chain swap acceptance rate to within the range recommended by the MrBayes users' manual. We ran six independent analyses, for a total of 180 000 000 generations. Sampling was performed every 2000 generations. An appropriate burn-in was discarded from each analysis using Tracer [23], leaving 96 956 000 post-burn-in generations; these were further sampled using LogCombiner v. 1.6.1 [24] to ensure independent sampling of trees. The final combined posterior distribution of 25 239 trees was used to build a maximum clade credibility tree using TreeAnnotator v. 1.6.1 [24] (electronic supplementary material, figure S1).

Maximum-likelihood analyses were performed using RAxML v. 7.0.4 (sequential version raxmlHPC [25]). We used the rapid bootstrapping algorithm with a GTR + CAT approximation to perform 1000 bootstrap replicates. The maximum-likelihood bootstrap tree is shown in electronic supplementary material, figure S2.

(d) Divergence dating analysis using BEAST

We used BEAST v. 1.6.1 to perform a Bayesian divergence dating analysis [24]. Each partition was analysed using a GTR + I + Γ model; substitution models were unlinked across partitions. We used an uncorrelated lognormal relaxed-clock model with a Yule tree prior. Trees were sampled every 2000 generations. We randomly chose a starting tree from the posterior distribution of trees from the MrBayes analysis; we used TreeEdit v. 1.0 [26] to scale the root height to 130 Myr in order to conform to the constraints imposed by prior distributions on divergence times. Ten independent analyses were run for a total of 300 000 000 generations. An appropriate burn-in was discarded from each analysis using Tracer [23], leaving 217 068 000 total post-burn-in generations. In order to ensure independent sampling of trees, we sampled every third tree from the post-burn-in posterior distribution of trees using LogCombiner v. 1.6.1 [24] and then used TreeAnnotator v. 1.6.1 [24] to build a maximum clade credibility tree from this posterior distribution of trees (electronic supplementary material, figure S3).

(e) Calibration of internal nodes and root node in BEAST

We used fossils to time-calibrate seven internal nodes on our tree. Five of these calibration points were assigned a lognormal prior distribution, while two were assigned a normal prior distribution. We present the details of these calibration points, as well as a discussion of fossils that were unusable for the purposes of calibrating our phylogeny, in the electronic supplementary material.

Bees are thought to be the sister group to the apoid wasps [27]. Apoids first appear in the fossil record during the Cretaceous [28]; Engel [28] proposes that bees originated some time after this and gives an uppermost boundary for their age of 125 Myr. There is no direct fossil evidence to suggest that bees arose at this time, however, and we believe that the age of the bees may be older than previously estimated. The Late Cretaceous (approx. 65 Ma) origin of Cretotrigona prisca, a highly derived eusocial meliponine bee, indicates that a significant amount of bee diversification had already taken place by the Late Cretaceous. Furthermore, it has been widely speculated that the origin of bees happened after the origin of the angiosperms [28–31]; recent molecular evidence [32] places the origin of the angiosperms in the Late Triassic, 30–80 Myr earlier than previously estimated. We find both of these arguments compelling reasons to explore the possibility that bees arose earlier than current estimates suggest.

We assign a uniform prior distribution to the root node. While other studies have favoured more informative root priors, such as the lognormal [33] or the normal [34], we feel that the only way to obtain an objective estimate for the origin of Megachilidae is to impose a relatively uninformative prior on the root. The lower bound of the root prior is assigned a value of 100 Myr and is based on an extremely conservative estimate for the origin of bees based on the fossil record [28]. The upper bound is assigned a value of 217 Myr and is based on a recent molecular estimate for the age of crown angiosperms [32]. Our use of a fairly broad uniform prior causes the 95 per cent HPD for divergence date estimates to be larger than those associated with other types of prior distributions. Our dating analyses, however, were run to stationarity, and age estimates from multiple independent runs converged to a single stable value; we accept the broad 95 per cent HPD as a necessary consequence of using a uniform prior distribution.

(f) Biogeographic reconstruction

Biogeographic reconstructions were performed using both S-Diva [35] and Lagrange [36]. Most of our terminal taxa represent genera; for this reason, the most plausible ancestral range for each terminal was coded based on the current distribution of the species represented by the terminal (based on [37]). In both S-Diva and Lagrange analyses, the following areas were considered: Afrotropic, Palaearctic, Southeast Asia, Australia, Nearctic and Neotropic; in case of ambiguity, polymorphism was allowed. Given our near-complete sampling of the basal-most branches, such polymorphisms only concerned the higher megachilid tribes Anthidiini, Osmiini and Megachilini, and did not affect inference at the base of the family. We present the details of both biogeographic analyses in the electronic supplementary material.

(g) Ancestral state reconstruction

We used BayesTraits [38] to reconstruct the ancestral nesting biology of Megachilidae. Cell-lining behaviour was coded for each terminal (including the outgroup) as: totally unlined (0), in Dasypoda, fideliine and lithurgine bees; lined with glandular secretion (1), in all members of the families Andrenidae, Halictidae and Colletidae, as well as in several lineages of Apidae and in the genus Melitta; lined with foreign material (2), in the oil-collecting bees, some Apidae and all higher Megachilidae; or as cleptoparasitic (3). We coded the corbiculate apidae, as well as all lineages for which no information was available, as (012). Meganomia was coded (02), as Rozen (p. 4 in [39]) states that cells of Meganomia contained ‘no built-in lining, i.e. consisting of soil mixed with secretions’, but have a waterproof lining, possibly consisting of nectar. Information on nesting biology was found in [37] and references therein. We present the details of our Bayesian ancestral state reconstructions in the electronic supplementary material.

(h) Correlated trait evolution

We used BayesTraits [38] to test for correlated evolution between the total geographical area occupied by a taxonomic group and diversification rate. We calculated diversification rate using the function lambda.stem.ms01 in the Laser package in R [40,41] and the total geographical range for each terminal taxon using the area calculator provided by the Free Map Tools website [42]. We present the details of this analysis, as well as specific information regarding species distribution, in the electronic supplementary material.

(i) Diversification rate analysis

We used MEDUSA (modelling evolutionary diversification using stepwise Akaike information criterion [43]) to test for changes in the tempo of diversification among the branches of the megachilid phylogeny. We used the final consensus tree from our BEAST analysis and removed the outgroup using Mesquite [44]. We collapsed several taxa into single terminals and calculated the total number of species represented by each terminal; terminals were collapsed in order to more easily quantify the number of species represented. The resulting phylogeny contained 82 taxa. We chose to use corrected Akaike information criterion (AICc) scores instead of AIC scores in order to account for the small sample size of our phylogeny. We used MEDUSA to fit a series of 20 models and used a strict cut-off value of 10 as our ΔAICc threshold. A model with two rate shifts (three sets of birth and death rates) was chosen as the best-fit model.

3. Results and discussion

The results of both maximum-likelihood and Bayesian analyses support a non-traditional interpretation of early megachilid phylogeny (figure 1a). According to our phylogenetic hypothesis, the small Palaearctic tribe Pararhophitini is not closely related to the largely austral tribe Fideliini, but appears more closely related to the subfamily Megachilinae; this result is strongly supported in all analyses (figure 1a; electronic supplementary material, figures S1, S2 and S3). Furthermore, the two lineages of Fideliini (the genera Fidelia and Neofidelia) constitute a weakly supported grade at the base of Megachilidae. Further tests using Bayes factors [46] strongly support the non-monophyly of both the subfamily Fideliinae (Bayes factor: hereafter BF = 260.36) and the tribe Fideliini (BF = 33.68).

The first two branches in our phylogeny are thus the South American genus Neofidelia and the primarily southern African genus Fidelia. The geographical distribution and phylogenetic placement of these lineages reveal an Austral disjunction between the Old and the New World, suggestive of a Gondwanan origin. We find the age of Megachilidae, and thus of the divergence between the South American and African fideliine bees, to be 126 Ma (95% HPD 100–154), pre-dating the separation of the African and South American continental plates (figure 1b). Our estimate of the age of Megachilidae is older than anticipated, given that bees are generally thought to have originated around 125 Ma [28]. Our results indicate an origin for the bees (the root height of our tree) of 149 Ma (95% HPD 119–182). We ran another analysis where the root was constrained to 120 Ma; even under this conservative estimate for the age of the bees [28], the age of Megachilidae is 104 Ma (95% HPD 95–113), which is still consistent with a Gondwanan origin, as the last connections between Africa and South America are thought to have disappeared around 100–110 Ma [47]. Both analyses indicate that the Megachilidae arose relatively rapidly after the origin of the bees.

A Gondwanan origin for Megachilidae is further supported by biogeographic reconstructions. S-Diva results favour a South American/African vicariance (75% of reconstructions) over scenarios involving either African (12.6%) or African/Palaearctic (12.4%) origins and subsequent dispersal to South America. Similarly, in biogeographic inferences using Lagrange [36], analyses where Africa and South America were allowed to be adjacent strongly supported Gondwanan vicariance at the root node (global maximum likelihood −250.4; electronic supplementary material). Analyses where Africa and South America were not adjacent (thus precluding vicariance as a possible outcome and implying Northern Hemisphere migrations) had significantly worse overall likelihood scores (global maximum likelihood −252.3). Dispersal from Africa to South America via Australia and Antarctica (achieved by allowing dispersal between Australia and South America) was even less likely (global maximum likelihood −295.9). However, we agree with Rozen [11] that the most convincing support for vicariance over migration comes from biological evidence. The brood cells of fideliine bees consist of unlined cavities in the sand (figure 1c); for this reason, these bees are entirely restricted to strongly seasonal deserts where annual rainfall is not only low but also extremely unlikely during their nesting season [11]. Alternative biogeographic scenarios to explain their present-day distribution necessarily involve migrations through the Northern Hemisphere or via Antarctica; both of these scenarios imply adaptations to temperate habitats, which we consider extremely unlikely. Indeed, ancestral state reconstructions performed using BayesTraits [38] reveal that the ancestor of Megachilidae built nests that were neither lined with foreign material nor with glandular secretions (average maximum-likelihood probability 0.99, average difference in likelihood 3.6 and 5.4, respectively; posterior probability 0.98, BF 6.0 and 14.4, respectively). All species using foreign material in nest construction form a monophyletic group. The use of foreign material in nest construction has a single origin at the base of the tribes Anthidiini, Dioxyini, Osmiini and Megachilini (average maximum-likelihood probability 0.99, average difference in likelihood 2.5 and 7.3; posterior probability 0.99, BF 4.4 and 10.3).

The use of foreign material in nest construction underlies the ability of megachilid bees to colonize temperate regions and appears to be associated with a dramatic increase in clade species diversity. The lineages Fidelia, Neofidelia and Pararhophites together number 17 species, while the tribes Anthidiini, Osmiini and Megachilini collectively include over 3900 species and exhibit a worldwide distribution. MEDUSA [43] results provide evidence for two significant increases in diversification rate in our phylogeny, the first at the base of the higher megachilids and the second nested within the genus Megachile (figure 1).

The larger of the two rate shifts increases from 0.0164 to 0.0867 and occurs approximately 7 Myr after the advent of nest construction using foreign material, a behaviour that is first observed in the enigmatic genus Aspidosmia [37], the first branch within the subfamily Megachilinae. The increase in diversification rate that occurs after the divergence between Aspidosmia and the rest of Megachilinae suggests that the use of foreign material in nesting may have driven diversification but was not the only factor underlying it.

The second shift in diversification rate occurs within the genus Megachile, from 0.087 to 0.315. The increase in diversification tempo happens approximately 8 Myr after the origin of the true leafcutting Megachile (Michener's group I) from the paraphyletic assemblage of the Chalicodoma group of subgenera (Michener's group II) [37]. Despite their relatively recent origin (22 Ma; 95% HPD 16–27), leafcutting Megachile are extremely diverse and abundant on all continents. The explanation for such species richness may be related to their high reproductive output [48] and their ability to colonize an extremely broad range of habitats, from moist tropics to extreme deserts.

In association with the ancestral state reconstructions of nesting biology, the diversification rate analysis reveals an intimate association between nesting biology, distribution and diversification. The single origin of nest-lining behaviour in Megachilidae makes it difficult to test for correlated evolution between nesting and other traits of interest. In contrast, the total geographical area occupied by the terminal taxa varies from lineage to lineage throughout the phylogeny, allowing us to test for an association between area and diversification rate. The results of BayesTraits analyses [38] indicate strongly correlated evolution between geographical area and diversification rate (BF = 25.8). Consistent with other studies where geographical area has been correlated with diversification [49], we envision a scenario where nest-lining behaviour promoted the widespread colonization of temperate habitats, which in turn drove the diversification seen in the higher megachilids.

Ancestral state reconstructions strongly indicate that the three fideliine lineages are restricted to deserts owing to their plesiomorphic nesting biology, rather than as a secondary adaptation. The use of foreign material in nest construction has a single origin at the base of the tribes Anthidiini, Dioxyini, Osmiini and Megachilini. It has enabled these bees to repeatedly colonize temperate habitats and catalysed a massive shift in diversification rate. Surprisingly, Lithurgini manage to survive in temperate and tropical conditions, although they do not line their brood cells. All Lithurgines dig burrows in wood or stems and their pollen provisions are protected from humidity in these above-ground substrates. In other respects, the pollen provisions and nest architecture of lithurgine bees are very similar to those of fideliine bees. The pollen mass is neither worked nor manipulated by the female; it does not form a spherical mass but rather occupies the entire rear portion of the nest cell. Their burrows are mostly branched and the cells are either not partitioned or partitioned using sawdust or wood particles obtained from the excavation of brood cells. These bees then fill their completed burrows with sawdust, in much the same way that fideliine bees do with sand [6,7].

The identification of nest-lining behaviour as a key innovation also offers an explanation for the behavioural conservatism seen in the early megachilids. The two basal lineages, Fidelia and Neofidelia, which emerged prior to the advent of this innovation, have retained highly similar and comparatively unusual behaviours on two different continents for more than 100 Myr, suggesting powerful evolutionary constraints on these behaviours. A comparison of their nesting biology and host-plant associations provides a unique glimpse into the biology of early megachilids over 120 Ma, early in bee evolution.

(a) Nesting

Unlined nests similar to those observed in fideliine bees are rare among bees. All members of the species-rich short-tongued bee families Andrenidae, Halictidae and Colletidae, which probably form a monophyletic group [9], apply secreted lining to their brood cells [1,2]. Curiously, some desert andrenids apply a secreted lining not to the walls of their nests but to the pollen provisions themselves [50]. In the family Apidae, the evolution of nest-lining behaviour is obscured by three probable origins of oil or resin collection, the unknown phylogenetic positions of lineages that apparently do not line their brood cells (e.g. Eremapis; [51]), four independent origins of cleptoparasitism and the evolution of social behaviour [34]. Lastly, unlined nests are known in several members of the melittid bees [37], a species-poor group that may represent the earliest lineages of extant bees [9]. Many melittids are restricted to xeric areas, especially several species-poor genera for which the nesting biology is not documented (e.g. Eremaphanta, Afrodasypoda, Promelitta). The few genera that are present in temperate regions either collect floral oil (Macropis and Rediviva), have evolved secreted cell lining (Melitta) or shape their pollen balls into peculiar, tripod-like structures that reduce contact between the provisions and the cell wall (Dasypoda). In fact, according to the most comprehensive phylogenetic hypothesis currently available for bees [9], the construction of unlined nests is a behaviour restricted to a few primitive lineages; among all bees, there is not a single documented instance of a reversion to building unlined nests after the evolution of nest-lining behaviour occurs. These observations strongly suggest that the ancestor of bees did not line its nest cells [52] and that cell lining, using either glandular secretions or foreign material, has multiple origins in bees.

By contrast, unlined nests are prevalent among apoid wasps [53,54], the paraphyletic group from which bees arose. In fact, the nesting biology of fideliine bees is reminiscent of that of many sand-nesting apoids [10] whose nests consist of unlined burrows in the sand. Apoid wasps store paralysed prey that may stay alive for several weeks before being consumed by their larvae. While stored provisions are always susceptible to spoilage [55], the transition from prey-hunting to pollen-collecting in the early bees may have dramatically exacerbated the problems associated with the storage of provisions, given the hygroscopic properties of pollen and its susceptibility to fungal infection, and driven selective pressure to protect provisions from moisture.

(b) Foraging behaviour and host-plant associations

Interactions with angiosperms have often been cited as important driving factors underlying diversification in phytophagous insects [56]. Our results, however, suggest that the shift to pollen collection in early bees did not simply open a vast new ecological niche. First, if the biology of the earliest extant megachilids indeed mirrors the biology of ancestral bees, early bees were constrained to xeric and strongly seasonal habitats, and highly limited in their phenology. Second, another aspect of the behaviour of early bees may have seriously hampered them from fully using all available angiosperm hosts: a pronounced floral specificity (oligolecty). Comparisons of the well-documented foraging behaviour of the basal members of Megachilidae (figure 1d; electronic supplementary material, table S4) provide unique insights into bee–flower relationships prevalent more than 100 Myr ago. Fideliine bees, both in South America and Africa, are notorious oligoleges. Rozen [12] states that on both continents, fideliine bees tend to forage on large flowers with well-exposed anthers (figure 1d); even the narrowly polylectic Neofidelia profuga appears to restrict pollen collection to a few hosts with similar flower architecture, namely large flowers with radial symmetry and well-exposed stamens. The same appears to be true for many lithurgine bees: distantly related species of the genera Lithurgus and Microthurge in Australia, Africa and South America forage exclusively or predominantly on Malvaceae with large flowers, such as Hibiscus, Sida and Turnera (electronic supplementary material, table S4); Asian species appear polylectic but restrict pollen collection to flowers of Malvaceae and Convolvulaceae; and two lineages, the subgenus Lithurgopsis and the genus Trichothurgus, have maintained a close association with the large flowers of Cacteaceae in both South and North America. Lastly, the two species of Pararhophites for which host-plant information is available restrict their foraging to morphologically similar but phylogenetically unrelated flowers that have exposed stamens and five white petals (electronic supplementary material, table S4). In summary, a narrow host range clearly appears to be the plesiomorphic condition in Megachilidae. Moreover, there is a striking lack of bilaterally symmetrical flowers among the hosts of the basal megachilid lineages. By contrast, bilaterally symmetrical flowers, such as Fabaceae and Lamiaceae, which are typical bee-pollinated flowers, are common hosts of a significant proportion of the higher megachilids.

These observations strongly support the view that host choices in bees are evolutionarily constrained [57], as well as the widely discussed assertion that oligolecty is a primitive (rather than a derived) state in bees (reviewed in [57]). Müller [58] suggested that oligolecty might be a behavioural constraint related to flower manipulation, pollen collecting or pollen digestion, rather than a secondary specialization. Interestingly, Müller [58] notes that most apoid wasps are specialized hunters. In fact, the foraging behaviour of apoid wasps is similar in many ways to that of primitive bees. It is evolutionarily conserved, with related species exhibiting similar behaviour on different continents. Most species restrict their host range to distantly related prey belonging to the same order (e.g. grasshoppers, spiders or leafhoppers), which are often similar in size and appearance [53,54], and co-occur in the same habitat. Evans [59] elegantly summarizes the foraging behaviour of the philanthine wasp tribe Cercerini as follows: ‘I suggest that these wasps are not necessarily “good taxonomists”, but that they are programmed to hunt in certain situations and to respond to prey of a certain size and behaviour’ (p. 521). We hypothesize that early bees inherited foraging specificity as a behavioural constraint from their apoid wasp ancestors.

4. Conclusion

Our work reveals that two extant lineages are ‘living fossils’ among the bees. The Mid-Cretaceous origin of Fidelia and Neofidelia, and their bizarre, plesiomorphic biology, strongly support the possibility that these bees reflect the biology of the earliest bees more closely than any other extant lineage. The evolutionary patterns we report in Megachilidae lay the initial framework for understanding patterns of nesting behaviour, distribution, host-plant preference and diversification in all bees.

Acknowledgements

This work was supported by a National Science Foundation grant to B.N.D. and Terry Griswold (DEB-0742998); a National Geographic Society grant to B.N.D. (8448-08); a grant from the Swiss National Science Foundation (PBEZP3-122970) to C.J.P.; grants from the Graduate School and the Mario Einaudi Center at Cornell University to J.R.L.; a grant through the Griswold Endowment (Department of Entomology, Cornell University) to J.R.L.; and a grant from the Linnean Society of London and the Systematics Association to J.R.L. We thank Andreas Müller, Robert Paxton and Antonella Soro for comments on the manuscript; Simon Ho for assistance with BEAST; Rick Ree and Stephen Smith for assistance with Lagrange; Anurag Agrawal for assistance with BayesTraits; Stuart Campbell for assistance with statistical analysis; Werner Arens, Sophie Cardinal, Victor Gonzalez, Terry Griswold, Michael Kuhlmann, Denis Michez, Laurence Packer, Sébastien Patiny, Jerry Rozen, Maximilian Schwarz and Claudio Sedivy for providing specimens; and Philip Perkins at the Harvard University Museum of Comparative Zoology for access to bee fossils.

1981Observations on the nesting behaviour of Megachile rubi and M. texana with a discussion of the significance of soil nesting in the evolution of megachilid bees (Hymenoptera: Megachilidae). J. Kansas Entomol. Soc.54, 557–570.

2008Patterns of host-plant choice in bees of the genus Chelostoma: the constraint hypothesis of host-range evolution in bees. Evolution62, 2487–2507.doi:10.1111/j.1558-5646.2008.00465.x (doi:10.1111/j.1558-5646.2008.00465.x)