Skip to main content
FreeE-Article

A Conceptual and Statistical Framework for Adaptive Radiations with a Key Role for Diversity Dependence

Abstract

In this article we propose a new framework for studying adaptive radiations in the context of diversity-dependent diversification. Diversity dependence causes diversification to decelerate at the end of an adaptive radiation but also plays a key role in the initial pulse of diversification. In particular, key innovations (which in our definition include novel traits as well as new environments) may cause decoupling of the diversity-dependent dynamics of the innovative clade from the diversity-dependent dynamics of its ancestral clade. We present a likelihood-based inference method to test for decoupling of diversity dependence using molecular phylogenies. The method, which can handle incomplete phylogenies, identifies when the decoupling took place and which diversification parameters are affected. We illustrate our approach by applying it to the molecular phylogeny of the North American clade of the legume tribe Psoraleeae (47 extant species, of which 4 are missing). Two diversification rate shifts were previously identified for this clade; our analysis shows that the first, positive shift can be associated with decoupling of two Pediomelum subgenera from the other Psoraleeae lineages, while we argue that the second, negative shift can be attributed to speciation being protracted. The latter explanation yields nonzero extinction rates, in contrast to previous findings. Our framework offers a new perspective on macroevolution: new environments and novel traits (ecological opportunity) and diversity dependence (ecological limits) cannot be considered separately.

Online enhancement   appendixes. Dryad data:http://dx.doi.org/10.5061/dryad.sr927n43.

Introduction

Adaptive radiations are expected to leave two distinct signatures on diversification (defined as origination/speciation minus extinction) patterns (Yoder et al. 2010): a large initial increase in diversification rate and a subsequent decrease in the diversification rate. The initial, sudden positive rate shift is generally thought to be triggered by ecological opportunity (Simpson 1949, 1953; Schluter 2000; Yoder et al. 2010). There are three types of ecological opportunity (Simpson 1949, 1953) through which species can enter the “adaptive zone”: (1) extinction of antagonists; (2) appearance of a new environment, either due to dispersal to a new area or due to external forces changing the environment; or (3) a key innovation, that is, the appearance of aspects of organismal phenotype that promote diversification (Hunter 1998) by enabling escape from competition for niche space, by reducing the probability of extinction by increasing population density via increased individual fitness or by favoring reproductive or ecological specialization (Heard and Hauser 1995). All types of ecological opportunity result in ecological release. Types 1 and 2 require (ecological) changes in the environment of the focal species, whereas type 3 requires an evolutionary change in the focal species itself. However, the three types of ecological opportunity are not independent: novel species traits might promote positive net diversification only in a new biotic (e.g., after antagonist extinction) or abiotic environment or in the presence of other traits (De Queiroz 2002). For this reason we believe that given a clade, it is more practical to consider two types of adaptive radiations: an increased diversification of all lineages of a clade and increased diversification of a single (or a few) lineages. We immediately admit that this dichotomy is a matter of scale, because a clade where all lineages are affected can be part of a larger clade where it is the only affected subclade, but the distinction is useful for a given clade.

While the initial, sudden positive shift in diversification rate is obviously evidence for adaptive radiations, the subsequent more gradual negative rate shift is just as indicative of adaptive radiation (Albertson et al. 1999; Lovette and Bermingham 1999; McPeek 2008; Phillimore and Price 2008). Because the negative rate shift is observed across clades of different ages and for different taxa, it seems unlikely to be caused solely by direct time dependence of diversification rates through external forcing (e.g., climate change reducing or increasing extinction rates), but it is the accumulation of species within the clade itself that limits further diversification. Therefore, this phenomenon is called diversity-dependent (or density-dependent) diversification (Valentine 1973; Sepkoski 1978, 1979, 1984; Walker and Valentine 1984; Schluter 2000; Rabosky 2009). The negative shift in diversification rate can, in principle, be due to a reduction in speciation rate or an increase in extinction rate as diversity increases (Sepkoski 1978), but an increase in extinction rate has been shown to be a very unlikely form of diversity dependence (Alroy 1996; Rabosky and Lovette 2008b; Burbrink and Pyron 2009; Quental and Marshall 2009). The reduction in speciation rate is considered to be a consequence of accessible niches becoming filled (Simpson 1953; Valentine 1980; Walker and Valentine 1984; Schluter 2000; Rabosky and Lovette 2008a; Gavrilets and Losos 2009) or a consequence of decreased range sizes following successive subdivisions of ancestral ranges (Sepkoski 1978; Rosenzweig 1996). Both mechanisms increasingly limit opportunity for ecological speciation and suggest that there is a clade-level carrying capacity for the number of species than can exist at any one time.

In this article we will present a framework for adaptive radiation that respects both the positive and negative rate shifts in a single coherent framework. In this framework, the negative shifts are due to diversity dependence and positive shifts are either due to (1) changes in the speciation and extinction rates or clade-wide shifts in carrying capacity or (2) decoupling of the diversity-dependent dynamics of a subclade from the main clade’s diversity-dependent dynamics, thereby affecting the subclade’s diversification rates. The decoupling occurs because of a key innovation or dispersal event that enables escape from diversity dependence regulated by competition for niche space. We emphasize that the key innovation could have occurred much earlier than the decoupling if the innovative trait required a new environment to cause the decoupling. We will compare the subclade-specific model of decoupling of diversity dependence with a model that assumes a clade-wide shift in speciation rate, extinction rate, or carrying capacity.

In the past decade molecular phylogenies have come to the fore as a valuable additional source of information on diversification patterns, especially where fossil evidence is lacking. Both temporal and topological methods have been developed to detect significant shifts in diversification rates (reviewed in Sanderson and Donoghue 1996; Mooers and Heard 1997; Barraclough and Nee 2001; Chan and Moore 2002; Ricklefs 2007). Temporal methods compare the observed distribution of speciation events through time, inferred from the lineage accumulation through time (the lineages-through-time plot; Paradis 1997), with the distribution expected by a null model of cladogenesis. These have the disadvantage that they identify only the timing of the change in the rate of lineage accumulation but not its location in the phylogenetic tree. Earlier, Farris (1976) had already argued that asymmetric phylogenies are more common than symmetric ones, and thus this asymmetry (or imbalance) conveys information. Topological methods build on this fact and focus on the relative diversity in the descendant subclades of a node (Vrba 1980). Single-node topological tests compare the observed difference in sister-group sizes to the expectation under a null model of cladogenesis (Slowinski and Guyer 1989a, 1989b, 1993; Goudet 1999). However, because these methods compare only present diversity to diversity at the clade’s origin, they cannot detect diversification rate increases and subsequent decreases (Hunter 1998). Whole-tree methods compute a single statistic of tree imbalance or asymmetry from the relative diversity of nodes throughout the entire tree (Shao and Sokal 1990; Kirkpatrick and Slatkin 1993; Chan and Moore 2002). These nonparametric approaches are robust in detecting rate shifts but do not lead us toward the underlying causes of these rate shifts. Alfaro et al. (2009) presented an algorithm to find significant rate shifts based on the likelihood of the phylogeny, but they used the constant-rate birth-death model and thus ignored the second phase of an adaptive radiation, despite the ample evidence for this second phase. This implies that this method may miss a shift in diversification due to a change in the carrying capacity or due to decoupling of diversity-dependent dynamics. Here we present not only the model of adaptive radiation incorporating its two phases in the light of diversity dependence as outlined above but also a likelihood-based inference method to detect decoupling of diversity dependence given the whole phylogenetic tree. This method accounts for asymmetry as well as the distribution of branching times and can handle incomplete phylogenies (if one knows the number of species that are not placed in the phylogeny but do have the same ancestral crown group).

We start by describing our general framework for studying adaptive radiations. We then describe our inference method for detecting decoupling of diversity-dependent dynamics, and we apply it to the radiation of the North American clade of the legume tribe Psoraleeae studied by Egan and Crandall (2008a). They found evidence for diversification rate shifts in this tribe, using both temporal and topological methods. We confirm this finding using our single unified approach. Moreover, not only do we estimate the diversification parameters, we also identify the timing of the decoupling.

A Coherent Framework for Adaptive Radiations

We view diversification as a birth-death process (Kendall 1948; Bailey 1964). This is the standard mathematical model of macroevolutionary diversification (Nee 2006) that has been frequently used in the study of adaptive radiations (e.g., Alfaro et al. 2009; Steeman et al. 2009). However, rather than assuming time-constant speciation and extinction rates or direct time dependence of these rates as has been done previously, we allow these rates to be diversity dependent. In general, it makes the most sense, as argued in the introduction, to assume that the diversity dependence is entirely in the speciation rate, but we note that our framework is easily amenable to account for diversity-dependent extinction. In the example we will study, we assume that the per-species speciation rate declines linearly with diversity (Etienne et al. 2012). The diversity at which the speciation rate is equal to the extinction rate is the carrying capacity of the clade, analogous to the definition of carrying capacity in population dynamics models. This diversity dependence describes phenomenologically the filling of accessible niches and hence the second phase of an adaptive radiation. Note that, at equilibrium, when speciation and extinction rates equal one another, not all niches are filled, unless the extinction rate equals 0.

In the first phase of an adaptive radiation, there are shifts in diversification parameters. These shifts can be clade-wide or subclade-specific (Table 1). We argue that diversity dependence again plays a key role, in both types of shift. The clade-wide rate shifts occur when external factors create new ecological opportunities for all lineages (McInnes et al. 2011), for instance, climate change (Ezard et al. 2011) or tectonic movement (e.g., cetaceans diversifying more rapidly due to restructuring of the oceans; Steeman et al. 2009). These external factors, in creating more niche space, most likely affect the clade-level carrying capacity rather than the intrinsic speciation rate or extinction rate.

Table 1. 

Proposed framework for adaptive radiations

Macroevolutionary modelPossible mechanismSpecific example
Clade-wide shift in diversification parameters: 
 Shift in intrinsic speciation rateClimate changeQuaternary glacial cycles affecting diversification of Psoraleeae (Egan and Crandall 2008a)
 Shift in extinction rateClimate change and impact eventsAsteroid shower producing the Chicxulub impactor that caused mass extinctions at the K-T boundary (Bottke et al. 2007)
 Shift in clade-level carrying capacityHabitat restructuringRestructuring of the oceans provides more niche space for cetaceans (Steeman et al. 2009)
Subclade-specific shift in diversification parameters: 
 Shift in subclade-specific intrinsic speciation rateKey innovation favoring specialization or reproductive isolation, thereby increasing speciation rateDifferent floral nectar spurs associated with different pollinators (Hodges and Arnold 1995)
 Shift in subclade-specific extinction rateKey innovation increasing individual fitness, thereby enlarging population size or allowing survival at lower population density and thereby reducing extinction rateGreater reproductive efficiency, when rare, of angiosperms than of gymnosperms (Haig and Westoby 1991)
 Decoupling of diversity dependenceEscape from niche competition due to antagonist extinction or due to the availability of or dispersal to a new environment. The new environment may require a (previously dormant) key innovation in species traits before it triggers an adaptive radiation.Galapagos finches (Lack 1947)
 Escape from niche competition due to key innovation. The innovative trait may require a change in (a)biotic environment before it results in adaptive radiation.Evolution of extended subdigital toe pads in anole lizards (Losos 2009); evolution of antifreeze glycoproteins in notothenioid fishes in Antarctic waters (Matschiner et al. 2011)

Subclade-specific shifts result from either shifts in intrinsic speciation rate and extinction rate for the subclade, or (partial) decoupling of diversity-dependent dynamics of the subclade from the main clade. Shifts in intrinsic speciation rate and extinction rate crucially interact with diversity dependence: because all species compete for the same limited number of niches, those with elevated speciation rates or reduced extinction rates will eventually exclude the other species. This may explain evolutionary succession, where radiation of one taxon is accompanied by decline in another (Valentine 1973), for example, the replacement of therapsids by archosauromorphs during the Triassic facilitated by the latter’s higher growth rates (Sookias et al. 2012) or the replacement of Diplograptina by Neograptina during the late Ordovician (Bapst et al. 2012). Decoupling of diversity-dependent dynamics, complete or partial, can occur after key innovations or dispersal to a new environment, where the focal species’ diversification is released from diversity dependence. The focal species starts its own subclade that does not “feel” the presence of the other species, simply because the key innovation or dispersal event has allowed it to escape competition for niche space as it has moved far away in trait space or in geographical space, respectively. Note that dispersal to a new environment can be interpreted as a key innovation (De Queiroz 1998). We adopt this broad interpretation of key innovation from hereon (if only for notational convenience). In this article, we concentrate on the decoupling of diversity-dependent dynamics, as we believe it to be a major process in causing adaptive radiations and it increases our understanding of the role of key innovations. For instance, it is likely that evolutionary succession also involves (partial) decoupling triggered by innovative traits.

Extinction of antagonists can be positioned at several places in our framework. When an antagonist becomes extinct, the surviving species do(es) not experience the presence of the antagonist, simply because it is no longer there, but this does not necessarily constitute a decoupling of the dynamics. If it is merely a random extinction opening up the niche of the antagonist, then the extinction event is already part of the usual speciation-extinction dynamics. If the antagonist extinction results in a lower extinction rate of the surviving species, it can be seen as time dependence of the extinction rate and should thus be viewed as an example of a (temporary) rate shift. If the extinction of the antagonist creates more niches than just leaving its own, it can be viewed as a shift in carrying capacity. Only if these new niches are not available to all species, it constitutes decoupling of diversity-dependent dynamics.

The idea of decoupling of diversity-dependent dynamics can apply to a single species or to a group of species. In the case of a group of species where the decoupling is caused by an innovative trait, it is not necessary that this trait evolves multiple times; the trait may evolve in a single species and stay dormant until a change in (a)biotic environment activates it, by which time the species may have diversified. In our mathematical implementation of the decoupling model, we assume for simplicity that only a single species undergoes the decoupling event. Furthermore, to keep our focus, we do not look at shifts in subclade-specific intrinsic speciation and extinction rates without enabling escape from diversity dependence, that is, without decoupling of diversity-dependent dynamics. Nor do we consider partial decoupling.

Our model of key innovations causing decoupling of diversity-dependent dynamics is depicted in figure 1. The decoupling occurs somewhere along the lineage that will form a new subclade. We denote the time of this occurrence by . Time is a parameter in our model, so it can be estimated together with the other parameters of the model rather than set beforehand at fixed points (Alfaro et al. 2009) or allowed only at branching points (Rabosky 2006). Up to the lineage and its descendants contribute to the diversity dependence of the main clade, which we will call M. After the lineage no longer has an effect on the diversification in the main clade, but it now affects only the diversification in the subclade it initiated, which we will call S. We thus have a model with seven parameters: the decoupling time , three parameters for the diversification of the main clade (speciation rate, extinction rate, and carrying capacity), and similarly, three for the subclade. The number of parameters may be reduced by assuming that the intrinsic speciation rate and/or the extinction rate are the same for main clade and subclade. A model comparison based on an information criterion (we will use the Akaike information criterion [AIC]) will single out the most parsimonious model for the data.

Figure 1. 
Figure 1. 

Example phylogeny with key innovation. Black lines: branches of main clade M; black dots: extant species of main clade M; red lines: branches of subclade S; red dots: extant species of subclade S; green line: branch on which key innovation occurs; green cross: key innovation event. The figure defines the branching times , , and used in the algorithm of box 2.

Likelihood of a Phylogeny under the Key Innovation Model

The birth-death process (Kendall 1948; Bailey 1964) can be mathematically described with the master equation for the probability of n species at time t which we denote by :

2222

where and are the per-species speciation and extinction probability rates (hereafter simply called rates), respectively. In the example we will study below we assume the following functional forms for the speciation and extinction rates (Etienne et al. 2012):

2222

where is the initial or intrinsic speciation rate during the radiation (more precisely, it is the virtual speciation rate when there are no species at all), μ is the extinction rate (we assume that ), and K is the clade-level carrying capacity, that is, the level of diversity where speciation and extinction exactly balance each other. The functional form for the speciation rate is a simple linear decline, extending the density-dependent (logarithmic) (DDL) model of Rabosky and Lovette (2008a) with extinction and therefore termed the DDL+E model (Etienne et al. 2012). It is similar to Sepkoski’s (1978) model of diversity dependence. Alternative models are also possible, for example, the extension of the density dependent (exponential) (DDX) model (Rabosky and Lovette 2008a) with extinction, which can also be formulated in terms of a carrying capacity where the net diversification rate vanishes, but for our illustrative purpose, the DDL+E model suffices.

We start with the algorithm to compute the likelihood of a phylogeny for the diversity-dependent diversification process (Etienne et al. 2012). It is based on the repeated integration of a system of ordinary differential equations. Consider a time t between two branching times where the phylogeny has k branches. Denote by the probability that a realization of the diversification process is consistent with the phylogeny up to time t and has n species at time t. The dynamical equation is (Etienne et al. 2012):

2222

At the branching times, we impose a branching event by multiplying the probabilities by the speciation rate , and increasing by 1 the number of branches k. This leads to the algorithm in box 1 to compute the probability of a phylogeny under the diversity-dependent birth-death model (Etienne et al. 2012). The resulting likelihood differs by a combinatorial factor , with q being the number of species, from the likelihood computed by Etienne et al. (2012). The latter allows direct comparison with the likelihood of Nee et al. (1994), but the combinatorial factor should be dropped if one wants to compute the likelihood of a phylogeny rather than the likelihood of a set of branching times.

Figure 2. 
Figure 2. 

Phylogeny of the North American Psoraleeae tribe. In this article we test whether the subclade in red has undergone a key innovation somewhere along the green branch of A or B. C, D, Lineages-through-time plots corresponding to A and B, respectively. The blue circles correspond to the lineage accumulation in the entire phylogeny, while the black and red dots correspond to the lineage accumulation in the main clade and subclade, respectively.

Here we extend the algorithm of box 1 to account for the key innovation (KI) causing decoupling of dynamics at time . We separate the likelihood computation in a part for the main clade M and a part for the subclade S. For the main clade M, we impose the key innovation event at time by decreasing by one the number of branches k. For the subclade S, we use the algorithm in box 1, starting from the key innovation time with . We further define the branching times in the main clade before the key innovation, , and after the key innovation, , and in the subclade, . See also figure 1. Finally, if there are missing species in the phylogeny, they can be specified as and . Because we often do not know whether the missing species are in the main clade or the subclade, we can sum over all possible combinations of and such that , where m is the total number of missing species. Box 2 shows the algorithm to compute the likelihood of a phylogeny with main clade M and subclade S. This algorithm gives the likelihood without requiring that the two crown lineages survive. To condition on their survival, as advocated since Nee et al. (1994), we need to follow the number of descendants of the two crown species from time to time and compute the probability that the number of descendants of both are positive. We refer to appendix A for the technical details of the conditioning and to appendix B for details on how missing species are handled. We have made the likelihood computation available in the R package DDD (http://cran.r-project.org/web/packages/DDD/index.html), from version 1.0 onward. We also have Matlab code available on request.1

Case Study

We illustrate our method by applying it to the North American clade of the legume tribe Psoraleeae (Egan and Crandall 2008a); see figure 2. The phylogenetic data for this clade are available in the Dryad Data Repository (Egan and Crandall 2008b). This clade contains 47 species (43 are in the phylogeny, 4 species are missing) within 5 genera. The most species-rich genus is Pediomelum (29 species) and contains 3 subgenera, Leucocraspedon, Disarticulatum, and Pediomelum, the first of which contains only 2 species. These asymmetries suggest an adaptive radiation in the latter two subgenera, or more precisely—assuming that antagonist extinction did not occur—a key innovation. Egan and Crandall (2008a) found evidence for significant rate shifts, using both temporal and topological approaches, but did not find support for the hypothesis that adaptation to xeric habitats (hot and dry environments) acted as key innovation. Egan and Crandall (2008a) dated the origin of this North American clade at 5.8 million years ago (mya), and of the genus Pediomelum at 3.2 mya. The major radiation of this genus did not start until around 2 mya, when the subclade containing subgenera Pediomelum and Disarticulatum diverged from the main clade (including subgenus Leucocraspedon). Egan and Crandall (2008a) detected a major shift in diversification rate around this time.

We tested whether our approach can indeed identify the seemingly evident key innovation around 2 mya or that the asymmetry occurs by chance and the increased branching is due only to a shift in diversification rates affecting the entire tree. We did this for two possible scenarios for the key innovation (see fig. 2). We determined the timing of this key innovation and whether there was only decoupling of diversity-dependent dynamics with main clade and subclade having their own carrying capacity or whether the intrinsic speciation rate and extinction rate also differed between the two. This resulted in a variety of models. The constant-rate (CR) models we tested were the standard (diversity-independent) birth-death model (CR0) and the diversity-dependent DDL+E model (CR1). The shifting rate (SR) models we tested include the Yule 2 rate model (SR0) that was also employed by Egan and Crandall (2008a) and models with a shift in the carrying capacity K (SR1), in K and the extinction rate μ (SR2), and in K and the intrinsic speciation rate (SR3). Our key innovation (KI) models include models where the subclade differs from the main clade only in K (KI1), in K and μ (KI2), in K and (KI3), and in K, , and μ (KI4). The models with a 0 denote models that we consider unrealistic because they either ignore extinction (SR0) or diversity dependence (CR0, SR0), but we analyzed them anyway as they are standard null models.

The key innovation models clearly stand out as the best description of the data regardless of the scenario (fig. 2) of the key innovation and regardless of whether the data set was truncated to compensate for lack of species recognition, as tables 2 and 3 show: the Akaike weights of the KI models were all greater than , while for the CR and SR models, they were all smaller than ). The scenario of figure 2B, where the key innovation occurs between the branching points at 2.02 and 1.73 mya seems more likely than the scenario of figure 2A, where it occurs between the branching points at 3.21 and 2.02 mya. Among the key innovation models, most support is lent to the model where the subclade not only differs in the carrying capacity K but also in the intrinsic speciation rate (KI3). This suggests that not only did the decoupled subclade experience a range of new niches, it was also able to colonize them more rapidly than would be expected from a reduction in the influence of diversity dependence alone. However, the Akaike weights of the KI models were similar and thus do not justify a strong preference for any particular model.

Table 2. 

Parameter estimates and model performance for the model shown in

figure 2A
Modelλ0, 1μ1K1λ0, 2μ2K2tiLLwA
Full data:         
 CR0.62.00−62.7.00
 CR1.93.0079.7−61.5.00
 SR0.35.702.02−61.3.00
 SR11.96.067.98λ0, 1μ151.32.02−54.5.00
 SR21.98.077.60λ0, 1.0051.22.02−53.7.00
 SR3.82.009.282.21μ150.02.02−53.0.00
 KI14.85.4616.8λ0, 1μ129.32.02−48.1.14
 KI23.54.5516.9λ0, 1.0030.22.02−46.2.35
 KI3.72.0122.03.56μ130.32.02−46.2.36
 KI4.70.0223.03.53.0030.22.02−46.2.14
Truncated data:         
 CR0.79.16−57.0.00
 CR11.07.37106.8−56.8.00
 SR0.36.862.02−54.6.00
 SR11.71.078.12λ0, 1μ160.72.02−51.7.00
 SR21.82.187.53λ0, 1.0058.82.02−50.7.00
 SR3.99.099.152.04μ155.22.02−51.0.00
 KI14.93.6116.8λ0, 1μ129.92.02−43.5.48
 KI24.32.6517.0λ0, 1.3730.22.02−43.4.20
 KI311.1.6317.04.91μ129.82.02−43.3.22
 KI411.4.6716.94.02.2930.92.02−43.1.10

Note. Parameter estimates and model performance in terms of log likelihood (LL) and Akaike weight (wA) for constant-rate (CR), shifting-rate (SR), and key innovation (KI) models for the full data set and reduced data where the last 0.22 million years is truncated. For the SR models, the subscripts correspond to the diversification before and after the rate shift. For the KI models, the parameters with subscript 1 correspond to the main clade and those with subscript 2 to the subclade. The key innovation is assumed to take place according to figure 2A. Explanation of the models: CR0 = constant-rate birth-death model, CR1 = constant-rate DDL+E model, SR0 = birth model with a shift in speciation rate, SR1 = DDL+E model with shift in carrying capacity, SR2 = DDL+E model with shift in carrying capacity and extinction rate, SR3 = DDL+E model with shift in carrying capacity and speciation rate, KI1 = DDL+E model for main clade and subclade that differ in their carrying capacities, KI2 = DDL+E model for main clade and subclade that differ in their carrying capacities and extinction rates, KI3 = DDL+E model for main clade and subclade that differ in their carrying capacities and speciation rates, and KI4 = DDL+E model for main clade and subclade that differ in all parameters.

View Table Image
Table 3. 

Parameter estimates and model performance for the model shown in

figure 2B
Modelλ0, 1μ1K1λ0, 2μ2K2tiLLwA
Full data:         
 CR0.62.00−62.8.00
 CR1.93.0079.7−61.5.00
 SR0.41.701.73−61.7.00
 SR13.41.3010.4λ0, 1μ146.71.73−54.4.00
 SR22.63.459.49λ0, 1.0048.71.73−51.3.00
 SR3.57.0020.62.56μ148.91.73−52.5.00
 KI15.75.5118.8λ0, 1μ127.41.73−47.8.14
 KI24.00.5818.7λ0, 1.0028.01.73−46.4.21
 KI3.75.0125.03.93μ127.91.73−45.6.47
 KI4.76.0225.13.91.0028.01.73−45.6.18
Truncated data:         
 CR0.79.16−57.0.00
 CR11.07.37106.8−56.8.00
 SR0.41.861.73−54.9.00
 SR13.85.5612.3λ0, 1μ147.31.73−50.4.00
 SR22.95.6611.1λ0, 1.2349.41.73−49.7.00
 SR34.89.5712.43.83μ147.41.73−50.3.00
 KI16.38.6919.1λ0, 1μ127.31.73−42.2.46
 KI26.21.7019.1λ0, 1.6527.31.73−42.3.17
 KI313.8.6819.06.08μ127.51.73−41.7.27
 KI414.8.7019.05.58.5527.51.73−41.7.10

Note. Parameter estimates and model performance in terms of log likelihood (LL) and Akaike weight (wA) for constant-rate (CR), shifting-rate (SR), and key innovation (KI) models for the full data set and reduced data where the last 0.22 million years is truncated. For the SR models the subscripts correspond to the diversification before and after the rate shift. For the KI models, the parameters with subscript 1 correspond to the main clade and those with subscript 2 to the subclade. The key innovation is assumed to take place according to figure 2B. The results for the CR models are identical to those in Table 2; the global likelihood optima for the SR models should also identical (because the data are identical), but we report the local likelihood optima that are found when starting the optimization close to the key innovation time found in the KI models. For explanation of the models, see Table 2.

View Table Image

Egan and Crandall (2008a) suggested that the final negative rate shift may be due to lack of species recognition. This has been termed protracted speciation (Rosindell et al. 2010; Etienne and Rosindell 2012): incipient species go unrecognized because they have not completed speciation. While the most elegant way to account for this is to explicitly model protracted speciation (Etienne and Rosindell 2012), an expression or algorithm for the likelihood of this model given the phylogeny is not available when diversity dependence is acting. Hence, we followed Phillimore and Price (2008) by chopping off the last 0.22 million years (during which no branching occurs), and we repeated the analysis on this truncated phylogeny. Even though this is a rather crude method, we believe the obtained parameter estimates to be more realistic than the ones obtained without accounting for protracted speciation. For instance, applying our method to this truncated data set, that is, the data set that compensates for unrecognized species clearly yields higher extinction rates than when applied to the full data set where the extinction estimates are often close to 0 (Table 1).

We have to base our model comparison on the parameter estimates, because we cannot use AIC to compare the full and truncated data sets, as such a comparison is allowed only for different models on the same data set. To make a proper comparison of models on the same data set, one could interpret the analysis of the truncated data as applying to the full data set a model that assumes no speciation or extinction events occur in the last 0.22 million years. This model then performs better but has quite a complex structure, because it assumes one set of free parameters before 0.22 mya and another set of fixed parameters after 0.22 mya. The AIC is not well suited for penalizing this complexity for lack of parsimony.

Discussion

Central to the study of adaptive radiations is distinguishing chance variation from that which requires a more deterministic explanation (Chan and Moore 2002). Equal-rates models such as the constant-rate (diversity independent and diversity dependent) and shifting-rate birth-death models (i.e., models where the speciation and extinction rates are independent of the lineage at a given time; Rabosky 2006) assign identical probability to all topologies, including imbalanced ones. Observing a single imbalanced tree therefore does not constitute evidence against such a model (Slowinski and Guyer 1989a; McConway and Sims 2004). Still, a model that assigns a higher likelihood to imbalanced topologies will likely be preferred as indicated by likelihood ratio tests or information criteria after accounting for the number of parameters (McConway and Sims 2004). In this article, we have presented a model that does assign a higher likelihood to imbalanced trees at the cost of at least one but possibly four parameters. In our model, imbalance is caused by a key innovation (which we have broadly defined to include innovative traits as well as dispersal to a new environment) that decouples the diversity-dependent dynamics of a subclade from those of the main clade.

While we focused on decoupling of diversity-dependent dynamics in this article, we have presented a more general framework of adaptive radiations (Table 1) where diversity dependence plays a key role. First, diversity-dependent diversification explains the slowdown of diversification after an initial burst. Second, the initial burst can best be understood in the light of diversity dependence. When the burst is clade-wide, then a likely cause is a shift in carrying capacity, hence a temporary alleviation of the pressure of competing species. When the burst is subclade-specific, then without decoupling of diversity-dependent dynamics, the limit to diversity will crucially determine the outcome of competition for niches: species with elevated speciation rates or reduced extinction rates will be expected to dominate. With decoupling of diversity-dependent dynamics, a subset of species will be temporarily alleviated from competitive pressure. Thus, diversity dependence is a pervasive factor in adaptive radiation.

We have illustrated our method for detecting decoupling of diversity-dependent dynamics assuming that only a single key innovation occurs (i.e., only one subclade is treated differently). The method can be readily extended to detect more than one key innovation, similar to the way Alfaro et al. (2009) detect the locations of shifts in diversification rates, using the standard diversity-independent birth-death model (see also Sanderson and Donoghue 1996). However, the power of any method to find multiple key innovation events will be limited, unless the phylogeny is large, because each new key innovation event involves at least one (the key innovation time), most likely two (the key innovation time and a different K value for the subclade, e.g., KI1), and perhaps even four (e.g., KI4) additional parameters. Also, our results suggest that the data are not informative enough (e.g., the phylogeny is not large enough) to allow distinguishing clearly between the various key innovation models (KI1-KI4). Because the parameter estimates may vary substantially between these models, one should also be careful in interpreting the actual estimated parameter values (this can be more rigorously checked by computing confidence intervals or posterior probability distributions, which are both possible in our likelihood framework). Similarly, we expect that it will be difficult to distinguish between different models of diversity dependence (e.g., a power-law vs. linear decline of speciation rate with diversity) and between decoupling of diversity-dependent dynamics and subclade-specific shifts in speciation and extinction rates without decoupling. For example, successive radiations where one group of species replaces another can be explained by decoupling of diversity dependence due to antagonist extinction or by higher diversification rates in the new group due to a key innovation allowing them to overtake the old one without decoupling. An additional, more technical, caveat is that with many parameters, one may easily get stuck in a local likelihood optimum. Although we used various starting points, we do not rule out that one or more of the likelihood optima are not the global optima. Tables 2 and 3 show such local likelihood optima for the shifting-rate model.

Egan and Crandall (2008a) found no support for the hypothesis that a shift from mesic to xeric habitat between 3.21 and 2.02 mya acted as a key innovation. Our method does reveal a key innovation, but we find more support for a somewhat later key innovation, between 2.02 and 1.73 mya. The key innovation is therefore probably not associated with this shift from mesic to xeric conditions. Note that we have looked only at the phylogeny rather than at traits that are associated with the subclade that underwent the key innovation (Heard and Hauser 1995). Incorporating character evolution (see, e.g., Ree 2005; Maddison et al. 2007) into our model or, conversely, incorporating (decoupling of) diversity dependence in models of character evolution, would be an obvious, but mathematically challenging, next step (see Mahler et al. 2010 for an interesting attempt).

Rather than finding evidence for a key innovation due to a shift from mesic to xeric conditions, Egan and Crandall (2008a) detected an overall shift in diversification rates and attributed this to Quaternary climate changes, spurring the diversification of the subclade containing subgenera Disarticulatum and Pediomelum. They suggest that the glacial cycles coupled with the varied topography of the southwestern United States created many unique habitats that were ripe for the colonization of the ancestors of these two subgenera. Our analysis supports this suggestion.

It is still possible that a (climate-induced) clade-wide rate shift also occurred during the same time. Indeed, the phylogeny suggests that around 2 mya there is increased diversification in all lineages. Our method can be readily extended to detect clade-wide rate shifts and subclade-specific key innovations simultaneously.

Our model of adaptive radiation through key innovations does not require the assumption that speciation is intrinsically more rapid during adaptive radiation as proposed by Schluter (2000). Glor (2010) scrutinizes this assumption, because it lacks an established mechanistic explanation. Our model provides such a mechanistic explanation. Under most realistic parameter combinations, decoupling of diversity dependence automatically leads to exceptional diversification of the subclade. But if, for example, the carrying capacity of the subclade is substantially lower than that of the main clade, the intrinsic speciation rate and the extinction rate are similar to those of the main clade, and the main clade is still far from equilibrium, then we might not expect a strong asymmetry to arise due to the key innovation, and we would fail to detect it. Yet, a low carrying capacity of the subclade means that there is not much ecological space for the key innovation, so one may wonder whether this would still be considered a key innovation. The justification sought by Glor (2010) for extraordinary diversification to be a necessary ingredient for adaptive radiation comes from the implicit assumption that innovations should create opportunities. If they do not, one can perhaps hardly speak of ecological release. Thus, our model offers a plausible explanation of the problem identified by Losos (2010) that clades may fail to radiate although seemingly in the presence of ecological opportunity.

The property that speciation takes time, that is, that it is “protracted” (Rosindell et al. 2010), can in principle be incorporated in our model along the lines of Etienne and Rosindell (2012). As we explained above, we simply chopped of the last 0.22 million years in our analysis to mimic protracted speciation (Phillimore and Price 2008), because deriving a full likelihood of a phylogeny under both diversity dependence and protracted speciation remains a challenge. The analysis yielded more realistic (nonzero) extinction rates, suggesting that protracted speciation is an important factor to consider in future models of diversification.

There is evidence that, overall, diversity increases over time (see Wiens 2011 for some examples). This seems to conflict with evidence for ecological limits to diversification. Our model resolves this apparent conflict as follows: diversity increases by key innovations, but when key innovations are absent, diversification rates decrease due to diversity dependence (i.e., ecological limits). The end result is still a diversity increase but an irregular one. Of course, this picture is a bit of a caricature. In reality, innovations occur all the time, but some are (much) more influential than others and can be detected in phylogenies because they create many new opportunities causing a rapid radiation. Thus, our model reconciles the contradictory findings on the correlation between clade size and clade age. Clade size and clade age are expected to be correlated for clades where major key innovations are rampant. On average, older clades will have more such key innovations and thus are expected to hold more species. In contrast, clade size and clade age are largely uncorrelated for clades with no major key innovations, because the dominant process is diversity dependence, where ecological limits set the maximum clade size irrespective of clade age. Only for clades that are too young to experience diversity dependence, there may still be a noticeable correlation, even without major key innovations. In phylogenetic studies, often a group of relatively similar species is selected, for example, a genus, or perhaps Vincent and Brown’s (2005) G function (i.e., species with the same Bauplan), which causes a bias toward clades with few, if any, major key innovations. We might hypothesize that major key innovations occur between G functions but not within them.

In an attack on Rabosky’s (2009) ecological limits hypothesis, Wiens (2011) argues that ecological limits to diversification, if they exist at all, should not be seen as an alternative paradigm explaining diversity patterns but that the ecological limits hypothesis is simply part of the traditional idea that ecology influences evolution, and hence, ecological limits should affect diversification rates. This is exactly what our model does (as in fact does Rabosky’s model): the speciation rate is made dependent on clade diversity, but occasionally this dependence is avoided by key innovations.

In summary, we propose a new perspective on adaptive radiations in general and key innovations in particular, in the light of diversity-dependent diversification. Clade-specific key innovations result mostly from a decoupling of the diversity dependence of a focal subclade from the main clade, which may or may not be accompanied by subclade-specific shifts in intrinsic speciation and extinction rates. Clade-wide shifts in speciation or extinction rates, or in the clade-level carrying capacity are usually driven by external factors and offer new opportunities for all species alike, resulting in an adaptive radiation in the whole clade. We have shown how these scenarios can be translated mathematically to compute the likelihood, which can be used for statistical detection of clade-wide and subclade-specific parameter shifts and estimation of the time at which they took place. Diversity dependence is thus key to understanding adaptive radiations.

Acknowledgments

We thank A. Egan for making the phylogenetic data available on Dryad (doi:10.5061/dryad.sr927n43) and for helpful insights in the diversification of the Psoraleeae. Furthermore, we thank A. Phillimore, A. Pigot, H. Sims, and S. Rice for comments on the manuscript; A. Hendry for the suggestion on a link with G functions; and S. Brusatte, N. Verzelen, and the COCON group at the University of Groningen for discussion. R.S.E. was supported by a VIDI grant from the Netherlands Organisation for Scientific Research (NWO) and R.S.E. and B.H. by an exchange grant from the Van Gogh Program.

Notes

1. Code that appears in the American Naturalist is provided as a convenience to the readers. It has not necessarily been tested as part of the peer review.

Appendix A Computing the Probability of Survival of the Crown Lineages

The algorithm in box 2 gives the likelihood without requiring that the two crown lineages survive. To condition on their survival, as advocated since Nee et al. 1994, we follow the number of descendants of the two crown species from time tc to time tp in order to compute the probability that the number of descendants of both are positive. The key innovation event at time ti occurs in one of the descendant species. If the subclade S becomes extinct before the present, we require that both crown lineages survive in the main clade. If the subclade S survives until the present, we require that the other crown lineage survives in the main clade (the crown species with descendants having the key innovation may or may not have descendants without the key innovation until the present).

The survival probability PS of subclade S initiated at time ti can be computed as in the algorithm of box 2 (but obviously ignoring branching events). The survival probability of the crown lineages requires a separate treatment. Denote by the probability that at time t the first crown species has descendants, and the second crown species has descendants. The dynamical equation is a 2-dimensional version of (1),

2222

First, we integrate equation (A1) from tc to ti. Immediately before the key innovation event, the distribution has the symmetry

2222

for . The key innovation event occurs in the first group of species with probability and in the second group of species with probability . The probability that the key innovation occurs in the first group and that immediately after the key innovation event, the first group has species and the second group has species (where we do not count the species with the key innovation) is

2222

Note that this transformation breaks the symmetry (A2). We integrate equation (A1) from ti to tp with initial condition (A3). We are interested in the probability that the key innovation occurs in the first group, but the first group does not survive while the second group survives,

2222

and in the probability that the key innovation occurs in the first group and that both groups survive,

2222

Finally, the conditioning probability Pcond is given by

2222

where we have used the fact that and . The likelihood obtained in box 2 must be divided by Pcond to yield the likelihood of the phylogeny conditional on survival of the two crown lineages.

Appendix B Computing the Likelihood When There Are Missing Species

The algorithm in box 1 gives the likelihood for the diversity-dependent model without decoupling even when there are species missing from the phylogeny. The likelihood is simply element , q being the number of extant species in the phylogeny and m being the number of extant species missing from the phylogeny, of the vector of probabilities resulting from the integration of the ordinary differential equation divided by a combinatorial factor:

2222

This combinatorial factor is explained by Etienne et al. (2012). Here we explain how missing species can be dealt with in the model with decoupling of diversity dependence. The total likelihood for clades M and S combined is given by

2222

where is the likelihood of the main clade M, with missing species (see box 1; eq. [B2]),

2222

and, similarly, is the likelihood of the subclade S with the remaining missing species,

2222

and is the probability of out of m missing species being assigned to clade M and the remainder, , being assigned to clade S. Because all topologies have equal probability (Etienne et al. 2012), the probability of selecting out of species and out of species, where and are the number of species in clades M and S, respectively, is simply given by the hypergeometric distribution

2222

Substituting equations (B3), (B4), and (B5) into (B2) we find

2222

which is the final result of box 2.

Enhancements

Appendix A: Computing the Probability of Survival of the Crown Lineages

Appendix B: Computing the Likelihood When There Are Missing Species

Dryad data: http://dx.doi.org/10.5061/dryad.sr927n43

Pediomelum megalanthum var. megalanthum on Wire Mesa, southern Utah. Photograph by Ashley N. Egan.