Impact statement
Whether extinction is random or selective is important for understanding the history of biodiversity and for better predicting the outcomes of anthropogenic extinction. Analysis of patterns of extinctions in morphospace can aid in understanding form and function interact with extinction processes in a selective way. Morphospaces are mathematical spaces constructed from variables that represent the form of organisms. If carefully constructed, the distribution of species in a morphospace summarizes their functional properties and ecological roles. The morphospace pattern of species that succumb to extinction can provide clues about the factors that make extinction more likely. This paper reviews strategies for analyzing extinctions in morphospace, explains some of the most common ways in which misinterpretations can arise from the mathematical properties of morphospace, and makes suggestions on how to avoid misinterpretations.
Introduction
Whether extinctions are random or selective remains an important question in ecology and evolutionary biology. The standing diversity of species at any time (t) and place (m) is a balance between the rate of extinction (μ) and the rate of origination (λ)such that mt = ae(μ-λ)t, where a is the standing diversity at an earlier time t = 0 (Raup, Reference Raup1985). Thus, the nature of extinction – constant or episodic, random or selective, ecologically intrinsic or driven by external processes – is key to understanding the processes that control biodiversity past and future (Yule, Reference Yule1925; MacArthur and Wilson, Reference MacArthur and Wilson1963; Raup and Sepkoski Jr, Reference Raup and Sepkoski1984; Raup, Reference Raup1994; McKinney, Reference McKinney1997; Droser et al., Reference Droser, Bottjer and Sheehan2000; Ciampaglio et al., Reference Ciampaglio, Kemp and McShea2001; Lyons et al., Reference Lyons, Smith and Brown2004; Koch and Barnosky, Reference Koch and Barnosky2006; Roy and Goldberg, Reference Roy and Goldberg2007; Jablonski, Reference Jablonski2008; Jackson, Reference Jackson2008; Lockwood, Reference Lockwood2008; Gill et al., Reference Gill, Williams, Jackson, Lininger and Robinson2009; Pereira et al., Reference Pereira, Leadley, Proença, Alkemade, Scharlemann, Fernandez-Manjarrés, Araújo, Balvanera, Biggs, Cheung and Chini2010; Alroy, Reference Alroy2015).
Morphospaces order species by their morphological traits in ways can be used to assess randomness or functional patterns by which taxa succumb to extinction. For example, morphospaces can help distinguish stochastic extinction from interspecific competition as in the Red Queen hypothesis (Van Valen, Reference Van Valen1973) from non-random extinction linked features associated with trophic level, body size, geographic range, dietary or locomotor specialization, phylogenetic relationship, or physiological tolerance (e.g., Buzas and Culver, Reference Buzas and Culver1984; McKinney, Reference McKinney1997; Jablonski, Reference Jablonski2008; Leighton and Schneider, Reference Leighton and Schneider2008; Lockwood, Reference Lockwood2008; Fritz and Purvis, Reference Fritz and Purvis2010; Payne et al., Reference Payne, Bush, Heim, Knope and McCauley2016). Indeed, analyses of morphospaces themselves can reveal evolutionary constraints, many-to-one functional mappings, and patterns of convergence that may themselves feed into extinction processes (e.g., Raup, Reference Raup1966; McGhee, Reference McGhee1999; Mitteroecker and Hetteger, Reference Mitteroecker and Hetteger2009; Hallgrímmson et al., Reference Hallgrímsson, Jamniczky, Young, Rolian, Schmidt-Ott and Marcucio2012; Chartier et al., Reference Chartier, Jabbour, Gerber, Mitteroecker, Sauquet, von Balthazar, Staedler, Crane and Schoenenberger2014; Gerber, Reference Gerber2014, Reference Gerber2017; Polly, Reference Polly2008, Reference Polly2017). The increasing ease of obtaining morphometric data has allowed the role of morphological specialization in extinction to be more widely studied (e.g., Johnson et al., Reference Johnson, Budd and Stemann1995; Hopkins, Reference Hopkins2013; Wilson, Reference Wilson2013; Grossnickle and Newham, Reference Grossnickle and Newham2016; Halliday and Goswami, Reference Halliday and Goswami2016; Hopkins and Gerber, Reference Hopkins, Gerber, Nuno de la Rosa and Müller2017; Sibert et al., Reference Sibert, Friedman, Hull, Hunt and Norris2018; Polly, Reference Polly2020; Bazzi et al., Reference Bazzi, Campione, Ahlberg, Blom and Kear2021; Ali et al., Reference Ali, Blonder, Pigot and Tobias2023).
Here I review the concept of morphospaces, ways of measuring patterns of extinction within morphospaces, pitfalls for interpreting patterns in high-dimensional morphospaces like those derived from geometric morphometrics, and remedies to avoid those pitfalls.
Morphospaces and the study of extinction
A morphospace is any mathematical space defined by morphological variables. The simplest morphospaces are univariate, but they can have any number of dimensions defined directly by variables like length, width, and height, by transformed variables like principal components axes, or by axes derived from pairwise distances as in principal coordinates spaces (Thompson, Reference Thompson1917; Blackith and Reyment, Reference Blackith and Reyment1971; Mardia et al., Reference Mardia, Kent and Bibby1979; Mitteroecker and Hetteger, Reference Mitteroecker and Hetteger2009; Chartier et al., Reference Chartier, Jabbour, Gerber, Mitteroecker, Sauquet, von Balthazar, Staedler, Crane and Schoenenberger2014). Geometric morphometric morphospaces can have dozens or even thousands of dimensions.
Morphospaces can be derived theoretically from principles of embryonic development, geometry, or functional properties, or they can be constructed empirically from a measured sample (McGhee, Reference McGhee1999). Raup’s logarithmic shell coiling equations are a classic example of theoretical morphospace that represents mantle-based ontogenetic shell accretion using four parameters (aperture shape, whorl expansion, aperture translation, and the distance of the aperture from the coiling axis) to define a space of all possible shell shapes (Raup and Michelson, Reference Raup and Michelson1965; Raup, Reference Raup1966). Most morphospaces, however, are derived from empirical data centered on the sample mean with unspecified limits of biologically plausible variation within their mathematically infinite bounds. Geometric morphometric morphospaces are empirical, as are multivariate spaces based on linear measurements or Fourier coefficients (e.g., Sokal, Reference Sokal1961; Rohlf, Reference Rohlf1986, Reference Rohlf1993). A simulated example of an empirical morphospace of brachiopods is shown in Figure 1. Rarely morphospace axes are based on categorical variables, such as Stebbins’ (Reference Stebbins1951) floral space or Thomas and Reif’s (Reference Thomas and Reif1993) skeleton space. As discussed below, the mathematical properties of these morphospaces are varied – not all have orthogonal axes, not all are Euclidean, and not all are linear transformations of one another even when they are constructed for the same objects. Perceived patterns of extinction can therefore depend in part on the choice of variables and ordination.
Regardless, morphospaces order – or ordinate – species such that their spatial positions indicate similarity and differences that can be used to detect patterns of randomness or selectivity. Generalized statistical models of morphospace occupation exist that balance trait evolution, speciation, and extinction (Slatkin, Reference Slatkin1981; Gavrilets, Reference Gavrilets1999; Pie and Weitz, Reference Pie and Weitz2005), as do studies of statistical issues associated with measuring morphospace occupation (Ciampaglio et al., Reference Ciampaglio, Kemp and McShea2001).
Extinction and morphospace distributions
By removing a subset of species, extinction transforms the distribution of taxa within morphospace, leaving a signature of the extinction process (Korn et al., Reference Korn, Hopkins and Walton2013; Ali et al., Reference Ali, Blonder, Pigot and Tobias2023). Random (non-selective extinction) will reduce the total number of taxa (Figures 1C, 2A) but should have a little statistical effect on the moments of their distribution within the space. Selective extinction, however, may change the moments of the distribution, especially the mean and variance (variance in morphospace is one measure of morphological disparity (Foote, Reference Foote1997)). If only certain kinds of specialized morphological outliers are more vulnerable (Figure 1D), both the disparity and the mean will be affected (Figure 2B), but if all morphologically distinctive species are likely to be culled, then the disparity will decrease but the mean will be unchanged (Figure 2C).
Non-selective extinctions in morphospace are expected not only under a truly random ‘field of bullets’ scenario (Raup, Reference Raup, Holland and Trendall1984), but also when the selective filter is unrelated to the variables that define the morphospace (Figure 1E), when the sample does not fully represent the range of selectivity of the extinction process (e.g., extinction differentially affects high trophic levels and only carnivores are included in the study), or under the Red Queen model in which all species are continually competing for limited resources and eventually lose (Van Valen, Reference Van Valen1973). Selective extinction can occur when highly derived and ecologically specialized morphologies at the peripheries of morphospace are susceptible, when one part of the morphospace represents adaptations to an environment that is hit by the extinction process, or similar scenarios. Contraction of niche space is an example cause of selective extinctions that would reduce morphological disparity (Valentine, Reference Valentine1995; Bush and Pruss, Reference Bush and Pruss2013). Geographic range size and niche breadth have been shown to be factors in selective extinction processes, but their connection to morphological traits (and thus morphospace) is indirect and varies from clade to clade (Jablonski, Reference Jablonski2008, Reference Jablonski2017; Harnik et al., Reference Harnik, Simpson and Payne2012; Huang et al., Reference Huang, Roy, Valentine and Jablonski2015; Saupe et al., Reference Saupe, Qiao, Hendricks, Portell, Hunter, Soberón and Lieberman2015).
As an example, Cole and Hopkins (Reference Cole and Hopkins2021) using morphospaces derived from discrete character data sets found that the Late Ordovician mass extinction of diplobathrid crinoids was random with respect to morphology and ecology, and that post-extinction recovery in this clade re-filled previously occupied regions of morphospace rather than exploring previously unrealized morphologies. The non-selectivity of the mass extinction was notably different from background extinctions in the same clade through the Paleozoic which selectively removed species with specific filtering strategies and habitats, especially those that were highly specialized (Baumiller, Reference Baumiller2003; Liow, Reference Liow2004). In contrast, Wilson (Reference Wilson2013) found using morphospaces derived from geometric morphometrics that the end-Cretaceous extinction was highly selective on the dietary specializations of mammals, preferentially removing larger-bodied taxa with carnivorous and specialized herbivorous diets suggesting that the extinctions were caused by depressed productivity in the aftermath of the asteroid.
Korn et al. (Reference Korn, Hopkins and Walton2013) used the expected changes in disparity and mean morphology to distinguish selectivity and asymmetry in extinction. They used standardized versions of the range of morphological disparity, its total variance, and the change in the position of the mean morphology (centroid) to define an “extinction space”. To take into account the correlation between range and variance metrics, they used principal components to reduce their space to a “selectivity” axis driven by change in disparity and an “asymmetry” axis driven by shifts in mean morphology (Figure 2A–C).
Ali et al. (Reference Ali, Blonder, Pigot and Tobias2023) pointed out that some extinction processes might selectively remove species that lie clustered within the overall distribution of taxa leaving a “hole” in the morphospace (Figure 2D). Disparity and mean morphology will usually be affected by such extinctions, but they will produce smaller changes on the selectivity and asymmetry axes of extinction space than when extinction differentially affects the periphery of morphospace. Nevertheless, selective extinction of morphologies on the interior of morphospace distributions could be just as easily produced as selective extinction at the edges by reductions of ecospace or culling of taxa at certain trophic levels.
Ali et al. (Reference Ali, Blonder, Pigot and Tobias2023) also argued that random extinction may reduce disparity when species are concentrated near the center of morphospace (e.g., if they are multivariate normal). They therefore argued that the density distribution of species in morphospace is also an important metric for assessing extinction selectivity. If measured as variance, disparity would drop in this scenario only when the risk of extinction is distributed uniformly across the morphospace (because more rare species at the periphery would be lost than in the dense central region), but not if each species had an equal probability of extinction (range-based disparity would decrease in either scenario). Note that expectations are contingent on the peculiarities of the mathematical properties of the morphospace, the complexity of the distribution of species within it, and sample size (e.g., Ciampaglio et al., Reference Ciampaglio, Kemp and McShea2001). Regardless, the density distribution of species within a multivariate morphospace is more complex than it might appear as discussed below. Clumpiness or evenness statistics can be used to determine whether extinctions are clustered within limited regions of morphospace, regardless of whether at the periphery or in the interior (e.g., Heip et al., Reference Heip, Herman and Soetaert1998; Tuomisto, Reference Tuomisto2012).
Potential pitfalls
Gerber (Reference Gerber2017) warned that the mapping between morphospaces derived from different quantifications of the same morphologies can be complex and nonintuitive. The topological relationships between alternative morphospace projections – the apparent relationships between taxa, the proportionality of their spacing, and their apparent location with respect to the center and periphery of the morphospace – is partly due to their morphology and partly due to the mathematical properties of the space. Careful attention should therefore be paid to how the morphospace is constructed and what can unambiguously be inferred from it.
Mitteroecker and Hetteger (Reference Mitteroecker and Hetteger2009) reviewed the mathematical and geometric properties of morphospaces constructed from several types of data and methods. Some are composed of many kinds of variables – linear caliper measurements, angles, volumes, areas, counts of structures, or matrices of meristic character states – that do not share a common unit of measurement. These spaces are referred to as non-metric because standard concepts of “direction” and “distance” are ambiguous: the scaling of one axis might be measured in radians and another in centimeters making the proportional relationship of the axes and the units of multivariate distance undefinable. Measures of evenness are likely to be quite ambiguous in nonmetric spaces.
The theoretical morphospace defined by Raup’s shell coiling parameters (Figure 3A; Raup, Reference Raup1966) is an example of a non-metric space whorl expansion, translation, and distance parameters are measured in different units making it difficult to equate a change in one direction relative to another. The morphospace defined by geometric morphometrics of the same shells, in contrast, is a metric space where a change in all directions is measured in the same Procrustes shape units (Figure 3B; Gerber, Reference Gerber2017; Polly, Reference Polly2017). In the non-metric space, the measured disparity is dependent on arbitrary scaling between the incommensurate axes, but in the metric space the measured disparity is more objective. Measuring the asymmetric component of an extinction in non-metric space is problematic because the magnitude of a shift in one direction cannot be compared to a shift in a different direction, but asymmetry in metric space is invariant to direction.
Some spaces are metric but non-Euclidean (i.e., curved, bounded, or non-parallel; Mitteroecker and Hetteger, Reference Mitteroecker and Hetteger2009). Constraints or covariances between variables can reduce the morphospace’s dimensionality causing it to be curved or otherwise non-linear analogous to the surface of a sphere. Geometric morphometric spaces, for example, are non-Euclidean Riemannian hyperspheres whose dimensionality is reduced because of the translation, rotation, and scaling steps of Procrustes superimposition (Kendall, Reference Kendall1984; Dryden and Mardia, Reference Dryden and Mardia1998). Distances and symmetry can be ambiguous in these spaces. In geometric morphometric morphospaces, shapes are identical at all of the spaces’ peripheral edges, which would cause an extreme asymmetrical extinction (Figure 2B) to look like a peripheral model (Figure 2C).
More than one morphospace can often be constructed for the same species, like the snail shells in Figure 2. In this example, the same shell shapes are described by non-metric Raup coiling parameters in Figure 3A and by metric semilandmarks in Figure 3B. Both morphospaces are valid representations, but the mapping between them is non-linear (because of the logarithmic component of Raup’s equations). The same pattern of extinction has different mathematical properties in each space: a uniformly random pattern in the Raup space would be nonuniform (i.e., selective) in the shape space and vice-versa; likewise, an extinction that culled the periphery of the Raup space would also cull the periphery in the shape space, but as a result the mean snail shell would change in shape space but not in Raup space. This seeming paradox is not a problem in the strict sense – it simply means that whatever the root cause of the extinction might be, it “sees” coiling geometry differently than overall shell shape – but the details are important for drawing interpretations about the extinction process from the observed pattern.
Multivariate morphospaces are common in studies of extinction, and they are usually constructed using either principal components analysis (PCA) from data consisting of metric variables or principal coordinates analysis (PCO) when the data are meristic or categorical. PCAs constructed from covariance matrices – appropriate when all of the variables have the same units like in geometric morphometrics – are simple rigid rotations of the data that maintain the original spacing and distance between the objects. In principle, PCA space is identical to the original variable space except for the coordinate system, but some consequences of the transformation may seem counter-intuitive when only two or three dimensions are visible. For more background on multivariate ordinations, including PCA, readers are referred to longer explanations in the literature (Tatsuoka, Reference Tatsuoka1988; Hammer and Harper, Reference Hammer and Harper2006; Legendre, Reference Legendre2012; Polly et al., Reference Polly, Lawing, Fabre and Goswami2013; Polly and Motz, Reference Polly and Motz2017). Properties like peripherality, position of the mean shape, and the distribution of species may differ markedly when the original variable space and PCA morphospace are viewed in typical two-dimensional projections.
Figure 4 shows the first two dimensions (X and Y) of a five-variable space, each with a flat distribution that creates a rectilinear morphospace for the 10,000 random points shown here (Figure 4A,B). Because these variables have the same mean and variance, each accounts for about one-fifth (20%) of the total variance. When rotated to principal component (PC) space, the distribution projected onto the first two PCs counterintuitively appears to be circular (Figure 4C) with a greater density of points near the center of the morphospace (Figure 4D). Because the original variables of this example are uncorrelated, each PC axis also explains about one-fifth (20%) of the total variance. Despite being a rigid rotation, the relative distribution of points in PC 1 and 2 space appears to have no correspondence to their relative positions in the original X and Y space (Figure 4E).
The apparent differences between the original and ordinated PCA morphospaces are optical illusions stemming from the way the multidimensional space is projected onto a plane. In the full dimensionality, the spacing between objects in the two spaces is identical (the PC space is a rigid rotation of the original variable space), but the axes of the first space are univariate, whereas the axes of the PC space are linear combinations of all five variables. By definition, the first PC axis is the axis of greatest variance in the original variable space, the second axis is orthogonal (at right angles) to the first and drawn through the next greatest axis of variance, and so on (Hotelling, Reference Hotelling1933). The scattered points on the periphery of the PC space are those that have consistently high or low values on each of the original variables. The greatest Euclidean distance between two points in the original five-dimensional space is 22.67 (e.g., between a point lying at −10.0 on all five axes from one at 10.0 on all five) and it is the chance sampling of points that lie at distances close to this that define the principal component structure of this data set. Only a small number of points are peripheral on all five variables, even though the frequency of points at the extreme (e.g., 10) is identical to the frequency of points precisely at the center (0) on any single variable. Most points that fall at an extreme on one variable fall between extremes on the other four. Consequently, the distribution of points projected onto the PC axes is denser at the center than at the periphery and points that appear to be on the periphery in a space defined by two of the original variables may lie in the center of the PC 1 and 2 space. The PC distribution appears circular simply by chance sampling; if 500,000 points had been used instead of 10,000 one would see that the edges of the PC distribution are actually straight, forming a polygon with four to ten sides depending on the orientation of the major axes of the sample, essentially the “shadow” of a five-dimensional hypercube cast in one direction or another.
The consequences of these transformational illusions are profound if an analysis of morphospace is conducted on only a subset of axes instead of the distribution in the fully multivariate space. First, a finite sample measured in any subspace of a principal component morphospace (e.g., on the first two PC axes) is quite likely to have a higher density in the center regardless of whether the data actually have a flat or multinormal distribution. Tests for density or evenness should therefore be performed on all axes, not just the first few PCs. Second, measures of disparity and asymmetry also need to be assessed with all the axes, not just a couple. Figure 5 shows what three models of selective extinction with respect to the original variables look like projected onto the first two PCs and vice versa. What is actually a selective extinction of extremes on two of the five variables that reduce disparity appears to be randomly distributed in the space defined by the first two PCs with no change in overall disparity, but one that is actually peripheral on the first two PCs also appears to be peripheral (but fuzzier) in the original variable space (Figure 5A,B). Extinctions that are actually localized in the central part of either morphospace may appear to be asymmetrically peripheral in its counterpart space (Figure 5E,F). In all cases, the seeming contradiction between the distribution of the same extinction in the two spaces is an illusion caused by viewing a multivariate distribution in just two dimensions.
Conclusion
Because of the growing ease and speed with which morphological data can be collected (e.g., Boyer et al., Reference Boyer, Puente, Gladman, Glynn, Mukherjee, Yapuncich and Daubechies2015; Riley et al., Reference Riley, Woodard, Hwang and Punyasena2015; Elder et al., Reference Elder, Hsiang, Nelson, Strotz, Kahanamoku and Hull2018; Goswami et al., Reference Goswami, Noirault, Coombs, Clavel, Fabre, Halliday, Churchill, Curtis, Watanabe, Simmons and Beatty2022), it is increasingly feasible to study processes of extinction using the lens of its selectivity within morphospace. From the distribution of extinctions in a morphospace it may be possible to infer the causal links between extinction processes, environments, and organismal function that will lead to a better understanding of what differentiates background extinctions from the escalating events that produce of mass extinctions.
Morphospaces can have mathematical ambiguities that may confound interpretations, however, including their metric properties, non-linear mappings between morphospaces represented by different sets of variables, and the distorting effects of dimension reduction techniques. Many of potential pitfalls can be easily circumvented. Several authors have made recommendations how to avoid the potentially misleading consequences of ignoring the full dimensionality of morphospace (Bookstein, Reference Bookstein2013, Reference Bookstein2016; Polly et al., Reference Polly, Lawing, Fabre and Goswami2013; Goolsby, Reference Goolsby2015; Uyeda et al., Reference Uyeda, Caetano and Pennell2015; Polly and Motz, Reference Polly and Motz2017; Adams and Collyer, Reference Adams and Collyer2018; Cardini et al., Reference Cardini, O’Higgins and Rohlf2019). Disparity, asymmetry, and evenness statistics performed multivariately on all five dimensions of the space should produce identical results if calculated on the original variables or the PC scores; fully multivariate tests will therefore get around most problems.
Arguably, interpretations about extinction processes are most effectively framed in terms of biological or ecological significance of the specific variables that define the space rather than on the general pattern – some variables may be relevant to the selective extinction process, others may be correlated with the relevant variables, and yet other variables may be random (uncorrelated) with respect to those that are. Focusing on the functional roles of the morphology will also circumvent many of the mathematical ambiguities between different projections of shape space described above. For example, Hebdon et al. (Reference Hebdon, Polly, Peterman and Ritterbush2022) recently used performance spaces, in which multivariate shape is regressed onto independent measures of functional performance to estimate functional gradients in the shape space (Polly et al., Reference Polly, Stayton, Dumont, Pierce, Rayfield and Angielczyk2016), to study the selectivity of the Triassic–Jurassic extinction relative to swimming performance and life strategies in ammonites. If carried out multivariately, such an approach would come to the same conclusion about whether the extinction was selective relative to morphological function regardless of which projection of morphospace was used, or even whether a Raup space or geometric morphometric had been used.
Open peer review
To view the open peer review materials for this article, please visit http://doi.org/10.1017/ext.2023.16.
Data availability statement
No original data are presented in this paper. Simulated data were generated with code that is publicly distributed (Polly, Reference Polly2022a, Reference Polly2022b, Reference Polly2023).
Acknowledgements
Thanks to John Alroy and Barry Brook for suggesting I write this paper, to Elizabeth Housworth for answers about mathematical spaces, to Alycia Stigall for advice about brachiopod morphology, and to two anonymous reviewers, Jonathan Payne, Wolfgang Kiessling, and Laetitia Black for editorial support.
Author contribution
P. David Polly is the sole author of this paper.
Competing interest
The author declares no competing interests exist.
Comments
Dear editors,
This review article is being submitted based on invitation from John Alroy and Barry Brook to contribute something on the subject of extinction and morphspace occupation. In it I review current approaches for using morphospace to study extinction, I discuss several possible complications to interpretations that may arise from the mathematical properties of morphospaces.
With best wishes,
David