I would like to present methodological, theoretical, and practical arguments which suggest that Sackett et al. (Reference Sackett, Zhang, Berry and Lievens2022, Reference Sackett, Zhang and Berry2023) calls for revisiting the role of GMA in personnel selection are premature.
Methodology: Too many subjective decisions
Methodological concerns related to Sackett et al. (Reference Sackett, Zhang, Berry and Lievens2022, Reference Sackett, Zhang and Berry2023) meta-analysis were already raised by Oh, Le, and Roth (Reference Oh, Le and Rothin press). I would like to concentrate here on the conceptual discussion of the two focal methodological decisions of Sackett et al., which, in my view, results in the lowering of the general mental ability (GMA) validity estimates. First, to not attempt a correction for range restriction and, second, to not control for job complexity.
Sackett et al. (Reference Sackett, Zhang, Berry and Lievens2022, Reference Sackett, Zhang and Berry2023) propose that range restriction is an issue mainly if the predictor in question was used in selecting the validation sample, and they argue that this would virtually never be the case in validation studies on GMA included in their meta-analysis, as this is unlikely that the same or similar GMA test scores used in validation procedures were also a part of the selection process. But, GMA was and still is considered one of the most important job performance predictors, as Kuncel, Ones, and Sackett (Reference Kuncel, Ones and Sackett2010, p. 333) notice: “Cognitive ability is the workhorse of employee selection.” So why it is unlikely that GMA tests of some sort were used as a basis of selection procedures in validation studies? This is unclear to me and not explained sufficiently. But even if we assume that GMA tests were never used as a part of selection procedures in studies analyzed by Sackett et al. (Reference Sackett, Zhang, Berry and Lievens2022, Reference Sackett, Zhang and Berry2023), this still does not mean that employees were not directly and indirectly selected based on GMA. It seems that Sackett et al. (Reference Sackett, Zhang, Berry and Lievens2022) misinterpret the GMA test score with GMA as a construct itself, but it is not cognitive ability tests that predict performance but cognitive ability. It is important to notice that various GMA tests scores to a large extend reflect a common construct (Johnson et al., Reference Johnson, te Nijenhuis and Bouchard2008), which “involves the ability to reason, plan, solve problems, think abstractly, comprehend complex ideas, learn quickly and learn from experience” (Gottfredson, Reference Gottfredson1997a, p.13). This seems to be an ability developed in a time of the test, not the genetic potential or hereditary talent (see Schmidt, Reference Schmidt2002). Assuming that it is unlikely that employees are directly hired based on their GMA is as assuming that it is unlikely that during selection procedures the abilities to plan, solve problems, reason, and learn from experience are directly taken into account by employers, but if these are not the basis for selection in most jobs, then what is? I argue that because of GMA’s nature and importance in our lives (Brown et al., Reference Brown, Wai and Chabris2021), GMA as a construct is a vital criterion for selecting employees for jobs even if there is no formal GMA test in recruitment procedures. Jobs vary in cognitive complexity (https://www.onetonline.org/find/descriptor/browse/1.A), and job incumbents are not randomly assigned to jobs, but applicants take different jobs and occupy different positions based on the fit between their cognitive abilities and the complexity of the job (Gottfredson, Reference Gottfredson1997b). Individuals gravitate to jobs that are congruent with their cognitive ability (Judge et al., Reference Judge, Klinger and Simon2010; Wilk & Sackett, Reference Wilk and Sackett1996) thus if we test current employers in any occupation, this is a sample directly selected for the job based on their GMA, in such a way that GMA tends to fit tasks complexity level in a job (Gottfredson, Reference Gottfredson2002). Therefore GMA variation among given occupation incumbents is artificially lowered in comparison to a sample of applicants and even more in comparison to the general population because the selection on the job complexity directly reflected GMA.
Moreover, GMA is associated with numerous important occupational, educational, health, and social outcomes (Brown et al., Reference Brown, Wai and Chabris2021), so, there are many sources for indirect range restrictions. To mention three, GMA is reflected in education (Ritchie & Tucker-Drob, Reference Ritchie and Tucker-Drob2018), education might be a proxy for cognitive ability (Berry et al., Reference Berry, Gruys and Sackett2006), and education often is a job-selection criterion. Second, job interviews of various forms are another source of indirect range restriction on GMA, as these complex social interactions correlate with GMA (Roth & Huffcutt, Reference Roth and Huffcutt2013), thus selecting based on an interview we indirectly select on GMA, and in most selection procedures, some form of interview is present. Third, GMA is related to emotional intelligence as the ability to understand and then regulate emotions (Joseph & Newman, Reference Joseph and Newman2010), which might be vital in making a positive impression on recruiters and further employers in selection processes. Besides that, there might be even more sources of range restriction as GMA is related to many important life outcomes such as income, leadership, unemployment, and physical and mental health that all might play a role of criterion in personnel selection. Thus, on a conceptual level, I see the Sackett et al. (Reference Sackett, Zhang, Berry and Lievens2022) decision to avoid corrections for range restriction as unjustified. It is good to remain a wise warning “Failure to take range restriction into account can dramatically distort research findings” (Sackett et al., Reference Sackett, Borneman and Connelly2008, p. 217).
Second, it is of note that Schmidt and Hunter’s (Reference Schmidt and Hunter1998) findings about GMA critiqued by Sackett et al. (Reference Sackett, Zhang, Berry and Lievens2022) related to jobs with medium complexity, whereas Sackett et al. (Reference Sackett, Zhang, Berry and Lievens2022) do not control for job complexity as a moderator. This is surprising, as job complexity has an important effect on GMA validity for job performance when GMA validity increases as job complexity increases (Gottfredson, Reference Gottfredson1997b, Reference Gottfredson2002). Thus merging jobs of different complexity might give a blurred picture of GMA’s practical validity.
One of the main suggestions of Sackett et al. (Reference Sackett, Zhang, Berry and Lievens2022), is the rejection of Schmidt and Hunter’s (Reference Schmidt and Hunter1998) conclusion about GMA role in predicting performance claiming that “Cognitive ability is no longer the stand-out predictor that it was in the prior work. Structured interviews emerged as the predictor with the highest mean validity” (Sackett et al., Reference Sackett, Zhang and Berry2023, p.5), and they asked: “So how did this happen?” My answer is this: They decide not to use correction for range restriction and do not control for job complexity. In my opinion, contrasting Schmidt and Hunter’s (Reference Schmidt and Hunter1998) meta-analytical estimate for GMA in medium complex jobs and with range restriction correction to Sackett et al. (Reference Sackett, Zhang, Berry and Lievens2022) meta-analytical estimate without taking into account job complexity and with no range restrictions adjustments, is debatable as it is comparing apples and oranges.
Theory: Is meta-analysis a substitute for reasoning?
But even if we assume that the Sackett et al. (Reference Sackett, Zhang, Berry and Lievens2022) meta-analysis is without any flaws, it is still important to note that Schmidt and Hunter (Reference Schmidt and Hunter1998, Reference Schmidt and Hunter2004) positioned cognitive abilities as the most important predictor of job performance based not solely on meta-analysis estimates but also on theoretical and practical reasoning. Interestingly in Schmidt and Hunter’s (Reference Schmidt and Hunter1998) meta-analysis the highest validity in predicting job performance was found for the work sample tests but not for GMA, and structured employment interviews have as high validity as the GMA test (this can be also seen in Sackett et al., Reference Sackett, Zhang, Berry and Lievens2022, table 3). Schmidt and Hunter (Reference Schmidt and Hunter1998) do not focus on GMA only due to high meta-analytical estimates but because all other personnel selection measures fall short in comparison to GMA when we consider a wider theoretical context. Thus the framing of Sackett et al. (Reference Sackett, Zhang, Berry and Lievens2022) paper as an attempt to refute Schmidt and Hunter (Reference Schmidt and Hunter1998) based on comparing meta-analytical estimates seems to me like a straw man argument. Meta-analytical estimates are not a substitute for scientific reasoning and have no magical power to solve all controversies and pass a final verdict (Vrieze, Reference Vrieze2018). Schmidt and Hunter (Reference Schmidt and Hunter1998) suggest that GMA advantages come from many sources, not only from the fact that they have meta-analytical validity estimates higher by a few hundredths than other predictors. The GMA tests are valid, reliable, and have a clearly defined nomological network, whereas it is debatable even what is measured by some other personnel selection predictors. For example: What is measured by employment interviews: personality, GMA, integrity, knowledge, all of these or something else? Also, the evidence of GMA validity comes from many research but for other predictors, there is usually a relatively lower number of studies available, thus taking into account the crisis of reliability in psychological literature, their conclusions are less robust (e.g., total k = 884 and N = 88 894 for GMA vs k = 105 and N = 7864 for structured interviews, in Sackett et al., Reference Sackett, Zhang, Berry and Lievens2022, table 2). More importantly, Schmidt and Hunter (Reference Schmidt and Hunter1998, Reference Schmidt and Hunter2004) highlight that GMA predicts not only job performance but also job-related learning and training performance (discussion on this is missing in Sackett et al., Reference Sackett, Zhang, Berry and Lievens2022). Further, GMA not only predicts but has a causal impact on performance, people with higher GMA acquire more job knowledge and acquire it faster, thus doing their job better (Schmidt, Reference Schmidt2002; Schmidt, & Hunter, Reference Schmidt and Hunter2004). Whereas for many other predictors, mechanisms of its impact on performance are unknown or unclear; for example, what is the mechanism of which structural interviews influence job performance?
Practical and economical factors: A bird in the hand is worth two in the bush
Besides all the theoretical advantages that stem from more than 100 years of research on GMA that “amass a coherent body of empirical knowledge withstanding the test of time” (Lubinski, Reference Lubinski2004, p.96), the second line of cognitive abilities defences lies in practical and economical factors. In applied business settings, it is not only the validity of the predictor but the efficiency that counts—the validity of the procedure in relation to the costs of its application. The personnel selection method with perfect validity estimates is of no use to me if I cannot afford to use it, and for the GMA tests, the validity to cost ratio seems to be still very good and better than, for example, employment interviews. GMA measures are ready to use standardized psychometric solutions of common construct, with proof of validity and reliability, in contrast to interviews, which are an umbrella term encompassing many different types and procedures of interviews (not only structured and unstructured; Huffcutt et al., Reference Huffcutt, Culbertson and Weyhrauch2014). Sackett et al., highlight the practical implications of their findings, but as there is no one standard “structural interview,” then what practical advice can Maria, a manager of my neighborhood company, take from the conclusion, that structured interviews are valid predictors of job performance? Does this mean that simply arranging a set of random questions in a structuralized order is enough to predict performance? Possibly not. This means that Maria needs to invest time and money in developing valid, reliable, and fair employment interviews when on a shelf she has a ready-to-use GMA test. Then the GMA test might be applied to many jobs from entry to managerial level and from simple to complex, whereas structural interviews must be adapted to given positions and occupations and might be inappropriate in many selection contexts, for example when candidates lack relevant job knowledge. GMA tests might be also conducted even by inexperienced recruiters but interview often demands from interviewers not only substantial knowledge in the interview domain but also self-discipline and reflexivity about their prejudices to conduct them fairly, validly, and reliably. Despite all these efforts, it is not uncommon for two interviewers to come to different conclusions about the same applicant. In light of the many errors and cognitive biases (Kahneman, Reference Kahneman2012, in human judgment (including interviewers), the GMA test seems to be not only cheaper but also fairer compared to interviews. Let’s consider two applicants with the same abilities to reason, learn, and solve complex problems. During interviews structured or not, there is more room for subjective judgments and unfair evaluations of their abilities due to common cognitive biases (e.g., halo effect, attribution bias, confirmation bias) or bigotry (e.g., stereotyping, prejudices) compared to when we use objective, standardized tests. Furthermore, candidates' perception of their ability is often only loosely related to their actual ability level (Freund & Kasten, Reference Freund and Kasten2012). Thus, avoiding the GMA test might favor not those with higher ability but those with higher self-confidence. Moreover, despite the "bad press" that cognitive tests have received, there is still a lack of robust evidence that the GMA test underpredicts job performance for racial or ethnic subgroups (see e.g., Sackett, et al., Reference Sackett, Zhang and Berry2023), generally, applicants with similar GMA test scores have a similar level of job performance (Oh, Reference Oh2022). Structurization of the interview might do a good job of diminishing various biases but it does not reduce them all. In my view, the higher the probability of facing discrimination, the higher the probability that objective criteria created by GMA will provide more equal opportunities. This is important in the light of presumed adverse impacts mentioned by Sackett et al., because in this context we should also consider the costs and side effects of rejecting standardized GMA tests in personnel selection (Oh, Reference Oh2022).
Final thoughts
To sum it up, even in the absence of all methodological concerns to their meta-analysis, if Sackett et al. (Reference Sackett, Zhang, Berry and Lievens2022) want to dethrone GMA and instead position structured interviews as the focal predictor in personnel selection they should provide a sound theoretical explanation and practical benefits for this move not only meta-analytical estimates, estimates that in my view, depend on too many researchers degrees of freedom. But as GMA is an important performance predictor, this does not mean that we should use only the GMA tests or that GMA is the only number that matters. It seems to me that on a practical level, Sackett et al. (Reference Sackett, Zhang, Berry and Lievens2022) results might be seen as confirming old truth, we should use composite measures consisting of cognitive and noncognitive predictors to increase validity and utility, and reduce adverse impact in personnel selection (Schmidt, Reference Schmidt2002). It is very difficult to predict performance in a job; thus, to meet the demands of real life, we need all hands on the deck, not academic fights whose predictor is the best.
Competing interest
The authors state that there is no direct financial conflict of interest. However potential readers of this paper must be aware that the author of this manuscript is an academic employee and his future career and therefore earnings depend on how much he will publish. Also, the author institution although it does not expect or encourage any particular findings, might provide authors with financial bonuses for publications in scientific journals.
Funding
There is no funding related to this research.