Hostname: page-component-cd9895bd7-fscjk Total loading time: 0 Render date: 2024-12-28T04:14:46.018Z Has data issue: false hasContentIssue false

The providential randomisation of genotypes

Published online by Cambridge University Press:  11 September 2023

Jean-Baptiste Pingault
Affiliation:
Department of Clinical, Educational and Health Psychology, University College London, London, UK j.pingault@ucl.ac.uk www.jeanbaptistepingault.com p.fearon@ucl.ac.uk https://www.cfr.cam.ac.uk/staff/professor-pasco-fearon e.viding@ucl.ac.uk https://www.ucl.ac.uk/pals/people/essi-viding
Pasco Fearon
Affiliation:
Department of Clinical, Educational and Health Psychology, University College London, London, UK j.pingault@ucl.ac.uk www.jeanbaptistepingault.com p.fearon@ucl.ac.uk https://www.cfr.cam.ac.uk/staff/professor-pasco-fearon e.viding@ucl.ac.uk https://www.ucl.ac.uk/pals/people/essi-viding
Essi Viding
Affiliation:
Department of Clinical, Educational and Health Psychology, University College London, London, UK j.pingault@ucl.ac.uk www.jeanbaptistepingault.com p.fearon@ucl.ac.uk https://www.cfr.cam.ac.uk/staff/professor-pasco-fearon e.viding@ucl.ac.uk https://www.ucl.ac.uk/pals/people/essi-viding
Neil Davies
Affiliation:
Medical Research Council Integrative Epidemiology Unit at the University of Bristol, Bristol, UK Neil.Davies@bristol.ac.uk https://research-information.bris.ac.uk/en/persons/neil-m-davies marcus.munafo@bristol.ac.uk https://research-information.bris.ac.uk/en/persons/marcus-r-munafo KZ.Davey-Smith@bristol.ac.uk https://www.bristol.ac.uk/people/person/George-Davey%20Smith-285dce3f-4498-4e97-82de-250a865b4483/ Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, Norwegian University of Science and Technology, Trondheim, Torgarden, Norway
Marcus R. Munafò
Affiliation:
Medical Research Council Integrative Epidemiology Unit at the University of Bristol, Bristol, UK Neil.Davies@bristol.ac.uk https://research-information.bris.ac.uk/en/persons/neil-m-davies marcus.munafo@bristol.ac.uk https://research-information.bris.ac.uk/en/persons/marcus-r-munafo KZ.Davey-Smith@bristol.ac.uk https://www.bristol.ac.uk/people/person/George-Davey%20Smith-285dce3f-4498-4e97-82de-250a865b4483/
George Davey Smith
Affiliation:
Medical Research Council Integrative Epidemiology Unit at the University of Bristol, Bristol, UK Neil.Davies@bristol.ac.uk https://research-information.bris.ac.uk/en/persons/neil-m-davies marcus.munafo@bristol.ac.uk https://research-information.bris.ac.uk/en/persons/marcus-r-munafo KZ.Davey-Smith@bristol.ac.uk https://www.bristol.ac.uk/people/person/George-Davey%20Smith-285dce3f-4498-4e97-82de-250a865b4483/

Abstract

When building causal knowledge in behavioural genetics, the natural randomisation of genotypes at conception (approximately analogous to the artificial randomisation occurring in randomised controlled trials) facilitates the discovery of genetic causes. More importantly, the randomisation of genetic material within families also enables a better identification of (environmental) risk factors and aetiological pathways to diseases and behaviours.

Type
Open Peer Commentary
Copyright
Copyright © The Author(s), 2023. Published by Cambridge University Press

Madole & Harden draw parallels between randomised controlled trials (RCTs) and within-family genetic association designs to elaborate on the notion of genetic causation, that is, whether genes cause behaviours and how to interpret genetic causes. The article is a thoughtful introduction to these topics. Our comment focuses on the core feature shared by both designs, that is, randomisation. First, we discuss how to best capitalise on natural randomisation to help build causal knowledge. Second, despite randomisation being a core feature of both, we caution against drawing too literal parallels between the two designs.

The parallel drawn by the authors was made explicitly by Fisher who established a direct filiation between the (artificially) randomised design he theorised and the (natural) randomisation of genetic material at conception, in his words: “the factorial method of experimentation, now of lively concern so far afield as the psychologists, or the industrial chemists, derives its structure and its name, from the simultaneous inheritance of Mendelian factors.… Genetics is indeed in a peculiarly favoured condition in that Providence has shielded the geneticist from many of the difficulties of a reliably controlled comparison. The different genotypes possible from the same mating have been beautifully randomised by the meiotic process” (Fisher, Reference Fisher1952). As highlighted by the authors, this randomisation within families can help to establish genetic causation. Importantly, genetic causation is always indirect and happens entirely through (molecular and environmental) phenotypes. As such, rather than focusing on genetic causation, perhaps the greatest opportunity for building causal knowledge in behavioural genetics lies in leveraging the beautifully randomised process mentioned by Fisher to understand phenotypic causation in general. To that end, a method called Mendelian randomisation uses genetic variants as instrumental variables to assess phenotypic causation and identify (potentially modifiable) risk factors (Davey Smith & Ebrahim, Reference Davey Smith and Ebrahim2003; Richmond & Davey Smith, Reference Richmond and Davey Smith2022; Sanderson et al., Reference Sanderson, Glymour, Holmes, Kang, Morrison, Munafò and Davey Smith2022). Many Mendelian randomisation studies have focused on complex traits, including a within-family Mendelian randomisation study of the impact of educational attainment on physical health and mortality (Howe et al., Reference Howe, Rasheed, Jones, Boomsma, Evans, Giannelis and Davies2022b). In addition, the authors rightly state that identifying a causal genetic variant often does not, per se, provide insights into causal pathways leading to behaviours. However, novel methods are rapidly developing that leverage genetic variants to understand causal pathways and mechanisms underlying complex phenotypes. Many of these methods directly harness the randomisation of genetic material at conception. For example, recent methods extend Mendelian randomisation to systematically investigate the aetiological role played by gene expression or DNA methylation (Hannon et al., Reference Hannon, Gorrie-Stone, Smart, Burrage, Hughes, Bao and Mill2018; Porcu et al., Reference Porcu, Sjaarda, Lepik, Carmeli, Darrous, Sulc and Kutalik2021). Genetically informed methods for causal inference aiming to identify (environmental) risk factors and biological pathways have been reviewed extensively elsewhere (Davey Smith, Richmond, & Pingault, Reference Davey Smith, Richmond and Pingault2021; Pingault, Richmond, & Davey Smith, Reference Pingault, Richmond and Davey Smith2022).

Critically, the analogy drawn between within-family genetic association studies and RCTs needs to be used with care. In an RCT, the treatment should be well defined (e.g., a given dose of a drug) and can be administered to individuals. By contrast, generally, a genetic variant cannot be administered or modified during the life course. Thus, while RCTs can provide actionable evidence of a specific intervention's efficacy, a within-family genetic association only indicates the effect of inheriting one variant or another. The difference in timing is also essential as inheriting a genetic variant at conception leads to a lifelong exposure as opposed to a time-bound treatment.

Furthermore, the authors argue that genetic causes relevant to behavioural genetics are analogous to causes uncovered by RCTs in that they are shallow – non-unitary (no single isolable cause), non-uniform (people exposed have heterogeneous outcomes), and non-explanatory (not mechanistically informative). However, this analogy is strained for both monogenic and polygenic disorders. A gene implicated in a monogenic disorder such as cystic fibrosis is close to a well-defined and actionable treatment: Although the gene is not, in itself, a treatment, it can potentially be targeted by gene editing techniques such as CRISPR-Cas9 to correct deleterious variants (note that even for monogenic disorders, many deleterious variants can be involved and lead to different phenotypic manifestations) (Jinek et al., Reference Jinek, Chylinski, Fonfara, Hauer, Doudna and Charpentier2012). Successful human clinical trials using gene editing techniques for monogenic disorders are emerging (Frangoul et al., Reference Frangoul, Altshuler, Cappellini, Chen, Domm, Eustace and Corbacioglu2021). Many rare developmental disorders share a similar genetic architecture, some with relevance to behaviour. For example, adrenoleucodystrophy is a monogenic developmental disorder, which may first manifest with behavioural and cognitive difficulties and can be lethal (Zhu et al., Reference Zhu, Eichler, Biffi, Duncan, Williams and Majzoub2020). Even for polygenic disorders such as schizophrenia, some rare variants considerably increase the risk of disease, with odds ratios up to 50 (Singh et al., Reference Singh, Poterba, Curtis, Akil, Al Eissa, Barchas and Daly2022). As such, genetic causes relevant to behavioural genetics need not be shallow. Variants underlying monogenic disorders or high-risk rare variants may be better conceived as deep causes, that is, close to unitary, uniform, and explanatory.

Conversely, polygenic influences of small effects underlying complex behaviours can indeed be conceived as shallow causes. However, in this case, the “treatment” is not well defined (in content or timing), cannot be refined or changed to increase mechanistic insight and is not directly actionable. Even if extensions of techniques such as CRISPR-Cas9 could theoretically target hundreds of genetic variants at once, this could never be a treatment strategy given unknown and potentially devastating side effects (like a drug RCT consisting of the simultaneous administration of hundreds of compounds). In sum, causes established by RCTs and genetic causes derived from within-family association studies do not necessarily share many features beyond the core concept of randomisation. Further discussion of the notion of cause in genetics and the parallels between RCTs and genetically informed methods such as Mendelian randomisation are available elsewhere (Lynch, Reference Lynch2021; Nitsch et al., Reference Nitsch, Molokhia, Smeeth, DeStavola, Whittaker and Leon2006).

In conclusion, we agree that behavioural genetics should look to provide causal knowledge. To that end, perhaps the most useful will be exploiting genetic data to understand phenotypic causation and aetiological pathways. Genetically informed designs for causal inference and, in particular, within-family designs, can play a key role in improving aetiological understanding and, ultimately, prevention and treatment (Howe et al., Reference Howe, Nivard, Morris, Hansen, Rasheed, Cho and Davies2022a; Hwang, Davies, Warrington, & Evans, Reference Hwang, Davies, Warrington and Evans2021; Pingault et al., Reference Pingault, O'Reilly, Schoeler, Ploubidis, Rijsdijk and Dudbridge2018).

Financial support

This research received no specific grant from any funding agency, commercial, or not-for-profit sectors.

Competing interest

None.

References

Davey Smith, G., Richmond, R., & Pingault, J.-B. (Eds.) (2021). Combining human genetics and causal inference to understand human disease and development. Cold Spring Harbor Laboratory Press.Google Scholar
Davey Smith, G. D., & Ebrahim, S. (2003). “Mendelian randomization”: Can genetic epidemiology contribute to understanding environmental determinants of disease? International Journal of Epidemiology, 32(1), 122.CrossRefGoogle Scholar
Fisher, R. (1952). Statistical methods in genetics. Heredity, 6, 112 (reprinted in 2010, Int. J. Epidemiol., 39: 329–335). https://doi.org/10.1093/ije/dyp379CrossRefGoogle Scholar
Frangoul, H., Altshuler, D., Cappellini, M. D., Chen, Y.-S., Domm, J., Eustace, B. K., … Corbacioglu, S. (2021). CRISPR-Cas9 gene editing for sickle cell disease and β-thalassemia. New England Journal of Medicine, 384(3), 252260. https://doi.org/10.1056/NEJMoa2031054CrossRefGoogle ScholarPubMed
Hannon, E., Gorrie-Stone, T. J., Smart, M. C., Burrage, J., Hughes, A., Bao, Y., … Mill, J. (2018). Leveraging DNA-methylation quantitative-trait loci to characterize the relationship between methylomic variation, gene expression, and complex traits. American Journal of Human Genetics, 103(5), 654665. https://doi.org/10.1016/j.ajhg.2018.09.007CrossRefGoogle ScholarPubMed
Howe, L. J., Nivard, M. G., Morris, T. T., Hansen, A. F., Rasheed, H., Cho, Y., … Davies, N. M. (2022a). Within-sibship genome-wide association analyses decrease bias in estimates of direct genetic effects. Nature Genetics, 54(5), 581592. https://doi.org/10.1038/s41588-022-01062-7CrossRefGoogle ScholarPubMed
Howe, L. J., Rasheed, H., Jones, P. R., Boomsma, D. I., Evans, D. M., Giannelis, A., … Davies, N. M. (2022b). Educational attainment, health outcomes and mortality: A within-sibship Mendelian randomization study. medRxiv, 2022.01.11.22268884. https://doi.org/10.1101/2022.01.11.22268884Google Scholar
Hwang, L.-D., Davies, N. M., Warrington, N. M., & Evans, D. M. (2021). Integrating family-based and Mendelian randomization designs. Cold Spring Harbor Perspectives in Medicine, 11(3), a039503. https://doi.org/10.1101/cshperspect.a039503CrossRefGoogle ScholarPubMed
Jinek, M., Chylinski, K., Fonfara, I., Hauer, M., Doudna, J. A., & Charpentier, E. (2012). A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science (New York, N.Y.), 337(6096), 816821. https://doi.org/10.1126/science.1225829CrossRefGoogle ScholarPubMed
Lynch, K. E. (2021). The meaning of “cause” in genetics. Cold Spring Harbor Perspectives in Medicine, 11(9), a040519. https://doi.org/10.1101/cshperspect.a040519CrossRefGoogle Scholar
Nitsch, D., Molokhia, M., Smeeth, L., DeStavola, B. L., Whittaker, J. C., & Leon, D. A. (2006). Limits to causal inference based on Mendelian randomization: A comparison with randomized controlled trials. American Journal of Epidemiology, 163(5), 397403. https://doi.org/10.1093/aje/kwj062CrossRefGoogle ScholarPubMed
Pingault, J.-B., O'Reilly, P. F., Schoeler, T., Ploubidis, G. B., Rijsdijk, F., & Dudbridge, F. (2018). Using genetic data to strengthen causal inference in observational research. Nature Reviews Genetics, 19, 566580.CrossRefGoogle ScholarPubMed
Pingault, J.-B., Richmond, R., & Davey Smith, G. (2022). Causal inference with genetic data: Past, present, and future. Cold Spring Harbor Perspectives in Medicine, 12(3), a041271. https://doi.org/10.1101/cshperspect.a041271CrossRefGoogle ScholarPubMed
Porcu, E., Sjaarda, J., Lepik, K., Carmeli, C., Darrous, L., Sulc, J., … Kutalik, Z. (2021). Causal inference methods to integrate omics and complex traits. Cold Spring Harbor Perspectives in Medicine, 11(5), a040493. https://doi.org/10.1101/cshperspect.a040493CrossRefGoogle ScholarPubMed
Richmond, R. C., & Davey Smith, G. (2022). Mendelian randomization: Concepts and scope. Cold Spring Harbor Perspectives in Medicine, 12(1), a040501. https://doi.org/10.1101/cshperspect.a040501CrossRefGoogle ScholarPubMed
Sanderson, E., Glymour, M. M., Holmes, M. V., Kang, H., Morrison, J., Munafò, M. R., … Davey Smith, G. (2022). Mendelian randomization. Nature Reviews Methods Primers, 2(1), 121. https://doi.org/10.1038/s43586-021-00092-5CrossRefGoogle ScholarPubMed
Singh, T., Poterba, T., Curtis, D., Akil, H., Al Eissa, M., Barchas, J. D., … Daly, M. J. (2022). Rare coding variants in ten genes confer substantial risk for schizophrenia. Nature, 604(7906), 509516. https://doi.org/10.1038/s41586-022-04556-wCrossRefGoogle ScholarPubMed
Zhu, J., Eichler, F., Biffi, A., Duncan, C. N., Williams, D. A., & Majzoub, J. A. (2020). The changing face of adrenoleukodystrophy. Endocrine Reviews, 41(4), 577. https://doi.org/10.1210/endrev/bnaa013CrossRefGoogle ScholarPubMed