We use cookies to distinguish you from other users and to provide you with a better experience on our websites. Close this message to accept cookies or find out how to manage your cookie settings.
To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
Various in-vitro (induced pluripotent stem cells–derived) and in-vivo genetic models (animal models like Caenorhabditis elegans, Drosophila melanogaster, zebrafish, rodents, and non-human primates) have been used to study movement disorders such as Parkinson’s disease, hereditary ataxia, Huntington’s disease, dystonia, and essential tremor. These genetic models have provided important clues on the underlying pathophysiologic mechanisms of these diseases and serve as useful platforms to unravel potential therapeutic targets. The next generation of genetic models is promising with the advancement of gene-editing techniques, such as CRISPR-Cas9, brain organoid technology, and identification of novel genes and loci from large-scale genetic studies will facilitate development of new genetic models.
This chapter describes techniques related to genetics, at both a molecular and an organismal level. The introduction explains the principles of base pairing and single nucleotide polymorphisms. The molecular techniques and figures explained include transgene construct schematics, Manhattan plots generated from genome-wide association studies, and different analysis methods for studying the microbiome. Organismal-level techniques described include family trees and dendrograms.
Edited by
Allan Young, Institute of Psychiatry, King's College London,Marsal Sanches, Baylor College of Medicine, Texas,Jair C. Soares, McGovern Medical School, The University of Texas,Mario Juruena, King's College London
Mood disorders, including bipolar disorder (BD) and major depressive disorder (MDD), are known to have a significant genetic component based on familial and twin studies. Tremendous efforts from the scientific community and technical advancements have led to the discovery of multiple genes associated with the heritability of these disorders over the last years. Nonetheless, our knowledge of the exact genetic basis of BD and MDD is still fairly limited. Recent genome-wide association studies with massive sample sizes have started to characterize the polygenicity of these disorders, although future studies have yet to explore how genetic variants may interact with the environment to modulate one’s risk of disease. As our understanding of the genetics of mood disorders increases (with increasing sample sizes, a more significant shift from candidate gene studies to microarray and sequencing strategies, and integration of findings with environmental measures), many clinical opportunities may arise. This may include the future use of polygenic risk scores for risk assessment, predicting response to medications based on genotype, among others.
Seed dormancy is an important trait associated with pre-sprouting and malting quality in barley. Genome-wide association studies (GWASs) have been used to detect quantitative trait loci (QTLs) underlying complex traits in major crops. In the present study, we collected 295 barley (Hordeum vulgare L.) accessions from Australia, Europe, Canada and China. A total of 25,179 single nucleotide polymorphism (SNP)/diversity arrays technology sequence markers were used for population structure, linkage disequilibrium and GWAS analysis. Candidate genes within QTL regions were investigated, and their expression levels were analysed using RNAseq data. Five QTLs for seed dormancy were identified. One QTL was mapped on chromosome 1H, and one QTL was mapped on chromosome 4H, while three QTLs were located on chromosome 5H. This is the first report of a QTL on the short arm of chromosome 5H in barley. Molecular markers linked to the QTL can be used for marker-assisted selection in barley breeding programmes.
Previous studies have revealed an association between dietary factors and atopic dermatitis (AD). To explore whether there was a causal relationship between diet and AD, we performed Mendelian randomisation (MR) analysis. The dataset of twenty-one dietary factors was obtained from UK Biobank. The dataset for AD was obtained from the publicly available FinnGen consortium. The main research method was the inverse-variance weighting method, which was supplemented by MR‒Egger, weighted median and weighted mode. In addition, sensitivity analysis was performed to ensure the accuracy of the results. The study revealed that beef intake (OR = 0·351; 95 % CI 0·145, 0·847; P = 0·020) and white bread intake (OR = 0·141; 95 % CI 0·030, 0·656; P = 0·012) may be protective factors against AD. There were no causal relationships between AD and any other dietary intake factors. Sensitivity analysis showed that our results were reliable, and no heterogeneity or pleiotropy was found. Therefore, we believe that beef intake may be associated with a reduced risk of AD. Although white bread was significant in the IVW analysis, there was large uncertainty in the results given the wide 95 % CI. Other factors were not associated with AD in this study.
We examine some of the genetic features of neuroticism (N) taking as an animal model the Maudsley Reactive (MR) and Maudsley Nonreactive (MNR) rat strains which were selectively bred, respectively, for high and low open-field defecation (OFD) starting in the late 1950s. To draw analogies with human genetic studies, we explore the genetic correlation of N with irritable bowel syndrome (IBS). We review progress with the rat model and developments in the field of human complex trait genetics, including genetic association studies that relate to current understanding of the genetics of N. The widespread differences in the tone of the peripheral sympathetic nervous system that have been found between the Maudsley strains, particularly those observed in the colon, may underly the differences in OFD (MNR, higher sympathetic tone and zero defecation). In humans, a large genome-wide association study (GWAS) reported six genes contributing to IBS, four of which were implicated in mood and anxiety disorders or were expressed in the brain, with three of the four also expressed in the nerve fibers and ganglia of the gut. Heritability of N is estimated at around 50% in twin and family studies, and GWASs identified hundreds of loci, enabling estimation of genome-wide correlations (rg) with other traits. Significantly, the estimate for rg between risk of IBS, anxiety, N, and depression was >0.5 and suggested genetic pleiotropy without evidence for causal mechanisms. Findings on the adrenergic pharmacology of the colon, coupled with new understanding of the role of the locus ceruleus in modifying afferent information from this organ, generate hypotheses that challenge traditional cause/effect notions about the relationship of the central nervous system to peripheral events in response to stress, suggest specific targets for gene action in the Maudsley model and emphasize the value of reciprocal evaluation of genetic architecture underlying N in rodents and humans.
Iron (Fe) is an essential element for all organisms. Fe deficiency can limit plant production and cause anaemia in humans. The improvement of Fe homoeostasis could resolve both problems. Fe homoeostasis in Aegilops tauschii, the D genome donor of bread wheat, is not fully understood. Here, we analysed physiological traits in 42 accessions of Ae. tauschii associated with Fe homoeostasis, i.e. mugineic acid family phytosiderophores (MAs), phenylamides, SPAD values and metal concentrations. All traits showed diversity, suggesting the presence of candidate genes in the Ae. tauschii accessions, which could improve Fe homoeostasis in bread wheat. All accessions mainly produced and secreted mainly 2′-deoxymugineic acid among MAs, but eight of them secreted unknown products from their roots under Fe deficiency. It was revealed that 15 kinds of phenylamides and 2 kinds of bread wheat phytoalexins were produced in Fe-deficient roots of Ae. tauschii. Multivariate and principal component analyses showed that chlorophyll content was correlated with shoot Fe concentration. Genome-wide association study analysis associated several genomic markers with the variations in each trait analysed. Our results suggest that Ae. tauschii has alleles that could improve Fe homoeostasis to generate Fe-deficiency-tolerant or Fe-biofortified bread wheat.
Over the years, numerous observational studies have substantiated that various dietary choices have opposing effects on CVD. However, the causal effect has not yet been established. Thus, we conducted a Mendelian randomisation (MR) analysis to reveal the causal impact of dietary habits on CVD. Genetic variants strongly associated with 20 dietary habits were selected from publicly available genome-wide association studies conducted on the UK Biobank cohort (n 449 210). Summary-level data on CVD were obtained from different consortia (n 159 836–977 323). The inverse-variance weighted method (IVW) was the primary outcome, while MR-Egger, weighted median and MR Pleiotropy RESidual Sum and Outlier were used to assess heterogeneity and pleiotropy. We found compelling evidence of a protective causal effect of genetic predisposition towards cheese consumption on myocardial infarction (IVW OR = 0·67; 95 % CI = 0·544, 0·826; P = 1·784 × 10−4) and heart failure (IVW OR = 0·646; 95 % CI = 0·513, 0·814; P = 2·135 × 10−4). Poultry intake was found to be a detrimental factor for hypertension (IVW OR = 4·306; 95 % CI = 2·158, 8·589; P = 3·416 × 10−5), while dried fruit intake was protective against hypertension (IVW OR = 0·473; 95 % CI = 0·348, 0·642; P = 1·683 × 10−6). Importantly, no evidence of pleiotropy was detected. MR estimates provide robust evidence for a causal relationship between genetic predisposition to 20 dietary habits and CVD risk, suggesting that well-planned diets may help prevent and reduce the risk of CVD.
Edited by
Deepak Cyril D'Souza, Staff Psychiatrist, VA Connecticut Healthcare System; Professor of Psychiatry, Yale University School of Medicine,David Castle, University of Tasmania, Australia,Sir Robin Murray, Honorary Consultant Psychiatrist, Psychosis Service at the South London and Maudsley NHS Trust; Professor of Psychiatric Research at the Institute of Psychiatry
Cannabis use and cannabis use disorder (CUD) are heritable (~50%) complex traits closely linked to multiple neuropsychiatric syndromes. The largest genome-wide association studies have begun to identify variants that contribute to this heritability. These discoveries have started yielding answers to longstanding questions in the fields of mental health and substance use related to comorbidity and causal influence of cannabis use and CUD on other neuropsychiatric syndromes. The genetics of cannabis use appears to relate positively to psychosocial outcomes while CUD genetics map more closely to lower educational and socio-economic characteristics. The genetics of cannabis use and CUD considerably overlap with that of other substance use disorders and are being elucidated through very large genome-wide association studies. Similarly, the relationships between cannabis use, psychosis, depression, anxiety, externalizing syndromes and neurodevelopment are also being uncovered using novel genetic methods. In this chapter, we review these exciting advances in the light of pre-existing evidence from twin and family studies.
Severe infections and psychiatric disorders have a large impact on both society and the individual. Studies investigating these conditions and the links between them are therefore important. Most past studies have focused on binary phenotypes of particular infections or overall infection, thereby losing some information regarding susceptibility to infection as reflected in the number of specific infection types, or sites, which we term infection load. In this study we found that infection load was associated with increased risk for attention-deficit/hyperactivity disorder, autism spectrum disorder, bipolar disorder, depression, schizophrenia and overall psychiatric diagnosis. We obtained a modest but significant heritability for infection load (h2 = 0.0221), and a high degree of genetic correlation between it and overall psychiatric diagnosis (rg = 0.4298). We also found evidence supporting a genetic causality for overall infection on overall psychiatric diagnosis. Our genome-wide association study for infection load identified 138 suggestive associations. Our study provides further evidence for genetic links between susceptibility to infection and psychiatric disorders, and suggests that a higher infection load may have a cumulative association with psychiatric disorders, beyond what has been described for individual infections.
To identify risk genes whose expression are regulated by the reported risk variants and to explore the potential regulatory mechanism in schizophrenia (SCZ).
Methods
We systematically integrated three independent brain expression quantitative traits (eQTLs) (CommonMind, GTEx, and BrainSeq Phase 2, a total of 1039 individuals) and GWAS data (56 418 cases and 78 818 controls), with the use of transcriptome-wide association study (TWAS). Diffusion magnetic resonance imaging was utilized to quantify the integrity of white matter bundles and determine whether polygenic risk of novel genes linked to brain structure was present in patients with first-episode antipsychotic SCZ.
Results
TWAS showed that eight risk genes (CORO7, DDAH2, DDHD2, ELAC2, GLT8D1, PCDHA8, THOC7, and TYW5) reached transcriptome-wide significance (TWS) level. These findings were confirmed by an independent integrative approach (i.e. Sherlock). We further conducted conditional analyses and identified the potential risk genes that driven the TWAS association signal in each locus. Gene expression analysis showed that several TWS genes (including CORO7, DDAH2, DDHD2, ELAC2, GLT8D1, THOC7 and TYW5) were dysregulated in the dorsolateral prefrontal cortex of SCZ cases compared with controls. TWS genes were mainly expressed on the surface of glutamatergic neurons, GABAergic neurons, and microglia. Finally, SCZ cases had a substantially greater TWS genes-based polygenic risk (PRS) compared to controls, and we showed that fractional anisotropy of the cingulum-hippocampus mediates the influence of TWS genes PRS on SCZ.
Conclusions
Our findings identified novel SCZ risk genes and highlighted the importance of the TWS genes in frontal-limbic dysfunctions in SCZ, indicating possible therapeutic targets.
Drug development is essential to the advancement of human health, however, the process is slow, costly, and at high risk of failure at all stages. A promising strategy for expediting and improving the probability of success in the drug development process is the use of naturally randomized human genetic variation for drug target identification and validation. These data can be harnessed using the Mendelian randomization (MR) analytic paradigm to proxy the lifelong consequences of genetic perturbations of drug targets. In this review, we discuss the myriad applications of the MR paradigm for human drug target identification and validation. We review the methodology and applications of MR, key limitations of MR, and potential future opportunities for research. Throughout the review, we refer to illustrative examples of MR analyses investigating the consequences of genetic inhibition of interleukin 6 signaling which, in some cases, have anticipated results from randomized controlled trials. As human genetic data become more widely available, we predict that MR will serve as a key pillar of support for drug development efforts.
Given psychotic illnesses' high heritability and associations with brain structure, numerous neuroimaging-genetics findings have been reported in the last two decades. However, few findings have been replicated. In the present independent sample we aimed to replicate any psychosis-implicated SNPs (single nucleotide polymorphisms), which had previously shown at least two main effects on brain volume.
Methods
A systematic review for SNPs showing a replicated effect on brain volume yielded 25 studies implicating seven SNPs in five genes. Their effect was then tested in 113 subjects with either schizophrenia, bipolar disorder, ‘at risk mental state’ or healthy state, for whole-brain and region-of-interest (ROI) associations with grey and white matter volume changes, using voxel-based morphometry.
Results
We found FWER-corrected (Family-wise error rate) (i.e. statistically significant) associations of: (1) CACNA1C-rs769087-A with larger bilateral hippocampus and thalamus white matter, across the whole brain; and (2) CACNA1C-rs769087-A with larger superior frontal gyrus, as ROI. Higher replication concordance with existing literature was found, in decreasing order, for: (1) CACNA1C-rs769087-A, with larger dorsolateral-prefrontal/superior frontal gyrus and hippocampi (both with anatomical and directional concordance); (2) ZNF804A-rs11681373-A, with smaller angular gyrus grey matter and rectus gyri white matter (both with anatomical and directional concordance); and (3) BDNF-rs6265-T with superior frontal and middle cingulate gyri volume change (with anatomical and allelic concordance).
Conclusions
Most literature findings were not herein replicated. Nevertheless, high degree/likelihood of replication was found for two genome-wide association studies- and one candidate-implicated SNPs, supporting their involvement in psychosis and brain structure.
One reason behind the failure to understand the neurobiological background of affective disorders and develop more effective treatments is their heterogeneity warranting identification of clinically meaningful endophenotypes. Affective temperaments, considered subclinical manifestations and pathoplastic contributors of affective illnesses may constitute such endophenotypes. 775 general population subjects were phenotyped for affective temperaments using TEMPS-A, and genotyped using Illumina’s CoreExom PsychChip yielding 573141 variants. Primary SNP-based association tests were calculated using linear regression models assuming an additive genetic effect with the first 10 calculated principal components, gender, age, and other affective temperaments as covariates; a Bonferroni-corrected genome-wide significance threshold set at p≤5.0×10−8, and suggestive significance threshold set at p≤1.0x10-5. SNP-level relevances were aggregated to gene-level with the PEGASUS method, with a Bonferroni-corrected significance level set at 2.0×10-6, and suggestive significance thrshold set at p≤4.0×10-4. Functional effects of most significant SNPs as reported in public open databases based on expression quantitative trait loci (eQTL) and 3D-chromatin interactions were explored using FUMA v1.3.5. In SNP-based tests a novel genome-wide significant variant, rs3798978 (p=4.44x10-8) and several other suggestively significant SNPs in ADGRB3 were found for anxious temperament along with suggestively significant SNPs for the other four affective temperaments. In gene-based tests suggestively significant findings emerged for all five temperaments. Functional analysis suggested that several of the identified variants influence gene-expression levels or participate in chromatin interactions in brain areas implicated in affective disorders. In the next step these findings should be investigated in patient samples, and in other models of affective disorders and related phenotypes.
Depression is a chronic, recurrent mental disorder with a moderate level of genetic impact. Modern GWAS of depression require extra-large sample sizes and new effective, clinically sensitive, objective and simple to fill online phenotyping tools for population studies are necessary today.
Objectives
Aim: to test online phenotyping tools based on clinical and psychometric instruments to evaluate depressive symptoms in population cohort for using in GWAS
Methods
Participants: 2610 Russian-speaking respondents- clients of Genotek Ltd., provider of genetic testing services in Russian Federation. The online survey included HADS-D (Hospital Anxiety and Depression Scale - depression subscale), original questionnaire adapted for self-report from major depression DSM-5 criteria, questions about sex and age. Three research phenotypes were defined: quantitative “HADS-D score” and two categorical “HADS-D depression” (cut-off 8 points) - 20.63 %, “DSM depression” 16.73 %. DNA samples obtained from saliva were genotyped on Illumina Infinium GSA v1.0/v2.0/v3.0 microarrays. GWAS analysis was performed independently for each of the research phenotypes.
Results
None of the signals reached genome-wide significance (p value 10-8), but some signals with subthreshold significance were identified including four signals in the genes encoding proteins: “DSM_Depression”: rs2131596 in GRIP1 (p=3.682e-06,β=0.6965), rs11158021 in SAMD4A (p=2.841e-06, β=0.6762), “HADS-D Depression”: rs2425793 in CDH22 (p=4.408e-07, β=1.539), rs36006890 in PDIA6 (p = 6.529e-07, β=1.549).
Conclusions
Preliminary results of the first GWAS of depression symptoms in the Russian population are acceptable and confirm the accuracy of the research strategy using online phenotyping tools based on clinical and psychometric instruments and provide basis for further studies.
Disclosure
The study was supported by Russian Science Foundation Grant # 20-15-00132
The sociogenomics revolution is upon us, we are told. Whether revolutionary or not, sociogenomics is poised to flourish given the ease of incorporating polygenic scores (or PGSs) as “genetic propensities” for complex traits into social science research. Pointing to evidence of ubiquitous heritability and the accessibility of genetic data, scholars have argued that social scientists not only have an opportunity but a duty to add PGSs to social science research. Social science research that ignores genetics is, some proponents argue, at best partial and likely scientifically flawed, misleading, and wasteful. Here, I challenge arguments about the value of genetics for social science and with it the claimed necessity of incorporating PGSs into social science models as measures of genetic influences. In so doing, I discuss the impracticability of distinguishing genetic influences from environmental influences because of non-causal gene–environment correlations, especially population stratification, familial confounding, and downward causation. I explain how environmental effects masquerade as genetic influences in PGSs, which undermines their raison d’être as measures of genetic propensity, especially for complex socially contingent behaviors that are the subject of sociogenomics. Additionally, I draw attention to the partial, unknown biology, while highlighting the persistence of an implicit, unavoidable reductionist genes versus environments approach. Leaving sociopolitical and ethical concerns aside, I argue that the potential scientific rewards of adding PGSs to social science are few and greatly overstated and the scientific costs, which include obscuring structural disadvantages and cultural influences, outweigh these meager benefits for most social science applications.
Genetic studies of complex traits often show disparities in estimated heritability depending on the method used, whether by genomic associations or twin and family studies. We present a simulation of individual genomes with dynamic environmental conditions to consider how linear and nonlinear effects, gene-by-environment interactions, and gene-by-environment correlations may work together to govern the long-term development of complex traits and affect estimates of heritability from common methods. Our simulation studies demonstrate that the genetic effects estimated by genome wide association studies in unrelated individuals are inadequate to characterize gene-by-environment interaction, while including related individuals in genome-wide complex trait analysis (GCTA) allows gene-by-environment interactions to be recovered in the heritability. These theoretical findings provide an explanation for the “missing heritability” problem and bridge the conceptual gap between the most common findings of GCTA and twin studies. Future studies may use the simulation model to test hypotheses about phenotypic complexity either in an exploratory way or by replicating well-established observations of specific phenotypes.
Women experience major depression and post-traumatic stress disorder (PTSD) approximately twice as often as men. Estrogen is thought to contribute to sex differences in these disorders, and reduced estrogen is also known to be a key driver of menopause symptoms such as hot flashes. Moreover, estrogen is used to treat menopause symptoms. In order to test for potential shared genetic influences between menopause symptoms and psychiatric disorders, we conducted a genome-wide association study (GWAS) of estrogen medication use (as a proxy for menopause symptoms) in the UK Biobank.
Methods
The analysis included 232 993 women aged 39–71 in the UK Biobank. The outcome variable for genetic analyses was estrogen medication use, excluding women using hormonal contraceptives. Trans-ancestry GWAS meta-analyses were conducted along with genetic correlation analyses on the European ancestry GWAS results. Hormone usage was also tested for association with depression and PTSD.
Results
GWAS of estrogen medication use (compared to non-use) identified a locus in the TACR3 gene, which was previously linked to hot flashes in menopause [top rs77322567, odds ratio (OR) = 0.78, p = 7.7 × 10−15]. Genetic correlation analyses revealed shared genetic influences on menopause symptoms and depression (rg = 0.231, s.e.= 0.055, p = 2.8 × 10−5). Non-genetic analyses revealed higher psychiatric symptoms scores among women using estrogen medications.
Conclusions
These results suggest that menopause symptoms have a complex genetic etiology which is partially shared with genetic influences on depression. Moreover, the TACR3 gene identified here has direct clinical relevance; antagonists for the neurokinin 3 receptor (coded for by TACR3) are effective treatments for hot flashes.
Intelligence is inversely associated with schizophrenia (SCZ) and bipolar disorder (BD); it remains unclear whether low intelligence is a cause or consequence. We investigated causal associations of intelligence with SCZ or BD risk and a shared risk between SCZ and BD and SCZ-specific risk.
Methods
To estimate putative causal associations, we performed multi-single nucleotide polymorphism (SNP) Mendelian randomization (MR) using generalized summary-data-based MR (GSMR). Summary-level datasets from five GWASs (intelligence, SCZ vs. control [CON], BD vs. CON, SCZ + BD vs. CON, and SCZ vs. BD; sample sizes of up to 269,867) were utilized.
Results
A strong bidirectional association between risks for SCZ and BD was observed (odds ratio; ORSCZ → BD = 1.47, p = 2.89 × 10−41, ORBD → SCZ = 1.44, p = 1.85 × 10−52). Low intelligence was bidirectionally associated with a high risk for SCZ, with a stronger effect of intelligence on SCZ risk (ORlower intelligence → SCZ = 1.62, p = 3.23 × 10−14) than the reverse (ORSCZ → lower intelligence = 1.06, p = 3.70 × 10−23). Furthermore, low intelligence affected a shared risk between SCZ and BD (OR lower intelligence → SCZ + BD = 1.23, p = 3.41 × 10−5) and SCZ-specific risk (ORlower intelligence → SCZvsBD = 1.64, p = 9.72 × 10−10); the shared risk (ORSCZ + BD → lower intelligence = 1.04, p = 3.09 × 10−14) but not SCZ-specific risk (ORSCZvsBD → lower intelligence = 1.00, p = 0.88) weakly affected low intelligence. Conversely, there was no significant causal association between intelligence and BD risk (p > 0.05).
Conclusions
These findings support observational studies showing that patients with SCZ display impairment in premorbid intelligence and intelligence decline. Moreover, a shared factor between SCZ and BD might contribute to impairment in premorbid intelligence and intelligence decline but SCZ-specific factors might be affected by impairment in premorbid intelligence. We suggest that patients with these genetic factors should be categorized as having a cognitive disorder SCZ or BD subtype.
Peanut shell plays key roles in protecting the seed from diseases and pest infestation but also in the processing of peanut and is an important byproduct of peanut production. Most studies on peanut shell have focused on the utilization of its chemical applications, but the genetic basis of shell-related traits is largely unknown. A panel of 320 peanut (Arachis hypogaea) accessions including var. hypogaea, var. vulgaris, var. fastigiata and var. hirsuta was used to study the genetic basis of two physical and five microstructure-related traits in peanut shell. Significant phenotypic differences were revealed among the accessions of var. hypogaea, var. hirsuta, var. vulgaris and var. fastigiata for mechanical strength, thickness, three sclerenchymatous layer projections and main cell shape of the sclerenchymatous layer. We identified 10 significant single nucleotide polymorphisms (SNPs) through genome-wide association study (P < 5.0 × 10−6) combining the shell-related traits and high-quality SNPs. In total, 192 genes were located in physical proximity to the significantly associated SNPs, and 11 candidate genes were predicted related to their potential contribution to the development and structure of the peanut shell. All SNPs were detected on the B genome demonstrating the biased contribution of the B genome for the phenotypical make-up of peanut. Exploring the newly identified candidate genes will provide insight into the molecular pathways that regulate peanut shell-related traits and provide valuable information for molecular marker-assisted breeding of an improved peanut shell.