Hostname: page-component-cd9895bd7-mkpzs Total loading time: 0 Render date: 2024-12-26T17:18:35.828Z Has data issue: false hasContentIssue false

Individual Differences in Personality Masculinity-Femininity: Examining the Effects of Genes, Environment, and Prenatal Hormone Transfer

Published online by Cambridge University Press:  07 March 2016

Karin J. H. Verweij*
Affiliation:
Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden
Miriam A. Mosing
Affiliation:
Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden Department of Medical Epidemiology and Biostatistics, Karolinska Institute, Stockholm, Sweden
Fredrik Ullén
Affiliation:
Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden
Guy Madison
Affiliation:
Department of Psychology, Umeå University, Umeå, Sweden
*
Address for correspondence: Karin Verweij, Department of Neuroscience, Karolinska Institutet, Retzius v 8, SE-171 77 Stockholm, Sweden. E-mail: karin.verweij@ki.se/karin.verweij@vu.nl

Abstract

Males and females score differently on some personality traits, but the underlying etiology of these differences is not well understood. This study examined genetic, environmental, and prenatal hormonal influences on individual differences in personality masculinity–femininity (M-F). We used Big-Five personality inventory data of 9,520 Swedish twins (aged 27 to 54) to create a bipolar M-F personality scale. Using biometrical twin modeling, we estimated the influence of genetic and environmental factors on individual differences in a M-F personality score. Furthermore, we tested whether prenatal hormone transfer may influence individuals’ M-F scores by comparing the scores of twins with a same-sex versus those with an opposite-sex co-twin. On average, males scored 1.09 standard deviations higher than females on the created M-F scale. Around a third of the variation in M-F personality score was attributable to genetic factors, while family environmental factors had no influence. Males and females from opposite-sex pairs scored significantly more masculine (both approximately 0.1 SD) than those from same-sex pairs. In conclusion, genetic influences explain part of the individual differences in personality M-F, and hormone transfer from the male to the female twin during pregnancy may increase the level of masculinization in females. Additional well-powered studies are needed to clarify this association and determine the underlying mechanisms in both sexes.

Type
Articles
Copyright
Copyright © The Author(s) 2016 

Personality has been defined as ‘those characteristics of the person that account for consistent patterns of feeling, thinking, and behaving’ (Pervin et al., Reference Pervin, Cervone and John2005, p. 6). Human personality research focuses predominantly on dimensions of variation between individuals that predict behavior in various situations and domains of life and that are relatively stable over time. One of the most influential models that is used to describe human personality is the Five-Factor Model, which structures personality traits into five broad factors: Agreeableness, Conscientiousness, Extraversion, Openness to Experience, and Neuroticism (Costa & McCrae, Reference Costa and McCrae1985; Digman, Reference Digman1990). Previous research has shown that males and females score differently on some of these and other personality scales, but sex differences are generally small to medium; in a cross-national meta-analysis of effect sizes in gender differences in the Big Five facets, the sex differences ranged between D = 0.02 and 0.56 (Feingold, Reference Feingold1994). Females tend to score higher on traits associated with anxiety, neuroticism, extraversion, agreeableness, and warmth, whereas males generally score higher on assertiveness and self-esteem, as summarized by various literature reviews and meta-analyses (Costa et al., Reference Costa, Terracciano and McCrae2001; Feingold, Reference Feingold1994; Gentile et al., Reference Gentile, Grabe, Dolan-Pascoe, Twenge, Wells and Maitino2009; Kling et al., Reference Kling, Hyde, Showers and Buswell1999; Lippa, Reference Lippa2010).

Although there are well-established mean sex differences in these particular traits, there is no consensus whether sex differences in personality arise due to different environmental influences, such as social and cultural expectations regarding male and female's behavior, or whether they are due to underlying biological differences between sexes. An important socio-cultural explanation is the social role model (Eagly, Reference Eagly1987; Eagly & Wood, Reference Eagly and Wood1991) that proposes that most differences between the sexes result from the adoption of gender roles that arise from different expectations for men's and women's social roles, particularly in relation to family and occupation. The social role model would therefore predict cultural variation in personality sex differences across countries with more or less traditional sex roles. However, the social role model is contradicted by the finding that sex differences are consistent across 53 nations (Lippa, Reference Lippa2010) and that the sex differences appear more pronounced in Western countries (Costa et al., Reference Costa, Terracciano and McCrae2001), in which differences in traditional sex roles are generally minimized and there is more formal and economic equality between the sexes.

Other socio-cultural explanations of sex differences are the expectancy model and the artifact model. The expectancy model (Deaux & Major, Reference Deaux and Major1987) suggests that sex differences in behavior are partly the result of stereotype-based expectations of perceivers that result in self-fulfilling prophecies. However, self-fulfilling prophecy effects are very small, with meta-analytical estimates at around 0.10 (Jussim, Reference Jussim2012). The artifact model, finally, proposes that social desirability may cause males and females to endorse gender-appropriate personality scores (Feingold, Reference Feingold1990, Reference Feingold1991, Reference Feingold1992). According to this model, males and females place different values on the importance of certain personality traits and these differences differentially bias self-reports of personality characteristics. As such, according to this model, sex differences in personality scores reflect differences in social desirability responding rather than real differences in personality.

In contrast to socio-cultural models, biological and evolutionary psychology theories propose that sex differences in personality are partly due to innate temperamental differences that can be expected for all behaviors for which males and females have faced different adaptive pressures over the evolutionary past (Buss, Reference Buss1995; Buss & Hawley, Reference Buss and Hawley2010; Geary, Reference Geary2010; Trivers, Reference Trivers and Campbell1972). Sex differences in personality traits could be explained by the parental investment theory (Trivers, Reference Trivers and Campbell1972). This theory predicts that women have been under selection to invest more time and resources in their offspring, which makes them choosier in mate selection and more cautious and careful in social relations than males. In turn, males have faced greater pressure to compete for and attract mates, which favors assertiveness, aggressiveness, and competitiveness. Thus, personality traits linked to these behaviors have come under sex-specific selection, and this may explain why females are more agreeable, warm and nurturing, and less assertive and competitive than males. Such a common underlying selection pressure for male-female sex differences would suggest a common mechanism for their implementation, even if they manifest themselves in several personality dimensions.

In this perspective, sex differences in personality could be due to hormonal influences. Testosterone levels are 2–4 times higher in the amniotic fluid of male versus female fetuses (Auyeung et al., Reference Auyeung, Baron-Cohen, Ashwin, Knickmeyer, Taylor, Hackett and Hines2009) and during puberty levels are 20- to 30-fold higher in males than females (Fechner, Reference Fechner and Hayward2003). Prenatal exposure to high levels of testosterone have been found to cause masculinization of activity and occupational interests (Berenbaum & Beltz, Reference Berenbaum and Beltz2011), and circulating testosterone levels during adolescence seem to play a role in aggression (e.g., Pajer et al., Reference Pajer, Tabbah, Gardner, Rubin, Czambel and Wang2006).

One approach by which prenatal hormonal exposure can be examined is by studying twin pairs. It has been hypothesized that hormone transfer from one twin to the other during pregnancy may influence the level of masculinization (Miller, Reference Miller1994); that is, individuals with a male co-twin may develop more masculine behaviors than individuals with a female co-twin and the other way around. Two mechanisms of hormone transfer have been proposed: hormones from one twin could be transferred to the other twin either through maternal circulation (Miller, Reference Miller1994) or directly from one twin to the other through diffusion across fetal membranes (Even & vom Saal, Reference Even and vom Saal1992). Note that research has predominantly focused on testosterone transfer, because this is believed to be the most potent androgen and has shown strong effects in non-human species (Ryan & Vandenbergh, Reference Ryan and Vandenbergh2002), while other hormones remain understudied in this context. Hence, studies examining the effect of hormone transfer on males are limited.

While in humans relatively consistent evidence has been found supporting the hormone transfer hypothesis for perception and cognition (especially for females), support for the influence of hormone transfer on personality is inconsistent (for a review, see Tapp et al., Reference Tapp, Maybery and Whitehouse2011). For example, Resnick et al. (Reference Resnick, Gottesman and McGue1993) and Slutske et al. (Reference Slutske, Bascom, Meier, Medland and Martin2011) found that females with a male co-twin scored on average higher on sensation seeking than females with a female co-twin. Other studies, however, did not find evidence for prenatal hormone transfer effects on behavioral traits, including personality/temperament (Cohen-Bendahan et al., Reference Cohen-Bendahan, Buitelaar, van Goozen, Orlebeke and Cohen-Kettenis2005; Loehlin & Martin, Reference Loehlin and Martin2000) and sex-typed childhood play (Henderson & Berenbaum, Reference Henderson and Berenbaum1997; Rodgers et al., Reference Rodgers, Fagot and Winebarger1998).

Berenbaum and Beltz (Reference Berenbaum and Beltz2011) reviewed the literature of different types of studies on the influence of prenatal and postnatal sex-hormone exposure on behavior. They concluded that prenatal exposure to high levels of androgens is associated with masculinization of occupational and activity interests, sexual orientation, and spatial abilities, whereas evidence for an influence of postnatal hormone exposure is not as strong. However, they did not find much evidence of hormone influences on sex differences in personality traits and social behaviors (Berenbaum & Beltz, Reference Berenbaum and Beltz2011).

Because of the confounding effects of genetic and societal influences, it is hard to identify the source of the between-sex differences in personality. However, we can determine the source(s) of within-sex variation in the sex-differentiating dimension of personality by using a genetically informative sample, and we can determine whether the same or different genes or environmental factors influence level of masculinization in males versus females. Accordingly, the aim of the current study was to examine the etiology of individual differences in personality M-F. To this end, we created a M-F personality scale by performing a discriminant analysis on the 44-item Big-Five personality inventory on 9,520 Swedish twin individuals. Because of our large sample we were able to use all single personality items in one discriminant analysis without the risk of overfitting, which can result from a high ratio of predictors to participants. Using single item scores instead of the overarching dimensions is preferable as aggregating at the level of the dimensions can mute or even annihilate sex differences as an effect of component traits cancelling each other out (Del Giudice et al., Reference Del Giudice, Booth and Irwing2012). For example, extraversion loads on ‘warmth’, which is higher in females, as well as ‘dominance’, which is higher in males. Accordingly, accurate assessment of sex differences therefore requires analysis at the primary trait level or lower (Del Giudice et al., Reference Del Giudice, Booth and Irwing2012).

We applied biometrical twin modeling to estimate the influence of genetic and environmental factors on individual differences in the derived M-F score. Based on previous twin studies using various personality inventories and indices of M-F, we expect the M-F personality score to be moderately heritable (previous heritability estimates were in the range of 30–60%, Lippa & Hershberger, Reference Lippa and Hershberger1999; Loehlin & Martin, Reference Loehlin and Martin2000; Loehlin et al., Reference Loehlin, Jonsson, Gustavsson, Stallings, Gillespie, Wright and Martin2005), similarly to the heritability of personality itself (Johnson et al., Reference Johnson, Vernon, Feiler, Boyle, Matthews and Saklofske2008). However, previous studies had substantially smaller sample sizes (Lippa & Hershberger, Reference Lippa and Hershberger1999; Mitchell et al., Reference Mitchell, Baker and Jacklin1989) or used less ideal methods to create the personality M-F scales (Loehlin & Martin, Reference Loehlin and Martin2000; Loehlin et al., Reference Loehlin, Jonsson, Gustavsson, Stallings, Gillespie, Wright and Martin2005). The large sample size enabled us to estimate the variance components separately for males and females, and provided power to detect non-additive genetic effects.

Second, we examined potential influence of prenatal hormone transfer on individuals’ M-F personality scores by comparing the scores of twins with a same-sex co-twin with the scores of twins with an opposite-sex co-twin. If hormone transfer would play a role we would expect twins with a male co-twin to score more masculine than twins with a female co-twin.

Methods

Participants

Between 2012 and 2013 a large cohort of approximately 32,000 Swedish twins born between 1959 and 1985 (the STAGE Cohort, see Lichtenstein et al., Reference Lichtenstein, Sullivan, Cnattingius, Gatz, Johansson, Carlstrom and Pedersen2006) were invited to complete a web-based survey designed to collect data on music-related traits. In total, 11,543 twins participated in the web survey, and their age was between 27 and 54 (mean = 40.7; SD = 7.8). However, due to missing data and zygosities, the effective study sample used for this study is slightly lower, as described in the results section. Zygosity determination was based on a questionnaire about intra-pair resemblance. In the Swedish Twin Registry (STR), this method has been confirmed in 27% of the twins using genotyping and showed an accuracy of more than 98%. For further information on this survey, the STAGE cohort, or zygosity determination in the STR, see Lichtenstein et al. (Reference Lichtenstein, De Faire, Floderus, Svartengren, Svedberg and Pedersen2002), Lichtenstein et al. (Reference Lichtenstein, Sullivan, Cnattingius, Gatz, Johansson, Carlstrom and Pedersen2006) and Mosing et al. (Reference Mosing, Madison, Pedersen, Kuja-Halkola and Ullén2014). All participants gave informed consent and the study received approval from the Regional Ethics Review Board in Stockholm (Diary Numbers 2011/570-31/5, 2011/1425-31, and 2012/1107/32).

Measures

As part of the web survey, participants filled out the Swedish translation of the Big Five Inventory, a 44-item self-report inventory measuring the Big-Five dimensions of personality, that is, Openness, Conscientiousness, Extraversion, Agreeableness, and Neuroticism (BFI; John et al., Reference John, Donahue and Kentle1991; John et al., Reference John, Naumann, Soto, John, Robins and Pervin2008). The participants were asked to indicate on a 5-point Likert scale (ranging from disagree strongly to agree strongly) to what extent certain characteristics applied to them. The BFI is a commonly used personality questionnaire and previous research has shown that the 44-item version has a test-retest reliability of more than 0.80 and an acceptable external validity of 0.56 (Rammstedt & John, Reference Rammstedt and John2007). Comparisons between paper-and-pencil personality questionnaires and online surveys like the present showed that both types of assessment are highly comparable (Lang et al., Reference Lang, John, Ludtke, Schupp and Wagner2011; Pettit, Reference Pettit2002; Rammstedt et al., Reference Rammstedt, Holzinger and Rammsayer2004). For the current study, single item scores for all items were used to obtain a measure of M-F personality for each individual, as described below.

Statistical Analysis

Discriminant-Function Analysis

To compute a data-driven single measure of M-F personality, we conducted a discriminant-function analysis (DFA) in SPSS (IBM SPSS Statistics for Windows, Version 22.0. Armonk, NY: IBM Corp) with sex as the grouping variable (male = 0, female = 1). Based on the responses to the personality questionnaire, this analysis generated a discriminant function that was the linear weighted combination of personality items that optimally classified individuals as male or female. The discriminant function from this analysis represents the bipolar M-F dimension. Accordingly, for each individual, a M-F personality score was derived by multiplying the standardized coefficients by the standardized item scores and adding across all items. The positive pole of the scale represents the masculine end and the negative pole the feminine end. To determine whether hormone transfer may play a role in individuals’ M-F personality score we tested for mean differences between twins with an opposite-sex co-twin versus those with a same-sex co-twin (correcting for age effects and relatedness of the sample).

Genetic Analysis

With the classical twin design we determined the extent to which individual differences in the M-F personality scores were due to genetic and environmental influences, by decomposing the variance into additive genetic (A), non-additive genetic (D), shared environmental (C), and residual (E) influences. Additive genetic variance is the influence of the summed allelic effects, while non-additive genetic effects include allelic interactions within and across genes (dominance and epistasis). Shared environmental variance results from environmental influences shared within pairs, which make them more similar to each other, such as family environment. Residual variance results from influences not shared by twin pairs, including environmental influences not shared between twins, stochastic biological effects, as well as measurement error.

The classical twin design makes use of the fact that identical (monozygotic, MZ) twins share 100% of their genes, whereas non-identical (dizygotic, DZ) twins on average only share 50% of their segregating genes. If A were the only source of variance in a trait we would expect a twin pair correlation of 1 for MZ pairs, while for DZ pairs the twin correlation would be 0.5. If non-additive genetic influences were the sole source of variance in a trait, we would expect a twin pair correlation of 1.0 for MZ pairs and, at most, 0.25 for DZ pairs (for an explanation, see Posthuma et al., Reference Posthuma, Beem, de Geus, van Baal, von Hjelmborg, Lachine and Boomsma2003). In contrast, if C were the only source of variance in a trait, by definition we would expect a twin pair correlation of 1 for both MZ and DZ twin pairs. Finally, if all variance were due to E we would expect a twin pair correlation of 0 for both MZ and DZ twin pairs. Hence, A, C, D, and E influences predict different patterns of MZ and DZ twin pair correlations, and we used structural equation modeling to determine which combination best matched our observed data.

It is important to note that it is not possible to estimate C and D simultaneously when including only twins reared together, as C and D are negatively confounded: C decreases the MZ-DZ correlation ratio, while D increases it. Only three of the four sources of variance can therefore be estimated at a time, the choice of which (i.e., an ACE or ADE model) depends on the pattern of MZ and DZ correlations. When DZ twin correlations are at least half the MZ correlation, shared environmental influences are implied and so an ACE model is fitted. If DZ twin correlations are less than half the MZ correlations, non-additive genetic influences are implied and an ADE model is more suitable. A second limitation is that the classical twin design provides little statistical power to disentangle non-additive from additive genetic effects, because they are partly confounded as they predict similar but not identical patterns of MZ versus DZ twin pair correlations. However, as shown by Keller et al. (Reference Keller, Medland and Duncan2010) the broad sense heritability of a trait (H2; i.e., the total proportion of variance accounted for by genetic factors (i.e., A + D) is quite robustly estimated using only twins reared together. Additional information on the classical twin design can be found in Neale and Cardon (Reference Neale and Cardon1992) and Posthuma et al. (Reference Posthuma, Beem, de Geus, van Baal, von Hjelmborg, Lachine and Boomsma2003).

Twin analyses were conducted using maximum likelihood procedures in the statistical package Mx (Neale et al., Reference Neale, Boker, Xie and Maes2006). In maximum-likelihood modeling, the goodness-of-fit of a model to the observed data is distributed as chi-square (χ2). To test whether dropping model parameters or constraining parameters to be equal significantly worsened the model fit, we tested the change in chi-square (Δχ2) against the change in degrees of freedom (Δdf). Variance components were estimated separately for males and females, and for all analyses age effects on the means were accounted for by including age as a covariate.

Results

Of the 11,543 twins who participated in the survey, those who had an unknown zygosity (368), did not fill out the personality questionnaire (1,555), or only partly filled out the personality questionnaire (100) were excluded from further analyses. Accordingly, the final sample consisted of 9,520 participants, including 2,245 complete twin pairs — 695 MZ female, 374 MZ male, 392 DZ female, 248 DZ male, and 536 DZ opposite-sex pairs, and 5,030 single twins without a participating co-twin. Single twins were included as they contribute to the estimation of mean and variance effects. Participants were aged between 27 and 54 (mean 40.8, SD 7.8).

Discriminant Analysis

Table 1 shows the item loadings for each of the Big Five Inventory items as obtained from the Discriminant Function Analysis; the standardized coefficients (ranging from -0.56 to 0.30) indicate which of the items have the highest predictor capability of predicting male versus female group membership. Multiplying the standardized coefficients by the standardized variables and adding across all items results in the discriminant score for each participant, with positive values indicating the masculine side of the scale and negative values the feminine side. As shown in Table 1, examples of items that differentiate relatively strongly between males and females are, for example: ‘I am someone who can be cold and aloof’, ‘I am someone who values artistic, aesthetic experiences’, ‘I am someone who is emotionally stable, not easily upset’, and ‘I am someone who is inventive’.

TABLE 1 Standardized Canonical Discriminant Function Coefficients for Each Item of the Big-Five Inventory as Obtained from the Discriminant Function Analysis

On the overall discriminant function score males score higher than females.

Figure 1 shows the distribution of the discriminant scores for males and females. On average, males scored 1.09 standard deviations higher on the discriminant score than females, indicating a large effect size (males Mean = 0.64 [SD = 1.01], females Mean=-0.45 [SD = 0.99]; Cohen's d = 1.09, 95% confidence intervals: 1.07–1.11). The canonical correlation between participants’ sex and the discriminant score was 0.47, indicating that the personality items combined can explain 22% of the group membership (Wilks’ Lambda = 0.78, p < .001). Based on the discriminant function, the sex of 72% of participants could be correctly classified.

FIGURE 1 Frequency of masculinity-femininity (M-F) personality scores as obtained from the discriminant-function analysis. For males (blue) M = 0.64 (SD = 1.01) and for females (pink) M = -0.45 (SD = 0.99). The purple portions of the bars represent overlapping distributions between sexes.

To cross-validate our methods, we also ran a Discriminant Function Analysis on half of our sample and used the obtained standardized coefficients to estimate the sex of the other half of the sample. This yielded a canonical correlation between participants’ sex and the discriminant score of 0.46 and a correct classification rate of 70%.

Prenatal Hormone Influences on M-F Personality Scores

Males with a male co-twin scored on average 0.61 (SD = 1.00) and males with a female-co-twin 0.70 (SD = 1.01) on the derived M-F personality scale. Females with a female co-twin scored on average -0.49 (SD = 0.99) and females with a male-co-twin -0.38 (SD = 1.01). So, both male and female participants from opposite-sex pairs scored significantly higher on the M-F personality scale (both approximately 0.1 SD more masculine) than did males and females from same-sex pairs (Δχ2 1 = 8.56, p = .003 and Δχ2 1 = 9.32, p = .002 for males and females, respectively).

Genetic Analysis

Before modeling the variance components, we tested the effects of age and zygosity on the derived M-F personality scale using an α level of 0.01. The mean scores on the M-F personality scale did not differ significantly between MZ and DZ twins of the same sex (Δχ2 2 = 6.40, p = .04), whereas, as mentioned above, opposite sex twins scored significantly higher than same-sex twins. Variances in the M-F scale did not differ significantly between MZ versus DZ twins (Δχ2 2 = 0.49, p = .78), or between twins of opposite sex versus same-sex pairs (Δχ2 2=1.10, p = .58). We found a significant effect of age for both sexes, such that older males scored more feminine than younger males (Δχ2 1 = 49.84, p < .001) and older females scored slightly more masculine than younger females (Δχ2 1 = 6.93, p = .008). Note that age and sex were included as covariates in subsequent modeling, and means were estimated separately for same-sex versus opposite-sex twins.

Table 2 shows the twin pair correlations for each zygosity group taking age and sex effects into account as obtained by maximum likelihood procedures in Mx. MZ twin pair correlations were higher than DZ twin pair correlations in both sexes, suggesting the influence of genetic factors — this is formally tested below. The DZ opposite-sex twin pair correlation is not significantly lower than the DZ same-sex twin pair correlations (Δχ2 1 = 0.07, p = .79), indicating there are no qualitative sex-differences in sources of familial aggregation between males and females. This means there is no evidence that different genes or different environmental factors influence the M-F scores in males versus females.

TABLE 2 Twin Pair Correlations (and 95% Confidence Intervals) for the Obtained M-F Personality Score by Zygosity

Because the DZ twin pair correlation is less than half the MZ twin pair correlation we fitted a non-additive genetic latent variable instead of a shared environmental component. Table 3 shows the A, D, and E parameter estimates, as well as an estimate of the broad-sense heritability (H2). Results are shown for males and females separately as well as for both sexes combined. Equating the male and female parameter estimates did not result in a significant deterioration of model fit (Δχ2 2 = 1.81, p = .41), indicating that the relative influence of genes and environment does not differ significantly between sexes. Estimates indicate that individual differences in M-F personality scores are moderately heritable for both sexes; broad-sense heritability estimates are 35% for males and 33% for females. Also, we found some evidence for non-additive genetic influences, especially for females. The majority of the variance in the derived personality score can be explained by residual influences, including non-shared environmental influences and measurement error.

TABLE 3 Estimates of the Proportions of Variance (95 % Confidence Intervals Between Brackets) in Masculinity-Femininity (M-F) Personality Scores Explained by A (Additive Genetic), D (Non-Additive Genetic), and E (Residual) Influences

Note: H2 represents the broad sense heritability.

Discussion

In the current study, we used data on 9,520 twins to examine the etiology of individual differences in personality M-F. We computed a bipolar M-F personality scale, and examined the extent to which individual differences in this scale could be attributed to genetic and environmental influences. We also tested whether prenatal hormone transfer may influence individuals’ M-F personality score.

Males scored on average 1.09 standard deviations higher than females on the derived bipolar M-F personality score. This is a large effect size and indicates an overlap in distributions of approximately 41% between the sexes and means that approximately 86% of males score higher than the average female. This derived sex difference is larger than those reported in other studies (using different methodologies; Costa et al., Reference Costa, Terracciano and McCrae2001; Lippa, Reference Lippa2010; Loehlin et al., Reference Loehlin, Spurdle, Treloar and Martin1999), but substantially smaller than the multivariate effect size of D = 2.71 found by Del Giudice et al. (Reference Del Giudice, Booth and Irwing2012), who used multigroup latent variable modeling to estimate sex differences on 16 individual personality dimensions, which were then aggregated to yield a multivariate effect size taking intercorrelations between the dimensions into account.

We found that individual differences in the M-F score were moderately heritable; broad-sense heritability estimates were 35% for males and 33% for females, with some evidence for a role of non-additive genetic effects, especially for females. Testing for sex-differences in the genetic architecture showed no evidence that different genes or environmental factors influence the M-F scores in males versus females, and also the relative influences of genes and environment do not differ significantly between sexes. Shared environmental factors do not seem to influence individual differences in M-F score, implying factors such as parenting style, socioeconomic status, familial attitudes and values, home environment, and other family environmental factors have very little influence on M-F personality development. Residual influences have the strongest impact on individual differences in M-F score (E = 65% for males and 67% for females); while part of E will be due to measurement error, this finding also suggests that unique experiences and unique social interactions may play a role.

The estimated broad-sense heritability estimates for M-F scores are similar to those found previously for various indices of masculinity/femininity, which ranged approximately between 30% and 60% (Lippa & Hershberger, Reference Lippa and Hershberger1999; Loehlin & Martin, Reference Loehlin and Martin2000; Loehlin et al., Reference Loehlin, Jonsson, Gustavsson, Stallings, Gillespie, Wright and Martin2005). These studies generally also found zero or very low influences of shared environment, and Loehlin and Martin (Reference Loehlin and Martin2000) — but not Lippa and Hershberger (Reference Lippa and Hershberger1999) or Loehlin et al. (Reference Loehlin, Jonsson, Gustavsson, Stallings, Gillespie, Wright and Martin2005) — also found some evidence for non-additive genetic influences. Our estimates of the genetic and environmental influences are also comparable to those of the various Big-Five and other personality traits, for which heritability estimates are in the range of 30–60% and do not indicate much shared environmental influences (Johnson et al., Reference Johnson, Vernon, Feiler, Boyle, Matthews and Saklofske2008). Moreover, non-additive genetic influences were also detected for several personality traits (Keller et al., Reference Keller, Coventry, Heath and Martin2005).

The presence of heritable variation shows that it may be of interest to identify the specific genes that are involved in individual differences in personality M-F, to gain insight into the underlying biological mechanisms. However, the identification of genes for the typical personality scales has proven difficult (de Moor et al., Reference de Moor, van den Berg, Verweij, Krueger, Luciano, Arias Vasquez and Boomsma2015; Service et al., Reference Service, Verweij, Lahti, Congdon, Ekelund, Hintsanen and Freimer2012; van den Berg et al., Reference van den Berg, de Moor, Verweij, Krueger, Luciano, Arias Vasquez and Boomsma2016; Verweij et al., Reference Verweij, Zietsch, Medland, Gordon, Benyamin, Nyholt and Wray2010), so very large sample sizes are expected to be required to identify specific variants.

The prenatal hormone transfer hypothesis was addressed by comparing personality scores of twins with a same-sex co-twin with scores of twins with an opposite-sex co-twin. Both male and female twins from opposite-sex pairs scored significantly more masculine (approximately 0.1 SD) than males and females from same-sex pairs. For females, this finding is consistent with the hypothesis that testosterone transfer from the male co-twin to the female twin during pregnancy increases the female's level of masculinization. Several previous studies also found that females with a male co-twin exhibited more masculine behavior for disinhibition, experience seeking and overall sensation seeking (Resnick et al., Reference Resnick, Gottesman and McGue1993), experience-seeking and thrill-and-adventure-seeking (Slutske et al., Reference Slutske, Bascom, Meier, Medland and Martin2011), rule-breaking behavior (for one of the two subsamples; Loehlin & Martin, Reference Loehlin and Martin2000), social conservatism (Miller & Martin, Reference Miller and Martin1995), and aggression (Cohen-Bendahan et al., Reference Cohen-Bendahan, Buitelaar, van Goozen, Orlebeke and Cohen-Kettenis2005). The effect size we found is comparable with the effect sizes found by Slutske et al. (Reference Slutske, Bascom, Meier, Medland and Martin2011), and Loehlin and Martin (Reference Loehlin and Martin2000), while the effect sizes reported by Resnick et al. (Reference Resnick, Gottesman and McGue1993, Cohen's d ranging between 0.18 and 0.38) and Cohen-Bendahan et al. (Reference Cohen-Bendahan, Buitelaar, van Goozen, Orlebeke and Cohen-Kettenis2005, Cohen's d of 0.34 and 0.49) were substantially larger. There are also studies that did not find support for the hormone transfer hypothesis in females; for instance, for sensation seeking or various temperament subscales (Cohen-Bendahan et al., Reference Cohen-Bendahan, Buitelaar, van Goozen, Orlebeke and Cohen-Kettenis2005), toy preference (Henderson & Berenbaum, Reference Henderson and Berenbaum1997; Rodgers et al., Reference Rodgers, Fagot and Winebarger1998), feminine interest (Rose et al., Reference Rose, Kaprio, Winter, Dick, Viken, Pulkkinen and Koskenvuo2002), and for Worried and Reserved subscales (Loehlin & Martin, Reference Loehlin and Martin2000). In some cases, these null results may be due to the much smaller sample sizes; the samples used by Cohen-Bendahan et al. (Reference Cohen-Bendahan, Buitelaar, van Goozen, Orlebeke and Cohen-Kettenis2005, N = 129 twins), Henderson and Berenbaum (Reference Henderson and Berenbaum1997, N = 71 twins), and Rodgers et al. (Reference Rodgers, Fagot and Winebarger1998, N = 70 female twins) provided low statistical power to detect an effect. Moreover, contrary to expectations, Koopmans et al. (Reference Koopmans, Boomsma, Heath and van Doornen1995) found that female opposite-sex twins scored lower on Experience Seeking than female same-sex twins.

The finding that males with a female co-twin scored more masculine than males with a male co-twin is puzzling and not accounted for by the hormone transfer theory. There are few studies on prenatal hormone transfer in males and findings are inconsistent. Similar to our results, Koopmans et al. (Reference Koopmans, Boomsma, Heath and van Doornen1995) found that male opposite-sex twins scored higher on Experience Seeking than male same-sex twins. Loehlin and Martin (Reference Loehlin and Martin2000) found that males with a female co-twin scored more feminine than males from same-sex pairs on the Worried subscale (in one of two subsamples), but that they also scored more masculine for the Breaks Rules subscale (in both samples), and a trend in this direction for the Worried subscale in the second of the two samples. No differences were found between males with a female versus male co-twin for the Reserved subscale (Loehlin & Martin, Reference Loehlin and Martin2000), various sensation seeking scores (Resnick et al., Reference Resnick, Gottesman and McGue1993), toy preference (Rodgers et al., Reference Rodgers, Fagot and Winebarger1998), feminine interest (Rose et al., Reference Rose, Kaprio, Winter, Dick, Viken, Pulkkinen and Koskenvuo2002), and social conservatism (Miller & Martin, Reference Miller and Martin1995); again, in some cases, these null results may be due to the small sample sizes. Overall, the evidence for prenatal hormone transfer influences on personality is not strong, with study findings especially inconsistent for males.

It is important to note that with the present design it is impossible to differentiate between prenatal hormone and postnatal socialization influences of having a sibling of a certain sex. The higher scores on the M-F scale for males and females with an opposite sex co-twin might therefore be caused by some kind of social interaction after birth or because of a comparison effect (albeit differing for males and females). Slutske et al. (Reference Slutske, Bascom, Meier, Medland and Martin2011) included comparisons between female same-sex twins with and without a non-twin brother (close in age) to distinguish between prenatal hormone transfer versus postnatal socialization influences. Their findings suggested that the masculinization effect of having a male co-twin on females’ level of experience seeking and thrill and adventure seeking could not be explained by postnatal socialization effects of having a brother, and that it therefore had to be attributed to prenatal hormone transfer. Unfortunately, Slutske et al. (Reference Slutske, Bascom, Meier, Medland and Martin2011) did not perform the same analyses for males, so there is no indication of whether higher masculinization for male twins might be an effect of interacting with a female co-twin.

Our study showed that within-sex differences can be explained by genetic, unshared environmental influences and potentially male-to-female hormonal transfer. While the sources of within-sex individual differences are not necessarily the same as the sources of between-sex differences in personality, our findings may inform future work to that end. In that respect, it is of interest that we did not find evidence for qualitative or quantitative sex differences in the sources of variation, indicating that the same genes and environmental factors influence masculinity of personality in both sexes to the same extent. An important strength of the present study is the very large sample size that enabled us to be the first twin study to perform a discriminant analysis using all single personality items without the risk of overfitting. The large sample also provided power to detect subtle hormone-transfer effects and non-additive genetic influences. Another strength of this study is that we also looked for prenatal hormone effects in males, whereas prior studies often focused on prenatal effects in females only.

The main limitation of the current study is that we relied on self-report data, which is subject to response biases such as social desirable responding. To reduce influences of social desirability and measurement error as well as to differentiate between actual sexual dimorphism in personality versus the artefact or expectation theories, future studies should employ different types of personality measurement, such as observational studies or parental, peer, or teacher ratings.

Overall, by means of a discriminant analysis on the single items of the Big Five personality inventory, we created a M-F personality scale with large sex differences, and showed that around one third of the variance on this scale was attributable to genetic factors, while shared environmental factors have no influence. Prenatal hormone transfer may also play a role, but additional well-powered studies are needed to clarify the association and determine the underlying mechanisms in both sexes. By including non-twin siblings these studies should aim to differentiate between prenatal influences and postnatal socialization influences.

Acknowledgments

This work was supported by the Bank of Sweden Tercentenary Foundation (M11-0451:1), the Swedish Scientific Council (521-2010-3195), and the Sven and Dagmar Salén Foundation.

References

Auyeung, B., Baron-Cohen, S., Ashwin, E., Knickmeyer, R., Taylor, K., Hackett, G., & Hines, M. (2009). Fetal testosterone predicts sexually differentiated childhood behavior in girls and in boys. Psychological Science, 20, 144148.Google Scholar
Berenbaum, S. A., & Beltz, A. M. (2011). Sexual differentiation of human behavior: Effects of prenatal and pubertal organizational hormones. Frontiers in Neuroendocrinology, 32, 183200.Google Scholar
Buss, D. M. (1995). Psychological sex differences. Origins through sexual selection. American Psychologist, 50, 164168.Google Scholar
Buss, D. M., & Hawley, P. H. (2010). The evolution of personality and individual differences. Oxford: Oxford University Press.Google Scholar
Cohen-Bendahan, C. C., Buitelaar, J. K., van Goozen, S. H., Orlebeke, J. F., & Cohen-Kettenis, P. T. (2005). Is there an effect of prenatal testosterone on aggression and other behavioral traits? A study comparing same-sex and opposite-sex twin girls. Hormones and Behavior, 47, 230237.CrossRefGoogle ScholarPubMed
Costa, P. T., & McCrae, R. R. (1985). The NEO personality inventory manual. Odessa, FL: Psychological Assessment Resources.Google Scholar
Costa, P. T., Terracciano, A., & McCrae, R. R. (2001). Gender differences in personality traits across cultures: Robust and surprising findings. Journal of Personality and Social Psychology, 81, 322331.Google Scholar
de Moor, M. H., van den Berg, S. M., Verweij, K. J., Krueger, R. F., Luciano, M., Arias Vasquez, A., . . . Boomsma, D. I. (2015). Meta-analysis of genome-wide association studies for neuroticism, and the polygenic association with major depressive disorder. JAMA Psychiatry, 72, 642650.Google Scholar
Deaux, K., & Major, B. (1987). Putting gender into context: An interactive model of gender-related behaviour. Psychological Review, 94, 369389.Google Scholar
Del Giudice, M., Booth, T., & Irwing, P. (2012). The distance between Mars and Venus: Measuring global sex differences in personality. PLoS One, 7, e29265.Google Scholar
Digman, J. M. (1990). Personality structure — emergence of the 5-factor model. Annual Review of Psychology, 41, 417440.Google Scholar
Eagly, A. H. (1987). Sex differences in social behavior: A social-role interpretation. Hillsdale, NJ: Lawrence Erlbaum.Google Scholar
Eagly, A. H., & Wood, W. (1991). Explaining sex differences in social behavior: A meta-analytic study. Personality and Social Psychology Bulletin, 17, 306315.Google Scholar
Even, M. D., & vom Saal, F. S. (1992). Seminal vesicle and preputial gland response to steroids in adult male mice is influenced by prior intrauterine position. Physiology & Behavior, 51, 1116.Google Scholar
Fechner, P. Y. (2003). The biology of puberty: New developments in sex differences. In Hayward, C. (Ed.), Gender differences at puberty (pp. 1728). Cambridge, UK: Cambridge University Press.Google Scholar
Feingold, A. (1990). Gender differences in effects of physical attractiveness on romantic attraction: A comparison across five research paradigms. Journal of Personality and Social Psychology, 59, 981993.Google Scholar
Feingold, A. (1991). Sex differences in the effects of similarity and physical attractiveness on opposite-sex attraction. Basic and Applied Social Psychology, 12, 357367.Google Scholar
Feingold, A. (1992). Gender differences in mate selection preferences: a test of the parental investment model. Psychological Bulletin, 112, 125139.Google Scholar
Feingold, A. (1994). Gender differences in personality: A meta-analysis. Psychological Bulletin, 116, 429456.CrossRefGoogle ScholarPubMed
Geary, D. C. (2010). Male, female: The evolution of human sex differences (2nd ed.). Washington, DC: American Psychological Association.Google Scholar
Gentile, B., Grabe, S., Dolan-Pascoe, B., Twenge, J. M., Wells, B. E., & Maitino, A. (2009). Gender differences in domain-specific self-esteem: A meta-analysis. Review of General Psychology, 13, 3445.Google Scholar
Henderson, B. A., & Berenbaum, S. A. (1997). Sex-typed play in opposite-sex twins. Developmental Psychobiology, 31, 115123.Google Scholar
John, O. P., Donahue, E. M., & Kentle, R. L. (1991). The big five inventory — versions 4a and 54. Berkeley, CA: University of California, Berkeley, Institute of Personality and Social Research.Google Scholar
John, O. P., Naumann, L. P., & Soto, C. J. (2008). Paradigm shift to the integrative Big Five trait taxonomy: History, measurement, and conceptual issues. In John, O. P., Robins, R. W., & Pervin, L. A. (Eds.), Handbook of personality: Theory and research (pp. 114158). New York, NY: Guilford Press.Google Scholar
Johnson, A. M., Vernon, P. A., & Feiler, A. R. (2008). Behavioral genetic studies of personality: An introduction and review of the results of 50+ years of research. In Boyle, G., Matthews, G. & Saklofske, D. (Eds.), Handbook of personality theory and assessment (vol. 1, pp. 145173). London: Sage.Google Scholar
Jussim, L. (2012). Social perception and social reality: Why accuracy dominates bias and self-fulfilling prophecy. Oxford: Oxford University Press.Google Scholar
Keller, M. C., Coventry, W. L., Heath, A. C., & Martin, N. G. (2005). Widespread evidence for non-additive genetic variation in Cloninger's and Eysenck's personality dimensions using a twin plus sibling design. Behavior Genetics, 35, 707721.Google Scholar
Keller, M. C., Medland, S. E., & Duncan, L. E. (2010). Are extended twin family designs worth the trouble? A comparison of the bias, precision, and accuracy of parameters estimated in four twin family models. Behavior Genetics, 40, 377393.Google Scholar
Kling, K. C., Hyde, J. S., Showers, C. J., & Buswell, B. N. (1999). Gender differences in self-esteem: A meta-analysis. Psychological Bulletin, 125, 470500.Google Scholar
Koopmans, J. R., Boomsma, D. I., Heath, A. C., & van Doornen, L. J. (1995). A multivariate genetic analysis of sensation seeking. Behavior Genetics, 25, 349356.Google Scholar
Lang, F. R., John, D., Ludtke, O., Schupp, J., & Wagner, G. G. (2011). Short assessment of the big five: Robust across survey methods except telephone interviewing. Behavior Research Methods, 43, 548567.Google Scholar
Lichtenstein, P., De Faire, U., Floderus, B., Svartengren, M., Svedberg, P., & Pedersen, N. L. (2002). The Swedish Twin Registry: A unique resource for clinical, epidemiological and genetic studies. Journal of Internal Medicine, 252, 184205.Google Scholar
Lichtenstein, P., Sullivan, P. F., Cnattingius, S., Gatz, M., Johansson, S., Carlstrom, E., . . . Pedersen, N. L. (2006). The Swedish Twin Registry in the third millennium: An update. Twin Research and Human Genetics, 9, 875882.Google Scholar
Lippa, R. A. (2010). Sex differences in personality traits and gender-related occupational preferences across 53 nations: Testing evolutionary and social-environmental theories. Archives of Sexual Behavior, 39, 619636.Google Scholar
Lippa, R. A., & Hershberger, S. (1999). Genetic and environmental influences on individual differences in masculinity, femininity, and gender diagnosticity: Analyzing data from a classic twin study. Journal of Personality, 67, 127155.Google Scholar
Loehlin, J. C., Jonsson, E. G., Gustavsson, J. P., Stallings, M. C., Gillespie, N. A., Wright, M. J., & Martin, N. G. (2005). Psychological masculinity-femininity via the gender diagnosticity approach: Heritability and consistency across ages and populations. Journal of Personality, 73, 12951319.Google Scholar
Loehlin, J. C., & Martin, N. G. (2000). Dimensions of psychological masculinity-femininity in adult twins from opposite-sex and same-sex pairs. Behavior Genetics, 30, 1928.Google Scholar
Loehlin, J. C., Spurdle, A., Treloar, S. A., & Martin, N. G. (1999). Number of X-linked androgen receptor gene CAG repeats and femininity in women. Personality and Individual Differences, 27, 887899.Google Scholar
Miller, E. M. (1994). Prenatal sex hormone transfer: A reason to study opposite-sex twins. Personality and Individual Differences, 17, 511529.Google Scholar
Miller, E. M., & Martin, N. (1995). Analysis of the effect of hormones on opposite-sex twin attitudes. Acta Geneticae Medicae et Gemellologiae (Roma), 44, 4152.Google Scholar
Mitchell, J. E., Baker, L. A., & Jacklin, C. N. (1989). Masculinity and femininity in twin children: Genetic and environmental factors. Child Development, 60, 14751485.CrossRefGoogle ScholarPubMed
Mosing, M. A., Madison, G., Pedersen, N. L., Kuja-Halkola, R., & Ullén, F. (2014). Practice does not make perfect: No causal effect of musical practice on musical ability. Psychological Science, 25, 17951803.Google Scholar
Neale, M. C., Boker, S. M., Xie, G., & Maes, H. H. (2006). Mx: Statistical modeling (7th ed.). Richmond, VA: Department of Psychiatry, Virginia Commonwealth University.Google Scholar
Neale, M. C., & Cardon, L. R. (1992). Methodology for genetic studies of twins and families. Dordrecht: Kluwer.Google Scholar
Pajer, K., Tabbah, R., Gardner, W., Rubin, R. T., Czambel, R. K., & Wang, Y. (2006). Adrenal androgen and gonadal hormone levels in adolescent girls with conduct disorder. Psychoneuroendocrinology, 31, 12451256.Google Scholar
Pervin, L. A., Cervone, D., & John, O. P. (2005). Personality: Theory and research (9th ed.). Hoboken, NJ: Wiley.Google Scholar
Pettit, F. A. (2002). A comparison of World-Wide Web and paper-and-pencil personality questionnaires. Behavior Research Methods, Instruments, & Computers, 34, 5054.Google Scholar
Posthuma, D., Beem, A. L., de Geus, E. J. C., van Baal, G. C. M., von Hjelmborg, J. B., Lachine, I., & Boomsma, D. I. (2003). Theory and practice in quantitative genetics. Twin Research, 6, 361376.Google Scholar
Rammstedt, B., Holzinger, B., & Rammsayer, T. (2004). Comparison of paper-and-pencil vs. computerized administration of the NEO-Five-Factor-Inventory (NEO-FFI). [Zur Äquivalenz der Papier-Bleistift- und einer computergestützten Version des NEO-Fünf-Faktoren-Inventars (NEO-FFI)]. Diagnostica, 50, 8897.Google Scholar
Rammstedt, B., & John, O. P. (2007). Measuring personality in one minute or less: A 10-item short version of the big five inventory in English and German. Journal of Research in Personality, 41, 203212.Google Scholar
Resnick, S. M., Gottesman, II, & McGue, M. (1993). Sensation seeking in opposite-sex twins: an effect of prenatal hormones? Behavior Genetics, 23, 323329.Google Scholar
Rodgers, C. S., Fagot, B. I., & Winebarger, A. (1998). Gender-typed toy play in dizygotic twins: A test of hormone transfer theory. Sex Roles, 39, 173184.Google Scholar
Rose, R. J., Kaprio, J., Winter, T., Dick, D. M., Viken, R. J., Pulkkinen, L., & Koskenvuo, M. (2002). Femininity and fertility in sisters with twin brothers: Prenatal androgenization? Cross-sex socialization? Psychological Science, 13, 263267.Google Scholar
Ryan, B. C., & Vandenbergh, J. G. (2002). Intrauterine position effects. Neuroscience & Biobehavioral Reviews, 26, 665678.Google Scholar
Service, S. K., Verweij, K. J., Lahti, J., Congdon, E., Ekelund, J., Hintsanen, M., . . . Freimer, N. B. (2012). A genome-wide meta-analysis of association studies of Cloninger's Temperament Scales. Translational Psychiatry, 2, e116.Google Scholar
Slutske, W. S., Bascom, E. N., Meier, M. H., Medland, S. E., & Martin, N. G. (2011). Sensation seeking in females from opposite- versus same-sex twin pairs: Hormone transfer or sibling imitation? Behavior Genetics, 41, 533542.Google Scholar
Tapp, A. L., Maybery, M. T., & Whitehouse, A. J. (2011). Evaluating the twin testosterone transfer hypothesis: A review of the empirical evidence. Hormones and Behavior, 60, 713722.Google Scholar
Trivers, R. L. (1972). Parental investment and sexual selection. In Campbell, B. (Ed.), Sexual selection and the descent of man 1871–1971 (pp. 136179). Chicago, IL: Aldine.Google Scholar
van den Berg, S. M., de Moor, M. H., Verweij, K. J., Krueger, R. F., Luciano, M., Arias Vasquez, A., . . . Boomsma, D. I. (2016). Meta-analysis of genome-wide association studies for extraversion: Findings from the genetics of personality consortium. Behavior Genetics, 46 (2), 170182.Google Scholar
Verweij, K. J. H., Zietsch, B. P., Medland, S. E., Gordon, S. D., Benyamin, B., Nyholt, D. R., . . . Wray, N. R. (2010). A genome-wide association study of Cloninger's temperament scales: Implications for the evolutionary genetics of personality. Biological Psychology, 85, 306317.Google Scholar
Figure 0

TABLE 1 Standardized Canonical Discriminant Function Coefficients for Each Item of the Big-Five Inventory as Obtained from the Discriminant Function Analysis

Figure 1

FIGURE 1 Frequency of masculinity-femininity (M-F) personality scores as obtained from the discriminant-function analysis. For males (blue) M = 0.64 (SD = 1.01) and for females (pink) M = -0.45 (SD = 0.99). The purple portions of the bars represent overlapping distributions between sexes.

Figure 2

TABLE 2 Twin Pair Correlations (and 95% Confidence Intervals) for the Obtained M-F Personality Score by Zygosity

Figure 3

TABLE 3 Estimates of the Proportions of Variance (95 % Confidence Intervals Between Brackets) in Masculinity-Femininity (M-F) Personality Scores Explained by A (Additive Genetic), D (Non-Additive Genetic), and E (Residual) Influences