The Norwegian Twin Registry
The Norwegian Twin Registry (NTR) was established in 2009 as a merger of three major population-based Norwegian Twin Panels (Bergem, Reference Bergem2002; Harris et al., Reference Harris, Magnus and Tambs2002, Reference Harris, Magnus and Tambs2006). The NTR is housed at the Norwegian Institute of Public Health (NIPH), which currently runs 10 of Norway's national health registries and several large population-based cohort studies. The principal reasons for establishing the NTR are to promote research that exploits the value inherent in twin designs and to create a resource for studies in genetic epidemiology. Therefore, the registry makes data accessible to researchers for a wide range of studies. The registry infrastructure easily accommodates future expansions and enhancements of the twin data as described below. The purpose of this article is to describe the NTR today, including potentials for enriching the existing data through matches to other national registries and existing biological sample resources.
Since the 1960s, when twin research was initiated in Norway, substantial investments have been made in collecting population-based twin data for specific studies, but there were neither resources nor formal plans for developing a national twin data resource that would be made available to researchers. Only data from the NIPH panel had been fairly accessible (Harris et al., Reference Harris, Magnus and Tambs2002, Reference Harris, Magnus and Tambs2006). Establishment of the NTR was jointly funded by the University of Oslo, Oslo University Hospital, and NIPH. It does not have dedicated governmental funding and is currently supported through the Division for Epidemiology (NIPH) and through research studies using NTR data.
Panel I of the NTR covers cohorts born 1895–1945. A total of 37,000 pairs were identified for these birth cohorts through Statistics Norway, which also provided information about the twins’ full names, and the date and place of birth of the twins and their parents. This information was then used to conduct linkages with national population registry files so that the national identity (NI) number of the twins could be assigned. This process identified a subset of 18,972 individuals (including 5,250 complete pairs) for whom the population registry files could provide validated NI numbers. The NI numbers make it possible to conduct linkages to other national registries. Importantly, a majority (10,000) of the same sex pairs from Panel I are also part of Panel II, and for these twins the NTR also contains information on zygosity plus other data as described below.
Panel II covers same sex twin births from 1915 through 1960 (and overlaps with Panel I for the years 1915–1945). Pairs where at least one twin had died before the age of 20 at the time of recruitment were excluded. Today, this panel contains 21,963 consenting twins, from an estimated 20,173 same sex pairs, comprising 9,183 complete pairs.
National identity numbers were introduced in Norway in 1964, based on the census of 1960; thus, twins deceased before 1960 were mostly lost to both Panel I and Panel II (Bergem, Reference Bergem2002; Iversen et al., Reference Iversen, Tretli and Kringlen2001).
Panel III covers birth cohorts from 1967 through 1979 (Harris et al., Reference Harris, Magnus and Tambs2002). Twins were identified through the national Medical Birth Registry, which is complete for all births in Norway since the beginning of mandatory registration of all pregnancies in 1967 (Irgens, Reference Irgens2000). Approximately 15,000 twins were born in this period, of which 9,477 (4,242 complete pairs) consenting twins are now part of the NTR. Consent was granted via the return of a completed questionnaire or specific consent form. Today, studies will invariably need to obtain specific informed consent for the details of the data collection, storage and use.
Collectively, the three panels comprising the NTR span cohorts 1895–1960 and 1967–1979, and include a total of 14,742 complete pairs. Birth cohorts from 1961–1966 are missing because Panel II only recruited twins who were of consenting age (18 years), which meant birth cohorts before and including 1960, identified through Statistics Norway. Panel III relied on the Medical Birth Registry, which started in 1967 to identify twins nationwide.
Zygosity Classification
Panel I holds no information on zygosity except from opposite sex pairs. Panels II and III have determined zygosity by questionnaire and subsequently conducted genetic marker analysis and DNA-based verification for a sub-sample of twins in Panel II and Panel III. These tests revealed high validity of the questionnaire-based method (Harris et al., Reference Harris, Magnus and Tambs2006; Magnus et al., Reference Magnus, Berg and Nance1983). More specifically, for Panel II zygosity classification was determined by concordance of genetic markers from nine blood type systems and five serum group systems and four red cell enzymes systems and is reported in detail elsewhere (Berg, Reference Berg1973). Tests of zygosity classification in Panel III were based on two multiplex panels of 12 micro-satellite markers each (for a total of 24 micro-satellite markers covering various autosomes). The alleles were scored by two individuals independently. Once discrepancies were resolved, the consensus data was reviewed for homozygosity between the two twins of each pair manually. Table 1 provides an overview of the number of twin individuals and the number of complete pairs across the three panels by zygosity.
*Unique for panel I—otherwise included in panel II count.
Number of twins in NTR is not equal to the sum of previously published numbers of twins from the respective twin panels. This is due to three factors: (1) Files received from two of the three panels deviate from published numbers, (2) considerable overlap between two of the panels, and (3) legal issues concerning consent which have impact on number of twins available for research.
The twin panels were established under different legal circumstances and with different purposes, thus making a legal integration of all our twin data challenging with certain issues still under pending decision from the Norwegian Data Protection Agency. Thus the number of twins available for research may vary.
Complete pairs are pairs where both twins have consented, although zygosity can be predicted with great accuracy using questionnaire data from one twin only. Twins with no consent and only NI are included in the base registry, but cannot be included in studies nor have their NI number linked to other registry data without prior consent. The exception to this rule applies to deceased twins, which can be included in studies, as the Personal Data Act in Norway generally only applies to living persons. Hence, the number of twins available for linkage is greater than the number of consenting twins. As of today, data on more than 8,000 non-consenting, deceased twins are also available for linkage pending prior approval by the Regional Committees for Medical and Health Research Ethics and the registries being linked. Additionally, there are approximately 1,500 twins from Panel II who have not been approached for consent and, therefore, linkage is not currently an option. Figure 1 depicts the structure of the cohort data in the NTR.
Phenotypes in NTR
The NTR contains a wide range of phenotypes, obtained by questionnaires, clinical assessments, and interviews. These have been detailed for Panel III elsewhere (Harris et al., Reference Harris, Magnus and Tambs2002, Reference Harris, Magnus and Tambs2006) and described to some extent for Panel II (Bergem Reference Bergem2002). Panel II conducted two large questionnaire surveys in 1978–1982 and again in 1990–1998. Twins who responded to a previous general zygosity questionnaire and agreed to further contact were invited to participate (Magnus et al., Reference Magnus, Berg and Nance1983). Two questionnaires were sent to each twin: the first (1978–1982) contained items on general health, including current weight and height plus basic demographics and education, lifestyle, tobacco and alcohol use, physical health history, and degree of contact with co-twin. For all health history questions (diseases and medical conditions), the twin was also asked to report about their first-degree relatives, thus constituting twin family data. Data on the first-degree relatives only indicates whether said disease or condition afflicts any member of the family, including the twin. For these family members neither national identity number nor name is available. The second, companion questionnaire focused on reproductive health, maternal health, and lifestyle (including alcohol and tobacco use) during pregnancy, complications during pregnancy or birth, congenital disease, health history, and medical conditions of the children from infancy onwards. These two questionnaires were repeated in the second survey (1990–1998), where those who completed the first survey were invited. The general health questionnaire was expanded to include more items on health history, seizures, diet, anxiety and depression, whereas the reproductive health questionnaire was reduced, excluding many items regarding the children's health history, but including items on menstruation and fertility. Responses were received from 14,000 of the 21,885 consenting twins in Panel II. In total, 6,800 twins (including 1,900 complete pairs) responded to both of the general health questionnaires and 2,700 twins (including 700 complete pairs) responded to the two reproduction questionnaires, thus constituting a longitudinal sample (1978–1982 and 1990–1998). The discrepancy in response rates is primarily due to the reproduction questionnaire being targeted just to women whereas the general health questionnaire was targeted to the full twin sample.
Registry Linkage Opportunities
Norway has a comparative advantage in registry-based research. The first national patient registry in the world on leprosy was established in Bergen in 1856 (Irgens, Reference Irgens2002). Since the 19th century, in particular with the creation of The Central Bureau of Statistics (now Statistics Norway) in 1876, Norway has built an extensive registry infrastructure covering health and general population data. A large effort is underway, funded by the Norwegian government, to modernize all major health registries, and improve quality, scope, and access (www.nhrp.no). Being a welfare state, with free healthcare and education (primary school through university) and mandatory national insurance systems for unemployment, pension, sickness, and other benefits, these registries cover the whole population, thus selection and attrition bias is virtually eliminated. Through the national identity number, which is assigned at birth or registration with the National Population Registry, linkage between various registries and NTR can be conducted. The data available on each individual is potentially very extensive and detailed, both cross-sectionally and longitudinally. Table 2 provides an overview of some of the principal health and population registries available for linkage to the NTR (Dahl et al., Reference Dahl, Stoltenberg, Magnus, Høyer, Skjesol, Vassenden, Skjesol, Vassenden, Skau, Berntsen, Kapstad, Hagen, Opdal, Viste and Vollset2009; Statistics Norway, 2011).
*These are large and complex registries, quality and completeness, may vary depending on the variables and population group and years of interest.
Many more disease specific health registries also exist, and several others are under creation; for example, the Registry for Cardiovascular Diseases, which will be a national health registry. The registry creation and infrastructure are parts of a national strategy for improving healthcare, medical research, and disease surveillance. If linkages are not specifically mentioned in the consent, the Regional Committees for Medical and Health Research Ethics decide on a study-to-study basis whether the twins have to be individually notified or provide consent about the linkage study. However, through the NTR web pages and newsletters all twins are informed about current research, as required by law.
Access to NTR
Access to data from the NTR is applied for through an application (www.fhi.no/english) and granted by a steering committee that reviews applications according to a set of NIPH guidelines. These guidelines are to ensure that projects have a sound scientific basis, fall within the scope of the NTR mandate, have the pre-requisite permits and meet legal obligations pertaining to the registry and project host country. Within the European Economic Area (EEA), this is regulated through the EU Data Protection Directive 95/46/EC. For countries outside EEA, compliance with 95/46/EC is required. NTR also has responsibility to evaluate ethical aspects of all projects (even though they have been approved by the applicant's ethical review board) to ensure that they follow the proper consent procedures. Access to biological samples is more restricted, being a limited resource, and results from analyses must be returned to NTR for general use. Access fees are charged to cover administrative and data management costs incurred by the project.
NTR Research Projects
Table 3 provides an overview of main ongoing NTR-based projects. As seen from this table, new data are continually added to the registry (data obtained by linkage cannot be transferred to NTR) from new projects collecting study-specific data. During the project period, exclusive rights to the data are given to the research project, and after the project ends the data are released to the NTR and made available for research use through application.
Biological Samples
Biological samples have been collected in several studies, primarily from twins in Panel III, and are, therefore, limited to cohorts 1967–1979. DNA was collected by mail-out buccal smear kits sent to all questionnaire participants. Four buccal samples were collected from each twin using small brushes (cytobrush, cell collector) with soft bristles used for swabbing the inside of the cheek. The twins were instructed to collect cheek swab samples at least 1 hour after their last meal and with at least 8 hours between the first two and last two swabs. Altogether, 4,800 individual samples were collected. In addition, blood samples were collected and DNA and plasma were extracted from 1,850 of these twins, who also participated in the later Genetics and Personality Study. These samples are stored at NIPH biobank at −20°C (plasma −80°C). Biological samples collected by NTR cover only a fraction of twins in the registry, but other national sources of biological samples are or will become available. Millions of blood samples, genetic samples, and other biological materials from the Norwegian populace have been collected over many years. Norway's many biobanks are now to be reorganized into a single national research infrastructure. One aim is to develop databases and improve information and management systems that make it easier to link the large amounts of existing health data in Norwegian registries to the biobanks’ biological materials. This endeavor, called Biobank Norway, is a national hub aiming to harmonize with Europe's large-scale biobank infrastructure, the Biobanking and Biomolecular Resources Research Infrastructure (BBMRI). The NTR will be integrated into Biobank Norway.
Future Plans
Of primary importance is recruitment of new twins from birth cohorts not already included in the NTR, starting from 1980 and onwards. This will encompass all twins aged 18 or older at the time of recruitment and would encompass a target population of approximately 10,000 new twin pairs to NTR. This is vital, not only from the perspective of statistical power, but maybe even more so from linkage opportunities offered. As mentioned above, Norway has one of the most comprehensive population-based registry systems in the world; many of these registries have been established in the last decades, and older registries are becoming increasingly informative as detailed information is collected. Thus, recruiting new twins will provide important new linkage opportunities, especially as older cohorts experience natural attrition.
Acknowledgments
Development and use of the NTR has been supported from research funds granted through the European Union's Seventh Framework Programme (FP7/2007-2013), ENGAGE Consortium, grant agreement HEALTH-F4-2007-201413; BioSHaRE-EU, grant agreement HEALTH-F4-2010-261433, the Ellison Foundation, USA, and Norwegian Research Council funds through Biobank Norway (NFR 197443/F50) and grants Axis I and Axis II Psychiatric Disorders in Norwegian Twins: A Follow-Up Study (NFR 196148) and Consequences of Common Mental Disorders and Personality Disorders (NFR 193615).