INTRODUCTION
Approximately 10–15% of human cancers are caused by infection with high-risk human papillomaviruses (HPVs), Epstein–Barr virus, hepatitis B virus, human T-lymphotropic virus-I, hepatitis C virus, Kaposi's sarcoma herpesvirus, and Merkel cell polyomavirus [Reference Moore and Chang1]. HPV is a double-stranded DNA virus belonging to the papillomavirus family that is capable of infecting the basal cells of the stratified epithelium in humans [Reference Bian2]. HPVs are highly prevalent worldwide, and high-risk HPV infections cause various cancers, including vaginal, vulvar, penile, oropharyngeal, anal, and cervical [Reference Dillner3–Reference Saraiya6]. Cervical cancer, which is mainly categorized into squamous cell carcinoma and adenocarcinoma, occurs when abnormal cells of the cervix grow out of control, and is a leading cause of death in women with 0·5 million new cases and 275 000 deaths worldwide each year [7]. Significantly, almost all cases of cervical cancer are caused by long-lasting infections with certain types of human HPV [Reference Steenbergen8].
The genome of HPVs consists of a circular DNA molecule of ~8000 base pairs subdivided into the early (E) region, upstream regulatory region (URR), and late (L) region [Reference Johansson and Schwartz9, Reference Abbate, Voitenleitner and Botchan10]. The E region encodes at least eight early viral proteins: E1, responsible for viral DNA replication; E2, responsible for activating the replication of viral DNA in synergy with E1 and transcriptional repression of E6 and E7 in the nucleus, and responsible for induction of apoptosis and genomic instability in the cytoplasm; E4, responsible for virion maturation; E5, responsible for stimulation of cell proliferation; E6, responsible for oncogenic protein promoting degradation of p53 by the proteasome; E7, responsible for oncogenic protein promoting degradation of retinoblastoma susceptibility protein; and E3 and E8, of no known functions [Reference Johansson and Schwartz9, Reference Abbate, Voitenleitner and Botchan10]. The L region encodes the major protein capsid protein L1 and the minor protein L2 to form the viral particle [Reference Johansson and Schwartz9, Reference Abbate, Voitenleitner and Botchan10]. To date, over 170 HPV genotypes have been identified [Reference Ghittoni11], with types 16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, 59, 68, 73, and 82 defined as high-risk, carcinogenic, sexually transmitted HPVs that may lead to the development of cervical cancer [Reference Muñoz12]. The diagnosis of HPV infection is based primarily on detection of viral DNA [Reference Muñoz12, Reference Szarewski13]. Genotyping of HPV is mainly based on the unique sequence in the L1 region of the HPV genome [Reference Eide and Debaque14]. HPV genotypes are not only disease-specific but also strain-specific within a certain disease or cancer [Reference Lacey, Lowndes and Shah15–Reference Muñoz18]. HPV6, 11, 42, and 44 infections are correlated with anogenital warts [Reference Lacey, Lowndes and Shah15]. HPV16 infection causes oropharyngeal cancer [Reference Elrefaey16]. HPV16 and 18 are most prevalent in head and neck squamous cell carcinomas [Reference Kreimer17]. HPV16, 18, 26, 31, 33, 35, 39, 45, 51, 52, 53, 56, 58, 59, 66, 68, 73, and 82 infections cause genital cancers [Reference Muñoz12, Reference Muñoz18].
As HPVs are highly prevalent worldwide and genotypes differ among regions, molecular epidemiological survey of HPVs could provide background data regarding characteristic HPV genotype distribution and composition in a certain area and therefore support local vaccination strategies. For this purpose, the distribution characteristics of high-risk HPV genotypes were studied in women living in Shanghai.
METHODS
Study population
The Shanghai 7th Hospital is a general hospital located in northeastern Shanghai. Outpatients visiting our Department of Gynecology between January 2014 and May 2015 for cervical screening were included in this study. All participants were screened by clinical gynecological examination, cytology examination, and colposcopy examination. Age, clinical data, and HPV testing results were recorded. Inclusion criteria were a sexual history, not currently pregnant, no history of total uterus or cervix resection, and agreement to undergo HPV testing. Patients that had visited a doctor 2 months previously and received cervical physical therapy and hormone treatment were excluded from the study. A total of 10 501 subjects (age: median 48; quartile range 39–55 years) were included. For Pap smears, cells were collected at the outer opening of the cervix at the transformation zone and then examined under a microscope to detect any abnormalities in the exfoliated cervical cells. Pap smears were interpreted by experienced pathologists in our hospital. Cytological diagnoses were evaluated according to the Bethesda System criteria [Reference Barcelos19]. At the same time, the cell samples were analysed to detect HPV DNA.
Ethical standards
This study was conducted in accordance with the World Medical Association Declaration of Helsinki. The Review Board of the Ethics Committee of Medical Research at Shanghai Seventh People's Hospital (Shanghai, China) approved the study protocols. Written informed consent was obtained from all patients according to the guidelines of the Chinese National Ethics Regulation Committee. All patients were informed of their rights to withdraw consent personally or via relatives, caregivers, or guardians.
The authors assert that all procedures contributing to this work comply with the ethical standards of the relevant national and institutional committees on human experimentation and with the Helsinki Declaration of 1975, as revised in 2008.
HPV genotyping
HPV genotyping was performed using a commercial detection kit purchased from Hybribio (Chaozhou, China). The kit could detect and distinguish 15 high-risk HPV strains (HPV16, 18, 31, 33, 35, 39, 45, 51, 52, 53, 56, 58, 59, 66, 68) and six low-risk HPV strains (HPV6, 11, 42, 43, 44, CP8304) (81). The kit is based on polymerase chain reaction followed by HPV DNA microarray analyses. All detection procedures were guided by the protocol provided by manufacturers. Detected HPV strains were defined by genotype not by sub-genotype or quasi-species [20].
Statistical analysis
Statistical analysis was performed using SPSS v. 13.0 (SPSS Inc., USA). Continuous variables are presented as means ± standard deviation (s.d.) and categorical data are presented as numbers (percentages). Differences between groups were examined using the t test, one-way ANOVA, c 2 test, or Fisher's exact probability test according to the characteristics of data distribution. The significance level α was set at 0·05.
RESULTS
Overall prevalence of HPV
Data from a total of 10 501 outpatients were collected; 8430 (80·3%) were negative for HPV. The overall prevalence of HPV was 19·7% (2071 patients), 14·5% (1525 patients) of whom were infected with high-risk HPVs, 2·8% (289 patients) were infected with low-risk HPVs, and 2·4% (257 patients) were co-infected with both (Fig. 1 a). The overall HPV prevalence rates were calculated for the following age groups: ⩽24 years, 20·6%; 25–34 years, 14·9%; 35–44 years, 15·9%; 45–55 years, 17·4%; and >55 years, 21·2% (Fig. 1 b). The prevalence ratio of HPV infection decreased from 20·6% in subjects aged <24 years to the lowest ratio of 14·9% in those aged 25–34 years, and then increased to 21·2% in subjects aged >55 years (Fig. 1 b).
Prevalence characteristic of high-risk HPVs
A total of 1782 subjects were infected with high-risk HPVs. Of these, 1346 (75·5%) subjects were infected with a single high-risk HPV strain, 322 (18·1%) subjects were infected with two high-risk strains, and 114 (6·4%) subjects were infected with ⩾3 high-risk strains (Fig. 2 a). The average ages of subjects infected with one, two, and ⩾3 HPV strains were 39·5 ± 12·4, 40·2 ± 13·6, and 36·6 ± 15·8 years, respectively (Fig. 2 b). The average age of subjects infected with ⩾3 HPV strains was significantly younger than that of the other two groups (Fig. 2 b).
Distribution characteristics of high-risk HPV genotypes
As shown in Table 1, HPV genotypes 16, 18, 31, 33, 35, 39, 45, 51, 52, 53, 56, 58, 59, 66, and 68 were detected in 17·7%, 6·3%, 7·5%, 10·4%, 2·2%, 9·3%, 2·1%, 9·3%, 23·0%, 10·8%, 3·1%, 16·9%, 2·7%, 4·9%, and 7·0% of the 1782 high-risk HPV-infected subjects, respectively. The top three most dominant HPV genotypes prevalent in our population were HPV52, HPV16, and HPV58.
HPV, Human papillomavirus.
Percentages for individuals co-infected by ⩾2 high-risk HPV strains were calculated respectively.
Characteristics of HPV co-infection
As described above, multiple HPV infection was common in this population. To determine the co-infection status, the co-infection rates of the three most dominant high-risk HPVs were calculated. As shown in Table 2, only 60·0% of cases positive for HPV genotype 52 were mono-infections, and the rates of HPV52 co-infection with HPV16, 18, 31, 33, 35, 39, 45, 51, 53, 56, 58, 59, 66, and 68 were 5·4%, 4·9%, 4·4%, 4·1%, 0·5%, 6·8%, 1·0%, 4·4%, 6·6%, 2·7%, 8·3%, 1·7%, 2·7%, and 4·4%, respectively. Of HPV16 cases, only 67·7% were mono-infections, and the rates of HPV16 co-infection with HPV18, 31, 33, 35, 39, 45, 51, 52, 53, 56, 58, 59, 66, and 68 were 3·8%, 4·7%, 3·8%, 0%, 4·1%, 0·6%, 5·1%, 7·0%, 3·2%, 1·6%, 7·0%, 2·2%, 2·8%, and 1·6%, respectively (Table 2). Only 53·6% of cases positive for HPV58 were mono-infections, and the rates of HPV58 co-infected with HPV16, 18, 31, 33, 35, 39, 45, 51, 52, 53, 56, 59, 66, and 68 were 7·0%, 3·6%, 4·6%, 12·9%, 1·6%, 6·3%, 1·3%, 4·0%, 11·3%, 4·6%, 2·3%, 2·6%, 4·0%, and 2·0%, respectively (Table 2). The greatest number of high-risk HPV strains detected in a single subject was seven; a 55-year-old woman was simultaneously co-infected with HPV31, 35, 52, 53, 39, 66, and 68. HPV58/HPV33 and HPV58/HPV52 showed significantly higher co-infection rates, suggesting that HPV58 is the most frequent co-infection high-risk HPV in this population (Table 2). HPV58/HPV68 and HPV16/HPV68 showed significantly lower co-infection rates, suggesting that HPV68 may be the rarest co-infection strain in our population (Table 2).
HPV, Human papillomavirus.
Differences in co-infection were examined by c 2 test or Fisher's exact probability test according to the characteristics of the data distribution.
High-risk HPV genotype distribution differs regionally
To determine the differences in HPV genotype distributions, we reviewed recent representative reports from the National Center for Biotechnology Information. In high-risk HPV-infected subjects, the three most dominant high-risk HPV strains (HPV16, 52, 58) were the same in six studies performed in Chinese populations, although the ratios varied between studies, while HPV53 was among the three most dominant strains instead of HPV52 in one study performed in Harbin (Table 3). However, the order from high to low varied widely. In the present study, the order was HPV52, HPV16, and HPV58 in order of decreasing prevalence (Table 3). Furthermore, other genotype distributions also differed regionally in China (Table 3). Clifford et al. [Reference Clifford26] reported a distribution tendency worldwide, with HPV16, 31, and 51 representing the three most dominant HPV genotypes (Table 3). Of these, only HPV16 is among the three most dominant strains in China. In the general population included in our study, the prevalence rates of HPV52, 16, and 58 were 3·9%, 3·0%, and 2·9%, respectively (Table 3). Data from India, Northern Ireland and the USA varied widely, except for HPV16 which remained a dominant strain worldwide [Reference Bhatla27–Reference de Sanjosé30]. The genotype distribution differed regionally worldwide.
HPV, Human papillomavirus.
Data are presented as percentage (%). The three most dominant genotypes are shown in bold.
DISCUSSION
Cervical cancer is estimated to affect about half a million women each year, of whom 80% live in developing countries [Reference Chew31]. Virtually all cases of cervical cancer are caused by HPV infection [Reference Moore and Chang1–Reference Dillner3]. Well-organized programmes of regular gynaecological screening and treatment of precancerous lesions have been very effective for preventing squamous cervical cancer but have had less impact on adenocarcinoma [Reference Chew31]. Fortunately, HPV vaccines were licensed for use in several countries in 2006, and adenocarcinoma is now expected to be controlled by HPV vaccines [Reference Stokley32]. However, currently available HPV vaccines are not region-specific. That is, current vaccination strategies do not adequately take into consideration the distribution and prevalence characteristics of HPV genotypes. Both the quadrivalent and bivalent vaccines protect against HPV types 16 and 18, which cause 70% of cervical cancers; the quadrivalent vaccine also protects against HPV types 6 and 11, which cause 90% of genital wart cases [33]. Although it is difficult to map the HPV genotype distribution and prevalence, large amounts of data indicate that HPV genotype distribution varies largely across regions [Reference Sun21–Reference de Sanjosé30]. Thus, regional HPV surveillance is essential for effective vaccination and HPV-related cancer control and prevention. In this study, we found that 17·3% of outpatients undergoing cervical screening were infected with high-risk HPVs. Of these subjects, 24·5% were infected with ⩾2 HPV strains, and the average age of subjects infected with ⩾3 HPV strains was significantly younger than those of other groups. The three most dominant HPV genotypes prevalent in our population were HPV52, HPV16, and HPV58. Of the three most dominant HPV strains, only 60·0% of subjects infected with HPV52, 67·7% of subjects infected with HPV16, and 53·6% of subjects infected with HPV58 were mono-infections, with the remainder being co-infected with other high-risk HPVs. HPV58/HPV33 (12·9%) and HPV58/HPV52 (11·3%) showed significantly higher co-infection rates than other combinations. HPV genotype distribution varied markedly between domestic and international regions, while HPV16 remained among the three most dominant strains worldwide. Our data emphasize the complexity of HPV genotype distribution and the importance of local HPV surveys for vaccination strategies.
The prevalence rates of high-risk HPVs according to age are somewhat confusing, with 20·6%, 14·9%, 15·9%, 17·4%, and 21·2% of subjects in the ⩽24, 25–34, 35–44, 45–55, and >55 years age groups, respectively. The majority of women who attended for cervical screening were aged 25–34 years. In contrast, women with cervical diseases who visited our department spontaneously, belonged primarily to the other age groups. This might explain why the lowest HPV positivity rate was seen in the population aged 25–34 years. Although we could not give a scientific explanation for the age-specific prevalence rates, our data are similar to the distribution tendencies in central and South America and Europe reported in a meta-analysis [Reference de Sanjosé30]. Thus, our data may represent the real age-specific distribution characteristics that may be determined by bearing, female hormone dynamics, sex, and age of the host. On the other hand, our data also indicate that the average ages of subjects infected by one, two, and ⩾3 HPV strains were 39·5 ± 12·4, 40·2 ± 13·6, and 36·6 ± 15·8 years, respectively. The average age of subjects infected by ⩾3 HPV strains was significantly lower than that of the other two groups. As HPV is a sexually transmitted disease, the relatively high frequency of sexual intercourse and greater number of sexual partners might explain why multiple infections are more frequent in younger women. With regard to multiple infections, few studies have investigated the natural history of multiple HPV infection, how and when these HPV strains infect the same host, the impact of multiple infections on cervical cancer, and any tendency for an affinity between HPVs in co-infecting the same host. These questions are also important for HPV virological studies. However, these issues are outside the scope of this epidemiological study.
Further comparative analysis showed that HPV16, 52, and 58 are the three most dominant high-risk HPV strains in five Chinese reports, although the order of prevalence rates from high to low varied widely and other genotype distributions also differed regionally in China [Reference Sun21–Reference Bi25]. Other data from India, Northern Ireland, the USA and integrated data varied except for HPV16, which remained a dominant strain worldwide [Reference Clifford26–Reference de Sanjosé30]. Although all data showed that HPV16 is a dominant strain worldwide, the highest percentage of HPV16 infection was only 26·2% in high-risk HPV-infected subjects, and therefore the genotype distribution differed regionally worldwide. Again, our report emphasizes the complexity of HPV genotype distribution and the importance of local HPV surveys.
In this study, we detected only 15 high-risk HPV strains (HPV16, 18, 31, 33, 35, 39, 45, 51, 52, 53, 56, 58, 59, 66, 68) and six low-risk HPV strains (HPV6, 11, 42, 43, 44, CP8304), and we could not determine the distribution characteristics of other HPVs in women living in Shanghai. HPV genotype distribution is known to differ with pathological grade. We are in the process of analysing these data regarding HPV infection of different pathological grades and will report the results in the near future.
ACKNOWLEDGEMENTS
This work was supported by a grant as a Leading Academic Discipline Project of the Shanghai 7th People's Hospital (xk2013-01).
DECLARATION OF INTEREST
None.