Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the virus that causes coronavirus disease (COVID-19), remains a global threat despite massive diagnostic testing, isolation, therapies, and vaccines. Emerging SARS-CoV-2 variants of concern (VOCs) continue to challenge these measures with increased transmissibility or virulence, escape of host antibody neutralization, and decreased efficacy of detection, therapeutics, and vaccination [Reference Aleem, Akbar Samad and Slenker1]. Viral genomic sequencing can improve the understanding of VOCs while providing insight into the dynamics of all circulating SARS-CoV-2 lineages.
Of the over 650 million global cases of COVID-19 reported by the end of December 2022, 36 million cases were reported in Brazil, with deaths approaching 700,000. Almost 180,000 of these cases were reported in Ribeirão Preto, located in São Paulo state [2]. Ribeirão Preto is home to a prominent national medical centre, making it a unique setting for the study of COVID-19 pandemic dynamics with multiple ongoing studies [Reference Bonifácio, Csizmar, Barbosa-Júnior, Pereira, Koenigkam-Santos, Wada, Gaspar, Carvalho, Bollela, Santana, Souza and Bellissimo-Rodrigues3]. This ongoing study offers insight into the role of different SARS-CoV-2 lineages in the epidemiology of multiple waves of the COVID-19 pandemic in Ribeirão Preto, with comparison within Brazil and globally.
All patients cared for at the Hospital das Clínicas, Faculdade da Medicina de Ribeirão Preto, Universidade de São Paulo (HCFMRP-USP), were eligible to be included in the Uncovering new CoronaVirus Encoded Ramifications in Brazil (UnCoVER-Brazil) study. Nasopharyngeal and/or oropharyngeal samples were collected for over 6,000 patients in outpatient and inpatient settings at HCFMRP-USP between 4 April 2020 and 31 January 2022. The detection of SARS-CoV-2 was performed by multiplex real-time reverse transcription PCR (RT-PCR) targeting N, E, and RDRP genes (Gene Finder™ COVID-19 Plus Real Amp Kit; OSANG Healthcare). Subjects were included in the UnCoVER study with a cycle threshold for SARS-CoV-2 less than or equal to 30. From 2,637 viral samples meeting this criterion, we generated complete SARS-CoV-2 genomes from 2,395 distinct infections of 2,379 patients. Multiple distinct infections from a single individual were required to have a minimum sample collection interval of 60 days. Sequencing was performed at the Laboratory of Translational Oncology at the Hemocentro of Ribeirão Preto, with the Illumina COVIDSeq Test (RUO Version) kit and the included ARTIC Network nCoV-2019 Amplicon Panel (V3). Most samples were sequenced on HiSeq 4000 (Illumina), with some sequenced on MiSeq (Illumina). FASTQ generation was completed in BaseSpace (Illumina) using the FASTQ Generation app (1.0.0). DRAGEN COVID Lineage (version 3.5.10) was used for alignment to SARS-CoV-2 reference genome NC_045512 and for Pango lineage classification (Pangolin software 4.1.2, Pangolin data 1.14). Only genomes that passed Pangolin quality control and had at least 90% non-ambiguous content were included in analysis [4]. All included SARS-CoV-2 genomes are available on GISAID under the identifier EPI_SET_230119wh (doi: 10.55876/gis8.230119wh). Sequencing results were merged with sociodemographic and clinical data in RStudio (2022.07.01), which was also used for analysis and figure generation. REDCap (7.6.3) was used for data storage.
Sociodemographic information and COVID-19 case details were extracted from the HCFMRP-USP COVID-19 surveillance reporting system, with some entries supplemented with information from electronic medical records. Of the 2,395 distinct SARS-CoV-2 infections captured by sequencing, the median age of subjects was 41.0 (IQR 31–56) years, and 1,469 (61.3%) subjects were female. A total of 2,015 cases were symptomatic (84.1%, missing data 15.2%). Less than a third of patients were hospitalized (666 patients, 27.8%), and 176 patients died (7.3%, missing data 17.5%).
We identified four World Health Organization (WHO) VOCs (Gamma, Alpha, Delta, Omicron) and one WHO variant of interest (VOI) (Zeta), four of which (Zeta, Gamma, Delta, Omicron) caused distinct waves in this cohort. Within these waves, we identified 47 distinct Pango lineages, including multiple Brazilian lineages (Figure 1, Table 1).
Throughout 2020, the SARS-CoV-2 B lineage and descendants made up nearly all cases sequenced, with Zeta rising at the end of 2020. Gamma spiked early in 2021 and, within 2 months, dominated most sequenced genomes. Gamma continued as the dominant variant until Delta emerged in the last third of 2021. Delta was completely replaced by the Omicron spike in early 2022. Each new dominant variant group quickly and nearly completely replaced the dominant group preceding it (Figure 1). This pattern, which continued despite vaccination increasing in Brazil starting in 2021, is consistent with the hypothesis that herd immunity is SARS-CoV-2 variant-specific and does not broadly cover COVID-19 [Reference Barber7].
We focused our analysis on comparative outcomes between variant groups. Relative to the overall median age of 41.0 years and the largely similar variant group median ages (Table 1), a decrease to 35.0 and 33.0 was observed for N lineages and the non-VOC P lineage, respectively. However, interpretation is limited by the low number of cases sequenced in these groups. Hospitalization was similarly consistent among variant groups, with an overall average rate of 27.8% and range from 20% to 45% for all groups except Omicron. Less than 10% of individuals with Omicron were hospitalized (Table 1); however, we only captured patients early in its wave, and increased vaccination is a likely confounder for decreased severity.
The Pango lineage composition of variant groups offers additional insight into transmission dynamics unique to HCFMRP-USP. Combined, B.1.1.28 and B.1.1.33 made up 80% of the sequences observed in the B & B descendant group that dominated throughout 2020. Of global sequences reported to date, over 75% of all B.1.1.28 lineages and 85% of all B.1.1.33 were reported in Brazil [Reference O’Toole8]. Both descended from European lineage B.1.1 and were considered primary drivers of the first wave of the pandemic in Brazil. VOI Zeta (P.2) and VOC Gamma (P.1) descended from B.1.1.28, while N.9 – which was present at the same time at HCFMRP-USP but dominated by Zeta – descended from B.1.1.33 [Reference Resende9]. More than half of global lineages attributed to Gamma were observed in Brazil [Reference O’Toole8]. The long dominance of Gamma at HCFMRP-USP is clear in Figure 1, with over 80% observed being P.1 (Table 1).
All Delta lineages observed in this study were AY lineages, and no B.1.617.2 – the lineage first attributed to the global surge of Delta – was observed. Over 50% of Delta lineages were AY.99.2, a Brazilian lineage with over 95% of all global sequences reported in Brazil [Reference O’Toole8]. AY.99.2 descended from AY.99, which in turn descended from B.1.617.2. In Brazil, Delta was responsible for a more modest surge of cases than was observed in other countries in the second half of 2021. A recent study from Minas Gerais suggested that despite Delta’s increased transmissibility compared to Gamma – which dominated infections in Brazil when Delta arrived – the over 80% vaccination rate in Brazil at the time may have protected against a surge [Reference Fonseca10]. Unfortunately, no conclusions can be drawn from this study about case surges, as sequencing did not have any fixed correlation with COVID-19 case count at HCFMRP-USP. Although our data suggest that herd immunity is variant-specific, we speculate that Gamma – which was responsible for a larger outbreak in Brazil than in many countries – may have produced an immune response that was more cross-reactive with Delta than other variants.
In this dispatch, we describe the preliminary insights gained from the large COVID-19 study based at HCFMRP-USP, one of Brazil’s largest public academic hospitals located in Ribeirão Preto, São Paulo state. Here we observed trends in VOC prevalence consistent with those observed in general in Brazil. We also identified the contribution of key Brazilian lineages to pandemic waves. Our results are consistent with the hypothesis that herd immunity to SARS-CoV-2 tends to be variant-specific.
Data availability statement
Data that support the findings of this study are available upon a reasonable request addressed to the corresponding author.
Acknowledgements
We thank all individuals who participated in this study by donating specimen. We also thank all members of the UnCoVER-Brazil project. We acknowledge the Virology Laboratory at HCFMRP-USP for completing SARS-CoV-2 RT-PCR testing; Vivian Neves Dias Arantes for selecting and preparing samples for SARS-CoV-2 genomic sequencing; and the sequencing team in the Laboratory of Translational Oncology at the Hemocentro of Ribeirão Preto for completing sequencing. We also acknowledge Núcleo de Vigilância Epidemiológica Hospitalar (NVEH) at HCFMRP-USP for clinical data collection.
Author contribution
Conceptualization: F.B., B.A.L.F., J.M.S., F.B.J., L.M.C., F.S.D.C.; Data curation: F.B., S.D.S., F.B.C., G.S.C., F.B.J., L.M.C., L.M.T.B., M.R.C., S.B., A.Y.Y., F.S.D.C.; Funding acquisition: F.B., L.M.C., R.d.T.C.; Investigation: F.B., S.D.S., F.B.C., G.S.C., L.M.T.B., F.S.D.C.; Methodology: F.B., B.A.L.F., F.B.C., G.S.C., J.M.S., F.B.J., L.M.C., L.M.T.B., M.R.C., S.B., A.Y.Y., F.S.D.C.; Project administration: F.B., B.A.L.F., F.B.J., L.M.C., M.R.C., F.S.D.C.; Supervision: F.B., B.A.L.F., L.M.C., R.d.T.C.; Validation: F.B., F.B.J., M.R.C.; Visualization: F.B.; Writing – review & editing: F.B.; Formal analysis: S.D.S., F.B.J., F.S.D.C.; Writing – original draft: S.D.S., F.S.D.C.; Resources: M.R.C.
Financial support
This work was funded by the Department of Social Medicine at HCFMRP-USP, the Hemocentro of Ribeirão Preto, and the Provost of Graduate Studies of the University of São Paulo. We acknowledge financial support for S.D.S. by the Fulbright US Student Program sponsored by the US Department of State and the Fulbright Commission in Brazil. The content of this publication is solely the responsibility of the authors and does not necessarily represent the official views of those funding the work.
Competing interest
The authors declare none.
Ethical standard
This work was approved by the Research Ethics Committee at HCFMRP-USP and complies with the Brazilian General Data Protection Law.