Hostname: page-component-745bb68f8f-b95js Total loading time: 0 Render date: 2025-01-28T05:15:19.817Z Has data issue: false hasContentIssue false

Fundamentals of genomic epidemiology, lessons learned from the coronavirus disease 2019 (COVID-19) pandemic, and new directions

Published online by Cambridge University Press:  07 December 2021

Denis Jacob Machado*
Affiliation:
University of North Carolina at Charlotte, College of Computing and Informatics, Department of Bioinformatics and Genomics, Charlotte, North Carolina
Richard Allen White III
Affiliation:
University of North Carolina at Charlotte, College of Computing and Informatics, Department of Bioinformatics and Genomics, Charlotte, North Carolina University of North Carolina at Charlotte, North Carolina Research Campus (NCRC), Kannapolis, North Carolina
Janice Kofsky
Affiliation:
University of North Carolina at Charlotte, College of Computing and Informatics, Department of Bioinformatics and Genomics, Charlotte, North Carolina
Daniel A. Janies
Affiliation:
University of North Carolina at Charlotte, College of Computing and Informatics, Department of Bioinformatics and Genomics, Charlotte, North Carolina
*
Author for correspondence: Denis Jacob Machado, PhD, Department of Bioinformatics and Genomics, College of Computing and Informatics, University of North Carolina at Charlotte, 9331 Robert D. Snyder Rd, BINF 224, Charlotte, NC28223. E-mail: dmachado@uncc.edu

Abstract

The coronavirus disease 2019 (COVID-19) pandemic was one of the significant causes of death worldwide in 2020. The disease is caused by severe acute coronavirus syndrome (SARS) coronavirus 2 (SARS-CoV-2), an RNA virus of the subfamily Orthocoronavirinae related to 2 other clinically relevant coronaviruses, SARS-CoV and MERS-CoV. Like other coronaviruses and several other viruses, SARS-CoV-2 originated in bats. However, unlike other coronaviruses, SARS-CoV-2 resulted in a devastating pandemic. The SARS-CoV-2 pandemic rages on due to viral evolution that leads to more transmissible and immune evasive variants. Technology such as genomic sequencing has driven the shift from syndromic to molecular epidemiology and promises better understanding of variants. The COVID-19 pandemic has exposed critical impediments that must be addressed to develop the science of pandemics. Much of the progress is being applied in the developed world. However, barriers to the use of molecular epidemiology in low- and middle-income countries (LMICs) remain, including lack of logistics for equipment and reagents and lack of training in analysis. We review the molecular epidemiology literature to understand its origins from the SARS epidemic (2002–2003) through influenza events and the current COVID-19 pandemic. We advocate for improved genomic surveillance of SARS-CoV and understanding the pathogen diversity in potential zoonotic hosts. This work will require training in phylogenetic and high-performance computing to improve analyses of the origin and spread of pathogens. The overarching goals are to understand and abate zoonosis risk through interdisciplinary collaboration and lowering logistical barriers.

Type
Review
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2021. Published by Cambridge University Press on behalf of The Society for Healthcare Epidemiology of America

How did genomic epidemiology become what it is?

Genomic epidemiology stems from molecular epidemiology, which uses evidence ranging from gel electrophoresis to multilocus sequence typing to study the origins and spread of pathogenic microorganisms. Janies et al Reference Janies1 reviewed the history of molecular epidemiology and compared it with syndromic epidemiology. Here, we focus on recent advances toward genomic epidemiology (Fig. 1), which includes genomic sequencing combined with rapid data sharing as enabled by the Internet. In 2002–2003, the severe acute respiratory syndrome coronavirus (SARS-CoV) was the first infectious disease for which scientists shared software and pathogen genetic data over the Internet to rapidly respond to the disease. Thereafter, genomic epidemiology was solidified by responses to H5N1, H1N1-2009, and other strains of influenza such as H7N9Reference Janies, Pomeroy and Aaronson 2 and expanded to respond to foodborne and sexually transmitted diseases. Reference Hoffmann, Luo and Monday3Reference Allard, Strain and Melka5

Fig. 1. Timeline of major events in sequencing technology (green) and genomic epidemiology (purple) alongside the first recorded occurrence of SARS-CoV, H1N1-2009, MERS-CoV, and SARS-CoV-2 in humans. Associated references can be found in Supplementary Table 1.

The first SARS-CoV genome was shared after publication Reference Marra, Jones and Astell6,Reference Rota, Oberste and Monroe7 on National Center for Biotechnology Information’s (NCBI) GenBank website, which was customary. Meanwhile, dashboards, graphs, and maps emerged to track cases over time and space. Reference Boulos8 Janies et al Reference Janies, Habib, Alexandrov, Hill and Pol9,Reference Janies, Hill, Guralnick, Habib, Waltari and Wheeler10 combined genomic and geographic data for SARS-CoV and H5N1 influenza, respectively, being the first to project phylogenies onto a virtual globe. Janies et al Reference Janies, Treseder and Alexandrov11 used Keyhole Markup Language (KML) to develop Supramap, which facilitates geographic mapping of phylogenies. Supramap allowed hypothesis testing ranging from the host and geographic origins of pathogens Reference Studer and Janies12 to tracing mutations that conferred drug resistance or host switching. Reference Janies, Voronkin and Studer13,Reference Hill, Guralnick, Wilson, Habib and Janies14 Limitations of computing large data sets, coupled with a preference for sharing data after publication, resulted in a greater turnaround between data acquisition and results than occurs today. However, these conditions did not impede a hypothesis-driven field with value to decision makers, as demonstrated in a 2007 congressional hearing. 15

In the 2000s, some genomes were sequenced for respiratory pathogens such as H1N1-2009. However, even SARS-CoV genomes were not always sequenced completely, and sequences were released gradually. Reference Janies, Habib, Alexandrov, Hill and Pol9 This changed due to factors such as new DNA sequencing technologies.

How did advances in sequencing technology reshape genomic epidemiology?

Current genomic epidemiology of infectious diseases originated in response to the SARS-CoV epidemic. Reference Janies, Voronkin, Das, Hardman, Treseder and Studer16 Sequencing the SARS-CoV genome was instrumental in recognizing it as a novel coronavirus associated with HCoV-OC43 and HCoV-229E. Reference Marra, Jones and Astell6,Reference Rota, Oberste and Monroe7 Researchers combined genomic and epidemiological data to trace the genotypic variation of the viral transmission paths between 2002 and 2003. Reference Ruan, Wei and Ee17,Reference Zhao18 However, today’s genomic surveillance evolved with the advance of high-throughput sequencing (HTS) (Fig. 1).

Reuter et al Reference Reuter, Spacek and Snyder19 summarized HTS history until 2015 and Pérez-Losada Reference Pérez-Losada, Arenas and Galán20 reviewed recent HTS advances. We focus on the sequence cost variation per raw megabase between 2001 and 2020 21 (Fig. 2a) to illustrate the increasing feasibility of sequencing coronavirus genomes (Fig. 2b). Considering raw nucleotide sequencing cost, US$100 was not sufficient to sequence one coronavirus genome in 2020, but $100 it would cover >400,000 genomes in 2020.

Fig. 2. The increasing feasibility of sequencing complete coronavirus genomes. (a) Sequencing cost per raw megabase of DNA sequence from September 2001 until August 2020 (data source: genome.gov/sequencingcosts, access date: September 2021). (b) Number of complete coronavirus genomes that can be sequenced with USD 100, assuming a genome size of 32 Kbp. These cost estimates do not consider sampling, storage, consumables, equipment, and staff costs. These plots use a logarithmic scale.

What are coronaviruses?

Coronaviruses correspond to the four genera of the subfamily Orthocoronavirinae. Gammacoronavirus (GammaCoVs) and Deltacoronavirus (DeltaCoVs) mainly infect birds and rarely infect mammals. Reference Woo, Lau and Lam22,Reference Durães-Carvalho, Caserta and Barnabé23 Alphacoronavirus (AlphaCoVs) and Betacoronavirus (BetaCoVs) originated from Chiroptera (bats) and are often found in other mammals, including humans. Reference Woo, Lau and Lam24

The coronavirus virion encapsulates one of the longest RNA virus genomes (27–32 kb), Reference Woo, Huang, Lau and Yuen25 which has complex gene expression Reference Irigoyen, Firth, Jones, Chung, Siddell and Brierley26 and variable gene content among genera (Fig. 3a). 27

Fig. 3. Fundamental evolution of coronaviruses based on Machado et al.Reference Machado, Scott, Guirales and Janies 49 (a) Virion and genome structure. The genomic regions indicated in the figure do not represent all the genes in the coronavirus genome, but the genes that are shared among the different genera of Orthocoronavirinae and that were analyzed by Machado et al.Reference Machado, Scott, Guirales and Janies 49 Note. E, envelope small membrane protein; M, membrane protein; N, nucleoprotein; S, spike glycoprotein. (b) Summarized cladogram from Machado et al.Reference Machado, Scott, Guirales and Janies 49 The original cladogram contained 2,006 terminals corresponding to unique coronavirus genomes. Terminals indicating the eight species of human coronaviruses (HCoVs) are in bold. (c) Hosts involved in the emergence of all human coronaviruses, including SARS-CoV-2. The HCoVs of special concern to human health (SARS-CoV, MERS-CoV, and SARS-CoV-2) are shown in red. The flow chart indicates that HCoV-NL63, SARS-CoV, and SARS-CoV-2 originated from bat-hosted coronaviruses. Bats were also key to the emergence of MERS-CoV in camels and humans. HCoV-229E, HCoV-HKU1, and HCoV-OC43 originated from viruses hosted in artiodactyls, rodents, and bovids, respectively. All silhouettes were downloaded from PhyloPic (http://phylopic.org). The coronavirus vision structure was modified from https://commons.wikimedia.org/wiki/File:Coronavirus_virion_structure.svg. See Supplementary File 1 for detailed copyright and license information.

Coronavirus infections in domestic animals are economically significant. Reference Li, Ge and Li28Reference Mandelik, Sarvas, Jackova, Salamunova, Novotny and Vilcek30 However, the episodic emergence of human coronaviruses (HCoVs) is a pressing concern because they cause infections in all age groups, often leading to respiratory or enteric diseases. Reference Su, Wong, Shi, Liu, Lai and Zhou31 Neurological illness or hepatitis is less frequent. Reference Lai and Cavanagh32 The US Centers for Disease Control (CDC) website 33 lists 7 HCoVs: 2 AlphaCoVs (HCoV-229E and HCoV-NL63) and 5 BetaCoVs (HCoV-OC43, HCoV-HKU1, SARS-CoV, MERS-CoV, and SARS-CoV-2). We added the human enteric coronavirus 4408 (HECV-4408) to the list because it was isolated from a child with acute gastroenteritis. Reference Zhang, Herbst, Kousoulas and Storz34

How did SARS-CoV-2 accelerate the growth of genomic epidemiology?

Coronaviruses were not deemed highly pathogenic to humans until the 2002 SARS-CoV outbreak. Reference Zhong, Zheng and Li35,Reference Ksiazek, Erdman and Goldsmith36 The dangers of HCoVs were made more evident by the 2012 outbreak of Middle East respiratory syndrome (MERS) coronavirus (MERS-CoV). Reference Zumla, Hui and Perlman37 Nevertheless, coronaviruses did not receive the current level of attention until the pandemic coronavirus disease 2019 (COVID-19), caused by SARS-CoV-2, was first reported in humans in Wuhan, China, in December 2019. 38 However, Pekar et al Reference Pekar, Worobey, Moshiri, Scheffler and Wertheim39 inferred that the virus was present in Hubei approximately a month before. On March 11, 2020, the World Health Organization (WHO) declared a pandemic due to the spread of SARS-CoV-2. 38 By October 14, 2021, COVID-19 had caused 4,863,818 deaths worldwide. 40

Understanding the emergence and evolution of SARS-CoV-2 is vital to preventing future pandemics. Reference Yuen, Ye, Fung, Chan and Jin41 The question can be divided into 3 components. First, was the virus purposefully manipulated? Several peer-reviewed publications have concluded that SARS-CoV-2 emerged naturally via zoonosis (see eg, Anderson et al, Reference Andersen, Rambaut, Lipkin, Holmes and Garry42 Liu et al Reference Liu, Saif, Weiss and Su43 , and Holmes et al Reference Holmes, Goldstein and Rasmussen44 ). Moreover, previous serology data indicate natural human infections by bat-hosted, SARS-like viruses. Reference Wang, Li and Yang45

Second, was SARS-CoV-2 an accidental release? If a naturally occurring virus was transported to a laboratory and humans were infected shortly thereafter, the virus may not have accumulated sufficient mutations to record its passage through controlled environments. Reference Zhang, Hasoksuz and Spiro46 However, no evidence indicates that SARS-CoV-2 was known to scientists before December 2019. Reference Rasmussen47,Reference Shi48

Third, what is the natural source of SARS-CoV-2? The most comprehensive phylogenomic analysis of coronavirus Reference Machado, Scott, Guirales and Janies49 (Fig. 3b) addressed the fundamental evolution of HCoVs (Fig. 3c) and showed that SARS-CoV-2 results from bat-hosted viruses infecting humans. Reference Zhao, Zhuang and Cao50 SARS-CoV-2 finds its closest related bat-hosted coronaviruses in the subgenus Sabercovirus, a subgroup of SARS-related coronaviruses (SARSr-CoV) first identified in horseshoe bats (Rhinophulus spp). Reference Li, Shi and Yu51 Bat-hosted viruses similar to SARS-CoV-2 were collected in the Yunnan province, >1,500 km away from Wuhan, but the hosts have a wide geographic range. Reference Wang, Li and Yang45,Reference Lytras, Hughes and Martin52,Reference Lytras, Xia, Hughes, Jiang and Robertson53

Despite a confusing array of reports confirming Reference Lam, Jia and Zhang54Reference Zhang, Wu and Zhang56 and denying Reference Liu, Jiang and Wan57 the origin of SARS-CoV-2 from pangolin (Manis javanica) hosts, pangolins are not involved in the lineage of SARS-CoV-2 that infected humans. Reference Machado, Scott, Guirales and Janies49 This finding is similar to the emergence of SARS-CoV, Reference Janies, Habib, Alexandrov, Hill and Pol9 which also infected humans from bat-hosted viruses without any need for intermediate hosts, including Himalayan palm civets (Parguma larvata) and raccoon dogs (Nyctereutes procyonoides).

Are we sequencing SARS-CoV-2 genomes fast enough?

SARS-CoV-2 was identified on January 7, 2020. Three days later, its genome and metadata were shared via the Global Initiative on Sharing Avian Influenza Data (GISAID) 58 EpiCoV database, 59 before the first peer-reviewed article was published in February 2020. Reference Wu, Zhao and Yu60

To put the SARS-CoV-2 genome sequencing speed into context, consider that SARS-CoV was first reported in November 2002, but its genome was publicly released in April 2003. Reference Marra, Jones and Astell6 The speed at which such data are released was changed by several forces, illustrated by Janies et al. Reference Janies, Voronkin, Das, Hardman, Treseder and Studer16 In brief, the reasons include the increased feasibility of genome sequencing, the willingness to share data before publication, and the rise of the popular GISAID database, which credits submitting laboratories.

Figure 4 shows the accumulation of 4,224,785 complete SARS-CoV-2 genomes in EpiCoV between January 10, 2020, and October 13, 2021. The curve is far from reaching a plateau, indicating that we are not producing coronavirus genomes at total capacity. Efforts to sequence SARS-CoV-2 following international guidelines 61,62 are welcome because these data inform epidemiological forecasts (eg, increased transmission efficiency of SARS-CoV-2 variants has led to projections of the rise of higher numbers of cases Reference Truelove, Smith and Qin63 ).

Fig. 4. Progressive accumulation of 4,224,785 complete SARS-COV-2 genome sequences (>26 Kbp) submitted to the GISAID EpiCoV database (https://www.epicov.org/) between January 10, 2020, and October 13, 2021. These cost estimates do not consider sampling, storage, consumables, equipment, and staff costs (see eg, Schwarze et alReference Schwarze, Buchanan and Fermont 168 ). Nevertheless, the price of raw nucleotide sequencing is a significant component of the cost of genome projects.

Genomic sequencing generates a snapshot of a viral lineage in a place and time. When sequences are collected longitudinally, applications in genomic epidemiology and pandemic responses emerge, which we illustrate with 4 examples. First, profiling mutation fingerprints from the viral pangenome to individual infection quasi-species enables molecular contact tracing. Reference Lau, Pavlichin and Hooker64 Second, genomic sequencing informs the peptide mass fingerprinting (PMF) used to predict novel structures and find inhibitors for viral peptides, Reference Hamza, Ali and Khan65 although results must be tested in randomized controlled trials Reference Hariton and Locascio66 to identify effective antivirals. Reference Boulware, Pullen and Bangdiwala67,Reference Siemieniuk, Bartoszko and Ge68 Third, the data are used to model epidemic or pandemic size and severity. Reference Truelove, Smith and Qin63 Fourth, viral sequences are fundamental for developing mRNA vaccines. 69 For a review on current pitfalls and opportunities in applying HTS to SARS-CoV-2 genomes, see Chiara et al. Reference Chiara, D’Erchia and Gissi70

As SARS-CoV-2 becomes endemic, Reference Shaman and Galanti71,Reference Nakanishi and Yoshio72 sequencing demand will remain high. SARS-CoV-2 infections are decreasing as more people develop immunity through natural infection or vaccination. Reference Phillips73 However, variants may evade infection and vaccine-induced antibodies, Reference Zhou, Dejnirattisai and Supasa74 especially with infections occurring months after vaccination (ie, breakthrough infections). Reference Kustin, Harel and Finkel75,Reference Farinholt, Doddapaneni and Qin76 Given breakthrough infections, increased transmission of some variants, and the lack of full vaccination among eligible people, we can predict that SARS-CoV-2 will continue to evolve. Whether SARS-CoV-2 is evolving toward more severe or more benign COVID-19 phenotypes is a pressing research question for genomic epidemiology.

Effective countermeasures depend on understanding SARS-CoV-2 lineages, such as sampling variants for which phenotype is not fully understood Reference Giovanetti, Benedetti and Campisi77 and addressing sampling bias. Reference To, Sridhar and Chiu78 For example, if we restrict sequencing viral isolates from hospitalized patients, the relationships between any variables associated with hospitalization will be distorted when compared to the general population. Thus, we would miss mutations associated with asymptomatic and symptomatic cases that did not require hospitalization, which could lead to inducing or misinterpreting the evidence for phenotype-genotype associations. Reference Munafò, Tilling, Taylor, Evans and Davey Smith79Reference Tattan-Birch, Marsden, West and Gage81

Brito et al Reference Brito, Semenova and Dudas82 analyzed the spatiotemporal heterogeneity in each country’s SARS-CoV-2 genomic surveillance efforts based on metadata submitted to GISAID until May 30, 2021. These researchers estimated that when the prevalence of a rare lineage is 2%, 300 cases would need to be sequenced to detect at least 1 genome of that lineage with 95% probability. Therefore, sequencing capacity should be at least 0.5% of cases per week when incidence is >100 positive cases per 100,000 people.

Brito et al Reference Brito, Semenova and Dudas82 observed that countries like Denmark, which have a quick turnaround for sequencing, processing, and sharing SARS-CoV-2 genomic data (<18 days) and a high sequencing rate (>32%), observe greater lineage diversity. Many variants may be missed when sampling rates are low. However, disparities in wealth, investment in research and training, coordination, and supply chain logistics affect the ability of countries to perform genomic surveillance, especially LMICs. Therefore, efforts must be made to provide funds, training, and logistic support for researchers based in LMICs to improve their genomic surveillance capacity and public-health decision making.

How do we classify the variants of SARS-CoV-2?

Any genome sequence that is genetically distinct from the reference can be called a variant. In practice, the SARS-CoV-2 variants represent clades that share a set of key mutations while still permitting a small amount of other sequence variation. Reference Lauring and Hodcroft83,Reference Tegally, Wilkinson and Giovanetti84 Moreover, convergent evolution among geographically distant variants has been observed (Table 1). Reference Ford, Scott, Machado and Janies85 Although variants and strains are different, some researchers use these terms interchangeably (eg, Awadasseid et al, Reference Awadasseid, Wu, Tanaka and Zhang86 Hossein et al, Reference Hossain, Hassanzadeganroudsari and Apostolopoulos87 and Ul-Rahman et al Reference Ul-Rahman, Shabbir and Aziz88 ). The term “strain” is typically associated with lineages that became sufficiently divergent to exhibit a changed phenotype. Reference Kuhn, Bao and Bavari89

Table 1. Notable Variants of SARS-CoV-2 and Their Main Attributes a

Note. SIG, US government SARS-CoV-2 Interagency Group; VBM, variant being monitored; VOC, variant of concern; VOI, variant of interest; VUM, variants under monitoring; EUA, emergency use authorization.

a This table was modified and updated from the WHO website, 93 the CDC website, 94 Rambaut et al,Reference Rambaut, Holmes and O’Toole 97 and Soh et al. Reference Soh, Kim, Kim, Jang and Lee167 SIG and WHO classifications are detailed in Table 2.

In late 2020 and throughout 2021, as vaccine availability increased, information on variants began to dominate the COVID-19 response. Reference Parums90Reference Janik, Niemcewicz, Podogrocki, Majsterek and Bijak92 The emergence of variants that might pose an increased risk to global public health prompted the WHO to characterize specific variants of interest (VOIs) and variants of concern (VOCs) to prioritize global monitoring and research. 93 The US government SARS-CoV-2 interagency group (SIG) developed a separate variant classification scheme, 94 which we compare to the WHO system in Table 2.

Table 2. Comparing the Different Categories in the WHO Variant Classification System 93 With the System Used by the US government SARS-CoV-2 Interagency Group (SIG) 94 , a

Note. VBM, variant being monitored; VOC, variant of concern; VOI, variant of interest; VUM, variants under monitoring; VOHC, variant of high consequence; EUA, emergency use authorization.

a Currently, no variants are being classified as VOI or VOHC by the CDC and SIG.

In March 2021, the WHO assigned letters of the Greek alphabet to categorize VOIs and VOCs, 93 for simplicity and to avoid association with particular localities. These labels do not replace existing classifications by GISAID (https://gisaid.org/), Reference Shu and McCauley95 Nextstrain (https://nexstrain.org/), Reference Hadfield, Megill and Bell96 and Pango lineages (https://cov-lineages.org/). Reference Rambaut, Holmes and O’Toole97 SARS-CoV-2 variants were reviewed by Harvey et al. Reference Harvey, Carabelli and Jackson98

Why are vaccines still not enough against COVID-19?

The speed of development and testing of COVID-19 vaccines development is one of history’s most outstanding public health achievements. Vast vaccination of eligible individuals is the best and safest way to control the pandemic. Reference Flanagan, MacIntyre, McIntyre and Nelson99 Although some SARS-CoV-2 variants show a degree of escape from protective antibodies induced by natural infection (and, to a lesser degree, after immunization), T-cell responses are retained. Reference Cevik, Grubaugh, Iwasaki and Openshaw100 Furthermore, first-generation SARS-CoV-2 mRNA-based vaccines induce public antibodies (ie, antibodies with similar genetic elements and modes of recognition against a different antigen observed in multiple individuals) with robust neutralizing and potentially durable protective activity against variants such as alpha (α), beta (β), and gamma (γ). Reference Schmitz, Turner and Liu101

SARS-CoV-2 variants will continue to emerge, Reference Boehm, Kronig and Neher102 requiring close international monitoring to determine the need for vaccination boosters and or redesign. Reference Boehm, Kronig and Neher102 As variants emerge in areas of low vaccination, a global COVID-19 vaccination rollout is imperative. Since the vaccine rollout, new questions have arisen regarding vaccine efficacy against the transmission of different variants, Reference Cevik, Grubaugh, Iwasaki and Openshaw100 the duration of protection, Reference Farooqi, Malik and Mulla103 and the efficacy of prime-boost schedules. Reference Flanagan, MacIntyre, McIntyre and Nelson99,Reference Krause and Gruber104Reference Chen, Zhu and Huang106 A demand has also arisen for studies to determine the immunological correlates of protection against COVID-19 as cases decline and prevention of severe disease gains more importance in vaccine efficacy. Reference Hodgson, Mansatta, Mallett, Harris, Emary and Pollard107 Meanwhile, nonpharmaceutical interventions to reduce the spread of SARS-CoV-2 and other pathogens are still warranted. Reference Boehm, Kronig and Neher102,Reference Zhao, Hu, Ayaz Ahmed, Cheng, Chen and Sun108,Reference Lanzavecchia, Beyer and Evina Bolo109

How can we bridge the knowledge gap between disease origin and transmission?

Genomic epidemiology can be a tool to study emerging infectious diseases (EIDs) in humans, but its effectiveness is maximized when it accounts for animal and environmental components. In the case of zoonosis, there is a knowledge gap between the animal and human components of EID research, and One Health can bridge this gap.

Although most human health researchers have only started focusing on coronaviruses since the emergence of SARS-CoV-2, veterinarians, virologists, and zoologists have been researching animal coronaviruses long before the COVID-19 epidemic. Reference Poudel, Subedi, Pantha and Dhakal110 One Health proposes placing these realms of research (on humans and animals) in the same environmental context. The next steps in pandemic prevention science are to understand factors that create opportunities for zoonosis, Reference Semenza and Menne111,Reference Bartlow, Manore and Xu112 such as entering infectious habitats such as bat caves and the use of wildlife as food and medicine. Reference Mersha and One Health113Reference Kelly, Karesh and Johnson117

Deep sequencing the microbiomes and viromes of taxonomically, geographically, and temporally deep biorepository archives of putative host animals will serve as the basis of new approaches to zoonosis, risk assessment, and threat mitigation. Reference Colella, Stephens, Campbell, Kohli, Parsons and Mclean118Reference Thompson, Phelps and Allard120 Therefore, another step toward furthering the One Health approach is leveraging biorepositories in biomedical research. Although the Global Museum initiative already offers a route of international integration among museum biorepositories in a decentralized and geographically dispersed network, Reference Bakker, Antonelli and Clarke121 the link to EID research is still not fully realized.

The recent creation of the Museums and Emerging Pathogens in the Americas network (MEPA) is vital for linking biorepositories and EID research. Reference Colella, Bates and Burneo122 The overarching goal of the MEPA is to leverage museum biorepositories in a global, decentralized pathogen surveillance system by expanding biodiversity infrastructure and opening communication channels that foster collaboration among biorepositories and biomedical communities.

The need for this host-based approach to genomic epidemiology is made evident by the transmissible nature of SARS-CoV-2, Reference Conceicao, Thakur and Human123 which has the potential to infect a range of hosts, including tigers, Reference Wang, Mitchell and Calle124Reference Bartlett, Diel and Wang126 minks, Reference Oreshkova, Molenaar and Vreman127,Reference Hammer, Quaade and Rasmussen128 domestic cats, Reference Halfmann, Hatta and Chiba129Reference Braun, Moreno and Halfmann131 ferrets, Reference Liu, Yeh and Phan132Reference Kim, Kim and Kim134 raccoon dogs, Reference Freuling, Breithaupt and Müller135 cynomolgus and rhesus macaques, Reference Freuling, Breithaupt and Müller135Reference Rockx, Kuiken and Herfst137 rabbits, Reference Mykytyn, Lamers and Okba138 Egyptian fruit bats, Reference Mykytyn, Lamers and Okba138,Reference Schlottau, Rissmann and Graaf139 Syrian hamsters, Reference Imai, Iwatsuki-Horimoto and Hatta140 and white-tailed deer. Reference Palmer, Martins and Falkenberg141Reference Gryseels, De Bruyn, Gyselings, Calvignac-Spencer, Leendertz and Leirs143

How can we track SARS-CoV-2 variants faster?

Vaccines are still effective in preventing severe outcomes against all SARS-CoV-2 variants, Reference Cevik, Grubaugh, Iwasaki and Openshaw100 which are ravaging unvaccinated people. Reference Griffin, Haddix and Danza144,Reference Del Rio, Malani and Omer145 However, the likelihood of new mutations increases as cases rise, possibly leading to enhanced transmission, immune escape, or increased pathogenicity. This process has resulted in more transmissible variants. Reference Lazarevic, Pravica, Miljanovic and Cupic146,Reference Kemp, Collier and Datir147

Researchers face 2 main challenges in keeping pace with SARS-CoV-2 variants: using resources at optimal capacity and lowering barriers to technology and training in genomic epidemiology across the world. On the one hand, countries with a high positivity rate, like India, are not sequencing isolates at full capacity. Reference Srivastava, Banu, Singh, Sowpati and Mishra148 The United States is an even more extreme example because it has ranked low in SARS-CoV-2 sequencing despite its capacity and expertise. Reference Furuse149,Reference Crawford and Williams150 On the other hand, countries like South Africa have sequencing laboratories struggling with reagent shortages and the scarcity of trained scientists. Reference Adepoju151

Global efforts to strengthen pathogen sequencing capacity are still required to respond to technical, logistical, and financial challenges in resource-limited settings despite increased sequencing feasibility. Moreover, good SARS-CoV-2 sequencing performance for some LMICs (eg, Democratic Republic of the Congo, Brazil, Senegal, and Thailand) further encourages international and domestic collaboration among public health authorities, healthcare facilities, academia, and industries. Reference Furuse149

Additional challenges include consistent handling of isolates as well as metadata and sequence data curation and deposition in a way that facilitates combining data sets from different laboratories. These challenges require coordinated efforts Reference Blomberg and Lauer152 and data standards Reference Conesa and Beck153 to guarantee rapid access to large volumes of raw and processed molecular data at unprecedented scales. Reference Chiara, D’Erchia and Gissi70

We also need to address bioinformatics bottlenecks to respond faster to the threat of emergent diseases and to manage the fast-paced production of genomic information. Most tools are co-opted from evolutionary biology’s arsenal to study the lineages of higher taxa with exemplar approaches. Reference Hodcroft, De Maio and Lanfear154 Although these tools were not designed to manage big data from rapidly evolving pathogens, Reference Hodcroft, De Maio and Lanfear154 some have already started to respond to these demands. For example, the ultrafast sample placement on existing trees (UShER) enables the rapid placement of novel genomes into a reference tree using the parsimony optimality criterion. Reference Turakhia, Thornlow and Hinrichs155 Thus, as phylogenetic principles underpin how we view genetic changes over time, One Health will also include the exchange of knowledge among evolutionary biologists and epidemiologists.

Phylogenetic trees are hard to compute and interpret. The need to consult professional phylogeneticists is made plain by the number of prominent papers that did not adhere to the standards of phylogenetics and failed to identify the fundamental hosts of coronaviruses. Reference Wenzel156 Moreover, a good phylogenetic analysis requires many elements: careful choice of the collected taxa, sequence, and or phenotypic data; method and quality control of sequence data and alignment; evaluation of substitution and indel models; treatment of partitions; tree-search protocol; measures of fit or confidence; and strategies for character coding and optimization. Reference Machado, Scott, Guirales and Janies49,Reference Wenzel156,Reference Machado, Schneider, Guirales and Janies157 Moreover, results may vary with parameterization. Reference Wheeler158 These are only a few of the difficult decisions that go way beyond the level of sophistication of any software manuals and automated systems. Reference Wenzel156,Reference Grant159

Are trees mapped to globes always needed?

In many cases, such as the initial spread of H5N1 influenza, trees and Supramaps were very useful to understand the geographic spread of the pathogen, its multiple geographically and mutationally distinct patterns of zoonosis, Reference Janies, Hill, Guralnick, Habib, Waltari and Wheeler10 and drug resistance. Reference Hill, Guralnick, Wilson, Habib and Janies14 However, due to occlusion, Supramaps were not suitable for the visualization of cosmopolitan diseases, such as strains of Salmonella (eg, Hoffman et al Reference Hoffmann, Luo and Monday3 ), seasonal influenza (eg, H3N2), pandemic influenza (H1N1-2009), Reference Janies, Voronkin, Das, Hardman, Treseder and Studer16 and SARS-CoV-2. In response, researchers have worked on alternative visualization tools, including pointmaps and route maps Reference Janies, Voronkin and Studer13,Reference Hovmöller, Alexandrov, Hardman and Janies160 and eventually moved beyond the need for mapping trees to globes with Strainhub. Reference de Bernardi Schneider and Ford161

Unlike Supramap, Strainhub is less computationally demanding. It can be executed from a web browser; it does not depend on closed source software (Google Earth), and geographical data are optional (Fig. 5). Moreover, Strainhub can be used to test hypotheses on the relative importance of hosts or places in disease spread. Future efforts for Strainhub will focus on usability, interoperability, visual clarity, and quantification of the relative importance of hosts or places in the spread of disease to better understand zoonosis.

Fig. 5. Comparison between Supramap and Strainhub visualizations. (a) Supramap phylogenetic visualization of bat-hosted and pangolin-hosted coronaviruses that share recent ancestry (2005–2019) with human-hosted SARS-CoV-2. The underlying data are genomic sequences, temporal and geographic metadata. (b) Strainhub visualization of the same data plus host metadata in a network using arbitrary space. Arrow colors correspond to different types of transmission (red = bat to human, green = bat to bat, yellow = bat to pangolin). The size of the circle represents the source hub ratio (SHR). SHR is the number of transitions originating from a node as a fraction of the total number of transitions related to that node. A node scoring SHR close to 1 indicates a source (eg, Hubei, Yunnan, and Zhejiang), SHR close to 0.5 a hub and SHR close to 0 a sink for the pathogen. The thickness of the line represents a higher frequency of viral transmission (eg, Hubei to Zhejiang).

How do we prepare for the next pandemic?

The COVID-19 pandemic has illustrated how unprepared our interconnected global society is for zoonotic disease. For the next pandemic, 2 frontiers of investigation are interesting for genomic epidemiology as a tool to survey microbes of pandemic potential to predict, prevent, or respond faster to the emergence of new disease.

First, we must survey the natural diversity of coronaviruses and other microbes of pandemic potential present within animals. Reference Thompson, Phelps and Allard120 Second, we must develop the science of pandemic prevention by moving from tracking pandemics that are occurring to predicting outbreaks. For example, combining artificial intelligence with genomic epidemiology can lead constructing a “viral forecast“ to inform decisions about viruses with pandemic potential. Reference Syrowatka, Kuznetsova and Alsubai162 Moreover, we have proposed a novel mathematical modeling framework based on agent-based modeling to predict pathogen patch dynamics underlying zoonosis. Reference Chen, Owolabi and Li163

Final remarks

The COVID-19 pandemic, while ongoing, has caused 4,863,818 deaths worldwide as of October 14, 2021, 164 and it has surpassed the US death toll from the 1918–1919 H1N1 pandemic, which was ∼675,000. As SARS-CoV-2 becomes endemic, we must remember that it is not as lethal as other pathogens such as H5N1 influenza or Nipah virus. In its last 100 years of existence, smallpox killed 300 million people, and Variola major (the major variant of smallpox) killed 30% of these patients. Reference Henderson165

A novel pathogen at 30% mortality infecting 50% of the US population (166.7 million) would have resulted in 50 million deaths. MERS-CoV, henipaviruses, and hantavirus all have high mortality (>30%) and virulence with no approved vaccines or antivirals available. The 2018 Nipah outbreak had a 91% case-fatality rate, claiming 21 lives. Reference Arunkumar, Chandni and Mourya166 We must heed the warning that pathogens with more severe disease phenotypes than SARS-CoV-2 could resultin a far more devastating pandemic.

Supplementary material

To view supplementary material for this article, please visit https://doi.org/10.1017/ash.2021.222

Acknowledgments

We acknowledge the support of the following units of the University of North Carolina at Charlotte: the College of Computing and Informatics, the Bioinformatics Research Center, the Ribarsky Center for Visual Analytics, the Department of Bioinformatics and Genomics, Research and Economic Development, Academic Affairs, and University Research Computing. We further acknowledge the support of the North Carolina Research Campus and of the Belk Family. R.A. White III is supported by a UNC Charlotte start-up package. D.J.M. thanks Thiago José Jacob Carnevalli for his example and inspiration.

Financial support

No financial support was provided relevant to this article.

Conflict of interest

All authors report no conflicts of interest relevant to this article.

References

Janies, DA. Phylogenetic concepts and tools applied to epidemiologic investigations of infectious diseases. Microbiol Spectr 2019;7. doi: 10.1128/microbiolspec.AME-0006-2018. Accessed November 10, 2021.CrossRefGoogle Scholar
Janies, DA, Pomeroy, LW, Aaronson, JM, et al. Analysis and visualization of H7 influenza using genomic, evolutionary and geographic information in a modular web service. Cladistics 2012;28:483488.CrossRefGoogle Scholar
Hoffmann, M, Luo, Y, Monday, SR, et al. Tracing origins of the Salmonella Bareilly strain causing a foodborne outbreak in the United States. J Infect Dis 2016;213:502508.CrossRefGoogle ScholarPubMed
Ezeoke, I, Galac, MR, Lin, Y, et al. Tracking a serial killer: integrating phylogenetic relationships, epidemiology, and geography for two invasive meningococcal disease outbreaks. PLoS One 2018;13: e0202615.CrossRefGoogle ScholarPubMed
Allard, MW, Strain, E, Melka, D, et al. Practical value of food pathogen traceability through building a whole-genome sequencing network and database. J Clin Microbiol 2016;54: 19751983.CrossRefGoogle ScholarPubMed
Marra, MA, Jones, SJM, Astell, CR, et al. The genome sequence of the SARS-associated coronavirus. Science 2003;300:13991404.CrossRefGoogle ScholarPubMed
Rota, PA, Oberste, MS, Monroe, SS, et al. Characterization of a novel coronavirus associated with severe acute respiratory syndrome. Science 2003;300:13941399.CrossRefGoogle ScholarPubMed
Boulos, MNK. Descriptive review of geographic mapping of severe acute respiratory syndrome (SARS) on the Internet. Int J Health Geogr 2004;3:2.CrossRefGoogle ScholarPubMed
Janies, DA, Habib, F, Alexandrov, B, Hill, A, Pol, D. Evolution of genomes, host shifts and the geographic spread of SARS-CoV and related coronaviruses. Cladistics 2008;24:111130.CrossRefGoogle ScholarPubMed
Janies, DA, Hill, AW, Guralnick, R, Habib, F, Waltari, E, Wheeler, WC. Genomic analysis and geographic visualization of the spread of avian influenza (H5N1). Syst Biol 2007;56:321329.CrossRefGoogle Scholar
Janies, DA, Treseder, T, Alexandrov, B, et al. The Supramap project: linking pathogen genomes with geography to fight emergent infectious diseases. Cladistics 2011;27:6166.CrossRefGoogle ScholarPubMed
Studer, J, Janies, DA. Global spread and evolution of viral haemorrhagic septicaemia virus. J Fish Dis 2011;34:741747.CrossRefGoogle ScholarPubMed
Janies, DA, Voronkin, IO, Studer, J, et al. Selection for resistance to oseltamivir in seasonal and pandemic H1N1 influenza and widespread co-circulation of the lineages. Int J Health Geogr 2010;9:13.CrossRefGoogle ScholarPubMed
Hill, AW, Guralnick, RP, Wilson, MJC, Habib, F, Janies, D. Evolution of drug resistance in multiple distinct lineages of H5N1 avian influenza. Infect Genet Evol 2009;9:169178.CrossRefGoogle ScholarPubMed
Testimony of Daniel A. Janies, PhD. Local challenges of global proportions: evaluating role, preparedness for, and surveillance for pandemic onfluenza: Hearing before the committee on homeland security and government affairs, United States senate, 1 Sess. (2007). US government website. https://www.govinfo.gov/content/pkg/CHRG-110shrg38846/html/CHRG-110shrg38846.htm. Accessed October 7, 2021.Google Scholar
Janies, DA, Voronkin, IO, Das, M, Hardman, J, Treseder, TW, Studer, J. Genome informatics of influenza A: from data sharing to shared analytical capabilities. Anim Health Res Rev 2010;11:7379.CrossRefGoogle ScholarPubMed
Ruan, YJ, Wei, CL, Ee, AL, et al. Comparative full-length genome sequence analysis of 14 SARS coronavirus isolates and common mutations associated with putative origins of infection. Lancet 2003;361:17791785.CrossRefGoogle ScholarPubMed
Zhao, G-P. SARS molecular epidemiology: a Chinese fairy tale of controlling an emerging zoonotic disease in the genomics era. Philos Trans R Soc Lond B Biol Sci 2007;362:10631081.CrossRefGoogle ScholarPubMed
Reuter, JA, Spacek, DV, Snyder, MP. High-throughput sequencing technologies. Mol Cell 2015;58:586597.CrossRefGoogle ScholarPubMed
Pérez-Losada, M, Arenas, M, Galán, JC, et al. High-throughput sequencing (HTS) for the analysis of viral populations. Infect Genet Evol 2020;80:104208.CrossRefGoogle ScholarPubMed
The cost of sequencing a human genome. National Human Genome Research Institute website. https://www.genome.gov/about-genomics/fact-sheets/Sequencing-Human-Genome-cost. Accessed September 6, 2021.Google Scholar
Woo, PCY, Lau, SKP, Lam, CSF, et al. Discovery of a novel bottlenose dolphin coronavirus reveals a distinct species of marine mammal coronavirus in gamma coronavirus. J Virol 2014;88:13181331.CrossRefGoogle Scholar
Durães-Carvalho, R, Caserta, LC, Barnabé, ACS, et al. Coronaviruses detected in Brazilian wild birds reveal close evolutionary relationships with beta- and deltacoronaviruses isolated from mammals. J Mol Evol 2015;81:2123.CrossRefGoogle ScholarPubMed
Woo, PCY, Lau, SKP, Lam, CSF, et al. Discovery of seven novel mammalian and avian coronaviruses in the genus deltacoronavirus supports bat coronaviruses as the gene source of alphacoronavirus and betacoronavirus and avian coronaviruses as the gene source of gammacoronavirus and deltacoronavirus. J Virol 2012;86:39954008.CrossRefGoogle ScholarPubMed
Woo, PCY, Huang, Y, Lau, SKP, Yuen, K-Y. Coronavirus genomics and bioinformatics analysis. Viruses. 2010;2:18041820.CrossRefGoogle ScholarPubMed
Irigoyen, N, Firth, AE, Jones, JD, Chung, BY-W, Siddell, SG, Brierley, I. High-resolution analysis of coronavirus gene expression by RNA sequencing and ribosome profiling. PLoS Pathog 2016;12:e1005473.CrossRefGoogle ScholarPubMed
Coronavirinae. ViralZone website. https://viralzone.expasy.org/785?outline=all_by_species. Accessed September 8, 2021.Google Scholar
Li, BX, Ge, JW, Li, YJ. Porcine aminopeptidase N is a functional receptor for the PEDV coronavirus. Virology. 2007;365:166172.CrossRefGoogle ScholarPubMed
Boileau, MJ, Kapil, S. Bovine coronavirus associated syndromes. Vet Clin N Am Food Anim Pract 2010;26:123146.CrossRefGoogle ScholarPubMed
Mandelik, R, Sarvas, M, Jackova, A, Salamunova, S, Novotny, J, Vilcek, S. First outbreak with chimeric swine enteric coronavirus (SeCoV) on pig farms in Slovakia—lessons to learn. Acta Vet Hung 2018;66:488492.CrossRefGoogle Scholar
Su, S, Wong, G, Shi, W, Liu, J, Lai, ACK, Zhou, J, et al. Epidemiology, genetic recombination, and pathogenesis of coronaviruses. Trends Microbiol 2016;24:490502.CrossRefGoogle ScholarPubMed
Lai, MMC, Cavanagh, D. The molecular biology of coronaviruses. In Advances in Virus Research. New York: Elsevier; 1997: 1100.Google Scholar
Human coronavirus types. Centers for Disease Control and Prevention website. https://www.cdc.gov/coronavirus/types.html. Published March 17, 2021. Accessed September 7, 2021.Google Scholar
Zhang, XM, Herbst, W, Kousoulas, KG, Storz, J. Biological and genetic characterization of a hemagglutinating coronavirus isolated from a diarrhoeic child. J Med Virol 1994;44:152161.CrossRefGoogle ScholarPubMed
Zhong, NS, Zheng, BJ, Li, YM, et al. Epidemiology and cause of severe acute respiratory syndrome (SARS) in Guangdong, People’s Republic of China, in February 2003. Lancet 2003;362:13531358.CrossRefGoogle ScholarPubMed
Ksiazek, TG, Erdman, D, Goldsmith, CS, et al. A novel coronavirus associated with severe acute respiratory syndrome. N Engl J Med 2003;348:19531966.CrossRefGoogle ScholarPubMed
Zumla, A, Hui, DS, Perlman, S. Middle East respiratory syndrome. Lancet 2015;386:9951007.CrossRefGoogle ScholarPubMed
World Health Organization director-general’s opening remarks at the media briefing on COVID-19— 11 March 2020. World Health Organization website. https://www.who.int/dg/speeches/detail/who-director-general-s-opening-remarks-at-the-media-briefing-on-covid-19---11-march-2020. Accessed September 7, 2021.Google Scholar
Pekar, J, Worobey, M, Moshiri, N, Scheffler, K, Wertheim, JO. Timing the SARS-CoV-2 index case in Hubei province. Science 2021;372:412417.CrossRefGoogle ScholarPubMed
Coronavirus (COVID-19) dashboard. World Health Organization website. https://covid19.who.int. Accessed September 18, 2021.Google Scholar
Yuen, K-S, Ye, Z-W, Fung, S-Y, Chan, C-P, Jin, D-Y. SARS-CoV-2 and COVID-19: the most important research questions. Cell Biosci 2020;10:40.CrossRefGoogle ScholarPubMed
Andersen, KG, Rambaut, A, Lipkin, WI, Holmes, EC, Garry, RF. The proximal origin of SARS-CoV-2. Nat Med 2020;26:450452.CrossRefGoogle ScholarPubMed
Liu, S-L, Saif, LJ, Weiss, SR, Su, L. No credible evidence supporting claims of the laboratory engineering of SARS-CoV-2. Emerg Microbes Infect 2020;9:505507.CrossRefGoogle ScholarPubMed
Holmes, EC, Goldstein, SA, Rasmussen, AL, et al. The origins of SARS-CoV-2: a critical review. Cell 2021. doi: 10.1016/j.cell.2021.08.017.CrossRefGoogle Scholar
Wang, N, Li, S-Y, Yang, X-L, et al. Serological evidence of bat SARS-related coronavirus infection in humans, China. Virol Sin 2018;33:104107.CrossRefGoogle ScholarPubMed
Zhang, X, Hasoksuz, M, Spiro, D, et al. Quasi-species of bovine enteric and respiratory coronaviruses based on complete genome sequences and genetic changes after tissue culture adaptation. Virology 2007;363:110.CrossRefGoogle ScholarPubMed
Rasmussen, AL. On the origins of SARS-CoV-2. Nat Med 2021. doi: 10.1038/s41591-020-01205-5.CrossRefGoogle Scholar
Shi, Z-L. Origins of SARS-CoV-2: focusing on science. Infect Dis Immun 2021;1:34.Google Scholar
Machado, DJ, Scott, R, Guirales, S, Janies, DA. Fundamental evolution of all including three deadly lineages descendent from Chiroptera-hosted coronaviruses: SARS-CoV, MERS-CoV, and SARS-CoV-2. Cladistics 2021. doi: 10.1111/cla.12454.CrossRefGoogle Scholar
Zhao, S, Zhuang, Z, Cao, P, et al. Quantifying the association between domestic travel and the exportation of novel coronavirus (2019-nCoV) cases from Wuhan, China in 2020: a correlational analysis. J Travel Med 2020;27. doi: 10.1093/jtm/taaa022.CrossRefGoogle Scholar
Li, W, Shi, Z, Yu, M, et al. Bats are natural reservoirs of SARS-like coronaviruses. Science 2005;310:676679.CrossRefGoogle ScholarPubMed
Lytras, S, Hughes, J, Martin, D, et al. Exploring the natural origins of SARS-CoV-2 in the light of recombination. bioRxiv 2021. doi: 10.1101/2021.01.22.427830.CrossRefGoogle Scholar
Lytras, S, Xia, W, Hughes, J, Jiang, X, Robertson, DL. The animal origin of SARS-CoV-2. Science 2021;373:968970.CrossRefGoogle ScholarPubMed
Lam, TT-Y, Jia, N, Zhang, Y-W, et al. Identifying SARS-CoV-2-related coronaviruses in Malayan pangolins. Nature 2020;583:282285.CrossRefGoogle ScholarPubMed
Xiao, K, Zhai, J, Feng, Y, et al. Isolation of SARS-CoV-2-related coronavirus from Malayan pangolins. Nature 2020;583:286289.CrossRefGoogle ScholarPubMed
Zhang, T, Wu, Q, Zhang, Z. Probable pangolin origin of SARS-CoV-2 associated with the COVID-19 outbreak. Curr Biol 2020;30:13461351.e2.CrossRefGoogle ScholarPubMed
Liu, P, Jiang, J-Z, Wan, X-F, et al. Are pangolins the intermediate host of the 2019 novel coronavirus (SARS-CoV-2)? PLoS Pathog 2020;16:e1008421.CrossRefGoogle ScholarPubMed
GISAID—Initiative. Global Initiative on Sharing All Influenza Data website. https://www.gisaid.org. Accessed September 18, 2021.Google Scholar
GISAID—Initiative. EpiCov website. https://www.epicov.org/. Accessed September 18, 2021.Google Scholar
Wu, F, Zhao, S, Yu, B, et al. Author correction: a new coronavirus associated with human respiratory disease in China. Nature 2020;580:E7.CrossRefGoogle ScholarPubMed
Sequencing of SARS-CoV-2. European Centre for Disease Control and Prevention website. https://www.ecdc.europa.eu/sites/default/files/documents/sequencing-of-SARS-CoV-2.pdf Accessed September 18, 2021.Google Scholar
Genomic sequencing of SARS-CoV-2: a guide to implementation for maximum impact on public health. World Health Organization website. https://www.who.int/publications/i/item/9789240018440. Published January 2021. Accessed September 18, 2021.Google Scholar
Truelove, S, Smith, CP, Qin, M, et al. Projected resurgence of COVID-19 in the United States in July–December 2021 resulting from the increased transmissibility of the Delta variant and faltering vaccination. medRxiv 2021. doi: 10.1101/2021.08.28.21262748.CrossRefGoogle Scholar
Lau, BT, Pavlichin, D, Hooker, AC, et al. Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies. Genome Med 2021;13:62.CrossRefGoogle ScholarPubMed
Hamza, M, Ali, A, Khan, S, et al. nCOV-19 peptides mass fingerprinting identification, binding, and blocking of inhibitors flavonoids and anthraquinone of and hydroxychloroquine. J Biomol Struct Dyn 2021;39:40894099.CrossRefGoogle ScholarPubMed
Hariton, E, Locascio, JJ. Randomised controlled trials—the gold standard for effectiveness research: Study design: randomised controlled trials. BJOG 2018;125:1716.CrossRefGoogle ScholarPubMed
Boulware, DR, Pullen, MF, Bangdiwala, AS, et al. A randomized trial of hydroxychloroquine as postexposure prophylaxis for COVID-19. N Engl J Med. 2020;383:517525.CrossRefGoogle ScholarPubMed
Siemieniuk, RA, Bartoszko, JJ, Ge, L, et al. Drug treatments for COVID-19: living systematic review and network meta-analysis. BMJ 2020;370:m2980.CrossRefGoogle ScholarPubMed
COVID-19 mRNA vaccine production. National Human Genome Research Institute website. https://www.genome.gov/about-genomics/fact-sheets/COVID-19-mRNA-Vaccine-Production. Accessed October 12, 2021.Google Scholar
Chiara, M, D’Erchia, AM, Gissi, C, et al. Next-generation sequencing of SARS-CoV-2 genomes: challenges, applications and opportunities. Brief Bioinform 2021;22:616630.CrossRefGoogle ScholarPubMed
Shaman, J, Galanti, M. Will SARS-CoV-2 become endemic? Science 2020;370:527529.CrossRefGoogle ScholarPubMed
Nakanishi, N, Yoshio, I. The novel coronavirus pandemic and the state of the epidemic in Kobe, Japan. J Disaster Res 2021;16:8487.CrossRefGoogle Scholar
Phillips, N. The coronavirus is here to stay—here’s what that means. Nature 2021;590:382384.CrossRefGoogle ScholarPubMed
Zhou, D, Dejnirattisai, W, Supasa, P, et al. Evidence of escape of SARS-CoV-2 variant B.1.351 from natural and vaccine-induced sera. Cell 2021;184:23482361.e6.CrossRefGoogle ScholarPubMed
Kustin, T, Harel, N, Finkel, U, et al. Evidence for increased breakthrough rates of SARS-CoV-2 variants of concern in BNT162b2-mRNA-vaccinated individuals. Nat Med 2021;27:13791384.CrossRefGoogle ScholarPubMed
Farinholt, T, Doddapaneni, H, Qin, X, et al. Transmission event of SARS-CoV-2 delta variant reveals multiple vaccine breakthrough infections. BMC Med 2021;19:255.CrossRefGoogle ScholarPubMed
Giovanetti, M, Benedetti, F, Campisi, G, et al. Evolution patterns of SARS-CoV-2: snapshot on its genome variants. Biochem Biophys Res Commun 2021;538:8891.CrossRefGoogle ScholarPubMed
To, KK-W, Sridhar, S, Chiu, KH-Y, et al. Lessons learned 1 year after SARS-CoV-2 emergence leading to COVID-19 pandemic. Emerg Microbes Infect 2021;10:507535.CrossRefGoogle ScholarPubMed
Munafò, MR, Tilling, K, Taylor, AE, Evans, DM, Davey Smith, G. Collider scope: when selection bias can substantially influence observed associations. Int J Epidemiol 2018;47:226235.CrossRefGoogle ScholarPubMed
Hernán, MA. Invited commentary: selection bias without colliders. Am J Epidemiol 2017;185:10481050.CrossRefGoogle ScholarPubMed
Tattan-Birch, H, Marsden, J, West, R, Gage, SH. Assessing and addressing collider bias in addiction research: the curious case of smoking and COVID-19. Addiction 2021;116:982984.CrossRefGoogle ScholarPubMed
Brito, AF, Semenova, E, Dudas, G, et al. Global disparities in SARS-CoV-2 genomic surveillance. medRxiv 2021. doi: 10.1101/2021.08.21.21262393.CrossRefGoogle Scholar
Lauring, AS, Hodcroft, EB. Genetic variants of SARS-CoV-2-what do they mean? JAMA 2021;325:529531.CrossRefGoogle ScholarPubMed
Tegally, H, Wilkinson, E, Giovanetti, M, et al. Emergence and rapid spread of a new severe acute respiratory syndrome-related coronavirus 2 (SARS-CoV-2) lineage with multiple spike mutations in South Africa. medRxiv 2020. doi: 10.1101/2020.12.21.20248640.CrossRefGoogle Scholar
Ford, CT, Scott, R, Machado, DJ, Janies, D. Sequencing data of North American SARS-CoV-2 isolates shows widespread complex variants. medRxiv 2021. doi: 10.1101/2021.01.27.21250648.CrossRefGoogle Scholar
Awadasseid, A, Wu, Y, Tanaka, Y, Zhang, W. Current advances in the development of SARS-CoV-2 vaccines. Int J Biol Sci 2021;17:819.CrossRefGoogle ScholarPubMed
Hossain, MK, Hassanzadeganroudsari, M, Apostolopoulos, V. The emergence of new strains of SARS-CoV-2. What does it mean for COVID-19 vaccines? Expert Rev Vaccines 2021;20:635638.CrossRefGoogle ScholarPubMed
Ul-Rahman, A, Shabbir, MAB, Aziz, MW, et al. A comparative phylogenomic analysis of SARS-CoV-2 strains reported from non-human mammalian species and environmental samples. Mol Biol Rep 2020;47:92079217.CrossRefGoogle ScholarPubMed
Kuhn, JH, Bao, Y, Bavari, S, et al. Virus nomenclature below the species level: a standardized nomenclature for natural variants of viruses assigned to the family Filoviridae . Arch Virol 2013;158:301311.CrossRefGoogle ScholarPubMed
Parums, D. Editorial: revised World Health Organization (WHO) terminology for variants of concern and variants of interest of SARS-CoV-2. Med Sci Monit 2021;27:e933622.Google Scholar
Konings, F, Perkins, MD, Kuhn, JH, et al. SARS-CoV-2 variants of interest and concern naming scheme conducive for global discourse. Nat Microbiol 2021;6:821823.CrossRefGoogle ScholarPubMed
Janik, E, Niemcewicz, M, Podogrocki, M, Majsterek, I, Bijak, M. The emerging concern and interest SARS-CoV-2 variants. Pathogens 2021;10. doi: 10.3390/pathogens10060633.CrossRefGoogle Scholar
Tracking SARS-CoV-2 variants. World Health Organization website. https://www.who.int/activities/tracking-SARS-CoV-2-variants. Accessed September 20, 2021.Google Scholar
SARS-CoV-2 variant classifications and definitions. Centers for Disease Control and Prevention website. https://www.cdc.gov/coronavirus/2019-ncov/variants/variant-info.html. Published 2021. Accessed September 20, 2021.Google Scholar
Shu, Y, McCauley, J. GISAID: Global initiative on sharing all influenza data—from vision to reality. Euro Surveill 2017;22. doi: 10.2807/1560-7917.ES.2017.22.13.30494.CrossRefGoogle Scholar
Hadfield, J, Megill, C, Bell, SM, et al. Nextstrain: real-time tracking of pathogen evolution. Bioinformatics 2018;34:41214123.CrossRefGoogle ScholarPubMed
Rambaut, A, Holmes, EC, O’Toole, Á, et al. A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat Microbiol 2020;5:14031407.CrossRefGoogle ScholarPubMed
Harvey, WT, Carabelli, AM, Jackson, B, et al. SARS-CoV-2 variants, spike mutations and immune escape. Nat Rev Microbiol 2021;19:409424.CrossRefGoogle ScholarPubMed
Flanagan, KL, MacIntyre, CR, McIntyre, PB, Nelson, MR. SARS-CoV-2 vaccines: where are we now? J Allergy Clin Immunol Pract 2021. doi: 10.1016/j.jaip.2021.07.016.CrossRefGoogle Scholar
Cevik, M, Grubaugh, ND, Iwasaki, A, Openshaw, P. COVID-19 vaccines: keeping pace with SARS-CoV-2 variants. Cell 2021. doi: 10.1016/j.cell.2021.09.010.CrossRefGoogle Scholar
Schmitz, AJ, Turner, JS, Liu, Z, et al. A vaccine-induced public antibody protects against SARS-CoV-2 and emerging variants. Immunity 2021;54:21592166.CrossRefGoogle ScholarPubMed
Boehm, E, Kronig, I, Neher, RA, et al. Novel SARS-CoV-2 variants: the pandemics within the pandemic. Clin Microbiol Infect 2021;27:11091117.CrossRefGoogle ScholarPubMed
Farooqi, T, Malik, JA, Mulla, AH, et al. An overview of SARS-COV-2 epidemiology, mutant variants, vaccines, and management strategies. J Infect Public Health 2021. doi: 10.1016/j.jiph.2021.08.014.CrossRefGoogle Scholar
Krause, PR, Gruber, MF. Emergency use authorization of COVID vaccines—safety and efficacy follow-up considerations. N Engl J Med 2020;383:e107.CrossRefGoogle ScholarPubMed
Pascual-Iglesias, A, Canton, J, Ortega-Prieto, AM, Jimenez-Guardeño, JM, Regla-Nava, JA. An overview of vaccines against SARS-CoV-2 in the COVID-19 pandemic era. Pathogens 2021;10. doi: 10.3390/pathogens10081030.CrossRefGoogle Scholar
Chen, Y, Zhu, L, Huang, W, et al. Potent RBD-specific neutralizing rabbit monoclonal antibodies recognize emerging SARS-CoV-2 variants elicited by DNA prime-protein boost vaccination. Emerg Microbes Infect 2021;10:13901403.CrossRefGoogle ScholarPubMed
Hodgson, SH, Mansatta, K, Mallett, G, Harris, V, Emary, KRW, Pollard, AJ. What defines an efficacious COVID-19 vaccine? A review of the challenges assessing the clinical efficacy of vaccines against SARS-CoV-2. Lancet Infect Dis 2021;21:e26e35.CrossRefGoogle ScholarPubMed
Zhao, T, Hu, C, Ayaz Ahmed, M, Cheng, C, Chen, Y, Sun, C. Warnings regarding the potential coronavirus disease 2019 (COVID-19) transmission risk: vaccination is not enough. Infect Control Hosp Epidemiol 2021;2. doi: 10.1016/j.xinn.2021.100116.CrossRefGoogle Scholar
Lanzavecchia, S, Beyer, KJ, Evina Bolo, S. Vaccination is not enough: understanding the increase in cases of COVID-19 in Chile despite a high vaccination rate. Epidemiologia 2021;2:377390.CrossRefGoogle Scholar
Poudel, U, Subedi, D, Pantha, S, Dhakal, S. Animal coronaviruses and coronavirus disease 2019: lesson for One Health approach. Open Vet J 2020;10:239251.CrossRefGoogle ScholarPubMed
Semenza, JC, Menne, B. Climate change and infectious diseases in Europe. Lancet Infect Dis 2009;9:365375.CrossRefGoogle ScholarPubMed
Bartlow, AW, Manore, C, Xu, C, et al. Forecasting zoonotic infectious disease response to climate change: mosquito vectors and a changing environment. Vet Sci China 2019;6. doi: 10.3390/vetsci6020040.CrossRefGoogle Scholar
Mersha, C, One Health, Tewodros F., one medicine, one world: co-joint of animal and human medicine with perspectives, a review. Veterinary World 2012. doi: 10.5455/vetworld.2012.238-243 CrossRefGoogle Scholar
Sánchez-Vizcaíno, JM. One world, One Health, one virology. Vet Microbiol 2013. doi: 10.1016/j.vetmic.2013.02.018.CrossRefGoogle Scholar
Reeve-Johnson, L. One Health and a world of opportunity. Veterinary Record 2015. doi: 10.1136/vr.h2117.CrossRefGoogle Scholar
Mwangi, W, de Figueiredo, P, Criscitiello, MF. One Health: addressing global challenges at the nexus of human, animal, and environmental Health. PLoS Pathog 2016;12:e1005731.CrossRefGoogle Scholar
Kelly, TR, Karesh, WB, Johnson, CK, et al. One Health proof of concept: bringing a transdisciplinary approach to surveillance for zoonotic viruses at the human–wild animal interface. Prev Vet Med 2017;137:112118.CrossRefGoogle Scholar
Colella, JP, Stephens, RB, Campbell, ML, Kohli, BA, Parsons, DJ, Mclean, BS. The open-specimen movement. Bioscience 2021;71:405414.CrossRefGoogle Scholar
Cook, JA, Arai, S, Armién, B, et al. Integrating biodiversity infrastructure into pathogen discovery and mitigation of emerging infectious diseases. Bioscience 2020;70:531534.CrossRefGoogle ScholarPubMed
Thompson, CW, Phelps, KL, Allard, MW, et al. Preserve a voucher specimen! The critical need for integrating natural history collections in infectious disease studies. MBio 2021;12. doi: 10.1128/mBio.02698-20.CrossRefGoogle Scholar
Bakker, FT, Antonelli, A, Clarke, JA, et al. The Global Museum: natural history collections and the future of evolutionary science and public education. Peer J 2020;8:e8225.CrossRefGoogle ScholarPubMed
Colella, JP, Bates, J, Burneo, SF, et al. Leveraging natural history biorepositories as a global, decentralized, pathogen surveillance network. PLoS Pathog 2021;17:e1009583.CrossRefGoogle ScholarPubMed
Conceicao, C, Thakur, N, Human, S, et al. The SARS-CoV-2 spike protein has a broad tropism for mammalian ACE2 proteins. PLoS Biol 2020;18:e3001016.CrossRefGoogle Scholar
Wang, L, Mitchell, PK, Calle, PP, et al. Complete genome sequence of SARS-CoV-2 in a tiger from a US zoological collection. Microbiol Resour Announc 2020;9. doi: 10.1128/MRA.00468-20.CrossRefGoogle Scholar
McAloose, D, Laverack, M, Wang, L, et al. From people to: natural SARS-CoV-2 infection in tigers and lions at the Bronx Zoo. MBio 2020;11. doi: 10.1128/mBio.02220-20.CrossRefGoogle Scholar
Bartlett, SL, Diel, DG, Wang, L, et al. SARS-CoV-2 infection and longitudinal fecal screening in Malayan tigers (Panthera tigris jacksoni), Amur tigers (Panthera tigris altaica), and African lions (Panthera leo krugeri) at the Bronx Zoo, New York, USA. J Zoo Wildl Med 2021;51:733744.CrossRefGoogle Scholar
Oreshkova, N, Molenaar, RJ, Vreman, S, et al. SARS-CoV-2 infection in farmed minks, the Netherlands, April and May 2020. Euro Surveill 2020;25. doi: 10.2807/1560-7917.ES.2020.25.23.2001005.CrossRefGoogle Scholar
Hammer, AS, Quaade, ML, Rasmussen, TB, et al. SARS-CoV-2 transmission between mink (Neovison vison) and humans, Denmark. Emerg Infect Dis 2021;27:547551.CrossRefGoogle ScholarPubMed
Halfmann, PJ, Hatta, M, Chiba, S, et al. Transmission of SARS-CoV-2 in domestic cats. N Engl J Med 2020;383:592594.CrossRefGoogle ScholarPubMed
Gaudreault, NN, Trujillo, JD, Carossino, M, et al. SARS-CoV-2 infection, disease and transmission in domestic cats. Emerg Microbes Infect 2020;9:23222332.CrossRefGoogle ScholarPubMed
Braun, KM, Moreno, GK, Halfmann, PJ, et al. Transmission of SARS-CoV-2 in domestic cats imposes a narrow bottleneck. PLoS Pathog 2021;17:e1009373.CrossRefGoogle ScholarPubMed
Liu, H-L, Yeh, I-J, Phan, NN, et al. Gene signatures of SARS-CoV/SARS-CoV-2–infected ferret lungs in short- and long-term models. Infect Genet Evol 2020;85:104438.CrossRefGoogle Scholar
Ryan, KA, Bewley, KR, Fotheringham, SA, et al. Dose-dependent response to infection with SARS-CoV-2 in the ferret model and evidence of protective immunity. Nat Commun 2021;12:81.CrossRefGoogle ScholarPubMed
Kim, Y-I, Kim, S-G, Kim, S-M, et al. Infection and rapid transmission of SARS-CoV-2 in ferrets. Cell Host Microbe 2020;27:704709.e2.CrossRefGoogle ScholarPubMed
Freuling, CM, Breithaupt, A, Müller, T, et al. Susceptibility of raccoon dogs for experimental SARS-CoV-2 infection. Emerg Infect Dis 2020;26:29822985.CrossRefGoogle ScholarPubMed
Munster, VJ, Feldmann, F, Williamson, BN, et al. Respiratory disease in rhesus macaques inoculated with SARS-CoV-2. Nature 2020;585:268272.CrossRefGoogle ScholarPubMed
Rockx, B, Kuiken, T, Herfst, S, et al. Comparative pathogenesis of COVID-19, MERS, and SARS in a nonhuman primate model. Science 2020;368:10121015.CrossRefGoogle Scholar
Mykytyn, AZ, Lamers, MM, Okba, NMA, et al. Susceptibility of rabbits to SARS-CoV-2. Emerg Microbes Infect 2021;10:17.CrossRefGoogle ScholarPubMed
Schlottau, K, Rissmann, M, Graaf, A, et al. SARS-CoV-2 in fruit bats, ferrets, pigs, and chickens: an experimental transmission study. Lancet Microbe 2020;1:e218e225.CrossRefGoogle Scholar
Imai, M, Iwatsuki-Horimoto, K, Hatta, M, et al. Syrian hamsters as a small animal model for SARS-CoV-2 infection and countermeasure development. Proc Nat Acad Sci 2020;117:1658716595.Google ScholarPubMed
Palmer, MV, Martins, M, Falkenberg, S, et al. Susceptibility of white-tailed deer (Odocoileus virginianus) to SARS-CoV-2. J Virol 2021. doi: 10.1128/JVI.00083-21.CrossRefGoogle Scholar
Chandler, JC, Bevins, SN, Ellis, JW, et al. SARS-CoV-2 exposure in wild white-tailed deer (Odocoileus virginianus). bioRxiv 2021. doi: 10.1101/2021.07.29.454326.CrossRefGoogle Scholar
Gryseels, S, De Bruyn, L, Gyselings, R, Calvignac-Spencer, S, Leendertz, FH, Leirs, H. Risk of human-to-wildlife transmission of SARS-CoV-2. Mamm Rev 2020. doi: 10.1111/mam.12225.CrossRefGoogle Scholar
Griffin, JB, Haddix, M, Danza, P, et al. SARS-CoV-2 Infections and hospitalizations among persons aged ≥16 years, by vaccination status—Los Angeles County, California, May 1–July 25, 2021. Morb Mortal Wkly Rep 2021;70:11701176.CrossRefGoogle Scholar
Del Rio, C, Malani, PN, Omer, SB. Confronting the delta variant of SARS-CoV-2, summer 2021. JAMA 2021. doi: 10.1001/jama.2021.14811.CrossRefGoogle Scholar
Lazarevic, I, Pravica, V, Miljanovic, D, Cupic, M. Immune evasion of SARS-CoV-2 emerging variants: what have we learnt so far? Viruses 2021;13. doi: 10.3390/v13071192.CrossRefGoogle Scholar
Kemp, SA, Collier, DA, Datir, RP, et al. SARS-CoV-2 evolution during treatment of chronic infection. Nature 2021;592:277282.CrossRefGoogle ScholarPubMed
Srivastava, S, Banu, S, Singh, P, Sowpati, DT, Mishra, RK. SARS-CoV-2 genomics: an Indian perspective on sequencing viral variants. J Biosci 2021;46. doi: 10.1007/s12038-021-00145-7 CrossRefGoogle Scholar
Furuse, Y. Genomic sequencing effort for SARS-CoV-2 by country during the pandemic. Int J Infect Dis 2021;103:305307.CrossRefGoogle ScholarPubMed
Crawford, DC, Williams, SM. Global variation in sequencing impedes SARS-CoV-2 surveillance. PLoS Genet 2021;17:e1009620.CrossRefGoogle ScholarPubMed
Adepoju, P. Challenges of SARS-CoV-2 genomic surveillance in Africa. Lancet Microbe 2021;2:e139.CrossRefGoogle ScholarPubMed
Blomberg, N, Lauer, KB. Connecting data, tools and people across Europe: ELIXIR’s response to the COVID-19 pandemic. Eur J Hum Genet 2020;28:719723.CrossRefGoogle ScholarPubMed
Conesa, A, Beck, S. Making multiomics data accessible to researchers. Sci Data 2019;6:251.CrossRefGoogle Scholar
Hodcroft, EB, De Maio, N, Lanfear, R, et al. Want to track pandemic variants faster? Fix the bioinformatics bottleneck. Nature 2021;591:3033.CrossRefGoogle ScholarPubMed
Turakhia, Y, Thornlow, B, Hinrichs, AS, et al. Ultrafast Sample placement on Existing tRees (UShER) enables real-time phylogenetics for the SARS-CoV-2 pandemic. Nat Genet 2021;53:809816.CrossRefGoogle ScholarPubMed
Wenzel, J. Origins of SARS-CoV-1 and SARS-CoV-2 are often poorly explored in leading publications. Cladistics 2020;36:374379.CrossRefGoogle ScholarPubMed
Machado, DJ, Schneider, AB, Guirales, S, Janies, DA. FLAVi: an enhanced annotator for viral genomes of Flaviviridae. Viruses 2020;12. doi: 10.3390/v12080892CrossRefGoogle Scholar
Wheeler, WC. Sequence alignment, parameter sensitivity, and the phylogenetic analysis of molecular data. Syst Biol 1995;44:321331.CrossRefGoogle Scholar
Grant, T. The perils of “point-and-click” systematics. Cladistics 2003;19:276285.CrossRefGoogle Scholar
Hovmöller, R, Alexandrov, B, Hardman, J, Janies, D. Tracking the geographical spread of avian influenza (H5N1) with multiple phylogenetic trees. Cladistics 2010;26:113.CrossRefGoogle Scholar
de Bernardi Schneider, A, Ford, CT, et al. StrainHub: a phylogenetic tool to construct pathogen transmission networks. Bioinformatics 2020;36:945947.CrossRefGoogle ScholarPubMed
Syrowatka, A, Kuznetsova, M, Alsubai, A, et al. Leveraging artificial intelligence for pandemic preparedness and response: a scoping review to identify key use cases. NPJ Digit Med 2021;4:96.CrossRefGoogle ScholarPubMed
Chen, S, Owolabi, Y, Li, A, et al. Patch dynamics modeling framework from pathogens’ perspective: Unified and standardized approach for complicated epidemic systems. PLoS One 2020;15:e0238186.CrossRefGoogle ScholarPubMed
COVID-19. Johns Hopkins Coronavirus Resource Center website. https://coronavirus.jhu.edu/map.html. Accessed October 12, 2021.Google Scholar
Henderson, DA. The eradication of smallpox—an overview of the past, present, and future. Vaccine 2011;29 suppl 4:D79.CrossRefGoogle Scholar
Arunkumar, G, Chandni, R, Mourya, DT, et al. Outbreak investigation of Nipah virus disease in Kerala, India, 2018. J Infect Dis 2019;219:18671878.CrossRefGoogle Scholar
Soh, SM, Kim, Y, Kim, C, Jang, US, Lee, H-R. The rapid adaptation of SARS-CoV-2-rise of the variants: transmission and resistance. J Microbiol 2021;59:807818.CrossRefGoogle ScholarPubMed
Schwarze, K, Buchanan, J, Fermont, JM, et al. The complete costs of genome sequencing: a microcosting study in cancer and rare diseases from a single center in the United Kingdom. Genet Med 2020;22:8594.CrossRefGoogle ScholarPubMed
Figure 0

Fig. 1. Timeline of major events in sequencing technology (green) and genomic epidemiology (purple) alongside the first recorded occurrence of SARS-CoV, H1N1-2009, MERS-CoV, and SARS-CoV-2 in humans. Associated references can be found in Supplementary Table 1.

Figure 1

Fig. 2. The increasing feasibility of sequencing complete coronavirus genomes. (a) Sequencing cost per raw megabase of DNA sequence from September 2001 until August 2020 (data source: genome.gov/sequencingcosts, access date: September 2021). (b) Number of complete coronavirus genomes that can be sequenced with USD 100, assuming a genome size of 32 Kbp. These cost estimates do not consider sampling, storage, consumables, equipment, and staff costs. These plots use a logarithmic scale.

Figure 2

Fig. 3. Fundamental evolution of coronaviruses based on Machado et al.49 (a) Virion and genome structure. The genomic regions indicated in the figure do not represent all the genes in the coronavirus genome, but the genes that are shared among the different genera of Orthocoronavirinae and that were analyzed by Machado et al.49 Note. E, envelope small membrane protein; M, membrane protein; N, nucleoprotein; S, spike glycoprotein. (b) Summarized cladogram from Machado et al.49 The original cladogram contained 2,006 terminals corresponding to unique coronavirus genomes. Terminals indicating the eight species of human coronaviruses (HCoVs) are in bold. (c) Hosts involved in the emergence of all human coronaviruses, including SARS-CoV-2. The HCoVs of special concern to human health (SARS-CoV, MERS-CoV, and SARS-CoV-2) are shown in red. The flow chart indicates that HCoV-NL63, SARS-CoV, and SARS-CoV-2 originated from bat-hosted coronaviruses. Bats were also key to the emergence of MERS-CoV in camels and humans. HCoV-229E, HCoV-HKU1, and HCoV-OC43 originated from viruses hosted in artiodactyls, rodents, and bovids, respectively. All silhouettes were downloaded from PhyloPic (http://phylopic.org). The coronavirus vision structure was modified from https://commons.wikimedia.org/wiki/File:Coronavirus_virion_structure.svg. See Supplementary File 1 for detailed copyright and license information.

Figure 3

Fig. 4. Progressive accumulation of 4,224,785 complete SARS-COV-2 genome sequences (>26 Kbp) submitted to the GISAID EpiCoV database (https://www.epicov.org/) between January 10, 2020, and October 13, 2021. These cost estimates do not consider sampling, storage, consumables, equipment, and staff costs (see eg, Schwarze et al168). Nevertheless, the price of raw nucleotide sequencing is a significant component of the cost of genome projects.

Figure 4

Table 1. Notable Variants of SARS-CoV-2 and Their Main Attributesa

Figure 5

Table 2. Comparing the Different Categories in the WHO Variant Classification System93 With the System Used by the US government SARS-CoV-2 Interagency Group (SIG)94,a

Figure 6

Fig. 5. Comparison between Supramap and Strainhub visualizations. (a) Supramap phylogenetic visualization of bat-hosted and pangolin-hosted coronaviruses that share recent ancestry (2005–2019) with human-hosted SARS-CoV-2. The underlying data are genomic sequences, temporal and geographic metadata. (b) Strainhub visualization of the same data plus host metadata in a network using arbitrary space. Arrow colors correspond to different types of transmission (red = bat to human, green = bat to bat, yellow = bat to pangolin). The size of the circle represents the source hub ratio (SHR). SHR is the number of transitions originating from a node as a fraction of the total number of transitions related to that node. A node scoring SHR close to 1 indicates a source (eg, Hubei, Yunnan, and Zhejiang), SHR close to 0.5 a hub and SHR close to 0 a sink for the pathogen. The thickness of the line represents a higher frequency of viral transmission (eg, Hubei to Zhejiang).

Supplementary material: File

Machado et al. supplementary material

Table S1
Download Machado et al. supplementary material(File)
File 13.4 KB