COVID-19 Genomics: Researchers Complain Of Substantial Biases And Gaps In Global Sequencing Data.
: Scientist from New Zealand, the United States and Australia have in a new study demonstrated the effectiveness of real-time genomic sequencing at tracking the re-emergence of severe SARS-CoV-2 in New Zealand, in August this year. They however pointed at various shortcomings in the current Global SARS-CoV-2 sequencing data available in various open access repositories online.
The researchers generated the genomes of 80% of the laboratory-confirmed samples of SARS-CoV-2 from New Zealand’s August 2020 outbreak and compared these genomes to the available global genomic data.
genomic sequencing was able to rapidly identify that the new COVID-19 cases in New Zealand belonged to a single cluster and hence resulted from a single introduction. However, successful identification of the origin of this outbreak was impeded by substantial biases and gaps in global sequencing data.
The team said that
access to a broader and more heterogenous sample of global genomic data would strengthen efforts to locate the source of any new outbreaks.
The study findings were published on a preprint server and have yet to be peer reviewed. https://www.medrxiv.org/content/10.1101/2020.10.28.20221853v1
The new SARS-CoV-2 coronavirus is the agent responsible for the current COVID-19 pandemic that continues to plague the globe posing a threat to public health and the economy.
Dr Jemma Geoghegan from the University of Otago in Dunedin, New Zealand, and colleagues say real-time genomic sequencing quickly identified that the new cases belonged to a single genomic lineage and were, therefore, the result of a single introduction.
The genome sequencing was used to inform the lockdown measures and track and trace efforts needed to control the outbreak and enable the virus to be eliminated from the community for a second time.
The study team also says substantial biases and gaps in global sequencing data limited the power of the genomics to successfully identify the precise origin of the August outbreak.
The team advises that potential sampling biases and gaps in this sequencing data should always be carefully considered when trying to identify the origin of a specific SARS-CoV-2 outbreak.
The team also says that access to a broader and more heterogeneous sample of global genomic data would improve future efforts to locate the sources of new outbreaks.
There has been rapid progress in field of genomics in this current pandemic as just twelve days after SARS-CoV-2 was first identified, a genome of the virus had been published, and as of September 25th this year, more than 110,000 SARS-CoV-2 genomes had been made publicly available.
Dr Geoghegan told Thailand Medical News, “The underlying genome sequencing has occurred so quickly that, for the first time during an infectious disease outbreak, it has enabled virological and epidemiological data to be integrated in real time.”
Importantly real-time genomic sequencing of these data has been pivotal in infor
ming the response to the pandemic by tracking the global transmission and evolution of SARS-CoV-2, including the identification of the number, source, and timing of introductions into different countries.
There has been significant between-country variation in the number and proportion of positive cases sequenced and genomes published, say the researchers.
Dr Geoghegan and colleagues say such disparities in sequencing efforts can have important implications for data interpretation and must be met with careful consideration.
The study team said, “Real-time sequencing of SARS-CoV-2 genomes has, however, had particular utility in tracking the re-emergence of the virus in New Zealand.”
After the initial outbreak in late February, SARS-CoV-2 had effectively been eliminated in the country by June, with any positive cases limited to those linked to quarantine facilities at the border.
Following more than one hundred days of no detectable community transmission, four new cases emerged on August 12th, none of which could be epidemiologically linked to any known case.
a. Maximum clade credibility phylogenetic tree of 2,000 subsampled global genomes (1,996 most recently sampled B.1.1.1. plus four non-B.1.1.1. used as an outgroup) with an outer ring coloured by sampling region; b. Posterior probability of genomes within the sister clade to New Zealand’s August outbreak, colour-coded by sampling location; c. Proportion of genomes within lineage B.1.1.1. in the global data set over time, colour-coded by sampling location.
Significantly during this second outbreak, genomic sequencing was used to support track and trace efforts in the country for the first time.
The study team generated the genomes of 80% of the laboratory-confirmed SARS-CoV-2-positive samples from the new outbreak. They then compared these to sequenced cases from the first outbreak and to those from quarantine facilities.
No link however was identified, and the study team went on to compare the genomes from the new community outbreak to the global dataset.
The genomic sequencing was able to quickly identify that the new COVID-19 cases and subclusters were linked to the one genomic lineage B.1.1.1, therefore showing that the outbreak had resulted from a single introduction.
But, of the countries that had so far contributed SARS-CoV-2 genomic data, 40% had genomes originating from this lineage.
Detailed phylogenetic analysis of the most recently sampled B.1.1.1. genomes found that those identified in Switzerland, South Africa, and England in August were the closest relatives of the viruses associated with the new outbreak in New Zealand.
Genomic epidemiological analysis on the possible origins of the new outbreak was found to be inconclusive, which the team says is “likely due to missing genomic data within the quarantine border facilities as well as in the global data set.”
For instance, twelve SARS-CoV-2 genomes from people returning to New Zealand from India who all arrived on the same flight spanned at least four genomic lineages, with sequence divergence of up to 34 genomic mutations.
“Such a high level of diversity in just a small sample of positive cases from India suggest that the currently available genomic data fails to encompass the true diversity that existed locally, let alone globally,” says the researchers.
Real-time genomic sequencing following the re-emergence of the virus helped to quickly inform the track and trace efforts and lockdown measures needed to control the outbreak, putting New Zealand on track to eliminate the virus for the second time, they add.
However, the biased nature of global sampling clearly limited the power of genomics to identify the geographical origin of New Zealand’s August 2020 outbreak, says the study team.
The team said, “We, therefore, advocate that careful consideration of the potential sampling biases and gaps in available genomic data be made whenever attempting to determine the geographic origins of a specific outbreak of SARS- CoV-2. Analysis should consider all available evidence, including from genomic and epidemiological sources.”
Currently many countries are not taking genomic sequencings seriously. It is important for as many of sequencings to be done whenever cases emerge and to upload them on various open access online repositories as there is so much data that can be extracted from these sequencings to understand about the spread of the disease to even about mutation developments etc.
For more on COVID-19 Genomics
, keep on logging to Thailand Medical News.