Nanopore sequencing contradicts human genome integration of SARS-CoV-2
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the causative agent of the ongoing coronavirus disease 2019 (COVID-19) pandemic, is a virus with a ribonucleic acid (RNA) genome.
Recently, Zhang et al. proposed severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) hijacks the LINE-1 (L1) retrotransposition machinery to integrate into the DNA of infected cells.
Now a new preprint research paper posted to the bioRxiv* server dismisses the possibility, raised by that study, that the virus inserts its genetic material into the host cell’s genome.
Background
This virus lacks a reverse transcriptase enzyme, which is required to convert the RNA into its complementary deoxyribonucleic acid (DNA) strand. Therefore, SARS-CoV-2 does not incorporate its genetic material into human genomic DNA during its infection lifecycle.
This is a crucial assumption that underlies the current diagnostic techniques used for this infection and for its potential long-term consequences. There are other viruses, such as the human immunodeficiency virus 1 (HIV-1) and hepatitis B virus (HBV), which do integrate with DNA and cause cancerous transformation or chronic infection.
All mammalian cells have LINE-1 (L1) retrotransposons in their genomes. This gives rise to messenger RNA (mRNA) that encodes two proteins required for the movement of L1. These are represented as ORF1p and ORF2p.
The ORF2p acts as an endonuclease as well as a reverse transcriptase (RT), with a strong tendency to catalyze reverse transcription of the L1 mRNA. It can also mobilize cellular RNA, which carries a polyadenylated tail, including RNA from other retrotransposons and from genes encoding proteins.
The latter activity is infrequent, however, probably because it is suppressed by the cell. In fact, less than a single cellular RNA insertion should occur per 2,000 L1 insertions.
Both types of insertions are marked by target site duplications (TSDs) and a 3′ polyA region, occurring at the L1 sequence 5′-TTTT/AA. These signs help to identify true insertions.
Earlier evidence
Earlier, SARS-CoV-2-infected cells with overexpressed L1 were shown to contain DNA identical to viral RNA. This was coupled with other weaker evidence to prove the integration of SARS-CoV-2 RNA into the host cell genome.
This was followed by the identification of 63 supposed sequences integrated from SARS-CoV-2 by the same researcher, using a sequencing technique called Oxford Nanopore Technologies (ONT) long-read sequencing. Only one of these was spanned by an ONT read, on the X chromosome, which showed potential TSDs on either side, and this one lacked the expected 3′ polyA tract.
Moreover, a 28 kb deletion seemed to have occurred from the viral sequence. Exons were enriched 26-fold in these integrants, a finding that cannot be explained by the preference of the L1 enzyme.
The second study was followed by still another, using Illumina short-read sequencing. This was claimed to show supposed junctions where SARS-CoV-2 sequences were joined to the host genome in cells lacking high levels of L1.
Other interpretations
However, the reads did not span the integrated sequence. Again, the preparation of a library for Illumina sequencing is known to tend towards artifact generation. These facts leave the conclusion in doubt.
Earlier, however, the scientists responsible for the current preprint used ONT reads to detect L1-dependent reverse transcription of viral genomes. One example is the incorporation of two hepatitis C virus (HCV) sequences in a liver tumor sample.
Moreover, the cell platform used for ONT sequencing, namely, HEK293T cells, has many features that may make it easy to assess the occurrence of such reverse-transcribed viral sequences. For instance, these cells express the L1 ORF1p, have been used to accept inserted L1 retrotransposons, and serve to support productive SARS-CoV-2 infection.
Study aim
Using ONT sequencing, the current study attempted to analyze genomic DNA from this cell line after infecting them with SARS-CoV-2 at multiple doses. Control cells were the tumor and non-tumor cell lines from a liver tumor positive for the hepatitis B virus.
The ONT sequencing data from the earlier studies were added. As negative controls, the previously identified HCV-positive liver carcinoma cells were used, along with normal liver cells.
No insertions found
All of these supposedly integrated sequences were analyzed to identify all L1 insertions specific for human cells, spanned by at least one ONT lead aligned to a unique locus. Not a single insertion was detected from the SARS-CoV-2, HCV or HBV genomes.
However, the program did identify 575 L1 insertions with the flanking TSDs. Seventy-eight were unique to the SARS-CoV-2-infected cells or mock-infected controls, the majority spanned by a single read.
Among these single-spanning insertions, six, which were chosen at random for manual checking and confirmation by a polymerase chain reaction (PCR) test, were confirmed to show all three features of a true L1 insertion.
Secondly, the researchers found that the program could recover a single HBV integrated sequence.
However, it did not show the presence of any ONT reads that could align to the SARS-CoV-2 or HCV genomes, showing that retrotransposons and insertions from exogenous viruses are reliably distinguished.
This was followed by the discovery that the ONT data from the first study contained 555 reads that could be aligned to the SARS-CoV-2 genome. One of these reads was the putative integrated sequence on the X chromosome that did not have a 3’ polyA tract.
The mean length of these reads was too short, however, compared to the overall dataset. Over half of these reads were composed of SARS-CoV-2 sequence, a high proportion relative to L1 sequences in the reads that were aligned to L1.
The program failed to call supposed integrants from the SARS-CoV-2 genome. Thus, despite the presence of reads that could be aligned to the viral genome in the dataset of the earlier study, these reads were too short compared to the typical read. Thus, the researchers postulate that these could be molecular artifacts wrongly interpreted as SARS-CoV-2 integrated sequences.
These results point to the ability of single spanning ONT reads to detect retrotransposons and to show endogenous L1 insertions in this cell line that lacked overexpressed L1.
What is the conclusion?
“In sum, we do not observe L1-mediated SARS-CoV-2 genomic integration in HEK293T cells, despite the availability of the L1 machinery.”
The reason is that the L1 clearly prefers to reverse transcribe its own mRNA rather than SARS-CoV-2. Thus, although possible, this is probably a rare phenomenon, as with other poly-A-bearing cellular RNAs.
“That we found no evidence of SARS-CoV-2 integration suggests such events in vivo are highly unlikely to drive later oncogenesis or explain post-recovery detection of the virus.”
*Important Notice
bioRxiv publishes preliminary scientific reports that are not peer-reviewed and, therefore, should not be regarded as conclusive, guide clinical practice/health-related behavior, or treated as established information.
- Smits, N.et al. (2021). Human genome integration of SARS-CoV-2 contradicted by long-read sequencing. bioRxiv preprint. doi: https://doi.org/10.1101/2021.05.28.446065. https://www.biorxiv.org/content/10.1101/2021.05.28.446065v1
Posted in: Device / Technology News | Medical Research News | Disease/Infection News
Tags: Carcinoma, Cell, Cell Line, Chromosome, Chromosome 14, Chromosome 18, Chronic, Coronavirus, Coronavirus Disease COVID-19, Diagnostic, DNA, Endonuclease, Enzyme, Exons, Genes, Genetic, Genome, Genomic, HBV, Hepatitis B, Hepatitis B Virus, Hepatitis C, Hepatocellular Carcinoma, HIV, HIV-1, Illumina, Immunodeficiency, in vivo, Liver, Locus, Mammalian Cells, Nucleotides, Oncogenesis, Pandemic, Polymerase, Polymerase Chain Reaction, Research, Respiratory, Reverse Transcriptase, Ribonucleic Acid, RNA, SARS, SARS-CoV-2, Severe Acute Respiratory, Severe Acute Respiratory Syndrome, Syndrome, Transcription, Tumor, Virus, X chromosome
Written by
Dr. Liji Thomas
Dr. Liji Thomas is an OB-GYN, who graduated from the Government Medical College, University of Calicut, Kerala, in 2001. Liji practiced as a full-time consultant in obstetrics/gynecology in a private hospital for a few years following her graduation. She has counseled hundreds of patients facing issues from pregnancy-related problems and infertility, and has been in charge of over 2,000 deliveries, striving always to achieve a normal delivery rather than operative.
Source: Read Full Article