Ultradeep Pyrosequencing of Hepatitis C Virus to Define Evolutionary Phenotypes

Brendan  A.  Palmer; Zoya  Dimitrova; Pavel  Skums; Orla  Crosbie; Elizabeth  Kenny-Walsh; Liam  J.  Fanning

doi:10.21769/BioProtoc.2284

Improve Research Reproducibility A Bio-protocol resource

Submit a Protocol
Receive Our Alerts
Log in
/
Sign up
- My Bio Page
- Edit My Profile
- Change Password
- Log Out
EN
- EN - English
- CN - 中文

Peer-reviewed

Ultradeep Pyrosequencing of Hepatitis C Virus to Define Evolutionary Phenotypes

EK Elizabeth Kenny-Walsh

LF Liam J. Fanning email

Published: Vol 7, Iss 10, May 20, 2017 DOI: 10.21769/BioProtoc.2284 Views: 6854

Reviewed by: Yannick DebingRaju GhoshVamseedhar RayaproluAnonymous reviewer(s)

PDF

Ask a question

How to cite

Favorite

Cited by

Original research article

The authors used this protocol in:

Cover of Journal of Virology, featuring study using the protocol.

Mar 2016

Bio-protocol welcomes Protocols in Bioinformatics and Computational Biology

Protocol Collections

Cell Imaging - A Special Collection for Cell Bio 2023

See all

Related protocols

A Small RNA Isolation and Sequencing Protocol and Its Application to Assay CRISPR RNA Biogenesis in Bacteria

Sukrit Silas [...] Joshua Arribere

Feb 20, 2018 11007 Views

Whole-genome Identification of Transcriptional Start Sites by Differential RNA-seq in Bacteria

Ramón Cervantes-Rivera and Andrea Puhar

Sep 20, 2020 7154 Views

Enterovirus Competition Assay to Assess Replication Fitness

Valeria Lulla and Andrew E. Firth

May 20, 2019 6127 Views

Abstract

Analysis of hypervariable regions (HVR) using pyrosequencing techniques is hampered by the ability of error correction algorithms to account for the heterogeneity of the variants present. Analysis of between-sample fluctuations to virome sub-populations, and detection of low frequency variants, are unreliable through the application of arbitrary frequency cut offs. Cumulatively this leads to an underestimation of genetic diversity. In the following technique we describe the analysis of Hepatitis C virus (HCV) HVR1 which includes the E1/E2 glycoprotein gene junction. This procedure describes the evolution of HCV in a treatment naïve environment, from 10 samples collected over 10 years, using ultradeep pyrosequencing (UDPS) performed on the Roche GS FLX titanium platform (Palmer et al., 2014). Initial clonal analysis of serum samples was used to inform downstream error correction algorithms that allowed for a greater sequence depth to be reached. PCR amplification of this region has been tested for HCV genotypes 1, 2, 3 and 4.

Keywords: Ultradeep pyrosequencing

Virus

Quasispecies

Hypervariability

Background

Analysis of UDPS datasets derived from virus amplicons frequently relies on software tools that are not optimized for amplicon analysis, assume random incorporation of sequencing mutations and are focused on finding true sequences rather than false variants. These difficulties are further complicated by the presence of hypervariable regions present in RNA virus genomes. Many studies utilizing UDPS look to overcome these issues by applying arbitrary frequency cut offs to the data, resulting in the loss of minor variants. Here, a temporally matched clonal dataset, together with an error correction methodology designed to overcome the problems outlined, facilitated the retention of valuable sequence information.

Materials and Reagents

1.5 ml tube (SARSTEDT, catalog number: 72.690.001 )
200 µl MicroAmp^® PCR tube (Thermo Fisher Scientific, Applied Biosystems^TM, catalog number: N8010840 )
Clean stainless steel blade
One Shot^® TOP10 Competent Cells (Thermo Fisher Scientific, Invitrogen^TM, catalog number: C404003 )
QIAamp^® Viral RNA mini kit (QIAGEN, catalog number: 52904 )
Random primer (Promega, catalog number: C1181 )
Deoxynucleoside triphosphate (dNTP’s, 100 mM) set, PCR grade (Roche Molecular Systems, catalog number: 11969064001 )
AMV reverse transcriptase (Promega, catalog number: M5101 )
RNasin^® Ribonuclease inhibitor (Promega, catalog number: N2511 )
Outer-forward primer: 5’- ATGGCATGGGATATGAT -3’ (10 pmol/µl, Eurofins)
Outer-reverse primer: 5’- AAGGCCGTCCTGTTGA -3’ (10 pmol/µl, Eurofins)
Inner-forward primer: 5’- GCATGGGATATGATGATGAA -3’ (10 pmol/µl, Eurofins)
Inner-reverse primer: 5’- GTCCTGTTGATGTGCCA -3’ (10 pmol/µl, Eurofins)
Pwo DNA polymerase (5 U/µl,) including 10x reaction buffer (- MgSO₄) and MgSO₄ stock solution (25 mM) (Roche Molecular Systems, catalog number: 11644955001 )
dH₂O (Sigma-Aldrich, catalog number: W4502 )
Sybr safe DNA gel stain (Thermo Fisher Scientific, Invitrogen^TM, catalog number: S33102 )
Agarose (Sigma-Aldrich, catalog number: A9539 )
GeneRuler 100 bp Plus DNA ladder (Thermo Fisher Scientific, Thermo Scientific^TM, catalog number: SM0323 )
Gel extraction kit (QIAGEN, catalog number: 28704 )
CloneJet PCR Cloning Kit (Thermo Fisher Scientific, Thermo Scientific^TM, catalog number: K1231 )
GeneJet Plasmid Miniprep Kit (Thermo Fisher Scientific, Thermo Scientific^TM, catalog number: K0503 )
Trizma^® base (Sigma-Aldrich, catalog number: T1503 )
Acetic acid glacial (BDH Laboratory Supplies, catalog number: 10001CU )
Ethylenediaminetetraacetic acid solution 0.5 M (EDTA) (Sigma-Aldrich, catalog number: 03690 )
1x TAE (see Recipes)

Equipment

PCR thermal cycler (Thermo Fisher Scientific, Applied Biosystems^TM, model: Applied Biosystems^® 2720 )
BioPhotometer (Eppendorf, http://arboretum.harvard.edu/wp-content/uploads/Biophotometer-manual.pdf)
Water bath (JULABO, model: SW22 )
Orbital shaker incubator (Grant, model: ES-80 )
Ultraviolet transilluminator (UVP, model: TMW-20 )

Software

SFFFile tools (Roche Molecular Systems)
k-mer error correction (KEC) and empirical threshold (ET) (Skums et al., 2012)
MEGA 6.0 (Tamura et al., 2013)

Procedure

RNA extraction and cDNA generation
1. Whole patient serum, surplus to diagnostic testing requirements and with a mean viral titer of 6 HCV RNA log₁₀ IU/ml, was used as the starting material.
2. RNA was extracted from 140 µl of serum using QIAamp^® Viral RNA mini kit according to the manufacturer’s instructions into 1.5 ml RNase free tubes and a final volume of 50 µl.
3. 11 µl of extracted viral RNA was mixed with 1 µl (0.5 µg) random primer.
4. Samples were incubated at 75 °C for 10 min.
5. To this was added a master mix which contained 2 µl (80 mM) dNTP mix, 1 µl (10 U) AMV reverse transcriptase, 1 µl (40 U) RNasin, 4 µl AMV reaction buffer.
6. cDNA generation took place at 42 °C for 60 min, followed by 94 °C for 3 min.
7. Samples were kept at 4 °C until required.
Nested PCR to amplify the HCV E1/E2 gene junction
1. Prepare the primary PCR master mix to a final volume of 45 µl:
  Outer-forward primer:
  1.5 µl
  Outer-reverse primer:
  1.5 µl
  10x reaction buffer (- MgSO₄):
  5 µl
  dNTP mix:
  1 µl
  MgSO₄ stock solution:
  3 µl
  Pwo:
  0.5 µl
  PCR grade water:
  32.5 µl
2. 5 µl of cDNA is then added to the master mix.
3. 1° PCR cycle parameters:
  1. Initial denaturation: 3 min at 94 °C
  2. Cycle conditions (repeat for 35 cycles):
    Denaturation: 15 sec at 94 °C
    Annealing: 30 sec at 51 °C
    Extension: 30 sec at 72 °C
  3. Final extension: 7 min at 72 °C
4. Keep sample at 4 °C until required.
5. Prepare master mix for secondary PCR to a final volume of 46 µl:
  Inner-forward primer:
  1.5 µl
  Inner-reverse primer:
  1.5 µl
  10x reaction buffer (- MgSO₄):
  5 µl
  dNTP mix:
  1 µl
  MgSO₄ stock solution:
  2 µl
  Pwo:
  0.5 µl
  PCR grade water:
  34.5 µl
6. 4 µl of primary PCR sample is then added to the master mix.
7. 2° PCR cycle parameters:
  1. Initial denaturation: 3 min at 94 °C
  2. Cycle conditions (repeat for 35 cycles):
    Denaturation: 15 sec at 94 °C
    Annealing: 30 sec at 53 °C
    Extension: 30 sec at 72 °C
  3. Final extension: 7 min at 72 °C
8. Samples were kept at 4 °C until required.
9. To ensure that the initial amount of the template was not limiting, 1:100 dilution of the viral RNA was prepared which, when used as the starting template for nested PCR as described, should yield an amplicon visualized by gel electrophoresis for each sample.
Preparation of samples for pyrosequencing
1. Two 2% TAE agarose gels were poured, one containing Sybr safe DNA gel stain and one without.
2. Once set, the gels were split in two, with one half of the gel containing the gel stain joined with the second gel without gel stain.
3. The 50 µl amplicon sample was split in two (10 µl and 40 µl) and resolved on the above gel. The 10 µl sample was stained using Sybr safe, while the 40 µl sample was not stained and went forward for downstream procedures. The resultant amplicon in this instance was 320 bp (Figure 1).
4. The region of the gel containing the unstained band (40 µl sample) was cut out using a clean stainless steel blade using the stained 10 µl sample as a positioning guide and transferred to a clean 1.5 ml tube.
5. The amplicon was gel extracted using a gel extraction kit according to the manufacturer’s instructions.
6. Extracted amplicons were quantified using a BioPhotometer.
7. Samples were prepared in equimolar concentrations and diluted to a final concentration of 1 x 10⁷ molecules/ml.
8. Pyrosequencing was outsourced to Roche 454 Life Sciences (Brandford, CT, USA).
  
  Figure 1. Amplicon visualization. Successful amplification of the 320 bp amplicon was confirmed following agarose gel electrophoresis. 10 µl of the 2° PCR sample was loaded.
Clonal analysis
1. Purified amplicons were cloned using CloneJet PCR Cloning Kit and transformed into One Shot^®TOP10 Competent Cells using the manufacturer’s instructions using a molar ratio of 3:1 insert to vector.
2. 20 clones per sample were generated.
3. Plasmids were purified using GeneJet Plasmid Miniprep Kit as per manufacturer’s instructions.
4. Sequencing of E1/E2 inserts was performed by Eurofins.
5. All trace files were inspected to exclude sequences where double peaks or regions of ambiguous sequence were present.
Data handling and error correction
1. The raw sff data files were managed using SFFFile tools.
2. Low-quality reads and reads shorter than 90% of the expected amplicon lengths were removed.
3. Phylogenetic separation of the clonal data using a general time-reversible model with gamma-distributed and invariant sites (GTR+G+I) using MEGA 6.0 (Tamura et al., 2013).
4. Main branches with bootstrap values (of 1,000 resamplings) > 85 were categorised as (sub-)lineages (Palmer et al., 2014).
5. Two 24-bp motifs, that defined the HVR1 amino acid profile of each (sub-)lineage, were subsequently applied to the sequence analysis pipeline. The first 15-bp of the motif span the conserved 3’-end of E1. The remaining 9-bp include the first three amino acids of the HVR1 at the 5’-end of E2.
6. The overall number of motifs used reflected the observed changes in the dominant HVR1 over time. For each (sub-)lineage, two motif reference sequences were deemed sufficient.
7. To increase the sensitivity of the sequencing error correction algorithms (KEC-ET), the UDPS data was partitioned according to the presence of corresponding motifs.
8. In order to ensure the quality of the analyzed data and the absence of PCR and sequencing chimeras, reads that had more than a 3 bp difference from the best-matching sequence from this motif set were removed.
9. KEC consists of the three stages
10. The following parameters of KEC were used: k = 25 and i = 3.

Data analysis

A more complete description of the data handling and error correction procedure can be found in the original article, http://jvi.asm.org/content/88/23/13709.short (Palmer et al., 2014).

Notes

All serum samples were genotyped and quantified by the Molecular Virology Diagnostic & Research Laboratory at Cork University Hospital, Cork, Ireland. https://www.ucc.ie/en/meddept/people/liam-fanning/mvdrl/

Recipes

1x TAE
4.84 g Tris base
1.15 ml acetic acid glacial
2 ml 0.5 M EDTA
Add dH₂O to 1 L

References

Palmer, B. A., Dimitrova, Z., Skums, P., Crosbie, O., Kenny-Walsh, E. and Fanning, L. J. (2014). Analysis of the evolution and structure of a complex intrahost viral population in chronic hepatitis C virus mapped by ultradeep pyrosequencing. J Virol 88(23): 13709-13721.
Skums, P., Dimitrova, Z., Campo, D. S., Vaughan, G., Rossi, L., Forbi, J. C., Yokosawa, J., Zelikovsky, A. and Khudyakov, Y. (2012). Efficient error correction for next-generation sequencing of viral amplicons. BMC Bioinformatics 13 Suppl 10: S6.
Tamura, K., Stecher, G., Peterson, D., Filipski, A. and Kumar, S. (2013). MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol 30(12): 2725-2729.

Article Information

Copyright

How to cite

Palmer, B. A., Dimitrova, Z., Skums, P., Crosbie, O., Kenny-Walsh, E. and Fanning, L. J. (2017). Ultradeep Pyrosequencing of Hepatitis C Virus to Define Evolutionary Phenotypes. Bio-protocol 7(10): e2284. DOI: 10.21769/BioProtoc.2284.

Download Citation in RIS Format

Outer-forward primer:	1.5 µl
Outer-reverse primer:	1.5 µl
10x reaction buffer (- MgSO₄):	5 µl
dNTP mix:	1 µl
MgSO₄ stock solution:	3 µl
Pwo:	0.5 µl
PCR grade water:	32.5 µl

Inner-forward primer:	1.5 µl
Inner-reverse primer:	1.5 µl
10x reaction buffer (- MgSO₄):	5 µl
dNTP mix:	1 µl
MgSO₄ stock solution:	2 µl
Pwo:	0.5 µl
PCR grade water:	34.5 µl