Validation of Variant Assembly Using HAPHPIPE with Next-Generation Sequence Data from Viruses.

Document Type

Journal Article

Publication Date

7-14-2020

Journal

Viruses

Volume

12

Issue

7

DOI

10.3390/v12070758

Grant Information

ul1tr001876 kl2tr001877

Abstract

Next-generation sequencing (NGS) offers a powerful opportunity to identify low-abundance, intra-host viral sequence variants, yet the focus of many bioinformatic tools on consensus sequence construction has precluded a thorough analysis of intra-host diversity. To take full advantage of the resolution of NGS data, we developed HAplotype PHylodynamics PIPEline (HAPHPIPE), an open-source tool for the de novo and reference-based assembly of viral NGS data, with both consensus sequence assembly and a focus on the quantification of intra-host variation through haplotype reconstruction. We validate and compare the consensus sequence assembly methods of HAPHPIPE to those of two alternative software packages, HyDRA and Geneious, using simulated HIV and empirical HIV, HCV, and SARS-CoV-2 datasets. Our validation methods included read mapping, genetic distance, and genetic diversity metrics. In simulated NGS data, HAPHPIPE generated

Peer Reviewed

1

Open Access

1

Find in your library

Share

COinS