Environmental and Occupational Health Faculty Publications

NASP: an accurate, rapid method for the identification of SNPs in WGS datasets that supports flexible input and output formats

Jason Sahl
Darrin Lemmer
Jason Travis
James Schupp
John Gillece
Maliha Aziz, George Washington University
+several additional authors

Document Type

Journal Article

Publication Date

8-2016

Journal

Microbial Genomics

Volume

Issue

Inclusive Pages

e000074.

DOI

10.1099/mgen.0.000074

Abstract

Whole-genome sequencing (WGS) of bacterial isolates has become standard practice in many laboratories. Applications for WGS analysis include phylogeography and molecular epidemiology, using single nucleotide polymorphisms (SNPs) as the unit of evolution. NASP was developed as a reproducible method that scales well with the hundreds to thousands of WGS data typically used in comparative genomics applications. In this study, we demonstrate how NASP compares with other tools in the analysis of two real bacterial genomics datasets and one simulated dataset. Our results demonstrate that NASP produces similar, and often better, results in comparison with other pipelines, but is much more flexible in terms of data input types, job management systems, diversity of supported tools and output formats. We also demonstrate differences in results based on the choice of the reference genome and choice of inferring phylogenies from concatenated SNPs or alignments including monomorphic positions. NASP represents a source-available, version-controlled, unit-tested method and can be obtained from tgennorth.github.io/NASP.

Comments

Reproduced with permission of the Microbiology Society. Microbial Genomics

Creative Commons License

This work is licensed under a Creative Commons Attribution-Noncommercial 4.0 License

APA Citation

Sahl, J., Lemmer, D., Travis, J., Schupp, J., Gillece, J., Aziz, M., & +several additional authors (2016). NASP: an accurate, rapid method for the identification of SNPs in WGS datasets that supports flexible input and output formats. Microbial Genomics, 2 (8). http://dx.doi.org/10.1099/mgen.0.000074

Peer Reviewed

Open Access

Download

Included in

Bioinformatics Commons, Integrative Biology Commons, Research Methods in Life Sciences Commons, Systems Biology Commons

COinS

Environmental and Occupational Health Faculty Publications

NASP: an accurate, rapid method for the identification of SNPs in WGS datasets that supports flexible input and output formats

Document Type

Publication Date

Journal

Volume

Issue

Inclusive Pages

DOI

Abstract

Comments

Creative Commons License

APA Citation

Peer Reviewed

Open Access

Included in

Search

Browse

Author Corner

Links

Environmental and Occupational Health Faculty Publications

NASP: an accurate, rapid method for the identification of SNPs in WGS datasets that supports flexible input and output formats

Authors

Document Type

Publication Date

Journal

Volume

Issue

Inclusive Pages

DOI

Abstract

Comments

Creative Commons License

APA Citation

Peer Reviewed

Open Access

Included in

Share

Search

Browse

Author Corner

Links