Document Type

Journal Article

Publication Date

6-11-2014

Journal

PLoS ONE

Volume

Volume 9, Issue 6

Inclusive Pages

Article number e99033

DOI

10.1371/journal.pone.0099033

Abstract

Due to the size of Next-Generation Sequencing data, the computational challenge of sequence alignment has been vast. Inexact alignments can take up to 90% of total CPU time in bioinformatics pipelines. High-performance Integrated Virtual Environment (HIVE), a cloud-based environment optimized for storage and analysis of extra-large data, presents an algorithmic solution: the HIVE-hexagon DNA sequence aligner. HIVE-hexagon implements novel approaches to exploit both characteristics of sequence space and CPU, RAM and Input/Output (I/O) architecture to quickly compute accurate alignments. Key components of HIVE-hexagon include non-redundification and sorting of sequences; floating diagonals of linearized dynamic programming matrices; and consideration of cross-similarity to minimize computations.

Comments

Reproduced with permission of PLoS ONE.

Creative Commons License

Creative Commons License
The person who associated a work with this deed has dedicated the work to the public domain by waiving all of his or her rights to the work worldwide under copyright law, including all related and neighboring rights, to the extent allowed by law.

Peer Reviewed

1

Open Access

1

Table_S1.xlsx (12 kB)
Table S1

Table_S2.xlsx (16 kB)
Table S2

Table_S3.xlsx (9 kB)
Table S3

Table_S4.xlsx (9 kB)
Table S4

Table_S5.xlsx (14 kB)
Table S5

Share

COinS