Benchmarking of viral haplotype reconstruction programmes: an overview of the capacities and limitations of currently available programmes

Schirmer, M., Sloan, W. T. and Quince, C. (2014) Benchmarking of viral haplotype reconstruction programmes: an overview of the capacities and limitations of currently available programmes. Briefings in Bioinformatics, 15(3), pp. 431-442. (doi: 10.1093/bib/bbs081)

Full text not currently available from Enlighten.

Publisher's URL: http://dx.doi.org/10.1093/bib/bbs081

Abstract

Viral haplotype reconstruction from a set of observed reads is one of the most challenging problems in bioinformatics today. Next-generation sequencing technologies enable us to detect single-nucleotide polymorphisms (SNPs) of haplotypes—even if the haplotypes appear at low frequencies. However, there are two major problems. First, we need to distinguish real SNPs from sequencing errors. Second, we need to determine which SNPs occur on the same haplotype, which cannot be inferred from the reads if the distance between SNPs on a haplotype exceeds the read length. We conducted an independent benchmarking study that directly compares the currently available viral haplotype reconstruction programmes. We also present nine in silico data sets that we generated to reflect biologically plausible populations. For these data sets, we simulated 454 and Illumina reads and applied the programmes to test their capacity to reconstruct whole genomes and individual genes. We developed a novel statistical framework to demonstrate the strengths and limitations of the programmes. Our benchmarking demonstrated that all the programmes we tested performed poorly when sequence divergence was low and failed to recover haplotype populations with rare haplotypes.

Item Type:Articles
Status:Published
Refereed:Yes
Glasgow Author(s) Enlighten ID:Sloan, Professor William and Schirmer, Ms Melanie and Quince, Dr Christopher
Authors: Schirmer, M., Sloan, W. T., and Quince, C.
College/School:College of Science and Engineering > School of Engineering > Infrastructure and Environment
Journal Name:Briefings in Bioinformatics
Publisher:Oxford University Press
ISSN:1467-5463
ISSN (Online):1477-4054

University Staff: Request a correction | Enlighten Editors: Update this record

Project CodeAward NoProject NamePrincipal InvestigatorFunder's NameFunder RefLead Dept
503351Pioneering the genomics era of environmental microbiologyChristopher QuinceEngineering & Physical Sciences Research Council (EPSRC)EP/H003851/1ENG - ENGINEERING INFRASTRUCTURE & ENVIR