Iterative correction of reference nucleotides (iCORN) using second generation sequencing technology

Otto, T. D. , Sanders, M., Berriman, M. and Newbold, C. (2010) Iterative correction of reference nucleotides (iCORN) using second generation sequencing technology. Bioinformatics, 26(14), pp. 1704-1707. (doi: 10.1093/bioinformatics/btq269) (PMID:20562415) (PMCID:PMC2894513)

[img]
Preview
Text
148431.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial.

299kB

Abstract

Abstract: Motivation: The accuracy of reference genomes is important for downstream analysis but a low error rate requires expensive manual interrogation of the sequence. Here, we describe a novel algorithm (Iterative Correction of Reference Nucleotides) that iteratively aligns deep coverage of short sequencing reads to correct errors in reference genome sequences and evaluate their accuracy. Results: Using Plasmodium falciparum (81% A + T content) as an extreme example, we show that the algorithm is highly accurate and corrects over 2000 errors in the reference sequence. We give examples of its application to numerous other eukaryotic and prokaryotic genomes and suggest additional applications.

Item Type:Articles
Status:Published
Refereed:Yes
Glasgow Author(s) Enlighten ID:Otto, Professor Thomas
Authors: Otto, T. D., Sanders, M., Berriman, M., and Newbold, C.
College/School:College of Medical Veterinary and Life Sciences > School of Infection & Immunity
Journal Name:Bioinformatics
Publisher:Oxford University Press
ISSN:1367-4803
ISSN (Online):1460-2059)
Copyright Holders:Copyright ©2010 The Authors
First Published:First published in Bioinformatics 26(14):1704-1707
Publisher Policy:Reproduced under a Creative Commons License

University Staff: Request a correction | Enlighten Editors: Update this record