Hunt, M., Kikuchi, T., Sanders, M., Newbold, C., Berriman, M. and Otto, T. D. (2013) REAPR: a universal tool for genome assembly evaluation. Genome Biology, 14(5), R47. (doi: 10.1186/gb-2013-14-5-r47) (PMID:22678431) (PMCID:PMC3648784)
|
Text
148018.pdf - Published Version Available under License Creative Commons Attribution. 1MB |
Abstract
Methods to reliably assess the accuracy of genome sequence data are lacking. Currently completeness is only described qualitatively and mis-assemblies are overlooked. Here we present REAPR, a tool that precisely identifies errors in genome assemblies without the need for a reference sequence. We have validated REAPR on complete genomes or de novo assemblies from bacteria, malaria and Caenorhabditis elegans, and demonstrate that 86% and 82% of the human and mouse reference genomes are error-free, respectively. When applied to an ongoing genome project, REAPR provides corrected assembly statistics allowing the quantitative comparison of multiple assemblies. REAPR is available at http://www.sanger.ac.uk/resources/software/reapr/.
Item Type: | Articles |
---|---|
Status: | Published |
Refereed: | Yes |
Glasgow Author(s) Enlighten ID: | Otto, Professor Thomas |
Authors: | Hunt, M., Kikuchi, T., Sanders, M., Newbold, C., Berriman, M., and Otto, T. D. |
College/School: | College of Medical Veterinary and Life Sciences > School of Infection & Immunity |
Journal Name: | Genome Biology |
Publisher: | BioMed Central |
ISSN: | 1474-760X |
ISSN (Online): | 465-6906 |
Copyright Holders: | Copyright © 2013 The Authors |
First Published: | First published in Genome Biology 14(5):R47 |
Publisher Policy: | Reproduced under a Creative Commons License |
University Staff: Request a correction | Enlighten Editors: Update this record