Evaluation of the impact of Illumina error correction tools on de novo genome assembly

dc.contributor.authorHeydari, Mahdi
dc.contributor.authorMiclotte, Giles
dc.contributor.authorDemeester, Piet
dc.contributor.authorVan de Peer, Yves
dc.contributor.authorFostier, Jan
dc.date.accessioned2017-09-11T09:15:52Z
dc.date.available2017-09-11T09:15:52Z
dc.date.issued2017-08-18
dc.descriptionAdditional file 1: Supplementary Data. Evaluation of the impact of Illumina error correction tools on de novo genome assembly.en_ZA
dc.description.abstractBACKGROUND : Recently, many standalone applications have been proposed to correct sequencing errors in Illumina data. The key idea is that downstream analysis tools such as de novo genome assemblers benefit from a reduced error rate in the input data. Surprisingly, a systematic validation of this assumption using state-of-the-art assembly methods is lacking, even for recently published methods. RESULTS : For twelve recent Illumina error correction tools (EC tools) we evaluated both their ability to correct sequencing errors and their ability to improve de novo genome assembly in terms of contig size and accuracy. CONCLUSIONS : We confirm that most EC tools reduce the number of errors in sequencing data without introducing many new errors. However, we found that many EC tools suffer from poor performance in certain sequence contexts such as regions with low coverage or regions that contain short repeated or low-complexity sequences. Reads overlapping such regions are often ill-corrected in an inconsistent manner, leading to breakpoints in the resulting assemblies that are not present in assemblies obtained from uncorrected data. Resolving this systematic flaw in future EC tools could greatly improve the applicability of such tools.en_ZA
dc.description.departmentGeneticsen_ZA
dc.description.librarianam2017en_ZA
dc.description.sponsorshipThe Research Foundation - Flanders (FWO) (G0C3914N)en_ZA
dc.description.urihttp://www.biomedcentral.com/bmcbioinformaticsen_ZA
dc.identifier.citationHeydari, M., Miclotte, G., Demeester, P., Van de Peer, Y. & Fostier, J. 2017, 'Evaluation of the impact of Illumina error correction tools on de novo genome assembly', BMC Bioinformatics, vol. 18, art. no. 374, pp. 1-13.en_ZA
dc.identifier.issn1471-2105 (online)
dc.identifier.other10.1186/s12859-017-1784-8
dc.identifier.urihttp://hdl.handle.net/2263/62208
dc.language.isoenen_ZA
dc.publisherBioMed Centralen_ZA
dc.rights© The Author(s). 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License.en_ZA
dc.subjectNext-generation sequencingen_ZA
dc.subjectError correctionen_ZA
dc.subjectIlluminaen_ZA
dc.subjectGenome assemblyen_ZA
dc.subjectState of the arten_ZA
dc.subjectStandalone applicationsen_ZA
dc.subjectSequencing errorsen_ZA
dc.subjectPoor performanceen_ZA
dc.subjectAnalysis toolsen_ZA
dc.subjectGenesen_ZA
dc.subjectError correction tools (EC tools)en_ZA
dc.titleEvaluation of the impact of Illumina error correction tools on de novo genome assemblyen_ZA
dc.typeArticleen_ZA

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
Heydari_Evaluation_2017.pdf
Size:
862.96 KB
Format:
Adobe Portable Document Format
Description:
Article
Loading...
Thumbnail Image
Name:
Heydari_EvaluationSuppl_2017.pdf
Size:
628.7 KB
Format:
Adobe Portable Document Format
Description:
Supplemental Data

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.75 KB
Format:
Item-specific license agreed upon to submission
Description: