Cornet, LucZaidi, Syed Shan‑e‑AliLi, JiaNgapout, YvanShakir, SaraMeunier, LoicCallot, CarolineMarande, WilliamHanikenne, MarcRombauts, StephaneVan de Peer, YvesVanderschuren, Hervé2026-03-122026-03-122025-06-29Cornet, L., Zaidi, S.S.-e-A., Li, J. et al. 2025, 'A BAC‑guided haplotype assembly pipeline increases the resolution of the virus resistance locus CMD2 in cassava', Genome Biology, vol. 26, no. 185, pp. 1-15. https://doi.org/10.1186/s13059-025-03620-8.1474-7596 (print)1474-760X (online)10.1186/s13059-025-03620-8http://hdl.handle.net/2263/108915DATA AVAILABILITY : L Cornet and H Vanderschuren have summitted raw data to the National Center for Biotechnology Information (NCBI) under the Bioproject accession no. PRJNA981703 [83], Raw nanopore reads no. SRR25338822 for cv. 60444 and no. SRR25338821 for TME3, bionano optical maps accession no. SUPPF_0000005665 for cv. 60444 and no. SUPPF_0000005666 for TME3 (both available from https://ftp.ncbi.nlm.nih.gov/pub/supplementary_data/bionanomaps.csv), and RNAseq reads accession no. SRR25338832. L Cornet and H Vanderschuren have summitted genome assemblies to European Nucleotide Archive (ENA) under the Bioproject PRJEB65447, genome accession GCA_963409065.1 for cv. 60444 and GCA_963409055.1 for TME3.SUPPLEMENTARY INFORMATION Additional file 1: Figs. S1–S2 and Tables S1–S6. Additional file 2: Peer review history.BACKGROUND : Cassava is an important crop for food security in the tropics where its production is jeopardized by several viral diseases, including the cassava mosaic disease (CMD) which is endemic in Sub-Saharan Africa and the Indian subcontinent. Resistance to CMD is linked to a single dominant locus, namely CMD2. The cassava genome contains highly repetitive regions making the accurate assembly of a reference genome challenging. RESULTS : In the present study, we generate BAC libraries of the CMD-susceptible cassava cultivar (cv.) 60444 and the CMD-resistant landrace TME3. We subsequently identify and sequence BACs belonging to the CMD2 region in both cultivars using high-accuracy long-read PacBio circular consensus sequencing (ccs) reads. We then sequence and assemble the complete genomes of cv. 60444 and TME3 using a combination of ONT ultra-long reads and optical mapping. Anchoring the assemblies on cassava genetic maps reveals discrepancies in our, as well as in previously released, CMD2 regions of the cv. 60444 and TME3 genomes. A BAC-guided approach to assess cassava genome assemblies significantly improves the synteny between the assembled CMD2 regions of cv. 60444 and TME3 and the CMD2 genetic maps. We then performed repeat-unmasked gene annotation on CMD2 assemblies and identify 81 stress resistance proteins present in the CMD2 region, among which 31 were previously not reported in publicly available CMD2 sequences. CONCLUSIONS : The BAC-assessed approach improved CMD2 region accuracy and revealed new sequences linked to virus resistance, advancing our understanding of cassava mosaic disease resistance.en© The Author(s) 2025. Open Access This article is licensed under a Creative Commons Attribution 4.0 International License.CassavaVirus resistanceCMD2 sequencesSub-Saharan Africa (SSA)Cassava mosaic disease (CMD)A BAC‑guided haplotype assembly pipeline increases the resolution of the virus resistance locus CMD2 in cassavaArticle