A haplotype-resolved reference genome for Eucalyptus grandis

dc.contributor.authorLötter, Anneri
dc.contributor.authorBruna, Tomas
dc.contributor.authorDuong, Tuan A.
dc.contributor.authorBarry, Kerrie
dc.contributor.authorLipzen, Anna
dc.contributor.authorDaum, Chris
dc.contributor.authorYoshinaga, Yuko
dc.contributor.authorGrimwood, Jane
dc.contributor.authorJenkins, Jerry W.
dc.contributor.authorTalag, Jayson
dc.contributor.authorBorevitz, Justin
dc.contributor.authorLovell , John T.
dc.contributor.authorSchmutz, Jeremy
dc.contributor.authorWegrzyn, Jill L.
dc.contributor.authorMyburg, Alexander A.
dc.date.accessioned2026-03-12T05:34:08Z
dc.date.available2026-03-12T05:34:08Z
dc.date.issued2025-07
dc.descriptionAVAILABILITY DATA : The E. grandis v2.0 reference is available at https://phytozome-next.jgi.doe.gov/. The TAG0014 HAP1 and HAP2 genomes are available on Phytozome (https://phytozome-next.jgi.doe.gov/) and can be downloaded here: https://data.jgi.doe.gov/refine-download/phytozome?q=Eucalyptus+grandis+var.+TAG0014. The raw data is available on NCBI under BioProject PRJNA1217046. Scripts for genome comparisons and summary statistics and to generate images are available on GitLab: https://gitlab.com/Anneri/tag0014_genome.
dc.description.abstractEucalyptus grandis is a hardwood tree used worldwide as pure species or hybrid partner to breed fast-growing plantation forestry crops that serve as feedstocks of timber and lignocellulosic biomass for pulp, paper, biomaterials, and biorefinery products. The current v2.0 genome reference for the species served as the first reference for the genus and has helped drive the development of molecular breeding tools for eucalypts. Using PacBio HiFi long reads and Omni-C proximity ligation sequencing, we produced an improved, haplotype-phased assembly (v4.0) for TAG0014, an early-generation selection of E. grandis. The 2 haplotypes are 571 Mbp (HAP1) and 552 Mbp (HAP2) in size and consist of 37 and 46 contigs scaffolded onto 11 chromosomes (contig N50 of 28.9 and 16.7 Mbp), respectively. These haplotype assemblies are 70–90 Mbp smaller than the diploid v2.0 assembly but capture all except one of the 22 telomeres, suggesting that substantial redundant sequence was included in the previous assembly. A total of 35,929 (HAP1) and 35,583 (HAP2) gene models were annotated, of which 438 and 472 contain long introns (>10 kbp) in gene models previously (v2.0) identified as multiple smaller genes. These and other improvements have increased gene annotation completeness levels from 93.8 to 99.4% in the v4.0 assembly. We found that 6,493 and 6,346 genes are within tandem duplicate arrays (HAP1 and HAP2, respectively, 18.4 and 17.8% of the total) and >43.8% of the haplotype assemblies consists of repeat elements. Analysis of synteny between the haplotypes and the E. grandis v2.0 reference genome revealed extensive regions of collinearity, but also some major rearrangements, and provided a preview of population and pangenome variation in the species.
dc.description.departmentBiochemistry, Genetics and Microbiology (BGM)
dc.description.departmentForestry and Agricultural Biotechnology Institute (FABI)
dc.description.librarianam2026
dc.description.sdgSDG-15: Life on land
dc.description.sponsorshipSupported in part by the Department of Science, Technology and Innovation (DSTI) and Technology Innovation Agency (TIA) of South Africa, the South African Forestry Sector Innovation Fund and South African forestry industry partners through the Forest Molecular Genetics (FMG) Programme at the University of Pretoria (UP). PhD bursary support from the University of Pretoria Postgraduate Research Bursary Programme and funding from the UP Postgraduate Studies Abroad Programme.
dc.description.urihttps://academic.oup.com/g3journal
dc.identifier.citationLotter, A., Bruna, T., Duong, T.A. et al. 2025, 'A haplotype-resolved reference genome for Eucalyptus grandis', G3 Genes|Genomes|Genetics, vol. 15, no. 7, pp. 1-11. https://doi.org/10.1093/g3journal/jkaf112.
dc.identifier.issn2160-1836 (online)
dc.identifier.other10.1093/g3journal/jkaf112
dc.identifier.urihttp://hdl.handle.net/2263/108912
dc.language.isoen
dc.publisherOxford University Press
dc.rights© The Author(s) 2025. Open Access This article is licensed under a Creative Commons Attribution 4.0 International License.
dc.subjectEucalyptus grandis
dc.subjectGenome improvement
dc.subjectPhased assembly
dc.subjectTandem duplications
dc.subjectSynteny
dc.titleA haplotype-resolved reference genome for Eucalyptus grandis
dc.typeArticle

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
Lotter_Haplotyperesolved_2025.pdf
Size:
1004.1 KB
Format:
Adobe Portable Document Format
Description:
Article
Loading...
Thumbnail Image
Name:
Lotter_HaplotyperesolvedSuppl_2025.docx
Size:
124.13 KB
Format:
Microsoft Word XML
Description:
Supplementary Material

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: