Haplotype mining panel for genetic dissection and breeding in Eucalyptus

Show simple item record

dc.contributor.author Candotti, Julia
dc.contributor.author Christie, Nanette
dc.contributor.author Ployet, Raphael
dc.contributor.author Mostert-O’Neill, Marja M.
dc.contributor.author Reynolds, Sharon Melissa
dc.contributor.author Neves, Leandro Gomide
dc.contributor.author Naidoo, Sanushka
dc.contributor.author Mizrachi, Eshchar
dc.contributor.author Duong, Tuan A.
dc.contributor.author Myburg, Alexander A.
dc.date.accessioned 2024-09-13T11:36:27Z
dc.date.available 2024-09-13T11:36:27Z
dc.date.issued 2023-01
dc.description DATA AVAILABILITY STATEMENT : All of the short-read sequencing data are being submitted to the NCBI Short Read Archive (SRA) under the BioProject ID number PRJNA873875. en_US
dc.description SUPPORTING INFORMATION : FIGURE S1. Number of genes in categories. FIGURE S2. Number of SNPs and target regions remaining after each round of SNP filtering. FIGURE S3. Example where calling SNPs as diploid lead to the identification of false haplotypes. FIGURE S4. Example where the allelic balance of tetraploid SNPs caused incorrect haplotypes to be identified. FIGURE S5. Distribution of SNP variant allele frequency across seven FS families. FIGURE S6. Location of target regions (haplotype markers) per haplotype quality category across the genome. FIGURE S7. Distribution of haplotype call rate. FIGURE S8. Distribution of average raw read depth per probe set for the three target region categories. FIGURE S9. Percentage of individuals with target regions containing 3 or 4 haplotypes per individual (tetraploid calls) in the four species. FIGURE S10. Average heterozygosity of individuals based on SNPs and haplotypes in Category 1 target regions. FIGURE S11. Number of shared and unique high-quality haplotypes from Category 1 target regions in the four species. FIGURE S12. Haplotype diversity across gene categories and gene regions. FIGURES13. Pairwise correlations of haplotype numbers (proxy for haplotype diversity) among the four gene regions. FIGURE S14. Distribution of allele frequency for haplotypes and SNPs. FIGURE S15. Haplotype diversity of genes in the lignin biosynthetic pathway. FIGURE S16. Overview of panel design process. FIGURE S17. Target regions for probe design for each gene in the haplotype mining panel. TABLE S1. Lines of evidence and gene numbers from published and unpublished data. TABLE S2. Lines of evidence and number of genes for each category in the final gene list. TABLE S3. Statistics for target regions with more than 2 haplotypes per individual for the four species. TABLE S4. Number of target regions at genes known to be duplicated. TABLE S5. Call rate of individuals and target regions for the four species across the three target region categories. TABLE S6. Comparison of low-frequency SNPs and haplotypes across the four species and a single half-sib family. TABLE S7. GO enrichment for the genes with the least (bottom 10%) diverse target regions. TABLE S8. Provenance and species information of 89 species samples. TABLE S9. Pedigree structure of the full-sib (FS) families used in this study. APPENDIX S1. Location and haplotype information for each gene targeted. en_US
dc.description.abstract To improve our understanding of genetic mechanisms underlying complex traits in plants, a comprehensive analysis of gene variants is required. Eucalyptus is an important forest plantation genus that is highly outbred. Trait dissection and molecular breeding in eucalypts currently relies on biallelic single-nucleotide polymorphism (SNP) markers. These markers fail to capture the large amount of haplotype diversity in these species, and thus multi-allelic markers are required. We aimed to develop a gene-based haplotype mining panel for Eucalyptus species. We generated 17 999 oligonucleotide probe sets for targeted sequencing of selected regions of 6293 genes implicated in growth and wood properties, pest and disease resistance, and abiotic stress responses. We identified and phased 195 834 SNPs using a read-based phasing approach to reveal SNP-based haplotypes. A total of 8915 target regions (at 4637 gene loci) passed tests for Mendelian inheritance. We evaluated the haplotype panel in four Eucalyptus species (E. grandis, E. urophylla, E. dunnii and E. nitens) to determine its ability to capture diversity across eucalypt species. This revealed an average of 3.13–4.52 haplotypes per target region in each species, and 33.36% of the identified haplotypes were shared by at least two species. This haplotype mining panel will enable the analysis of haplotype diversity within and between species, and provide multi-allelic markers that can be used for genome-wide association studies and gene-based breeding approaches. en_US
dc.description.department Biochemistry, Genetics and Microbiology (BGM) en_US
dc.description.department Forestry and Agricultural Biotechnology Institute (FABI) en_US
dc.description.librarian am2024 en_US
dc.description.sdg SDG-15:Life on land en_US
dc.description.sponsorship The Department of Science and Innovation and Technology Innovation Agency (DSI/TIA, Strategic Grant for the Eucalyptus Genomics Platform), the Forestry Sector Innovation Fund (FSIF Eucalyptus Genome Diversity Atlas grant), and by Sappi and Mondi through the Forest Molecular Genetics (FMG) Industry Consortium at the University of Pretoria. JC acknowledges PhD scholarship support from the NRF (UID 130369) and the University of Pretoria. en_US
dc.description.uri https://onlinelibrary.wiley.com/journal/1365313x en_US
dc.identifier.citation Candotti, J., Christie, N., Ployet, R. et al. 2023, 'Haplotype mining panel for genetic dissection and breeding in Eucalyptus', The Plant Journal, vol. 113, pp. 174-185. DOI: 10.1111/tpj.16026. en_US
dc.identifier.issn 0960-7412 (print)
dc.identifier.issn 1365-313X (online)
dc.identifier.other 10.1111/tpj.16026
dc.identifier.uri http://hdl.handle.net/2263/98194
dc.language.iso en en_US
dc.publisher Wiley en_US
dc.rights © 2022 The Authors. This is an open access article under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License. en_US
dc.subject Haplotype en_US
dc.subject Multi-allelic markers en_US
dc.subject Gene-centric genotyping en_US
dc.subject Eucalyptus en_US
dc.subject SDG-15: Life on land en_US
dc.title Haplotype mining panel for genetic dissection and breeding in Eucalyptus en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record