dc.contributor.author |
Candotti, Julia
|
|
dc.contributor.author |
Christie, Nanette
|
|
dc.contributor.author |
Ployet, Raphael
|
|
dc.contributor.author |
Mostert-O’Neill, Marja M.
|
|
dc.contributor.author |
Reynolds, Sharon Melissa
|
|
dc.contributor.author |
Neves, Leandro Gomide
|
|
dc.contributor.author |
Naidoo, Sanushka
|
|
dc.contributor.author |
Mizrachi, Eshchar
|
|
dc.contributor.author |
Duong, Tuan A.
|
|
dc.contributor.author |
Myburg, Alexander A.
|
|
dc.date.accessioned |
2024-09-13T11:36:27Z |
|
dc.date.available |
2024-09-13T11:36:27Z |
|
dc.date.issued |
2023-01 |
|
dc.description |
DATA AVAILABILITY STATEMENT :
All of the short-read sequencing data are being submitted
to the NCBI Short Read Archive (SRA) under the BioProject
ID number PRJNA873875. |
en_US |
dc.description |
SUPPORTING INFORMATION : FIGURE S1. Number of genes in categories. FIGURE S2. Number of SNPs and target regions remaining after each round of SNP filtering. FIGURE S3. Example where calling SNPs as diploid lead to the identification of false haplotypes. FIGURE S4. Example where the allelic balance of tetraploid SNPs caused incorrect haplotypes to be identified. FIGURE S5. Distribution of SNP variant allele frequency across seven FS families. FIGURE S6. Location of target regions (haplotype markers) per haplotype quality category across the genome. FIGURE S7. Distribution of haplotype call rate. FIGURE S8. Distribution of average raw read depth per probe set for the three target region categories. FIGURE S9. Percentage of individuals with target regions containing 3 or 4 haplotypes per individual (tetraploid calls) in the four species. FIGURE S10. Average heterozygosity of individuals based on SNPs and haplotypes in Category 1 target regions. FIGURE S11. Number of shared and unique high-quality haplotypes from Category 1 target regions in the four species. FIGURE S12. Haplotype diversity across gene categories and gene regions. FIGURES13. Pairwise correlations of haplotype numbers (proxy for haplotype diversity) among the four gene regions. FIGURE S14. Distribution of allele frequency for haplotypes and SNPs. FIGURE S15. Haplotype diversity of genes in the lignin biosynthetic pathway. FIGURE S16. Overview of panel design process. FIGURE S17. Target regions for probe design for each gene in the haplotype mining panel. TABLE S1. Lines of evidence and gene numbers from published and unpublished data. TABLE S2. Lines of evidence and number of genes for each category in the final gene list. TABLE S3. Statistics for target regions with more than 2 haplotypes per individual for the four species. TABLE S4. Number of target regions at genes known to be duplicated. TABLE S5. Call rate of individuals and target regions for the four species across the three target region categories. TABLE S6. Comparison of low-frequency SNPs and haplotypes across the four species and a single half-sib family. TABLE S7. GO enrichment for the genes with the least (bottom 10%) diverse target regions. TABLE S8. Provenance and species information of 89 species samples. TABLE S9. Pedigree structure of the full-sib (FS) families used in this study. APPENDIX S1. Location and haplotype information for each gene targeted. |
en_US |
dc.description.abstract |
To improve our understanding of genetic mechanisms underlying complex traits in plants, a comprehensive
analysis of gene variants is required. Eucalyptus is an important forest plantation genus that is highly outbred.
Trait dissection and molecular breeding in eucalypts currently relies on biallelic single-nucleotide polymorphism
(SNP) markers. These markers fail to capture the large amount of haplotype diversity in these
species, and thus multi-allelic markers are required. We aimed to develop a gene-based haplotype mining
panel for Eucalyptus species. We generated 17 999 oligonucleotide probe sets for targeted sequencing of
selected regions of 6293 genes implicated in growth and wood properties, pest and disease resistance, and
abiotic stress responses. We identified and phased 195 834 SNPs using a read-based phasing approach to
reveal SNP-based haplotypes. A total of 8915 target regions (at 4637 gene loci) passed tests for Mendelian
inheritance. We evaluated the haplotype panel in four Eucalyptus species (E. grandis, E. urophylla, E. dunnii
and E. nitens) to determine its ability to capture diversity across eucalypt species. This revealed an average
of 3.13–4.52 haplotypes per target region in each species, and 33.36% of the identified haplotypes were
shared by at least two species. This haplotype mining panel will enable the analysis of haplotype diversity
within and between species, and provide multi-allelic markers that can be used for genome-wide association
studies and gene-based breeding approaches. |
en_US |
dc.description.department |
Biochemistry, Genetics and Microbiology (BGM) |
en_US |
dc.description.department |
Forestry and Agricultural Biotechnology Institute (FABI) |
en_US |
dc.description.librarian |
am2024 |
en_US |
dc.description.sdg |
SDG-15:Life on land |
en_US |
dc.description.sponsorship |
The Department of Science and Innovation and Technology Innovation Agency (DSI/TIA, Strategic Grant for the Eucalyptus Genomics Platform), the Forestry Sector Innovation Fund (FSIF Eucalyptus Genome Diversity Atlas grant), and by Sappi and Mondi through the Forest Molecular Genetics (FMG) Industry Consortium at the University of Pretoria. JC acknowledges PhD scholarship support from the NRF (UID 130369) and the University of Pretoria. |
en_US |
dc.description.uri |
https://onlinelibrary.wiley.com/journal/1365313x |
en_US |
dc.identifier.citation |
Candotti, J., Christie, N., Ployet, R. et al. 2023, 'Haplotype mining panel for genetic dissection and breeding
in Eucalyptus', The Plant Journal, vol. 113, pp. 174-185. DOI: 10.1111/tpj.16026. |
en_US |
dc.identifier.issn |
0960-7412 (print) |
|
dc.identifier.issn |
1365-313X (online) |
|
dc.identifier.other |
10.1111/tpj.16026 |
|
dc.identifier.uri |
http://hdl.handle.net/2263/98194 |
|
dc.language.iso |
en |
en_US |
dc.publisher |
Wiley |
en_US |
dc.rights |
© 2022 The Authors.
This is an open access article under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License. |
en_US |
dc.subject |
Haplotype |
en_US |
dc.subject |
Multi-allelic markers |
en_US |
dc.subject |
Gene-centric genotyping |
en_US |
dc.subject |
Eucalyptus |
en_US |
dc.subject |
SDG-15: Life on land |
en_US |
dc.title |
Haplotype mining panel for genetic dissection and breeding in Eucalyptus |
en_US |
dc.type |
Article |
en_US |