Genome sequence and genetic diversity analysis of an under-domesticated orphan crop, white fonio (Digitaria exilis)

dc.contributor.authorWang, Xuewen
dc.contributor.authorChen, Shiyu
dc.contributor.authorChen, Shiyu
dc.contributor.authorYssel, A.E.J. (Anna)
dc.contributor.authorChaluvadi, Srinivasa R.
dc.contributor.authorJohnson, Matthew S.
dc.contributor.authorGangashetty, Prakash
dc.contributor.authorHamidou, Falalou
dc.contributor.authorSanogo, Moussa D.
dc.contributor.authorZwaenepoel, Arthur
dc.contributor.authorWallace, Jason
dc.contributor.authorVan de Peer, Yves
dc.contributor.authorBennetzen, Jeffrey L.
dc.contributor.authorVan Deynze, Allen
dc.date.accessioned2022-03-10T11:07:17Z
dc.date.available2022-03-10T11:07:17Z
dc.date.issued2021-03
dc.descriptionSupplementary Figure S1. A. Comparison of the contiguity of the Niatia Genome and CM05836 [21] genome. B. Comparison of contig tig00001331 corresponding to 100 consecutive segments anchored on the same chromosome 3B and tig00010942 corresponding to 65 consecutive segments on the chromosome 5A on the Abrouk et al. [21] genome.en_ZA
dc.descriptionSupplementary Figure S2. The k-mer distribution of raw Illumina reads at k-mer value 33 bp.en_ZA
dc.descriptionSupplementary Figure S3. A. Marginal posterior distributions for 2 independent chains (green and orange) and induced marginal prior distributions (blue) for internal node ages (t n11 to t n19, see panel C), overall mean substitution rate (mu), mean substitution rate for different codon positions (mu1,mu2, andmu3), and variance parameter of the uncorrelated relaxed clock (sigma2 1, sigma2 2, and sigma2 3) for the 3 codon positions. B. Trace plots for the MCMC chains associated with panel (A).en_ZA
dc.descriptionSupplementary Figure S4. There are 10,075 families that have 2 copies in fonio and 1 copy in Setaria italica, and 90% of 2-copy families are located in synteny blocks. The above 4 examples indicate the high degree of collinearity and synteny between S. italica and fonio.en_ZA
dc.descriptionSupplementary Figure S5. GO of single-copy, contracted genes in fonio.en_ZA
dc.descriptionSupplementary Figure S6. GO enrichment for expanded genes in D. exilis and relative to O. sativa.en_ZA
dc.descriptionSupplementary Figure S7. Phylogenetic tree of the SSH-like genes from fonio and related species. The genes shaded in light blue are the family members most closely related to SSH-1 in O. sativa and D. exilis. Genes are named according to their PLAZA identifiers. Abbreviations for species names are as follows: Bradi (Brachypodium distachyon), pgl GLEAN (Cenchrus amercianus), Digex (Digitaria exilis), Oropetium (Oropetium thomaeum), OsR (Oryza sativa), Seita (Setaria italica), Sobic (Sorghum bicolor), and Zm (Zea mays).en_ZA
dc.descriptionSupplementary Figure S8. Phylogenetic tree of the dw3 gene family of fonio and related species.en_ZA
dc.descriptionSupplementary Figure S9. Gene family tree for GW2-A-like genes in fonio and related species. This figure also includes the genes from 2 additional Pooid species, barley (Hordeum vulgare) (HORV) and wheat (Triticum turgidum) (TRITD).en_ZA
dc.descriptionSupplementary Table S1. Comparison of genome assembly statistics of fonio.en_ZA
dc.descriptionSupplementary Table S2. Statistics for the gene annotation.en_ZA
dc.descriptionSupplementary Table S3. Annotated non-coding RNA genes.en_ZA
dc.descriptionSupplementary Table S4. Orthologs for suppression of Shattering1 genes.en_ZA
dc.descriptionSupplementary Table S5. Orthologs of Dwarf Gene-3.en_ZA
dc.descriptionSupplementary Table S6. Orthologs of Grain Weight-2 genes.en_ZA
dc.descriptionSupplementary Table S7. Passport data for accessions and samples used for diversity study (see Supplementary Tables Excel file).en_ZA
dc.descriptionSupplementary Table S8. Single-nucleotide polymorphism database used for diversity study (see Supplementary Tables Excel file).en_ZA
dc.description.abstractBackground: Digitaria exilis, white fonio, is a minor but vital crop of West Africa that is valued for its resilience in hot, dry, and low-fertility environments and for the exceptional quality of its grain for human nutrition. Its success is hindered, however, by a low degree of plant breeding and improvement. FINDINGS : We sequenced the fonio genome with long-read SMRT-cell technology, yielding a ∼761 Mb assembly in 3,329 contigs (N50, 1.73 Mb; L50, 126). The assembly approaches a high level of completion, with a BUSCO score of >99%. The fonio genome was found to be a tetraploid, with most of the genome retained as homoeologous duplications that differ overall by ∼4.3%, neglecting indels. The 2 genomes within fonio were found to have begun their independent divergence ∼3.1 million years ago. The repeat content (>49%) is fairly standard for a grass genome of this size, but the ratio of Gypsy to Copia long terminal repeat retrotransposons (∼6.7) was found to be exceptionally high. Several genes related to future improvement of the crop were identified including shattering, plant height, and grain size. Analysis of fonio population genetics, primarily in Mali, indicated that the crop has extensive genetic diversity that is largely partitioned across a north-south gradient coinciding with the Sahel and Sudan grassland domains. CONCLUSIONS : We provide a high-quality assembly, annotation, and diversity analysis for a vital African crop. The availability of this information should empower future research into further domestication and improvement of fonio.en_ZA
dc.description.departmentBiochemistryen_ZA
dc.description.departmentGeneticsen_ZA
dc.description.departmentMicrobiology and Plant Pathologyen_ZA
dc.description.librarianam2022en_ZA
dc.description.sponsorshipThe Giles Fellowship from the University of Georgia; the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program; the Seed Biotechnology Center, University of California; the International Crops Research Institute for the Semi-Arid Tropics (ICRISAT) and the McKnight Foundation.en_ZA
dc.description.urihttps://academic.oup.com/gigascienceen_ZA
dc.identifier.citationWang, X., Chen, S., Ma, X. et al. 2021, 'Genome sequence and genetic diversity analysis of an under-domesticated orphan crop, white fonio (Digitaria exilis)', GigaScience, vol. 10, no. 3, pp. 1-12.en_ZA
dc.identifier.issn2047-217X (online)
dc.identifier.other10.1093/gigascience/giab013
dc.identifier.urihttp://hdl.handle.net/2263/84425
dc.language.isoenen_ZA
dc.publisherOxford University Pressen_ZA
dc.rights© The Author(s) 2021. This is an Open Access article distributed under the terms of the Creative Commons Attribution License.en_ZA
dc.subjectDomesticationen_ZA
dc.subjectGene amplificationen_ZA
dc.subjectGene lossen_ZA
dc.subjectMilleten_ZA
dc.subjectPolyploidyen_ZA
dc.titleGenome sequence and genetic diversity analysis of an under-domesticated orphan crop, white fonio (Digitaria exilis)en_ZA
dc.typeArticleen_ZA

Files

Original bundle

Now showing 1 - 4 of 4
Loading...
Thumbnail Image
Name:
Wang_Performance_2021.pdf
Size:
2.41 MB
Format:
Adobe Portable Document Format
Description:
Article
Loading...
Thumbnail Image
Name:
Wang_PerformanceSuppl1_2021.pdf
Size:
913.07 KB
Format:
Adobe Portable Document Format
Description:
Supplementary Material 1
Loading...
Thumbnail Image
Name:
Wang_PerformanceSuppl2_2021.xlsx
Size:
1.1 MB
Format:
Microsoft Excel XML
Description:
Supplementary Material 2
Loading...
Thumbnail Image
Name:
Wang_PerformanceSuppl3_2021.pdf
Size:
97.58 KB
Format:
Adobe Portable Document Format
Description:
Supplementary Material 3

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.75 KB
Format:
Item-specific license agreed upon to submission
Description: