The genome of the king protea, Protea cynaroides

Loading...
Thumbnail Image

Authors

Chang, Jiyang
Duong, Tuan A.
Schoeman, Cassandra
Ma, Xiao
Roodt, Danielle
Barker, Nigel
Li, Zhen
Van de Peer, Yves
Mizrachi, Eshchar

Journal Title

Journal ISSN

Volume Title

Publisher

Wiley

Abstract

The king protea (Protea cynaroides), an early-diverging eudicot, is the most iconic species from the Megadiverse Cape Floristic Region, and the national flower of South Africa. Perhaps best known for its iconic flower head, Protea is a key genus for the South African horticulture industry and cut-flower market. Ecologically, the genus and the family Proteaceae are important models for radiation and adaptation, particularly to soils with limited phosphorus bio-availability. Here, we present a high-quality chromosome-scale assembly of the P. cynaroides genome as the first representative of the fynbos biome. We reveal an ancestral whole-genome duplication event that occurred in the Proteaceae around the late Cretaceous that preceded the divergence of all crown groups within the family and its extant diversity in all Southern continents. The relatively stable genome structure of P. cynaroides is invaluable for comparative studies and for unveiling paleopolyploidy in other groups, such as the distantly related sister group Ranunculales. Comparative genomics in sequenced genomes of the Proteales shows loss of key arbuscular mycorrhizal symbiosis genes likely ancestral to the family, and possibly the order. The P. cynaroides genome empowers new research in plant diversification, horticulture and adaptation, particularly to nutrient-poor soils.

Description

DATA AVAILABILITY STATEMENT : Protea cynaroides genome assembly and associated annotation files have been deposited at DDBJ/ENA/GenBank under the accession JAMYWD000000000. The version described in this paper is version JAMYWD010000000. Raw sequence reads (whole genome, transcriptome and Hi-C data) generated in this study have been deposited in the Sequence Read Archive (SRA) under the BioProject PRJNA847781.
SUPPORTING INFORMATION : FIGURE S1. Genome profiling with GenomeScope2 using Illumina short-read data. P. cynaroides is estimated to have a haploid genome size of 1.18 Gb with a 1.07% genome-wide heterozygosity level. FIGURE S2. BUSCO scores were obtained from runs with the embryophyta_odb10 dataset (n = 1614) on the intermediate stages and the final assembly. FIGURE S3. Repeat content of the assembly showing the relative proportions of the DNA element, long terminal (LTR), long interspersed (LINE), and other and unclassified repeats. FIGURE S4. Insertion time of Gypsy and Copia in P. cynaroides, M. integrifolia and N. nucifera. FIGURE S5. Dot plot comparing P. cynaroides, M. integrifolia and T. speciosissima with Aristolochia fimbriata. FIGURE S6. Syntenic relationships comparing P. cynaroides, M. integrifolia and T. speciosissima with V. vinifera. FIGURE S7. The conserved motifs of NUP85, NUP133, POLLUX/ DMI1 and NENA using MEME suite. FIGURE S8. The duplications of CCaMK (DMI3) in Proteaceae species derived from the whole-genome duplication event. FIGURE S9. Reconstruction of metabolic pathways involved in fatty acid biosynthesis and terpene biosynthesis in P. cynaroides. (a) GO enrichment of 1345 expanded families in P. cynaroides (P < 0.05). (b) Fatty acid synthesis pathway in P. cynaroides. Number in red means the gene number in P. cynaroides, number in black means the gene number in Arabidopsis. (c) Terpenoid biosynthesis pathway in P. cynaroides. Number in red means the gene number in P. cynaroides, number in black means the gene number in Arabidopsis. Synteny means the expanded genes in the syntenic blocks from the whole-genome duplication. Tandem duplication means the expanded genes are tandem duplication. FIGURE S10. The phylogenetic tree of TPS genes in P. cynaroides, M. integrifolia, A. thaliana and O. sativa. FIGURE S11. The phylogenetic tree of common symbiotic pathway (CSP) genes involved in arbuscular mycorrhizal symbiosis. Different genes are indicated by different colors. FIGURE S12. GO enrichment of retained genes after WGD. TABLE S1. Statistics of repeat predict. TABLE S2. Species used in this study. TABLE S3. Type II MADS-box genes. TABLE S4. AMS genes investigated in this study. TABLE S5. FAS genes in the P. cynaroides. TABLE S6. TPS genes in the P. cynaroides

Keywords

Protea cynaroides, Genome assembly, Genome annotation, Early-divergent eudicot, Comparative genomics, King protea (Protea cynaroides), SDG-15: Life on land

Sustainable Development Goals

SDG-15:Life on land

Citation

Chang, J., Duong, T.A., Schoeman, C. et al. 2023, 'The genome of the king protea, protea cynaroides', The Plant Journal, vol. 113, pp. 262–276. DOI: 10.1111/tpj.16044.