DATA AVAILABILITY STATEMENT :
Protea cynaroides genome assembly and associated annotation
files have been deposited at DDBJ/ENA/GenBank
under the accession JAMYWD000000000. The version
described in this paper is version JAMYWD010000000. Raw
sequence reads (whole genome, transcriptome and Hi-C data)
generated in this study have been deposited in the Sequence
Read Archive (SRA) under the BioProject PRJNA847781.
SUPPORTING INFORMATION : FIGURE S1. Genome profiling with GenomeScope2 using Illumina short-read data. P. cynaroides is estimated to have a haploid genome size of 1.18 Gb with a 1.07% genome-wide heterozygosity level. FIGURE S2. BUSCO scores were obtained from runs with the embryophyta_odb10 dataset (n = 1614) on the intermediate stages and the final assembly. FIGURE S3. Repeat content of the assembly showing the relative proportions of the DNA element, long terminal (LTR), long interspersed (LINE), and other and unclassified repeats. FIGURE S4. Insertion time of Gypsy and Copia in P. cynaroides, M. integrifolia and N. nucifera. FIGURE S5. Dot plot comparing P. cynaroides, M. integrifolia and T. speciosissima with Aristolochia fimbriata. FIGURE S6. Syntenic relationships comparing P. cynaroides, M. integrifolia and T. speciosissima with V. vinifera. FIGURE S7. The conserved motifs of NUP85, NUP133, POLLUX/ DMI1 and NENA using MEME suite. FIGURE S8. The duplications of CCaMK (DMI3) in Proteaceae species derived from the whole-genome duplication event. FIGURE S9. Reconstruction of metabolic pathways involved in fatty acid biosynthesis and terpene biosynthesis in P. cynaroides. (a) GO enrichment of 1345 expanded families in P. cynaroides (P < 0.05). (b) Fatty acid synthesis pathway in P. cynaroides. Number in red means the gene number in P. cynaroides, number in black means the gene number in Arabidopsis. (c) Terpenoid biosynthesis pathway in P. cynaroides. Number in red means the gene number in P. cynaroides, number in black means the gene number in Arabidopsis. Synteny means the expanded genes in the syntenic blocks from the whole-genome duplication. Tandem duplication means the expanded genes are tandem duplication. FIGURE S10. The phylogenetic tree of TPS genes in P. cynaroides, M. integrifolia, A. thaliana and O. sativa. FIGURE S11. The phylogenetic tree of common symbiotic pathway (CSP) genes involved in arbuscular mycorrhizal symbiosis. Different genes are indicated by different colors. FIGURE S12. GO enrichment of retained genes after WGD. TABLE S1. Statistics of repeat predict. TABLE S2. Species used in this study. TABLE S3. Type II MADS-box genes. TABLE S4. AMS genes investigated in this study. TABLE S5. FAS genes in the P. cynaroides. TABLE S6. TPS genes in the P. cynaroides