DATA AVAILABILITY STATEMENT : All sequences described in this manuscript have been submitted to the National Genomics Data Center (NGDC). The raw whole-genome data of E. japonica have been deposited in BioProject/GSA (https://bigd.big.ac.cn/gsa.) under the accession codes PRJCA005268/CRA004271, and the assembly and annotation data have been deposited at BioProject/GWH (https://bigd.big.ac.cn/gwh) under the accession codes PRJCA005268/GWHBCHS00000000. The raw transcriptomes data of E. japonica have been deposited in BioProject/GSA (https://bigd.big.ac.cn/gsa.) under the accession codes PRJCA005298/CRA004272.
SUPPLEMENTARY MATERIAL 1: Supplementary Note 1. Chromosome number assessment.
Supplementary Note 2. Whole-genome duplication identification and dating.
Supplementary Note 3. Observation of E. japonica seed dispersal.
Supplementary Note 4. Determination of pentacyclic triterpene substances.
Figure S1. Cytogenetic analysis of E. japonica.
Figure S2. Genome size and heterozygosity of E. japonica estimation using 17 k-mer distribution.
Figure S3. Interchromosomal of Hi-C chromosome contact map of E. japonica genome.
Figure S4. Gene structure prediction results of E. japonica and other species.
Figure S5. Venn diagram shows gene families of malvids.
Figure S6. Phylogenetic tree constructed by chloroplast genomes from 17 species.
Figure S7. Concatenated- and ASTRAL-based phylogenetic trees.
Figure S8. Ks distribution in E. japonica.
Figure S9. Distributions of synonymous substitutions per synonymous site (Ks) of one-to-one orthologs identified between E. japonica and P. trichocarpa and V. vinifera.
Figure S10. Population structure plot.
Figure S11. Fixation index (FST) heat map among E. japonica populations.
Figure S12. Phylogenetic analysis of MADS-box genes from O. sativa, A. thaliana, E. japonica, and T. cacao.
Figure S13. Observation the fruit development.
Figure S14. Animal seed dispersal.
Figure S15. Anthocyanin biosynthesis in E. japonica fruits.
Figure S16. Carotenoid accumulation and the chlorophyll degradation in E. japonica fruits.
Figure S17. Expression profile of fruit dehiscence-related genes.
Figure S18. Phylogenetic tree of DELLA genes obtained from six malvids species.
Figure S19. Phylogenetic tree of CAD genes obtained from seven malvids species.
Figure S20. Expression pattern of fruit abscission-related genes.
Figure S21. Structure of pentacyclic triterpene compounds separated from Euscaphis.
Figure S22. Phylogenetic tree of HMGR gene in plants.
Figure S23. Phylogenetic tree of P450s gene family obtained from A. thaliana and E. japonica.
SUPPLEMENTARY MATERIAL 2: Table S1. Assembled statistics of E. japonica genome.
Table S2. Evaluation of E. japonica genome assembly.
Table S3. Chromosome length of E. japonica.
Table S4. Prediction of gene structures of the E. japonica genome.
Table S5. Statistics on the function annotation of the E. japonica genome.
Table S6. Non-coding RNA annotation results of E. japonica genome.
Table S7. BUSCO assessment of the E. japonica annotated genome.
Table S8. Statistic of repeat sequence in E. japonica genome.
Table S9. Gene-clustering statistics for 17 species.
Table S10. KEGG enrichment result of unique genes families of E. japonica.
Table S11. Gene Ontology (GO) and KEGG enrichment result of significant shared by malvids species gene families.
Table S12. Gene Ontology (GO) and KEGG enrichment result of significant expansion of E. japonica gene families.
Table S13. Gene Ontology (GO) enrichment result of significant contraction of E. japonica gene families.
Table S14. Statistical sampling population information.
Table S15. Statistics population resequencing information.
Table S16. Statistical nucleotide polymorphisms in the populations.
Table S17. Candidate positive selection genes (PSGs) in the evergreen population.
Table S18. Candidate positive selection genes (PSGs) in the deciduous population.
Table S19. Gene Ontology (GO) enrichment result of significant PSGs in the evergreen population.
Table S20. List of MADS-box genes identified in E. japonica.
Table S21. Genes involved in anthocyanin biosynthesis, carotenoid biosynthesis, and chlorophyll degradation.
Table S22. Identification fruit dehiscence-related genes in E. japonica.
Table S23. Genes related to lignin synthesis that are highly expressed during pericarp dehiscence.
Table S24. Gene expression levels (FPKMs) of fruit abscission-related genes in pericarp.
Table S25. Triterpene compounds separated from Euscaphis.
Table S26. Number of putative pentacyclic triterpene-related genes in the malvids species.
Table S27. Identified pentacyclic triterpene synthesis-related genes in E. japonica genome.
Table S28. Statistical simple sequence repeat.