Abstract:
Cultivated hawthorn (Crataegus pinnatifida var.
major) is an important medicinal and edible plant
with a long history of use for health protection in
China. Herein, we provide a de novo chromosomelevel
genome sequence of the hawthorn cultivar
“Qiu Jinxing.” We assembled an 823.41Mb genome
encoding 40 571 genes and further anchored the
779.24Mb sequence into 17 pseudo‐chromosomes,
which account for 94.64% of the assembled genome.
Phylogenomic analyses revealed that cultivated
hawthorn diverged from other species within
the Maleae (apple tribe) at approximately 35.4 Mya.
Notably, genes involved in the flavonoid and
triterpenoid biosynthetic pathways have been significantly
amplified in the hawthorn genome. In addition,
our results indicated that the Maleae share a
unique ancient tetraploidization event; however, no
recent independent whole‐genome duplication
event was specifically detected in hawthorn. The
amplification of non‐specific long terminal repeat
retrotransposons contributed the most to the expansion
of the hawthorn genome. Furthermore, we
identified two paleo‐sub‐genomes in extant species
of Maleae and found that these two sub‐genomes
showed different rearrangement mechanisms. We
also reconstructed the ancestral chromosomes of
Rosaceae and discussed two possible paleopolyploid
origin patterns (autopolyploidization or allopolyploidization)
of Maleae. Overall, our study
provides an improved context for understanding the
evolution of Maleae species, and this new highquality
reference genome provides a useful resource
for the horticultural improvement of hawthorn.
Description:
SUPPLEMENTARY MATERIAL : FIGURE S1. Information of collected sample. FIGURE S2. Frequency distribution of depth of 17‐mer (upper) and K‐mer (below) in genome survey of cultivated hawthorn. FIGURE S3. The genome annotation of hawthorn. FIGURE S4. Classification statistics of cultivated hawthorn genes. FIGURE S5. The maximum likelihood phylogenetic tree of Rosaceae with bootstraps. FIGURE S6. Changes in messenger RNA (mRNA) expression in hardfleshed hawthorn “Qiujinxing”. FIGURE S7. Changes in messenger RNA (mRNA) expression in soft‐fleshed hawthorn “Ruanrou Shanlihong #3”. FIGURE S8. Triterpene biosynthesis pathway in Crataegus pinnatifida. FIGURE S9. Syntenic dot plot and Ks distribution within the apple genome. FIGURE S10. Syntenic dot plot and Ks distribution between two subgenomes of hawthorn and loquat (A), apple and loquat (B). Syntenic dot plot and Ks distribution between Gillenia trifoliata and sub‐genome A of hawthorn (C) and sub‐genome B of hawthorn (D). TABLE S1. Statistics of genome survey data. TABLE S2. Statistics of paired‐end reads based on Hi‐C technology. TABLE S3. Statistics of the lengths of 17 pseudo‐chromosomes in the cultivated hawthorn genome. TABLE S4. Predicted genes and gene features of the cultivated hawthorn. TABLE S5. Gene functional annotation of the cultivated hawthorn. TABLE S6. Predicted RNA features of the cultivated hawthorn. TABLE S7. Conserved genes using the BUSCO (Benchmarking Universal Single‐Copy Orthologs) method. TABLE S8. The enriched Kyoto Encyclopedia of Genes and Genomes (KEGG) functional categories of species‐specific genes (P‐value <0.05) in the cultivated hawthorn. TABLE S9. The enriched Kyoto Encyclopedia of Genes and Genomes (KEGG) functional categories of significantly (P‐value <0.05) expanded genes in the cultivated hawthorn. TABLE S10. The enriched Kyoto Encyclopedia of Genes and Genomes (KEGG) functional categories of significantly (P‐value <0.05) differentially expressed genes between two fruit developmental stages in the hardfleshed (“Qiu Jinxing”) hawthorn cultivar. TABLE S11. The enriched Kyoto Encyclopedia of Genes and Genomes (KEGG) functional categories of significantly (P‐value <0.05) differentially expressed genes between two fruit developmental stages in the soft‐fleshed (“Ruanrou Shanlihong #3”) hawthorn cultivar. TABLE S12. Statistics of repeat sequences, including transposable elements (TEs) in hawthorn, loquat, apple and pear genomes. TABLE S13. Statistics of orthologs between hawthorn, apple and loquat genomes. The color of the table corresponds to the squares of chromosomes in Figure 4E in the paper.