Abstract:
The genus Emericellopsis is found in terrestrial, but mainly in marine, environments with a worldwide distribution. Although Emericellopsis has been recognized as an important source of bioactive compounds, the range of metabolites expressed by the species of this genus, as well as the genes involved in their production are still poorly known. Untargeted metabolomics, using UPLC- QToF–MS/MS, and genome sequencing (Illumina HiSeq) was performed to unlock E. cladophorae MUM 19.33 chemical diversity. The genome of E. cladophorae is 26.9 Mb and encodes 8572 genes. A large set of genes encoding carbohydrate-active enzymes (CAZymes), secreted proteins, transporters, and secondary metabolite biosynthetic gene clusters were identified. Our analysis also revealed genomic signatures that may reflect a certain fungal adaptability to the marine environment, such as genes encoding for (1) the high-osmolarity glycerol pathway; (2) osmolytes’ biosynthetic processes; (3) ion transport systems, and (4) CAZymes classes allowing the utilization of marine polysaccharides. The fungal crude extract library constructed revealed a promising source of antifungal (e.g., 9,12,13-Trihydroxyoctadec-10-enoic acid, hymeglusin), antibacterial (e.g., NovobiocinA), anticancer (e.g., daunomycinone, isoreserpin, flavopiridol), and anti-inflammatory (e.g., 2’-O-Galloylhyperin) metabolites. We also detected unknown compounds with no structural match in the databases used. The metabolites’ profiles of E. cladophorae MUM 19.33 fermentations were salt dependent. The results of this study contribute to unravel aspects of the biology and ecology of this marine fungus. The genome and metabolome data are relevant for future biotechnological exploitation of the species.
Description:
DATA AVAILABILTY STATEMENT : This Whole-Genome Shotgun project has been deposited in the GenBank database under the accession number JAGIXG000000000. The genome raw sequencing data
and the assembly reported in this paper is associated with NCBI BioProject: PRJNA718178 and
BioSample: SAMN18524397 within GenBank. The SRA accession number is SRR14127580. Data
generated or analyzed during this study are included in this published article and its supplementary
information files.