qtlXplorer : an online systems genetics browser in the eucalyptus genome integrative explorer (EucGenIE)

Show simple item record

dc.contributor.author Christie, Nanette
dc.contributor.author Mannapperuma, Chanaka
dc.contributor.author Ployet, Raphael
dc.contributor.author Van der Merwe, Karen
dc.contributor.author Mahler, Niklas
dc.contributor.author Delhomme, Nicolas
dc.contributor.author Naidoo, Sanushka
dc.contributor.author Mizrachi, Eshchar
dc.contributor.author Street, Nathaniel R.
dc.contributor.author Myburg, Alexander Andrew
dc.date.accessioned 2022-07-15T06:17:44Z
dc.date.available 2022-07-15T06:17:44Z
dc.date.issued 2021-12-15
dc.description ADDITIONAL FILE 1: TABLE S1. Metadata for RNA-seq datasets integrated in EucGenIE. In total 355 transcriptome datasets were integrated: 42 exAtlas datasets, 30 biotic interactions datasets and 283 transcriptomes from different F2 backcross individuals. TABLE S2. The 44 laccase genes detected in Eucalyptus with their corresponding BLASTP, HMMER and SignalP results. Genes encoding proteins with the three truncated domains of a canonical LAC and predicted to have a signal peptide were considered for analysis in this study. TABLE S3. Expression values in transcripts per million (TPM) of the 59 LAC/PRX genes, in the E. grandis exAtlas [35] and across biotic stress conditions [36–38]. TABLE S4. Secondary cell wall-related genes that were used for co-expression analysis. TABLE S5. Nine trait QTLs that co-locate with the eight LAC/PRX candidate genes from subnetwork 2 (Fig. 2, Table S3) or their eQTL positions. TABLE S6. Genes sharing eQTLs at the eleven eQTL positions associated with the eight LAC/PRX candidate genes from subnetwork 2 (Fig. 2, Additional file 1: Table S3). TABLE S7. Fisher’s Exact Test results for enrichment of cell wallrelated genes with eQTLs located at each of eleven eQTL positions, respectively, associated with the eight LAC/PRX candidate genes from subnetwork 2 (Fig. 2, Additional file 1: Table S3). TABLE S8. Transcription factors underlying the eleven eQTL peak positions associated with the eight LAC/PRX candidate genes from subnetwork 2 (Fig. 2, Additional file 1: Table S3). TABLE S9. Final score for ranking the eight LAC/PRX candidate genes from subnetwork 2 (Fig. 2, Additional file 1: Table S3) using multiple lines of evidence. en_US
dc.description ADDITIONAL FILE 2: METHOD S1. Expression profiling of RNA-seq datasets. Method METHOD S2. QTL and eQTL analysis of data in qtlXplorer. METHOD S3. Overview of the EucGenIE tools. METHOD S4. Translating genes between species within PlantGenIE. METHOD S5. Case study: Laccases and peroxidases. en_US
dc.description ADDITIONAL FILE 3: FIGURE S1. Co-expression analysis of 137 LAC/PRX genes using exNet in EucGenIE. (a) Correlation network representing 62 correlations between 42 (out of a total of 137) LAC/PRX genes, filtered at threshold 5. (b) Correlation network representing 1877 correlations between 90 LAC/PRX and 1043 other correlated genes, obtained after expanding at threshold 6. Correlation networks are based on 72 transcriptomic datasets (all exAtlas and biotic interactions datasets; sample collections 2–4 in Table 1) in EucGenIE and were visualized using the exNet tool. en_US
dc.description ADDITIONAL FILE 4: FIGURE S2. Systems genetics analysis of the eight LAC/PRX genes associated with secondary cell wall (SCW) biosynthesis in Eucalyptus xylem formation. The network was built from data exported from qtlXplorer and follows the same structure as presented in Fig. 6. Cis- and trans-eQTL associations connecting candidate genes (blue nodes) or cell wall-related genes (green nodes) to eQTL peak positions (turquoise triangles) are represented by red and blue edges, respectively. Transcription factors (TFs) underlying eQTL peak positions are represented by grey or green squares and connected to eQTL peak positions via grey dashed line edges. Gene expression profile correlations are represented by grey solid edges, with thickness proportional to the absolute value of the correlation. Physical overlap of trait QTLs (purple nodes) with candidate gene or eQTL peak positions are represented by green dashed lines. eQTL peak position node size is proportional to the number of genes having eQTLs (cis or trans) mapped at that genomic position. Underlying TF node size is proportional to the number and average value of the correlations of TFs with genes having cis- or trans-eQTLs mapped at that position (underlying TF score; Additional file 1: Table S8). Candidate gene node size is proportional to its score for prioritization (see Additional file 2: Methods S5; Additional file 1: Table S9), taking into account (i) their correlations (number and average value) with SCW genes across the population-wide transcriptomic data, (ii) the number of physical overlaps with candidate gene/SCWrelated QTLs, (iii) the number of eQTLs mapped at SCW-enriched eQTL positions, (iv) the number of overlaps of their eQTL positions with SCW-related trait QTLs, and (v) the number of SCW-related TFs in the top 10 best candidate TFs underlying their eQTL positions. en_US
dc.description.abstract BACKGROUND : Affordable high-throughput DNA and RNA sequencing technologies are allowing genomic analysis of plant and animal populations and as a result empowering new systems genetics approaches to study complex traits. The availability of intuitive tools to browse and analyze the resulting large-scale genetic and genomic datasets remain a significant challenge. Furthermore, these integrative genomics approaches require innovative methods to dissect the flow and interconnectedness of biological information underlying complex trait variation. The Plant Genome Integrative Explorer (PlantGenIE.org) is a multi-species database and domain that houses online tools for model and woody plant species including Eucalyptus. Since the Eucalyptus Genome Integrative Explorer (EucGenIE) is integrated within PlantGenIE, it shares genome and expression analysis tools previously implemented within the various subdomains (ConGenIE, PopGenIE and AtGenIE). Despite the success in setting up integrative genomics databases, online tools for systems genetics modelling and high-resolution dissection of complex trait variation in plant populations have been lacking. RESULTS : We have developed qtlXplorer (https:// eucge nie. org/ QTLXp lorer) for visualizing and exploring systems genetics data from genome-wide association studies including quantitative trait loci (QTLs) and expression-based QTL (eQTL) associations. This module allows users to, for example, find co-located QTLs and eQTLs using an interactive version of Circos, or explore underlying genes using JBrowse. It provides users with a means to build systems genetics models and generate hypotheses from large-scale population genomics data. We also substantially upgraded the EucGenIE resource and show how it enables users to combine genomics and systems genetics approaches to discover candidate genes involved in biotic stress responses and wood formation by focusing on two multigene families, laccases and peroxidases. CONCLUSIONS : qtlXplorer adds a new dimension, population genomics, to the EucGenIE and PlantGenIE environment. The resource will be of interest to researchers and molecular breeders working in Eucalyptus and other woody plant species. It provides an example of how systems genetics data can be integrated with functional genetics data to provide biological insight and formulate hypotheses. Importantly, integration within PlantGenIE enables novel comparative genomics analyses to be performed from population-scale data. en_US
dc.description.department Biochemistry en_US
dc.description.department Forestry and Agricultural Biotechnology Institute (FABI) en_US
dc.description.department Genetics en_US
dc.description.department Microbiology and Plant Pathology en_US
dc.description.librarian am2022 en_US
dc.description.sponsorship The National Research Foundation of South Africa, the Department of Science and Technology and Technology Innovation Agency of South Africa, the Technology and Human Resources for Industry Programme, Sappi and Mondi South Africa through the Forest Molecular Genetics Programme at the University of Pretoria. en_US
dc.description.uri http://www.biomedcentral.com/bmcbioinformatics en_US
dc.identifier.citation Christie, N., Mannapperuma, C., Ployet, R. et al. 2021, 'qtlXplorer: an online systems genetics browser in the Eucalyptus Genome Integrative Explorer (EucGenIE)', BMC Bioinformatics, vol. 22, art. 595, pp. 1-21. en_US
dc.identifier.issn 1471-2105
dc.identifier.other 10.1186/s12859-021-04514-9
dc.identifier.uri https://repository.up.ac.za/handle/2263/86211
dc.language.iso en en_US
dc.publisher BioMed Central en_US
dc.rights © The Author(s), 2021. Open Access. This article is licensed under a Creative Commons Attribution 4.0 International License. en_US
dc.subject qtlXplorer en_US
dc.subject Eucalyptus en_US
dc.subject Systems genetics en_US
dc.subject Co-expression en_US
dc.subject ‘Omics integration en_US
dc.subject Online resource en_US
dc.subject Database en_US
dc.subject Genome browser en_US
dc.subject Quantitative trait loci (QTLs) en_US
dc.subject Expression-based quantitative trait loci (eQTL) en_US
dc.subject Eucalyptus genome integrative explorer (EucGenIE) en_US
dc.title qtlXplorer : an online systems genetics browser in the eucalyptus genome integrative explorer (EucGenIE) en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record