The use of different 16S rRNA gene variable regions in biogeographical studies

Varliero, Gilda; Lebre, Pedro H.; Stevens, Mark I.; Czechowski, Paul; Makhalanyane, Thulani Peter; Cowan, Don A.

UPSpace Home
→
Natural and Agricultural Sciences
→
Biochemistry, Genetics and Microbiology (BGM)
→
Research Articles (Biochemistry, Genetics and Microbiology (BGM))
→
View Item

dc.contributor.author	Varliero, Gilda
dc.contributor.author	Lebre, Pedro H.
dc.contributor.author	Stevens, Mark I.
dc.contributor.author	Czechowski, Paul
dc.contributor.author	Makhalanyane, Thulani Peter
dc.contributor.author	Cowan, Don A.
dc.date.accessioned	2023-09-27T12:26:23Z
dc.date.available	2023-09-27T12:26:23Z
dc.date.issued	2023-06
dc.description	DATA AVAILABILITY STATEMENT : All Illumina sequences generated and analyzed in this study were deposited into the European Nucleotide Archive (accession number PRJEB55051).	en_US
dc.description	SUPPORTING INFORMATION 1 : FIGURE S1. Samples located in four inland areas of the Prince Charles Mountains (ME1 from Mount Rubin, ME2 and ME3 from Mawson Escarpment, MM1 and MM2 from Mount Menzies, LT1 and LT2 from Lake Terrasovoje), in the Reinbolt Hills (RH1), and in coastal sites in proximity of the Prince Charles Mountains (C1 and C2; see Table S1). Map was produced using MODIS mosaic (125 m) imagery distributed by Quantarctica (https://cmr.earthdata.nasa.gov/; https://www.npolar.no/quantarctica/). FIGURE S2. Pearson's pairwise correlations between Bray–Curtis dissimilarity matrices calculated on relative abundance taxonomic dataset (genus level; A), and between Jaccard dissimilarity matrices calculated on presence/absence taxonomic dataset (genus level; B). Correlations were calculated for all the variable region datasets (V1–V3, V3–V4, V4, V4–V5 and V8–V9), and the mixed datasets (Mix 1, Mix 2 and Mix 3) constituted by randomly picked samples from V1–V3, V3–V4, V4, V4–V5 and V8–V9 (Table S4). Pearson's correlation coefficients (r) are reported only in case of significant correlation (p < 0.05).	en_US
dc.description	SUPPORTING INFORMATION 2 : TABLE S1. Sample specifics. TABLE S2. Geochemical data. TABLE S3. Relative abundance (%) of the taxonomic domains Bacteria and Archaea in sample (i.e., ME1, ME2, ME3, MM1, MM2, LT1, LT2, RH1, C1 and C2) for each variable region dataset (i.e., V1–V3, V3–V4, V4, V4–V5 and V8–V9). TABLE S4. Composition of mixed communities. TABLE S5. Number of reads at each step of the 16S rRNA gene processing pipeline. *counts reported as read pairs. TABLE S6. Number and percentage of unknown amplicon sequence variants (ASVs) at genus level for each phylum. TABLE S7. Relative abundance associated to unknown amplicon sequence variants at genus-level for each phylum. TABLE S8. Pearson's correlations from pairwise comparisons of variable region datasets performed on number of genera (A), dominant genera (i.e., genera represented by a relative abundance higher than 1% in at least one sample) (B), rare genera (i.e., genera represented by a relative abundance lower than 0.1% in all samples (C), Shannon index (D) and unique genera (E).	en_US
dc.description.abstract	16S rRNA gene amplicon sequencing is routinely used in environmental surveys to identify microbial diversity and composition of the samples of interest. The dominant sequencing technology of the past decade (Illumina) is based on the sequencing of 16S rRNA hypervariable regions. Online sequence data repositories, which represent an invaluable resource for investigating microbial distributional patterns across spatial, environmental or temporal scales, contain amplicon datasets from diverse 16S rRNA gene variable regions. However, the utility of these sequence datasets is potentially reduced by the use of different 16S rRNA gene amplified regions. By comparing 10 Antarctic soil samples sequenced for five different 16S rRNA amplicons, we explore whether sequence data derived from diverse 16S rRNA variable regions can be validly used as a resource for biogeographical studies. Patterns of shared and unique taxa differed among samples as a result of variable taxonomic resolutions of the assessed 16S rRNA variable regions. However, our analyses also suggest that the use of multi-primer datasets for biogeographical studies of the domain Bacteria is a valid approach to explore bacterial biogeographical patterns due to the preservation of bacterial taxonomic and diversity patterns across different variable region datasets. We deem composite datasets useful for biogeographical studies.	en_US
dc.description.department	Biochemistry	en_US
dc.description.department	Genetics	en_US
dc.description.department	Microbiology and Plant Pathology	en_US
dc.description.librarian	hj2023	en_US
dc.description.sponsorship	Australian Antarctic Division, Australian Research Council and NRF SANAP.	en_US
dc.description.uri	http://wileyonlinelibrary.com/journal/emi4	en_US
dc.identifier.citation	Varliero, G., Lebre, P.H., Stevens, M.I., Czechowski, P., Makhalanyane, T. & Cowan, D.A. (2023) The use of different 16S rRNA gene variable regions in biogeographical studies. Environmental Microbiology Reports, 15(3), 216–228. Available from: https://doi.org/10.1111/1758-2229.13145.	en_US
dc.identifier.issn	1758-2229 (online)
dc.identifier.other	10.1111/1758-2229.13145
dc.identifier.uri	http://hdl.handle.net/2263/92438
dc.language.iso	en	en_US
dc.publisher	Wiley	en_US
dc.rights	© 2023 The Authors. Environmental Microbiology Reports published by Applied Microbiology International and John Wiley & Sons Ltd. This is an open access article under the terms of the Creative Commons Attribution License.	en_US
dc.subject	16S rRNA gene amplicon sequencing	en_US
dc.subject	Online sequence data repositories	en_US
dc.subject	16S rRNA gene variable regions	en_US
dc.subject	Biogeographical studies	en_US
dc.subject	Composite datasets	en_US
dc.title	The use of different 16S rRNA gene variable regions in biogeographical studies	en_US
dc.type	Article	en_US