DATA AVAILABITY STATEMENT: The nonredundant MAGs that form the basis of the DWGC
and species-level representative MAGs are available on
FigShare (DOI: 10.6084/m9.figshare.c.7245403.v1). This
database will be updated on a routine basis on the project
page https://github.com/AshSudarshan/Drinking-WaterGenome-Catalogue.
SUPPORTING INFORMATION : SUPPLEMENTARY TABLE S1: (A) details of metagenomes used to construct the DWGC, (B) accession numbers for isolate genomes obtained from NCBI and their isolation sources; SUPPLEMENTARY TABLE S2: SILVA v138.1 Phreatobacter sequences used for comparative analysis; SUPPLEMENTARY TABLE S3: summary of metagenome assembly and bins across different distribution systems; SUPPLEMENTARY TABLE S4: detailed taxonomy and genomic information for 1141 MAGs within the DWGC; SUPPLEMENTARY TABLE S5: relative abundance of species across the 80 distribution systems used to determine the core microbiome; SUPPLEMENTARY TABLES 6: detection frequency and average relative abundance of genera in the DWGC; and SUPPLEMENTARY TABLE S7: metabolism annotation results for Lineage 1 (L1) and Lineage 2 (L2) MAGs; SUPPLEMENTARY TABLE S8: complete list of names proposed in the current register SeqCode list; SUPPLEMENTARY FIGURE S1: (A) detection frequency vs log of average relative abundance of the DWGC genomes at the family level, (B) detection frequency vs log of average relative abundance of the DWGC genomes at the species level; and SUPPLEMENTARY FIGURE S2: relative abundance of MAGs from five genera observed in more than 60% of the systems pre- and post-disinfection in studies where pre-disinfection metagenomes were available PDF