SeqWord motif mapper : a tool for rapid statistical analysis and visualization of epigenetic modifications in bacterial genomes
Loading...
Date
Journal Title
Journal ISSN
Volume Title
Publisher
Elsevier
Abstract
Genomic methylation in bacteria plays a crucial role in gene regulation, chromosome replication, pathogenicity, and defense against phages. While single-molecule real-time (SMRT) sequencing technologies have advanced the detection of epigenetically modified bases, the statistical analysis of their distribution and the possible roles they play in bacterial cells remains challenging. To address this gap, we developed SeqWord Motif Mapper (SWMM), a computational tool designed for the statistical analysis and visualization of bacterial methylation patterns. SWMM utilizes PacBio sequencing data to identify sequence coverage, methylation motif distribution, and putative functional associations. Implemented in Python 3.9, the tool is platform-independent and requires minimal dependencies, making it accessible to a wide range of users. The SWMM command-line interface and a web-based version of the program facilitate the exploration of epigenetic modifications across bacterial genomes. Through case studies on different bacterial and archaeal taxa, we demonstrated that genome methylation in microorganisms extends beyond canonical sites and possibly influences gene expression, adaptation, and genome architecture. The tool enables detailed statistical evaluation of methylation motif distribution and provides insights into the potential regulatory roles of epigenetic modifications in bacterial genomes. SWMM is freely available at https://begp.bi.up.ac.za, with source code hosted on GitHub at https://github.com/chrilef/BactEpiGenPro.
HIGHLIGHTS
• Visualizes bacterial methylation using PacBio sequencing data.
• Detects canonical and non-canonical methylation motif distributions.
• Highlights strand-biased and replicon-specific methylation patterns.
• Includes statistical analysis of motif bias in coding and non-coding regions.
• Open-source and web-based tool for epigenetic data exploration.
Description
SUPPLEMENTARY MATERIAL
TABLE S1. Command prompt listings of SeqWord Motif Mapper (SWMM) program calls used to generate the example outputs.
FIGURE S1. Supplementary Figure S1. Circular maps showing the distribution of methylated adenine and cytosine residues associated with cRGKGatC canonical motifs in the following genomes: (A) Escherichia coli 3/145 [CP082827]; (B) E. coli 19/278 [CP082830]; (C) Klebsiella pneumoniae 13/97 [CP082805]; (D) K. pneumoniae 20/245 [CP082796]; and (E) Streptococcus pneumoniae PHRX1 [CP082820].
FIGURE S2. Circular maps showing the distribution of methylated adenine and cytosine residues associated with cRGKGatCMCYg super-palindromes in the following genomes: (A) Escherichia coli 3/145 [CP082827]; (B) E. coli 19/278 [CP082830]; (C) Klebsiella pneumoniae 13/97 [CP082805]; (D) K. pneumoniae 20/245 [CP082796]; and (E) Streptococcus pneumoniae PHRX1 [CP082820].
Keywords
Single-molecule real-time (SMRT), SeqWord Motif Mapper (SWMM), Software, Python, Biostatistics, Epigenetics, Methylomics, Genomic methylation, Bacteria
Sustainable Development Goals
SDG-09: Industry, innovation and infrastructure
SDG-15: Life on land
SDG-15: Life on land
Citation
Lefebvre, C.M.J., Pierneef, R.E. & Reva, O.N. 2025, 'SeqWord motif mapper : a tool for rapid statistical analysis and visualization of epigenetic modifications in bacterial genomes', Journal of Molecular Biology, vol. 437, no. 19, art. 169307, pp. 1-14, doi : 10.1016/j.jmb.2025.169307.
