SeqWord motif mapper : a tool for rapid statistical analysis and visualization of epigenetic modifications in bacterial genomes

dc.contributor.authorLefebvre, Christophe M.J.
dc.contributor.authorPierneef, Rian Ewald
dc.contributor.authorReva, Oleg N.
dc.contributor.emailoleg.reva@up.ac.za
dc.date.accessioned2025-11-18T08:47:32Z
dc.date.available2025-11-18T08:47:32Z
dc.date.issued2025-10
dc.descriptionSUPPLEMENTARY MATERIAL TABLE S1. Command prompt listings of SeqWord Motif Mapper (SWMM) program calls used to generate the example outputs. FIGURE S1. Supplementary Figure S1. Circular maps showing the distribution of methylated adenine and cytosine residues associated with cRGKGatC canonical motifs in the following genomes: (A) Escherichia coli 3/145 [CP082827]; (B) E. coli 19/278 [CP082830]; (C) Klebsiella pneumoniae 13/97 [CP082805]; (D) K. pneumoniae 20/245 [CP082796]; and (E) Streptococcus pneumoniae PHRX1 [CP082820]. FIGURE S2. Circular maps showing the distribution of methylated adenine and cytosine residues associated with cRGKGatCMCYg super-palindromes in the following genomes: (A) Escherichia coli 3/145 [CP082827]; (B) E. coli 19/278 [CP082830]; (C) Klebsiella pneumoniae 13/97 [CP082805]; (D) K. pneumoniae 20/245 [CP082796]; and (E) Streptococcus pneumoniae PHRX1 [CP082820].
dc.description.abstractGenomic methylation in bacteria plays a crucial role in gene regulation, chromosome replication, pathogenicity, and defense against phages. While single-molecule real-time (SMRT) sequencing technologies have advanced the detection of epigenetically modified bases, the statistical analysis of their distribution and the possible roles they play in bacterial cells remains challenging. To address this gap, we developed SeqWord Motif Mapper (SWMM), a computational tool designed for the statistical analysis and visualization of bacterial methylation patterns. SWMM utilizes PacBio sequencing data to identify sequence coverage, methylation motif distribution, and putative functional associations. Implemented in Python 3.9, the tool is platform-independent and requires minimal dependencies, making it accessible to a wide range of users. The SWMM command-line interface and a web-based version of the program facilitate the exploration of epigenetic modifications across bacterial genomes. Through case studies on different bacterial and archaeal taxa, we demonstrated that genome methylation in microorganisms extends beyond canonical sites and possibly influences gene expression, adaptation, and genome architecture. The tool enables detailed statistical evaluation of methylation motif distribution and provides insights into the potential regulatory roles of epigenetic modifications in bacterial genomes. SWMM is freely available at https://begp.bi.up.ac.za, with source code hosted on GitHub at https://github.com/chrilef/BactEpiGenPro. HIGHLIGHTS • Visualizes bacterial methylation using PacBio sequencing data. • Detects canonical and non-canonical methylation motif distributions. • Highlights strand-biased and replicon-specific methylation patterns. • Includes statistical analysis of motif bias in coding and non-coding regions. • Open-source and web-based tool for epigenetic data exploration.
dc.description.departmentBiochemistry, Genetics and Microbiology (BGM)
dc.description.librarianhj2025
dc.description.sdgSDG-09: Industry, innovation and infrastructure
dc.description.sdgSDG-15: Life on land
dc.description.sponsorshipThis project was funded by the National Research Foundation (NRF) of South Africa.
dc.description.urihttps://www.sciencedirect.com/journal/journal-of-molecular-biology
dc.identifier.citationLefebvre, C.M.J., Pierneef, R.E. & Reva, O.N. 2025, 'SeqWord motif mapper : a tool for rapid statistical analysis and visualization of epigenetic modifications in bacterial genomes', Journal of Molecular Biology, vol. 437, no. 19, art. 169307, pp. 1-14, doi : 10.1016/j.jmb.2025.169307.
dc.identifier.issn0022-2836
dc.identifier.other10.1016/j.jmb.2025.169307
dc.identifier.urihttp://hdl.handle.net/2263/105332
dc.language.isoen
dc.publisherElsevier
dc.rights© 2025 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://crea-tivecommons.org/licenses/by-nc-nd/4.0/).
dc.subjectSingle-molecule real-time (SMRT)
dc.subjectSeqWord Motif Mapper (SWMM)
dc.subjectSoftware
dc.subjectPython
dc.subjectBiostatistics
dc.subjectEpigenetics
dc.subjectMethylomics
dc.subjectGenomic methylation
dc.subjectBacteria
dc.titleSeqWord motif mapper : a tool for rapid statistical analysis and visualization of epigenetic modifications in bacterial genomes
dc.typeArticle

Files

Original bundle

Now showing 1 - 4 of 4
Loading...
Thumbnail Image
Name:
Lefebvre_SeqWord_2025.pdf
Size:
3.35 MB
Format:
Adobe Portable Document Format
Description:
Article
Loading...
Thumbnail Image
Name:
Lefebvre_SeqWordTabS1_2025.docx
Size:
15.75 KB
Format:
Microsoft Word XML
Description:
Table S1
Loading...
Thumbnail Image
Name:
Lefebvre_SeqWordFigS1_2025.jpg
Size:
775.98 KB
Format:
Joint Photographic Experts Group/JPEG File Interchange Format (JFIF)
Description:
Figure S1
Loading...
Thumbnail Image
Name:
Lefebvre_SeqWordFigS2_2025.jpg
Size:
1.63 MB
Format:
Joint Photographic Experts Group/JPEG File Interchange Format (JFIF)
Description:
Figure S2

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: