Prediction of recent HIV-1 infections using Shannon entropy analysis of HIV-1 group-specific antigen protein sequence

dc.contributor.authorFortuin, Tumelo L.
dc.contributor.authorNkone, Paballo
dc.contributor.authorLoubser, Shayne
dc.contributor.authorTiemessen, Caroline T.
dc.contributor.authorMayaphi, Simnikiwe Horatious
dc.contributor.emailsim.mayaphi@up.ac.za
dc.date.accessioned2026-03-19T10:41:07Z
dc.date.available2026-03-19T10:41:07Z
dc.date.issued2026-03-03
dc.descriptionDATA AVAILABILITY : All data generated or analysed during this study are included in this manuscript.
dc.description.abstractBACKGROUND : Avidity assays often misclassify chronic HIV-1 infection as recent HIV-1 infection (false recency rate), especially in participants on antiretroviral therapy. The aim of this study was to use Shannon entropy to evaluate HIV-1 group-specific antigen (Gag) sequence diversity for the prediction of recent HIV-1 infections. METHODS : This was a retrospective study that characterised the complete HIV-1 Gag using Sanger sequences obtained from participants with confirmed recent or chronic HIV-1 infection. Shannon entropy was calculated for the entire HIV-1 Gag amino acid (aa) sequence (501aa) and sliding window analysis was computed at intervals of 100aa each. This was followed by searching for aa sites that exhibited a different distribution of mutations between recent and chronic HIV-1 infection stages. Reference sequences were obtained from GenBank and the Los Alamos HIV database to verify the findings obtained from study sequences. RESULTS : Forty-seven participants with a mean age of 28.7 years (18 – 44) were enrolled, and fourteen (30%) of them had recent HIV-1 infection. Shannon entropy analysis showed a significantly higher aa diversity in chronic HIV-1 infection compared to recent HIV-1 infection (p = 0.0003). Analysis of sliding windows led to identification of four aa positions; S54, E55, I256, and S451; with different pattern of distribution between recent and chronic HIV-1 infection stages; however statistical significance was only observed for three of these aa, p values = 0.094, 0.027, 0.027 and 0.045, respectively. The performance of these informative sites for detection of recent HIV-1 infection in study sequences ranged from 71—86%, however, they had a high false recency rate (FRR) ranging from 39%—52%. Similar performance was observed in reference sequences. The combination of some informative aa sites reduced FRR in study sequences to below 24%. CONCLUSIONS : Our data show that a Gag-based molecular strategy can be used to detect recent HIV-1 infections where Gag sequences are available. However, the results would have to be interpreted with caution due to an association with a high FRR. Further studies are needed to develop a molecular-based strategy with better performance for detection of recent HIV-1 infections.
dc.description.departmentMedical Virology
dc.description.librarianhj2026
dc.description.sdgSDG-03: Good health and well-being
dc.description.sponsorshipFunded by the National Research Foundation (NRF); Poliomyelitis Research Foundation; University of Pretoria Research Development Fund; and the South African Research Chairs Initiative of the Department of Science and Innovation and National Research Foundation of South Africa .
dc.description.urihttps://link.springer.com/journal/12985
dc.identifier.citationortuin, T.L., Nkone, P., Loubser, S. et al. Prediction of recent HIV-1 infections using Shannon entropy analysis of HIV-1 group-specific antigen protein sequence. Virology Journal 23, 61: 1-11 (2026). https://doi.org/10.1186/s12985-026-03080-x.
dc.identifier.issn1743-422X (online)
dc.identifier.other10.1186/s12985-026-03080-x
dc.identifier.urihttp://hdl.handle.net/2263/109078
dc.language.isoen
dc.publisherBioMed Central
dc.rights© The Author(s) 2026. Open Access. This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
dc.subjectHuman immunodeficiency virus (HIV)
dc.subjectHIV-1 group-specific antigen (Gag)
dc.subjectShannon entropy
dc.subjectRecent HIV
dc.subjectHIV-1 infection
dc.subjectFalse recency rate
dc.subjectChronic HIV-1 infection
dc.subjectGag diversity
dc.titlePrediction of recent HIV-1 infections using Shannon entropy analysis of HIV-1 group-specific antigen protein sequence
dc.typeArticle

Files

Original bundle

Now showing 1 - 5 of 5
Loading...
Thumbnail Image
Name:
Fortuin_Prediction_2026.pdf
Size:
1.62 MB
Format:
Adobe Portable Document Format
Description:
Article
Loading...
Thumbnail Image
Name:
Fortuin_PredictionSuppl1_2026.pdf
Size:
211.3 KB
Format:
Adobe Portable Document Format
Description:
Supplementary Material 1
Loading...
Thumbnail Image
Name:
Fortuin_PredictionSuppl2_2026.pdf
Size:
293.53 KB
Format:
Adobe Portable Document Format
Description:
Supplementary Material 2
Loading...
Thumbnail Image
Name:
Fortuin_PredictionSuppl3_2026.pdf
Size:
177.44 KB
Format:
Adobe Portable Document Format
Description:
Supplementary Material 3
Loading...
Thumbnail Image
Name:
Fortuin_PredictionSuppl4_2026.pdf
Size:
596.15 KB
Format:
Adobe Portable Document Format
Description:
Supplementary Material 4

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: