Bayesian nonparametric estimation of differential entropy for toroidal data

Abstract

Entropy is a widely used information-theoretic measure; however, the estimation of entropy for observations of a periodic nature has not received much attention thus far. In this paper, we implement a Bayesian approach to obtain nonparametric estimates of Shannon entropy for toroidal data. This paves the way for its use in protein structure validation through an approach based on information theory and the distribution of backbone dihedral angles in the 3D structure of proteins. In addition, the kernel density estimation proposed in this paper can be applied alongside available parametric models for modeling toroidal observations. Simulation studies and the analysis of real datasets provide insights into this proposed method for protein structure validation. HIGHLIGHTS • Estimation of entropy for observations of a periodic nature specifically toroidal data. • A Bayesian nonparametric density estimator for toroidal data using a Dirichlet infinite mixture model. • Considering the possible dependencies between two circular variables. • An alternative information-theory based method for protein structure validation. • Diagnosing of the disordered pattern of wind direction in univariate case.

Description

DATA AVAILABILITY : The wind data is available upon request, and the protein data is available at http://scop.mrc-lmb.cam.ac.uk/scop/. .

Keywords

Bayesian nonparamertic inference, Dirichlet infinite mixture model, Modified Gibbs sampling, Plug-in estimates, Shannon entropy

Sustainable Development Goals

SDG-15: Life on land

Citation

Rad, N.N., Arashi, M., Bekker, A. & Millard, S. 2025, 'Bayesian nonparametric estimation of differential entropy for toroidal data', Applied Mathematical Modelling, vol. 148, art. 116241, pp. 1-15, doi : 10.1016/j.apm.2025.116241.