dc.contributor.author |
Modupe, Abiodun
|
|
dc.contributor.author |
Celik, Turgay
|
|
dc.contributor.author |
Marivate, Vukosi
|
|
dc.contributor.author |
Olugbara, Oludayo O.
|
|
dc.date.accessioned |
2023-04-24T07:47:39Z |
|
dc.date.available |
2023-04-24T07:47:39Z |
|
dc.date.issued |
2022-07-26 |
|
dc.description.abstract |
Post-authorship attribution is a scientific process of using stylometric features to identify
the genuine writer of an online text snippet such as an email, blog, forum post, or chat log. It has
useful applications in manifold domains, for instance, in a verification process to proactively detect
misogynistic, misandrist, xenophobic, and abusive posts on the internet or social networks. The
process assumes that texts can be characterized by sequences of words that agglutinate the functional
and content lyrics of a writer. However, defining an appropriate characterization of text to capture the
unique writing style of an author is a complex endeavor in the discipline of computational linguistics.
Moreover, posts are typically short texts with obfuscating vocabularies that might impact the accuracy
of authorship attribution. The vocabularies include idioms, onomatopoeias, homophones, phonemes,
synonyms, acronyms, anaphora, and polysemy. The method of the regularized deep neural network
(RDNN) is introduced in this paper to circumvent the intrinsic challenges of post-authorship attribution.
It is based on a convolutional neural network, bidirectional long short-term memory encoder,
and distributed highway network. The neural network was used to extract lexical stylometric features
that are fed into the bidirectional encoder to extract a syntactic feature-vector representation. The
feature vector was then supplied as input to the distributed high networks for regularization to
minimize the network-generalization error. The regularized feature vector was ultimately passed
to the bidirectional decoder to learn the writing style of an author. The feature-classification layer
consists of a fully connected network and a SoftMax function to make the prediction. The RDNN
method was tested against thirteen state-of-the-art methods using four benchmark experimental
datasets to validate its performance. Experimental results have demonstrated the effectiveness of the
method when compared to the existing state-of-the-art methods on three datasets while producing
comparable results on one dataset. |
en_US |
dc.description.department |
Computer Science |
en_US |
dc.description.librarian |
am2023 |
en_US |
dc.description.sponsorship |
The Department of Science and Technology (DST) and the Council for Scientific and Industrial Research (CSIR). |
en_US |
dc.description.uri |
https://www.mdpi.com/journal/applsci |
en_US |
dc.identifier.citation |
Modupe, A.; Celik, T.;
Marivate, V.; Olugbara, O.O.
Post-Authorship Attribution Using
Regularized Deep Neural Network.
Applied Sciences2022, 12, 7518. https://DOI.org/10.3390/app12157518. |
en_US |
dc.identifier.issn |
2076-3417 |
|
dc.identifier.other |
10.3390/app12157518 |
|
dc.identifier.uri |
http://hdl.handle.net/2263/90426 |
|
dc.language.iso |
en |
en_US |
dc.publisher |
MDPI |
en_US |
dc.rights |
© 2022 by the authors.
Licensee MDPI, Basel, Switzerland.
This article is an open access article
distributed under the terms and
conditions of the Creative Commons
Attribution (CC BY) license. |
en_US |
dc.subject |
Authorship attribution |
en_US |
dc.subject |
Character embedding |
en_US |
dc.subject |
Bidirectional decoder |
en_US |
dc.subject |
Bidirectional encoder |
en_US |
dc.subject |
Deep learning |
en_US |
dc.subject |
Neural network |
en_US |
dc.subject |
Social media |
en_US |
dc.subject |
Regularized deep neural network (RDNN) |
en_US |
dc.title |
Post-authorship attribution using regularized deep neural network |
en_US |
dc.type |
Article |
en_US |