arXiv Open Access 2024

Grammar as a Behavioral Biometric: Using Cognitively Motivated Grammar Models for Authorship Verification

Andrea Nini Oren Halvani Lukas Graner Valerio Gherardi Shunichi Ishihara

Lihat Sumber

Abstrak

Authorship Verification (AV) is a key area of research in digital text forensics, which addresses the fundamental question of whether two texts were written by the same person. Numerous computational approaches have been proposed over the last two decades in an attempt to address this challenge. However, existing AV methods often suffer from high complexity, low explainability and especially from a lack of clear scientific justification. We propose a simpler method based on modeling the grammar of an author following Cognitive Linguistics principles. These models are used to calculate $λ_G$ (LambdaG): the ratio of the likelihoods of a document given the candidate's grammar versus given a reference population's grammar. Our empirical evaluation, conducted on twelve datasets and compared against seven baseline methods, demonstrates that LambdaG achieves superior performance, including against several neural network-based AV methods. LambdaG is also robust to small variations in the composition of the reference population and provides interpretable visualizations, enhancing its explainability. We argue that its effectiveness is due to the method's compatibility with Cognitive Linguistics theories predicting that a person's grammar is a behavioral biometric.

Topik & Kata Kunci

cs.CL cs.LG

Penulis (5)

Andrea Nini

Oren Halvani

Lukas Graner

Valerio Gherardi

Shunichi Ishihara

Format Sitasi

APA MLA BibTeX

Nini, A., Halvani, O., Graner, L., Gherardi, V., Ishihara, S. (2024). Grammar as a Behavioral Biometric: Using Cognitively Motivated Grammar Models for Authorship Verification. https://arxiv.org/abs/2403.08462

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2024
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓