Uma Abordagem Computacional Baseada em Níveis Sintáticos da Língua na Atribuição de Autoria (A Computational Approach Based on Syntactic Levels of Language in Authorship Attribution)

Paulo Júnior Varela (paulovarela@utfpr.edu.br)1, Edson José Rodrigues Justino (edsonjustino@ppgia.pucpr.br)2, Flavio Bortolozzi (flaviobortolozzi@ppgia.pucpr.br)2, Luiz Eduardo Soares Oliveira (les.oliveira@ufpr.br)3


1Universidade Tecnológica Federal do Paraná
2Pontifícia Universidade Católica do Paraná
3Universidade Federal do Paraná

This paper appears in: Revista IEEE América Latina

Publication Date: Jan. 2016
Volume: 14,   Issue: 1 
ISSN: 1548-0992


Abstract:
This paper aims to insert a new approach based on syntactic features of the language, which are relating to the essential terms, integrant and accessories of a sentence, such as: subject, predicate and accessories, for the resolution of cases involving the authorship attribution. To this, a database in Portuguese was collected for the experiments. The proposed approach consists of conducting experiments with various fusion methods (sum, mean, median and majority vote), to verify the best performance and method for both authorship verification and identification. To evaluate the approach we used the dependent and independent models. The results generated in authorship verification were between 89-95% accuracy, and 81-90% in the authorship identification through classifiers based on SVM ‑ Support Vector Machines.

Index Terms:
Authorship Attribution, Syntactic Features, Fusion Methods.   


Documents that cite this document
This function is not implemented yet.


[PDF Full-Text (373)]