Use este identificador para citar ou linkar para este item: http://repositorio.ufla.br/jspui/handle/1/56513
Título: Individuação de autoria e identificação de estilo: análise de obras literárias com auxílio do R
Título(s) alternativo(s): Individuation of authorshipand style identification: analysis of literary works carried with R
Individuación de autoría e identificación de estilo: análisis de obras literárias con R
Palavras-chave: Detecção de autoria
Análise estilométrica
Linguagem de programação R
Authorship detection
Stylometric analysis
R programming language
Detección de autoría
Análsis estilométrico
Lenguaje de programación R
Data do documento: 23-Nov-2022
Editor: Universidade Federal de Santa Catarina (UFSC), Campus Universitário Trindade
Citação: SILVA, L. F. L. E.; CIRÍACO, L. Individuação de autoria e identificação de estilo: análise de obras literárias com auxílio do R. Fórum Linguístico, Florianópolis, v. 19, n. 3, p. 8214-8231, 2022. DOI: 10.5007/1984-8412.2022.e79086.
Resumo: This paper adds to the works available on Natural Language Processing by providing a demonstration of how programming languages such as R (R CORE TEAM, 2020) can be useful in detecting authorship and identifying the style of the author in literary works. Two authors and two works each were selected, namely: The Adventures of Tom Sawyer (1876) and Adventures of Huckleberry Finn (1884) by author Mark Twain (1835-1910), and Typee: A Peep at Polynesian Life (1846) and Omoo: A Narrative of Adventures in the South Seas (1847) by author Herman Melville (1819-1891). Subsequently, the data were analyzed following the same methodology as Eder et al. (2016), in order to test the effectiveness of the stylo package and apply the Principal Component Analysis, Cluster Analysis and Consensus Tree methods. The results showed that each of the tested methods was able to distinguish the works of the authors, thus evidencing the effectiveness of the package used. In addition, a stylometric analysis is performed based on Craig's Zeta and Rolling Delta methods. For the latter, works by two German-speaking authors, Frank Kafka and Heinrich von Kleist, were used. The results pointed to a stylistic similarity of von Kleist, especially in Kafka’s first work. Additionally, Rolling Delta was used to explore an analysis carried by Juola (2013a, 2013b) regarding a work by J. K. Rowling written under the pseudonym of Robert Galbraith.
URI: http://repositorio.ufla.br/jspui/handle/1/56513
Aparece nas coleções:DCH - Artigos publicados em periódicos

Arquivos associados a este item:
Arquivo Descrição TamanhoFormato 
ARTIGO_Individuação de autoria e identificação de estilo análise de obras literárias com auxílio do R.pdf3,54 MBAdobe PDFVisualizar/Abrir


Este item está licenciada sob uma Licença Creative Commons Creative Commons