Literary Research with R
Quantitative Analysis of Textual Data, Quanteda taking as an example The Book of Disquiet
DOI:
https://doi.org/10.14195/1647-8622_22_7Keywords:
Quanteda, R, Fernando Pessoa, textual data, distant readingAbstract
This article aims to offer a research methodology with the Quanteda package, which uses the R language. The corpus for the analysis is the work of Fernando Pessoa. Quanteda (Quantitative Analysis of Textual Data) is an R package for the manipulation and analysis of textual data. The program was created by R users who needed to apply natural language processing to texts. Also, R is a programming language for statistical computing supported by the R Core Team and the R Foundation for Statistical Computing. The tool, therefore, allows the quantitative textual analysis of the corpus and offers visualization tools that represent the corpus analyses. From topic modeling to semantic networks or analysis of co-occurrences, the tools enable detailed studies of textual structures.
Downloads
Downloads
Published
Issue
Section
License
Copyright (c) 2022 Revista Estudos do Século XX

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows sharing the work with recognition of authorship and initial publication in this journal. The journal retains the copyright to the publication "Revista Estudos do Século XX" as a whole, while the author retains the rights to his individual publication.