Literary Research with R

Quantitative Analysis of Textual Data, Quanteda taking as an example The Book of Disquiet

Authors

DOI:

https://doi.org/10.14195/1647-8622_22_7

Keywords:

Quanteda, R, Fernando Pessoa, textual data, distant reading

Abstract

This article aims to offer a research methodology with the Quanteda package, which uses the R language. The corpus for the analysis is the work of Fernando Pessoa. Quanteda (Quantitative Analysis of Textual Data) is an R package for the manipulation and analysis of textual data. The program was created by R users who needed to apply natural language processing to texts. Also, R is a programming language for statistical computing supported by the R Core Team and the R Foundation for Statistical Computing. The tool, therefore, allows the quantitative textual analysis of the corpus and offers visualization tools that represent the corpus analyses. From topic modeling to semantic networks or analysis of co-occurrences, the tools enable detailed studies of textual structures.

Downloads

Download data is not yet available.

Published

2022-12-06

Issue

Section

Caderno temático