Tutorial Package for: Text as Data in Economic Analysis

DOI

This tutorial package, comprising both data and code, accompanies the article and is designed primarily to allow readers to explore the various vocabulary-building methods discussed in the paper. The article discusses how to apply computational linguistics techniques to analyze largely unstructured corporate-generated text for economic analysis. As a core example, we illustrate how textual analysis of earnings conference call transcripts can provide insights into how markets and individual firms respond to economic shocks, such as a nuclear disaster or a geopolitical event: insights that often elude traditional non-text data sources. This approach enables extracting actionable intelligence, supporting both policy-making and strategic corporate decision-making. We also explore applications using other sources of corporate-generated text, including patent documents and job postings. By incorporating computational linguistics techniques into the analysis of economic shocks, new opportunities arise for real-time economic data, offering a more nuanced understanding of market and firm responses in times of economic volatility.

Python, 3.12.7

Identifier
DOI https://doi.org/10.34894/KNDZ9T
Metadata Access https://dataverse.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.34894/KNDZ9T
Provenance
Creator Hassan, Tarek; Hollander, Stephan; Kalyani, Aakash; Van Lent, Laurence; Schwedeler, Markus; Tahoun, Ahmed
Publisher DataverseNL
Contributor TiU Dataverse Admins; Hollander, Stephan; Tilburg University; DataverseNL
Publication Year 2025
Funding Reference Deutsche Forschungsgemeinschaft (403041268-TRR 266) ; Institute for New Economic Thinking
Rights CC-BY-ND-4.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/licenses/by-nd/4.0
OpenAccess true
Contact TiU Dataverse Admins (Tilburg University); Hollander, Stephan (Tilburg University)
Representation
Resource Type Other, text; Dataset
Format text/x-python; application/vnd.openxmlformats-officedocument.spreadsheetml.sheet; text/csv; application/octet-stream; text/markdown; text/plain; application/x-ipynb+json
Size 3800; 10436; 6678744; 43909246; 493802528; 89054804; 8712017; 405; 1743; 179; 148; 136; 131; 194; 1600; 952; 1706; 56525
Version 2.0
Discipline Business and Management; Economics; Humanities; Linguistics; Social and Behavioural Sciences