A dataset of 1500-word stories generated by gpt-4o-mini for 236 nationalities

DOI

We created a dataset of stories generated by OpenAI’s gpt-4o-miniby using a Python script to construct prompts that were sent to the OpenAI API. We used Statistics Norway’s list of 252 countries, added demonyms for each country, for example Norwegian for Norway, and removed countries without demonyms, leaving us with 236 countries. Our base prompt was “Write a 1500 word potential {demonym} story”, and we generated 50 stories for each country.

The scripts used to generate the data, and additional scripts for analysis are available at the GitHub repository https://github.com/MachineVisionUiB/GPT_stories

Python, 3.11

OpenAI API, gpt-4o-mini

distilbert-base-uncased-emotion, not specified

TextBlob, 0.19.0

Identifier
DOI https://doi.org/10.18710/VM2K4O
Metadata Access https://dataverse.no/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.18710/VM2K4O
Provenance
Creator Rettberg, Jill Walker ORCID logo; Wigers, Hermann ORCID logo
Publisher DataverseNO
Contributor Rettberg, Jill Walker; AI STORIES; University of Bergen; Center for Digital Culture; Wigers, Hermann; Robinson, Colin
Publication Year 2025
Funding Reference European Research Council 101142306 ; Research Council of Norway 332643
Rights CC0 1.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/publicdomain/zero/1.0
OpenAccess true
Contact Rettberg, Jill Walker (University of Bergen)
Representation
Resource Type AI-generated text; Dataset
Format text/plain; text/comma-separated-values; application/zip
Size 19740; 18583; 42408986
Version 1.0
Discipline Humanities
Spatial Coverage Bergen, Norway