Dataset: tweets and events linked to the paper 'Open-domain extraction of future events from Twitter'

Input data and output of research conducted in the study described in the paper:

F. Kunneman and A. Van den Bosch (2016), Open-domain extraction of future events from Twitter, Natural Language Engineering, doi: 10.1017/S1351324916000036

The paper describes a system that extracts future referring time expressions and entities from Twitter messages, and subsequently detects events as a pair of a date and entity the are often mentioned in the same tweet. This dataset features the ids of a large set of Dutch tweets posted in August 2014, which was used as input to the system, as well as the time expression and / or entity that was extracted from each tweet, if any. Furthermore, the detected events are included, represented as a date, one or more describing terms, the tweetids that refer to it and the assessment of the event by human annotators.

Identifier
DOI https://doi.org/10.17026/dans-227-36wn
PID https://nbn-resolving.org/urn:nbn:nl:ui:13-wfmy-w8
Metadata Access https://easy.dans.knaw.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:easy.dans.knaw.nl:easy-dataset:68091
Provenance
Creator Kunneman, F.A.; Bosch, A.P.J. van den
Publisher Data Archiving and Networked Services (DANS)
Contributor Radboud University
Publication Year 2017
Rights info:eu-repo/semantics/openAccess; License: http://creativecommons.org/licenses/by/4.0; http://creativecommons.org/licenses/by/4.0
OpenAccess true
Representation
Language Dutch; Flemish
Resource Type Dataset
Format PDF; TXT; INI
Discipline Computer Science; Computer Science, Electrical and System Engineering; Engineering Sciences
Spatial Coverage The Netherlands