Dataset: output related to the paper 'Event detection in Twitter: A machine-learning approach based on term pivoting'

This dataset features the output of intermediate steps and the final output of the research that is described in the paper: F. Kunneman and A. Van den Bosch (2014), Event detection in Twitter: A machine-learning approach based on term pivoting, In: F. Grootjen, M. Otworowska, and J. Kwisthout (Eds.), Proceedings of the 26th Benelux Conference on Artificial Intelligence, pp. 65-72, http://hdl.handle.net/2066/132203 The paper describes an approach to extracting events (of all types) from Twitter by identifying words that are suddenly mentioned frequently in tweets, clustering similar terms together and training a model to distinguish significant from insignificant event clusters. The current dataset features the word counts that are used to find words with a sudden high frequency, the clusters of such words, and the classification decisions for event significance.   Tweets relate to the following period: June 22nd 2013 - August 22nd 2013.

Identifier
DOI https://doi.org/10.17026/dans-zk4-aq5x
PID https://nbn-resolving.org/urn:nbn:nl:ui:13-9aus-kx
Metadata Access https://easy.dans.knaw.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:easy.dans.knaw.nl:easy-dataset:68149
Provenance
Creator Kunneman, F.A.; Bosch, A.P.J. van den
Publisher Data Archiving and Networked Services (DANS)
Contributor Radboud University
Publication Year 2017
Rights info:eu-repo/semantics/openAccess; License: http://creativecommons.org/licenses/by/4.0; http://creativecommons.org/licenses/by/4.0
OpenAccess true
Representation
Language Dutch; Flemish
Resource Type Dataset
Format PDF; TXT
Discipline Computer Science; Computer Science, Electrical and System Engineering; Engineering Sciences
Spatial Coverage The Netherlands, Flanders