Dataset: tweets and analyses related to the paper 'The (Un)Predictability of Emotional Hashtags in Twitter'

This dataset features all the tweetids and labels that were used to model the language of 24 hashtags, and test the performance on predicting the hashtags in unseen tweets. This study is described in:

Kunneman, F.A., Liebrecht, C.C. & Bosch, A.P.J. van den (2014). The (Un)Predictability of Emotional Hashtags in Twitter. In Proceedings of the 5th Workshop on Language Analysis for Social Media (LASM) @ EACL 2014 (pp. 26-34). s.l.: Association for Computational Linguistics, http://hdl.handle.net/2066/127067

In addition to the train and test data, this dataset includes the most indicative features (words and phrases) for four of the hashtags, as well as the human judgement whether the tweets that contain or are classified with these hashtags convey the presumed emotion of the hashtags.

Subject period: December 16th 2010 until February 1st 2013

Identifier
DOI https://doi.org/10.17026/dans-zs9-fj3t
PID https://nbn-resolving.org/urn:nbn:nl:ui:13-lp1m-xp
Metadata Access https://easy.dans.knaw.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:easy.dans.knaw.nl:easy-dataset:72554
Provenance
Creator Kunneman, F.A.; Liebrecht, C.C.; Bosch, A.P.J. van den
Publisher Data Archiving and Networked Services (DANS)
Contributor Radboud University
Publication Year 2017
Rights info:eu-repo/semantics/openAccess; License: http://creativecommons.org/licenses/by/4.0; http://creativecommons.org/licenses/by/4.0
OpenAccess true
Representation
Language English
Resource Type Dataset
Format TXT; INI; PDF
Discipline Communication Science; Computer Science; Computer Science, Electrical and System Engineering; Engineering Sciences; Social Sciences; Social and Behavioural Sciences
Spatial Coverage The Netherlands; Flanders