Dataset - B2FIND

TweetsKB: A Public and Large-Scale RDF Corpus of Annotated Tweets (Part 10, J...

TweetsKB is a public RDF corpus of anonymized data for a large collection of annotated tweets. The dataset currently contains data for nearly 3.0 billion tweets, spanning more...

Dataset: tweets and analyses related to the paper 'The (Un)Predictability of ...

This dataset features all the tweetids and labels that were used to model the language of 24 hashtags, and test the performance on predicting the hashtags in unseen tweets. This...

Dataset: tweets and analysis related to the paper 'Signaling sarcasm: From hy...

This dataset features training and test tweets as well as insights into the classifier model related to the paper: Kunneman, F.A., Liebrecht, C.C., Mulken, M.J.P. van & Bosch,...

Dataset: tweets and analysis related to the paper 'Signaling sarcasm: From hy...

This dataset features training and test tweets as well as insights into the classifier model related to the paper: Kunneman, F.A., Liebrecht, C.C., Mulken, M.J.P. van & Bosch,...

Data: Timely identification of event start dates from Twitter

This directory features data that is discussed in the paper: F. Kunneman, A. Hürriyetoglu, N. Oostdijk and A. Van den Bosch (2014), Timely identification of event start dates...

Dataset: tweets and analyses related to the paper 'The (Un)Predictability of ...

This dataset features all the tweetids and labels that were used to model the language of 24 hashtags, and test the performance on predicting the hashtags in unseen tweets. This...

Geopolitics of artivism datafiles

Data files for the research project Geopolitics of Artivism (ISBN: 978-90-361-0641-2). This dataset contains the following files:- tweets_final.txt: all tweets that mention...

Data: Timely identification of event start dates from Twitter

This directory features data that is discussed in the paper: F. Kunneman, A. Hürriyetoglu, N. Oostdijk and A. Van den Bosch (2014), Timely identification of event start dates...

Under His Thumb. The Effect of President Donald Trump's Twitter Messages on t...

Does president Trump’s use of Twitter affect financial markets? The president frequently mentions companies in his tweets and, as such, tries to gain leverage over their...

Sarcastic Soulmates: Intimacy and irony markers in social media messaging

We research the use of sarcasm on Twitter, and show that a computer has more difficulty to detect sarcasm shared among peers than sarcasm shared with any interested audience....

Dataset: output related to the paper 'Event detection in Twitter: A machine-l...

This dataset features the output of intermediate steps and the final output of the research that is described in the paper: F. Kunneman and A. Van den Bosch (2014), Event...

Dataset: Events and periodicity analysis related to the paper 'Automatically ...

This dataset features information on all the events that were automatically extracted from Twitter and used as input to periodicity detection, as described in the paper: F....

Dataset: tweets and events linked to the paper 'Open-domain extraction of fut...

Input data and output of research conducted in the study described in the paper: F. Kunneman and A. Van den Bosch (2016), Open-domain extraction of future events from Twitter,...

Multilingual dataset of COVID tweets for relation-level metaphor analysis TCM...

TCMeta is a dataset of noun phrase constructions from COVID-related tweets, annotated for relation-level metaphor. It contains 2,138 Slovene and 2,221 English instances in...

Replication Data for: Climate Nags: Affect and the Convergence of Global Risk...

This data set contains the IDs of the 1,186,322 tweets used in "Climate Nags: Affect and the Convergence of Global Risk in Online Networks" (published in Continuum, 2023). The...

Slovenian Twitter debate ahead of 2019 European Parliament elections

Namen raziskave je bil s pomočjo analize omrežij preučiti strukturo slovenske politične razprave na Twitterju pred volitvami v Evropski parlament leta 2019, pri čemer je bil...

Dataset: input and results related to the paper 'Anticipointment detection in...

This dataset features the training models, emotion classifications and emotion patterns before and after events, related to the paper: F. Kunneman, M. van Mulken and A. Van den...

Comparative Dataset of International Media Activity on Twitter and Instagram ...

This dataset contains Twitter (2022) and Instagram (2023) posts from five leading international news outlets (The New York Times, The Guardian, USA Today, The Independent, and...

Slovenian Day of Resistance X & news corpus

The dataset contains social media posts from X and traditional media articles from online news sources related to the Slovenian commemorations of the Day of Resistance. We used...

Pre-trained POS tagging models for German social media

Pre-trained POS tagging models for the HunPos tagger (Halácsy et al. 2007) the biLSTM-char-CRF tagger (Reimers & Gurevych 2017) Online-Flors (Yin et al. 2015)....

54 datasets found