-
Monitor corpus of Slovene Trendi 2025-08
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 58 publishers. Trendi 2025-08 covers the period from January... -
Monitor corpus of Slovene Trendi 2025-07
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 57 publishers. Trendi 2025-07 covers the period from January... -
Monitor corpus of Slovene Trendi 2025-06
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 57 publishers. Trendi 2025-06 covers the period from January... -
Monitor corpus of Slovene Trendi 2025-05
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 57 publishers. Trendi 2025-05 covers the period from January... -
Frequency lists of word-level n-grams from the Trendi corpus 2021
Frequency lists of word-level n-grams (or word sets) were extracted from the Trendi Monitor Corpus of Slovene (version 2022-05: http://hdl.handle.net/11356/1590) using the LIST... -
Monitor corpus of Slovene Trendi 2023-11
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 70 publishers. Trendi 2023-11 covers the period from January... -
Frequency lists of word-level n-grams from the Trendi corpus 2019
Frequency lists of word-level n-grams (or word sets) were extracted from the Trendi Monitor Corpus of Slovene (version 2022-05: http://hdl.handle.net/11356/1590) using the LIST... -
Manually sentiment annotated Slovenian news corpus SentiNews 1.0
Between 2 and 6 annotators independently sentiment annotated a stratified random sample of 10,427 documents from the Slovenian news portals 24ur, Dnevnik, Finance, Rtvslo, and... -
Corpus of Croatian news portals ENGRI (2014-2018)
The corpus consists of texts collected from the most popular (based on the Reuters Institute Digital News Report for 2018, retrieved from http://www.digitalnewsreport.org in... -
Multilingual IPTC Media Topic dataset EMMediaTopic 1.0
The multilingual IPTC Media Topic dataset EMMediaTopic 1.0 is a collection of news articles in Catalan, Croatian, Greek, and Slovenian, automatically annotated with the 17... -
Corpus of Bosnia and Herzegovina language-related news articles MetaLangNEWS-Bs
A comprehensive corpus of news articles on the topic of language, published in major daily newspapers and news portals in Bosnia and Herzegovina in the five-year period of... -
The news articles reporting on the 2021 Tokyo Olympics data set OG2021 (public)
The OG2021 corpus contains multilingual news articles that are reporting on the events happening during the 2021 Tokyo Olympics. The data set was created to evaluate the... -
Annotated corpus of Croatian language-related news articles MetaLangNEWS-Hr
A comprehensive corpus of news articles on the topic of language, published in major Croatian daily newspapers and news portals in the five-year period of January 1, 2015 -... -
Frequency lists of word-level n-grams from the Trendi corpus 2020
Frequency lists of word-level n-grams (or word sets) were extracted from the Trendi Monitor Corpus of Slovene (version 2022-05: http://hdl.handle.net/11356/1590) using the LIST... -
Monitor corpus of Slovene Trendi 2023-09
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 70 publishers. Trendi 2023-09 covers the period from January... -
Monitor corpus of Slovene Trendi 2024-01
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 70 publishers. Trendi 2024-01 covers the period from January... -
Annotated corpus of Serbian language-related news articles MetaLangNEWS-Sr
A comprehensive corpus of news articles on the topic of language, published in major Serbian daily newspapers and news portals in the five-year period of January 1, 2015 -... -
Annotated corpus of Slovenian language-related news articles MetaLangNEWS-Sl
A comprehensive corpus of news articles on the topic of language, published in major Slovenian daily newspapers and news portals in the five-year period of January 1, 2015 -... -
Corpus of Montenegrin language-related news articles MetaLangNEWS-Me
A comprehensive corpus of news articles on the topic of language, published in major Montenegrin daily newspapers and news portals in the five-year period of January 1, 2015 -... -
Monitor corpus of Slovene Trendi 2024-06
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 74 publishers. Trendi 2024-06 covers the period from January...