-
SentiLex-PT 02
SentiLex-PT is a sentiment lexicon for Portuguese, made up of 7,014 lemmas, and 82,347 inflected forms. In detail, the lexicon describes: 4,779 (16,863) adjectives, 1,081... -
HELLO CAMPANIA! Philippines Collection
The Philippines collection contains data for 66 speakers: 32 first generation (G1), 28 second generation (G2), 6 homeland (G0). The collection contains three folders for each... -
HELLO CAMPANIA! Bangladesh Collection
The collection contains 11 interviews with 1st Bangladeshi generation migrants in Naples. It also contains langauge portraits of the migrants. -
HELLO CAMPANIA! Ukraina Collection
The Ukrainian collection contains data for 26 speakers of first generation (G1), 19 females and 6 males. The collection contains three folders for each group: the... -
Concerto di Caterina Bueno - CB-CONC-042-01
Concerto di Caterina Bueno-CB-CONC-042-01 -
Registrazione di un incontro con più testimoni - CB-RIC-002-01
Registrazione di un incontro con più testimoni - CB-RIC-002-01 -
Prove (voce maschile e chitarra) - CB-PROV-189-01
Prove (voce maschile e chitarra) - CB-PROV-189-01 -
Slovenian commonsense reasoning model SloMET-ATOMIC 2020
The SloMET-ATOMIC 2020 is a Slovene commonsense reasoning model that is able to predict commonsense descriptions in a natural language for a given input sentence. The model is... -
Monitor corpus of Slovene Trendi 2024-10
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 76 publishers. Trendi 2024-10 covers the period from January... -
Monitor corpus of Slovene Trendi 2024-11
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 76 publishers. Trendi 2024-11 covers the period from January... -
Monitor corpus of Slovene Trendi 2024-12
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 76 publishers. Trendi 2024-12 covers the period from January... -
Monitor corpus of Slovene Trendi 2025-01
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 77 publishers. Trendi 2025-01 covers the period from January... -
Monitor corpus of Slovene Trendi 2025-02
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 77 publishers. Trendi 2025-02 covers the period from January... -
Monitor corpus of Slovene Trendi 2025-03
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 56 publishers. Trendi 2025-03 covers the period from January... -
Monitor corpus of Slovene Trendi 2025-04
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 56 publishers. Trendi 2025-04 covers the period from January... -
Lithuanian Hate Speech Corpus v.1
This corpus consists of (1) examples of hate speech based on ethnicity, nationality, or race, and (2) a collection of neutral comments, including both general comments and...