-
Database of Catalan Adjectives
The database contains 2,296 alphabetically ordered adjective lemmata (rows) and 45 columns with various types of linguistic information about each lemma. The adjectives... -
ParlaMintCAT corpus
Parliamentary speeches are considered to be of interest for different research areas because they are publicly available transcriptions, produced under controlled and regulated... -
Corpus de les construccions comparatives intensificadores de la lletjor en ca...
Corpus de les construccions comparatives intensificadores en català, espanyol, anglès i francés. Les ocurrències que composen cadascun dels corpus han estat extretes a partir... -
CatCoLA - Catalan Corpus of Linguistic Acceptability
We introduce CatCoLA, the Catalan Corpus of Linguistic Acceptability that will contribute to the Catalan Language Understanding Benchmark (CLUB) to assess and compare the... -
MultiBooked_Corpora [research data]
We release two corpora of hotel reviews annotated for aspect-level sentiment analysis in Catalan and Basque. We also include scripts which allow the conversion to sentence-level... -
Catalan in a bilingual context (PhonCAT)
Audio recordings of prompted, read and spontaneous speech data from L1 Catalan speakers from Barcelona. The data is stratified according to three different city districts and...