-
PANACEA Labour Legislation Corpus n-grams EN (English)
This data set contains English word n-grams and English word/tag/lemma n-grams in the "labour Legislation" (LAB) domain. N-grams are accompanied by their observed frequency... -
PANACEA Annotated Dependency Greek Labour Legislation Corpus Version 2
PANACEA Annotated Greek Labour Legislation Corpus Version 2 consists of Greek texts in the Labour Legislation (LAB) domain that were collected and automatically annotated in the... -
PANACEA Environment Corpus n-grams IT (Italian)
This data set contains Italian word n-grams and Italian word/tag/lemma n-grams in the "Environment" (ENV) domain. N-grams are accompanied by their observed frequency counts. The... -
PANACEA Spanish automatically acquired lexicon for ENV domain: Subcategorizat...
This is a domain-specific lexicon for Spanish for environment (ENV) domain. This lexicon contain both, subcategorization frames for verbs and lexical semantic classes for nouns.... -
PANACEA Environment Bilingual Glossary FR-EN (French-English)
This folder contains files for bilingual glossary creation from factored phrase tables that include part of speech tagged text for FR-EN language pair. The tables are firstly... -
PANACEA Labour and Repubblica merged Italian Lexicon
The Italian PANACEA_rep_lab_merged.lmf.xml is SCF lexicon obtained by merging two automatically extracted lexicons: a domain lexicon (labour) PANACEA_SCF_IT_labour.lmf.xml and a... -
PANACEA Italian V-SUBCAT gold-standard for LAB domain
The PANACEA_SCF_Gold_LAB_IT is a manually created "gold-standard" lexicon of verbal subcategorisation frames for 27 verb lemmas. The language is Italian and the domain is Labour... -
PANACEA Italian Parole V-SUBCAT Gold Standard lexicon
The PAROLE-SCF-31-IT is a lexicon of verb subcategorisation frames for 31 verb lemmas extracted from the PAROLE Italian Lexicon (Ruimy et a. 2003). -
PANACEA Labour and Parole merged Italian Lexicon
The Italian PAROLE_lab_merged.lmf.xml is SCF lexicon obtained by merging two automatically extracted lexicons: a domain lexicon (labour) pANACEA_SCF_IT_labour.lmf.xml and a the... -
PANACEA English V-SUBCAT gold-standard for LAB domain
This is a domain-specific gold-standard for English subcategorization frames, in the case, for labour (LAB) domain. This gold-standard was manually developed, choosing a set of... -
Mridangam stroke dataset
The Mridangam Stroke dataset is a collection of 7162 audio examples of individual strokes of the Mridangam in various tonics. The dataset comprises of 10 different strokes... -
Home-to-school pedestrian mobility GPS data from a citizen science experiment...
This data-set contains high resolution GPS records from a single day home-to-school pedestrian mobility of 10 schools in the Barcelona Metropolitan Area (Spain). The experiment... -
Database of Catalan Adjectives
The database contains 2,296 alphabetically ordered adjective lemmata (rows) and 45 columns with various types of linguistic information about each lemma. The adjectives... -
Raw data of the study Cannabis and public health: A study assessing regular c...
This is the raw data used for the analyses of the study. It includes all the variables analyzed in the manuscript. The file "Main data" corresponds to all variables relative to... -
Dataset about the Spanish academic libraries’ perceptions of Open Science. Dr...
This data set is made up of the data collected in two files ( odt., with the survey instrument and csv., with the responses to this survey) to carry out the analysis of the... -
Dades comparatives de formació presencial i virtual asíncrona de la SGPC
Dades provinents de formularis d'inscripció, d'enquestes de satisfacció i d'un qüestionari de percepció de l'usuari dissenyat ad-hoc i validat per experts per analitzar la taxa... -
Dendrograma de los sellos de ánforas olearias Dressel 20 de la serie de C. Iu...
Los dendrogramas son una nueva herramienta de estudio que nos permite organizar los sellos de un determinado taller de manera más eficiente que la que conseguimos con los... -
Platform cooperativism: cases and stakeholders analysis
This data set will include three sources of data: 1) Data that records both the web collection of 60 platform economy cases around Europe and the results of 20 interviews to... -
ERC Artsoundscapes project – rough data related to the Facebook page
Datos recogidos en el marco del proyecto de la ERC "The sound of special places: exploring rock art soundscapes and the sacred" (acrónimo: Artsoundscapes) que tiene como... -
Dataset la diada de Sant Jordi dels arxius públics de TV3
[eng] This a dataset related to the Diada de Sant Jordi through TV3 public audiovisual archive. This dataset contains for a total of 572 videos with title, link, length in time...