Dataset - B2FIND

Replication data for: CHIR99021 causes inactivation of Tyrosine Hydroxylase a...

CHIR99021, also known as laduviglusib or CT99021, is a Glycogen-synthase kinase 3β (GSK3β) inhibitor, which has been reported as a promising drug for cardiomyocyte regeneration...

Database of Catalan Adjectives

The database contains 2,296 alphabetically ordered adjective lemmata (rows) and 45 columns with various types of linguistic information about each lemma. The adjectives...

Corpus de les construccions comparatives intensificadores de la lletjor en ca...

Corpus de les construccions comparatives intensificadores en català, espanyol, anglès i francés. Les ocurrències que composen cadascun dels corpus han estat extretes a partir...

PANACEA Environment Corpus n-grams IT (Italian)

This data set contains Italian word n-grams and Italian word/tag/lemma n-grams in the "Environment" (ENV) domain. N-grams are accompanied by their observed frequency counts. The...

PANACEA Labour Legislation Corpus n-grams EN (English)

This data set contains English word n-grams and English word/tag/lemma n-grams in the "labour Legislation" (LAB) domain. N-grams are accompanied by their observed frequency...

PANACEA Environment Corpus n-grams ES (Spanish)

This data set contains Spanish word n-grams and Spanish word/tag/lemma n-grams in the "Environment" (ENV) domain. N-grams are accompanied by their observed frequency counts. The...

PANACEA Environment Corpus n-grams FR (French)

This data set contains French word n-grams and French word/tag/lemma n-grams in the "Environment" (ENV) domain. N-grams are accompanied by their observed frequency counts. The...

PANACEA Labour Legislation Corpus n-grams IT (Italian)

- This data set contains Italian word n-grams and Italian word/tag/lemma n-grams in the "Labour" (LAB) domain. N-grams are accompanied by their observed frequency counts. The...

PANACEA Environment Corpus n-grams EN (English)

- This data set contains English word n-grams and English word/tag/lemma n-grams in the "Environment" (ENV) domain. N-grams are accompanied by their observed frequency counts....

PANACEA Labour Legislation Corpus n-grams FR (French)

This data set contains French word n-grams and French word/tag/lemma n-grams in the "Labour" (LAB) domain. N-grams are accompanied by their observed frequency counts. The length...

PANACEA Labour Legislation Corpus n-grams ES (Spanish)

-

Replication Data for: “Threat” in Russian – A Linguistic Perspective

The dataset includes examples of usages of groza and ugroza from the Russian National Corpus (RNC). The dataset covers the period from 1700 to 2020 and consists of 4858...

KIParla - KIP transcripts

The KIP corpus is part of the larger KIParla collection (www.kiparla.it), which can be freely queried through the NoSketch Engine interface. The KIP corpus was compiled within...

ChiSCor: Children's Story Corpus

ChiSCor is a new corpus containing 619 fantasy stories, told freely by 442 Dutch children aged 4-12. ChiSCor was compiled for studying how children render character...

Pre-trained POS tagging models for German social media

Pre-trained POS tagging models for the HunPos tagger (Halácsy et al. 2007) the biLSTM-char-CRF tagger (Reimers & Gurevych 2017) Online-Flors (Yin et al. 2015)....

Replication data for: CHIR99021 causes inactivation of Tyrosine Hydroxylase a...

CHIR99021, also known as laduviglusib or CT99021, is a Glycogen-synthase kinase 3β (GSK3β) inhibitor, which has been reported as a promising drug for cardiomyocyte regeneration...

Corpus from the Aozora Bunko Library

This corpus contains a subset of available texts from the Aozora Bunko public library project, which contains various works of mostly older literature in Japanese. A custom...

Acts of ʾAzqir

Acts of ʾAzqir from the Corpus of the Classical Ethiopic Language (Ge'ez), produced by the TraCES project (https://www.traces.uni-hamburg.de/en/about.html) in...

How to use an EXMARaLDA corpus

This document explains how to use EXMARaLDA corpora from www.exmaralda.org, www.corpora.uni-hamburg.de and other sites. Examples are taken from the EXMARaLDA demo corpus. Other...

Book of Mystery

Book of Mystery from the Corpus of the Classical Ethiopic Language (Ge'ez), produced by the TraCES project (https://www.traces.uni-hamburg.de/en/about.html) in...

56 datasets found