-
Khresmoi Query Translation Test Data 2.0
This package contains data sets for development and testing of machine translation of medical queries between Czech, English, French, German, Hungarian, Polish, Spanish ans... -
English Model (CoNLL-2003) for NameTag
English model for NameTag, a named entity recognition tool. The model is trained on CoNLL-2003 training data. Recognizes PER, ORG, LOC and MISC named entities. Achieves... -
Motion Encoding Lexicalization Patterns: Portuguese and English Learners
General Information: Data collector: Jean Costa Silva (University of Georgia) Date of collection: September-December 2022 Manner of collection: Online questionnaire via... -
Khresmoi Summary Translation Test Data 2.0
This package contains data sets for development (Section dev) and testing (Section test) of machine translation of sentences from summaries of medical articles between Czech,... -
Khresmoi Summary Translation Test Data 1.1
This package contains data sets for development and testing of machine translation of sentences from summaries of medical articles between Czech, English, French, and German. -
Bosworth-Toller’s Anglo-Saxon Dictionary online
Description : This is an online edition of An Anglo-Saxon Dictionary, or a dictionary of "Old English". The dictionary records the state of the English language as it was used... -
English Models (Morphium + WSJ) for MorphoDiTa
English models for MorphoDiTa, providing morphological analysis, morphological generation and part-of-speech tagging. The morphological dictionary is created from Morphium and... -
Türkisch-Englisch-Deutsch bei Herkunftssprechern (TEDH)
The TEDH has been created as part of the project "Foreign Language Acquisition in German-Turkish bilinguals". The TEDH Corpus contains interviews in three languages:... -
Interactions of native vs. modified protamines with Gram-negative model biome...
Protamines are cationic antimicrobial peptides found commonly in the sperm cells of fish such as herring and salmon that are particularly active against food-borne Gram-negative... -
manufacture and standardisation of English Monumental letters
English brass letters form the earliest remains from decorated tombslabs of the period 1250 - 1350AD. The letters can be classified in several stylistic groups, and a certain... -
Covert translation: Business Communication (new)
Translation corpora of original texts with translations and comparable texts from the genre external business communication. Übersetzungs- und Vergleichskorpus mit authentischen... -
Covert translation: Business Communication (new)
Translation corpora of original texts with translations and comparable texts from the genre external business communication. Übersetzungs- und Vergleichskorpus mit... -
Covert translation: popular science
Translation corpora of original texts with translations and comparable texts from the genre popular scientific prose. Übersetzungs- und Vergleichskorpus mit authentischen... -
EXMARaLDA Demo corpus 1.1
A selection of short audio and video recordings in various languages to be used for instruction or demonstration of the EXMARaLDA system. The EXMARaLDA Demo Corpus is a small... -
Covert translation: Business Communication (old)
Translation corpora of original texts with translations and comparable texts from the genre external business communication. -
SkandSemikoAdmin
Data from the SkandSemiko project. -
The Spoken Wikipedia Corpora
The Spoken Wikipedia project unites volunteer readers of Wikipedia articles. Hundreds of spoken articles in multiple languages are available to users who are – for one... -
Covert translation: Business Communication (old)
Translation corpora of original texts with translations and comparable texts from the genre external business communication. CLARIN Metadata summary for Covert translation:... -
Covert translation: popular science
Translation corpora of original texts with translations and comparable texts from the genre popular scientific prose. Übersetzungs- und Vergleichskorpus mit authentischen... -
Community Interpreting Database Pilot Corpus (ComInDat)
Audio and video recordings of various types of community interpreted discourse (doctor-patient communication, simulated doctor-patient communication, courtroom communication) in...