Dataset - B2FIND

NameTag 3 Multilingual Model 260521

This is a trained model for the supervised machine learning tool NameTag 3 (https://ufal.mff.cuni.cz/nametag/3/). NameTag 3 is an open-source tool for both flat and nested named...

Extracted and NER-ed Pi Newspaper Articles

JSONL records for each issue of digitised Pi (student periodical from UCL Special Collections) at UCL*. The issues are grouped into folders by publication date. *Disclaimer: The...

A Human-Annotated Dataset for Language Modeling and Named Entity Recognition ...

This is an open dataset of sentences from 19th and 20th century letterpress reprints of documents from the Hussite era. The dataset contains a corpus for language modeling and...

NameTag 3 Multilingual CoNLL Model

This is a trained model for the supervised machine learning tool NameTag 3 (https://ufal.mff.cuni.cz/nametag/3/), trained jointly on several NE corpora: English CoNLL-2003,...

NameTag 3 Multilingual Model 250203

This is a trained model for the supervised machine learning tool NameTag 3 (https://ufal.mff.cuni.cz/nametag/3/). NameTag 3 is an open-source tool for both flat and nested named...

Czech Named Entity Corpus 1.0

The presented Czech Named Entity Corpus 1.0 is the first publicly available corpus providing a large body of manually annotated named entities in Czech sentences, including a...

NameTag 3 Czech CNEC 2.0 Model

This is a trained model for the supervised machine learning tool NameTag 3 (https://ufal.mff.cuni.cz/nametag/3/), trained on the Czech Named Entity Corpus 2.0...

A Human-Annotated Dataset for Language Modeling and Named Entity Recognition ...

This is an open dataset of sentences from 19th and 20th century letterpress reprints of documents from the Hussite era. The dataset contains a corpus for language modeling and...

NameTag 2 Models (2021-09-16)

NER models for NameTag 2, named entity recognition tool, for English, German, Dutch, Spanish and Czech. Model documentation including performance can be found here:...

DaMuEL 1.0: A Large Multilingual Dataset for Entity Linking

We present DaMuEL, a large Multilingual Dataset for Entity Linking containing data in 53 languages. DaMuEL consists of two components: a knowledge base that contains...

It-Sr-NER: CLARIN compatible NER and geoparsing web services for parallel tex...

It-Sr-NER-corp is the Italian/Serbian bilingual corpus with 10,000 aligned sentences compiled in the scope of the It-Sr-project from samples of several Italian novels translated...

It-Sr-NER

It-Sr-NER tool is a CLARIN compatible NER web service for parallel texts with case study on Italian and Serbian; it can be used for recognizing and classifying named entities in...

French ELTEC NER Open Dataset

This dataset is derived from the annotation of named entities in a collection of 100 French novels from the "long" 19th century. The collection was assembled in the framework of...

Swe-NERC

A resource for training and evaluation of Named Entity Recognition for Swedish

Liner2.6 model NER NKJP

Liner2.6 NER NKJP model The package contains a pre-trained Liner2 (https://github.com/CLARIN-PL/Liner2) model for recognition named entities according to NKJP guidelines. The...

Liner2.5 model NER

Przygotował: Michał Marcińczuk marcinczuk@gmail.com Data: 25.05.2016 Projekt:...

Liner2.5

Generic framework for information extraction tasks, including recognition of named entities, temporal expressions, spatial expressions and events.

Liner2.5 rc3

A framework for multitask sequence labeling dedicated for natural language processing tasks.

18 datasets found