-
New Gospels
Nowe Ateny -
NLP Web services and NLP workflow engine
Web based system for natural language processing of texts in Polish. It allows running complex workflows of language and machine learning tools. Making it avaliable via REST Web... -
MWE Sygietyński
Antoni Sygietyński -
Polish corpus of plWordNet usage examples
Corpus of 83k usage examples taken from plWordNet 3.0. All annotated with specific sense. All published on open licences. -
International Women's Day Corpus
The corpus contains articles form the daily "Trybuna Ludu" from years 1949-1956.The articles dealt with the situation of women, they were especially concerned with the... -
Indexes for djview4poliqarp
This is the archive of the mercurial repositories formerly available at https://bitbucket.org/jsbien/. They contain indexes to various resources in the DjVu format, in... -
Serel (WS)
Serel is a Python framework for recognition relations between annotations in text. -
Świgra — a parser of Polish
Świgra is a parser of Polish generating constituency trees using a DCG style grammar stemming from Marek Świdziński’s grammar “Gramatyka formalna języka polskiego” (1992). The... -
WoSeDon
WoSeDon is a tool for Word Sense Disambiguation. It works for polish texts and as a source of possible senses using plWordNet. -
Korpus testowy - ludzie
Korpus osób testowy -
Big Data language model in Word2Vec CBOW format.
Big Data language model in Word2Vec CBOW format. -
Liner2 events model
Liner2 model for event and event relation recognition -
ENIAM
ENIAM: Categorial Syntactic-Semantic Parser for Polish -
PoLitBert_v50k_linear_50k - Polish RoBERTa model
Polish RoBERTa model trained on Polish Wikipedia, Polish literature and Oscar. -
Bulhakov - corpus of events, temporal expressions and temporal relations
The corpus contains the text of the short story "Fatalne jaja" by Michaił Bułhakov (http:// www.wolnelektury.pl). The corpus is manually annotated with temporal expressions,... -
Keyword Extractor
Tool for extracting key phrases for text, using TextRank algorithm. -
Korpusiątko_Mateusz_Adamczyk(warsztaty)
Mały korpus -
HerBERT Large Pre-trained on KGR10 Data
HerBERT-large model fine-tuned on KGR10 data. The model was trained using DeepSpeed technology. -
Iwo Gall -teksty teatralne
Teksty teatralne Iwona Galla -
Big Data language model - subword - BPE - ARPA
Big data language model based on subword units, based on byte pair encoding in ARPA format