-
Informcje niefinansowe
Raporty informacji niefinansowych spółek notowanych na GPW w Warszawie -
Cleaned Polish Oscar corpus (128M lines)
Cleaned Polish Oscar corpus (part: 128M lines, 3.53 GB). Data was prepared with a few cleaning heuristics: - remove sentences shorter than - remove non-polish sentences... -
Feminism
How do Poles understand the concepts of feminism and feminist and how do they use these terms? Reconnaissance. -
Wcrft test
Wcrft test -
Korpus test - Wikinews
Testowa baza na zajęcia -
Teksty reklam TVP ABC
teksty reklam emitowane na kanale TVP ABC miedzy lipcem 2014 a styczniem 2015 -
SpatialPL
SpatialPl is a tool for automatic recognition of spatial expressions in Polish texts -
Aleksander Zelwerowicz - teksty teatralne
Teksty teatralne Aleksandra Zelwerowicza -
MWE Żeromski
Stefan Żeromski -
małesermony
małe sermony -
The procedure of the correction of plWordNet (ver. 1)
The pdf entails the specificationof tipical errors of lexicographic description of lexical units and synsets in plWordNet, and the procedure of them manual correction. -
Liner2 temporal expressions model
Liner2 model for temporal expression recognition and normalisation -
SermonsEN
Sermons in English -
Big Data language model - subword - BPE - ARPA
Big data language model based on subword units, based on byte pair encoding in ARPA format -
Iwo Gall -teksty teatralne
Teksty teatralne Iwona Galla -
HerBERT Large Pre-trained on KGR10 Data
HerBERT-large model fine-tuned on KGR10 data. The model was trained using DeepSpeed technology. -
Bulhakov - corpus of events, temporal expressions and temporal relations
The corpus contains the text of the short story "Fatalne jaja" by Michaił Bułhakov (http:// www.wolnelektury.pl). The corpus is manually annotated with temporal expressions,... -
Linguistic presentation of object structure and relations in OSL language
OSL is a markup universal language for linguistic description of any object in terms of structure and behavior. The kernel is presented and subsets for IT system, business... -
Korpusiątko_Mateusz_Adamczyk(warsztaty)
Mały korpus -
Keyword Extractor
Tool for extracting key phrases for text, using TextRank algorithm.
