-
plWordNet 3.0 Słowosieć 3.0
plWordNet is a lexico-semantic network which reflects the lexical system of the Polish language. plWN currently contains 178 000 nouns, verbs, adjectives, and adverbs, 259 000... -
Walenty (2016-04-28)
Walenty is a valence dictionary of Polish developed at the Institute of Computer Science, Polish Academy of Sciences (IPI PAN). The original formalism of Walenty was established... -
PoLitBert_v32k_linear_125k - Polish RoBERTa model
Polish RoBERTa model trained on Polish Wikipedia, Polish literature and Oscar. -
Toposław
Toposław is an editor of multi-word unit inflection lexicons. -
MWE Zapolska
Gabriela Zapolska -
Genology
Corpus -
WCRFT Webservice
Webservice for Weblicht -
TimeAssign
TimeAssign is a program which recognizes temporal expressions and assigns TimeML labels to words in Polish text using a Bi-LSTM based neural net and wordform embeddings. -
Korpus nagrań radiowych
A collection of radio 192 recordings, with around 200 speakers, each no longer than 40 minutes long. Audio saved as RAW 16-bit 16 kHz sampling frequency. -
War with striped beetle in main Polish communist party newspaper "Trybuna Lud...
Articles from main Polish communist party newspaper "Trybuna Ludu" concerning battle with potato beetle allegedly drop down by US Government to Poland and other socialists... -
WCCL
WCCL (Wrocław Corpus Constraint Language) is a formalism for writing functional expressions evaluated on morpho-syntactically annotated text. These expressions may be used... -
Life Story Interview - Polish sample
Life story interview -
Korpus - specyfikacje
This dataset has no description
-
Badanie infosfery Kronik Fallathanu - Inforex, materiały.
Korpus, w którego skład wchodzą przykładowe teksty pozyskane ze strony Kroniki Fallathanu. Posłużyły one analizie najczęściej występujących anotacji, które pozwoliły na... -
Blogi_zip 02
blogi zip -
NPSemRel
NPSemrel is a tool for recognizing semantic roles into nominal Phrases. -
wikinewsy
wikinewsy -
MWE Reymont, Chłopi
Władysław Reymont -
KPWr n82 NER model (on Polish RoBERTa base)
The named entity recognition model for fine-grained categories of entities (82 types) was trained on the KPWr corpus using Polish RoBERTa base language model. Details can be... -
Transkrypcja fonetyczna Kronik RP
This is a phonetic transcription of the "Kroniki RP" data set using the G2P tool available at mowa.clarin-pl.eu.