-
Big Data language model in FastText CBOW format
Big Data language model in FastText CBOW format -
MWE 10 Największych
dabrowska_nocednie3_1933.txt prus_emancypantki_1894.txt sienkiewicz_ogniem_1884.txt kaczkowski_grob_1857.txt prus_faraon_1897.txt sienkiewicz_rodzina_1894.txt... -
Street name changes in Poznań, Słubice and Zbąszyń, Poland 1916-2018
The corpus presents a historical overview of street and place (park, bridge, square) name changes in the years 1916-2018 for three Polish cities: Poznań, Słubice and Zbąszyń.... -
POLFIE-OT: an LFG grammar of Polish with OT marks
POLFIE-OT is a version of POLFIE, an LFG grammar of Polish implemented in the XLE system (Xerox Linguistic Environment), enriched with OT (Optimality Theory) constraints for the... -
WCRFT WebLichtService
WCRFT service for WebLicht -
Wiki train - 34 categories
Wikipedia, 34 kategorie - zbiór do uczenia klasyfikatora -
ENIAMtoolkit
ENIAMtoolkit is a collection of libraries that: - perform tokenization, lemmatization, part of speech tagging; - detect MWE and abbreviations; - split text into sentences. -
Vector Extractor
Collocations presented are based on co-occurrences of a selected noun with several features describing it and linked with it by syntactic dependencies. The recognised features... -
New Gospels
Nowe Ateny -
NLP Web services and NLP workflow engine
Web based system for natural language processing of texts in Polish. It allows running complex workflows of language and machine learning tools. Making it avaliable via REST Web... -
MWE Sygietyński
Antoni Sygietyński -
Polish corpus of plWordNet usage examples
Corpus of 83k usage examples taken from plWordNet 3.0. All annotated with specific sense. All published on open licences. -
International Women's Day Corpus
The corpus contains articles form the daily "Trybuna Ludu" from years 1949-1956.The articles dealt with the situation of women, they were especially concerned with the... -
Indexes for djview4poliqarp
This is the archive of the mercurial repositories formerly available at https://bitbucket.org/jsbien/. They contain indexes to various resources in the DjVu format, in... -
Serel (WS)
Serel is a Python framework for recognition relations between annotations in text. -
Świgra — a parser of Polish
Świgra is a parser of Polish generating constituency trees using a DCG style grammar stemming from Marek Świdziński’s grammar “Gramatyka formalna języka polskiego” (1992). The... -
WoSeDon
WoSeDon is a tool for Word Sense Disambiguation. It works for polish texts and as a source of possible senses using plWordNet. -
Korpus testowy - ludzie
Korpus osób testowy -
Big Data language model in Word2Vec CBOW format.
Big Data language model in Word2Vec CBOW format. -
Liner2 events model
Liner2 model for event and event relation recognition