-
AspectEmo 1.0: Multi-Domain Corpus of Consumer Reviews for Aspect-Based Senti...
AspectEmo 1.0 Corpus is an extended version of a publicly available PolEmo 2.0 corpus of Polish customer reviews, that was used in many projects on the use of different methods... -
ELMo Embeddings for Polish
A model of ELMo embeddings for Polish language trained on large textual corpora (KGR10). To retrain the model please use the checkpoint and vocabulary files available at:... -
Korpus przemówień przedwyborczych Baracka Obamy
Korpus tekstowy przemówień Baracka Obamy z lat 2006-2015. -
fronda
Some texts of fronda.pl -
Wordlist of the Contemporary Corpus of Lithuanian Language in the Face of War...
We present the comparative wordlist based on the Corpus of the Contemporary Lithuanian Language (CCLL2 version 2, pre-2020), supplemented by the media (courtesy of the news... -
English-French-Lithuanian Parallel Corpus of EU Financial Documents
The corpus is comprised of 154 EU legislative documents (English documents and their translations into French and Lithuanian) related to various financial issues and enacted in... -
Wordlist of Lemmas from the Joint Corpus of Lithuanian
The resource is a wordlist of lemmas from the Joint Corpus of Lithuanian (JCL). The JCL is a merge of three corpora: 1) Vilnius university corpus compiled out of the Lithuanian... -
Lithuanian 4-gram dataset
Dataset of 4-grams with frequencies extracted from Delfi.lt corpus (~ 70 million words, period: March 2014 - November 2016). Firstly corpus was split into sentences, then symbol... -
Corpus KLASIUS v.02
900 extracts for the corpus were collected from manuals and publications for secondary school students included in the compulsory bibliographic descriptions of the university... -
Lithuanian 2-gram dataset
Dataset of 2-grams with frequencies extracted from Delfi.lt corpus (~ 70 million words, period: March 2014 - November 2016). Firstly corpus was split into sentences, then symbol... -
Lithuanian Treebank ALKSNIS
ALKSNIS v2.1 ALKSNIS v2.1 consists of 2,355 syntactically annotated sentences in the PML (Prague Mark-up Language) format. The format allows researchers to visualise and edit... -
Lithuanian Spelling Checker V.1.0.45 for LibreOffice and OpenOffice
Lithuanian spelling checker for LIBREOFFICE / OPENOFFICE 2020-04-09 version 1.0.45 -
Lithuanian Spelling Checker V.1.0.45 for Linux
Lithuanian spelling checker for Linux 2020-04-07 version 1.0.45 -
LITIS v.1
Corpus of user-generated comments collected from two Lithuanian portals: www.delfi.lt and www.lrytas.lt Each comment is in a separate file (TXT). Each file contains: a comment,... -
Frequency lists of pivot words and GSE counts
The resource contains data used to estimate the amount of words in Lithuanian texts indexed by the selected Global Search Engines (GSE), namely Google (by Alphabet Inc.), Bing... -
Lithuanian Corpus of the EU Primary and Secondary Law Acts of the Period 2015...
274,460 word corpus comprised of selected primary and secondary law acts of the EU of the period 2015-2017. The corpus was compiled of documents containing words with the root... -
Lithuanian font family AISTIKA
Original TrueType font designed and hinted in Lithuania. The font complies with the ISO/IEC 10646 (Unicode) standard and have the full set of casual and accented Lithuanian... -
Lithuanian 3-gram dataset
Dataset of 3-grams with frequencies extracted from Delfi.lt corpus (~ 70 million words, period: March 2014 - November 2016). Firstly corpus was split into sentences, then symbol... -
Lithuanian 1-gram dataset
Dataset of 1-grams with frequencies extracted from Delfi.lt corpus (~ 70 million words, period: March 2014 - November 2016). Firstly corpus was split into sentences, then... -
Lithuanian Spelling Checker V.1.0.45 for macOS
Lithuanian spelling checker for macOS 2020-04-10 version 1.0.45
