-
Source code and data for the PhD Thesis "Metrics of Graph-Based Meaning Repre...
This dataset contains source code and data used in the PhD thesis "Metrics of Graph-Based Meaning Representations with Applications from Parsing Evaluation to Explainable NLG... -
Source code and data for the PhD Thesis "Measuring the Contributions of Visio...
This dataset contains source code and data used in the PhD thesis "Measuring the Contributions of Vision and Text Modalities in Multimodal Transformers". The dataset is split... -
Training and development dataset for information extraction in plant epidemio...
The “Training and development dataset for information extraction in plant epidemiomonitoring” is the annotation set of the “Corpus for the epidemiomonitoring of plant”. The... -
KPWr chunks 2021
357 documents from KPWr corpus annotated manually at syntactic level (chunks). Please cite as: Oleksy, M., Walentynowicz, W., & Wieczorek, J. (2021). New approach to the... -
Terminological dictionary of artificial intelligence
The terminological dictionary was compiled within the framework of the project Development of Slovene in the Digital Environment. It is an example collection of 413 terms from... -
Slovenian commonsense reasoning model SloMET-ATOMIC 2020
The SloMET-ATOMIC 2020 is a Slovene commonsense reasoning model that is able to predict commonsense descriptions in a natural language for a given input sentence. The model is... -
Corpus for identifying sex education concepts SemSex 1.0
The SemSex corpus is designed to facilitate the automated recognition of sexual education concepts within curriculum description documents. The corpus contains two components:... -
Pretrained models for recognising sex education concepts SemSEX 1.0
Pretrained language models for detecting and classifying the presence of sex education concepts in Slovene curriculum documents. The models are PyTorch neural network models,... -
Natural Language 2 Semantic Hypergraph Dataset NL2SH 1.0
NL2SH (Natural Language to Semantic Hypergraph) dataset can be used to build and evaluate methods for knowledge extraction and representation based on a semantic hypergraph.... -
Slovene translation of the SQuAD2.0 dataset
Stanford Question Answering Dataset (SQuAD) is a reading comprehension dataset, consisting of questions posed by crowdworkers on a set of Wikipedia articles, where the answer to... -
Extensions to the Slovene translation of SuperGLUE
SuperGLUE is a benchmark styled after GLUE with a new set of more difficult language understanding tasks, improved resources, and a public leaderboard. It is comprised of 8... -
Source code and data for the PhD Thesis "Measuring the Contributions of Visio...
This dataset contains source code and data used in the PhD thesis "Measuring the Contributions of Vision and Text Modalities in Multimodal Transformers". The dataset is split... -
Source code and data for the PhD Thesis "Metrics of Graph-Based Meaning Repre...
This dataset contains source code and data used in the PhD thesis "Metrics of Graph-Based Meaning Representations with Applications from Parsing Evaluation to Explainable NLG... -
Temporal Model mBERT_Tweets
Temporal Model "mBERT" finetuned on the "Tweets" dataset to solve the tasks of extraction and classification of temporal entities. Model produced in the... -
Temporal Model BERT-Large_Fullpate
Temporal Model "BERT-Large" finetuned on the "Fullpate" dataset to solve the tasks of extraction and classification of temporal entities. Model produced in... -
UIE Base Model
Clean UIE-Base model in English as proposed in the original paper "Unified Structure Generation for Universal Information Extraction" [Lu et al., 2022]. -
Temporal Model mBERT_WikiWars
Temporal Model "mBERT" finetuned on the "WikiWars" dataset to solve the tasks of extraction and classification of temporal entities. Model produced in the... -
Temporal Model BERT-Base_Tweets
Temporal Model "BERT-Base" finetuned on the "Tweets" dataset to solve the tasks of extraction and classification of temporal entities. Model produced in the... -
Temporal Model BERT-Large_Tweets
Temporal Model "BERT-Large" finetuned on the "Tweets" dataset to solve the tasks of extraction and classification of temporal entities. Model produced in the... -
Temporal Model BERT-Large_WikiWars
Temporal Model "BERT-Large" finetuned on the "WikiWars" dataset to solve the tasks of extraction and classification of temporal entities. Model produced in...