-
Lexicalised and Non-lexicalized Multi-word Expressions inWordNet: a Cross-enc...
Focusing on recognition of multi-word expressions (MWEs), we address the problem of recording MWEs in WordNet. In fact, not all MWEs recorded in that lexical database could with... -
Adverbs in plWordNet: Theory and Implementation
Adverbs are seldom well represented in wordnets. Princeton WordNet, for example, derives from adjectives practically all its adverbs and whatever involvement they have. GermaNet... -
Propagation of emotions, arousal and polarity inWordNet using Heterogeneous S...
In this paper we present a novel method for emotive propagation in a wordnet based on a large emotive seed. We introduce a sense-level emotive lexicon annotated with polarity,... -
Context-sensitive Sentiment Propagation inWordNet
In this paper we present a comprehensive overview of recent methods of the sentiment propagation in a wordnet. Next, we propose a fully automated method called Classifier-based... -
Neural Language Models vs Wordnet-based Semantically Enriched Representation ...
Neural language models, including transformer-based models, that are pretrained on very large corpora became a common way to represent text in various tasks, including... -
Word Sense Disambiguation Based on Iterative Activation Spreading with Contex...
Many knowledge-based solutions were proposed to solve Word Sense disambiguation (WSD) problem with limited annotated resources. Such WSD algorithms are able to cover very large... -
epic-uds
This dataset has no description
-
Tourism Corpus TURK 3.0
The Tourism Corpus TURK 3.0 is a multilingual corpus of tourism-related texts in Slovenian, accompanied by some texts (about 6% of the corpus) in English, Italian and German.... -
CooccurrenceFieldSampler (CFS)
The CooccurrenceFieldSampler (CFS) was developed for sampling from corpora to facilitate lexicographical data analysis. It works with corpora from different sources, text types... -
Lithuanian Hate Speech Corpus v.1
This corpus consists of (1) examples of hate speech based on ethnicity, nationality, or race, and (2) a collection of neutral comments, including both general comments and... -
WordNet-based Data Augmentation for Hybrid WSD Models
Recent advances in Word Sense Disambiguation suggest neural language models can be successfully improved by incorporating knowledge base structure. Such class of models are... -
Discriminating Homonymy from Polysemy in Wordnets: English, Spanish and Polis...
We propose a novel method of homonymy-polysemy discrimination for three Indo-European Languages (English, Spanish and Polish). Support vector machines and LASSO logistic... -
Testing Zipf’s meaning-frequency law with wordnets as sense inventories
According to George K. Zipf, more frequent words have more senses. We have tested this law using corpora and wordnets of English, Spanish, Portuguese, French, Polish, Japanese,... -
Extraction and description of multi-word lexical units in plWordNet 3.0
In this paper, we present methods of extraction of multi-word lexical units (MWLUs) from large text corpora and their description in plWordNet 3.0. MWLUs are filtered from... -
Enriching plWordNet with morphology
In the paper, we present the process of adding morphological information to the Polish WordNet (plWordNet). We describe the reasons for this connection and the intuitions behind... -
Wordnet – a Basic Resource for Natural Language Processing: the Case of plWor...
This paper presents a wide scope of wordnet applications on the example of applications of plWordNet – a wordnet of Polish. Wordnets are large lexical-semantic databases... -
plWordNet 4.1 – a Linguistically Motivated, Corpus-based Bilingual Resource
The paper presents the latest release of the Polish WordNet, namely plWordNet 4.1. The most significant developments since 3.0 version include new relations for nouns and verbs,... -
Terminology in WordNet and in plWordNet
We examine the strategies of organizing terminological information in WordNet, and describe an analogous strategy of adding terminological senses of lexical units to plWordNet,... -
Addenda to the inventory of female names in Słowosieć: The case of biskupka ‘...
Due to the dynamic social discussion and the observed increase in the use of feminatives, we deemed it appropriate to modify the current way of describing these units in the... -
The lexicographic description of feminine forms in plWordNet: the current sta...
The aim of the study is to present a method of describing feminine forms (nouns referring to humans with female gender) in plWordNet and to indicate possible directions of its...
