Dataset abstract
This dataset contains one data file used to create the graphs and tables in the paper "Reflexiones metodológicas y teóricas sobre el análisis de marcadores pragmáticos: ilustraciones a través del estudio de «es que»". It includes 200 tokens of the pragmatic marker es que. These were retrieved from CORMA, a conversational corpus of peninsular Spanish compiled between 2016 and 2019. The data are annotated for: (i) the position of es que in the speech act, (ii) the function of es que on the metadiscursive dimension, (iii) the presence or absence of a function on the modal dimension, (iv) the function of es que on the modal dimension, (v) the subvalue of es que with regard to attenuation.
Article abstract
Although pragmatic markers were considered a marginal linguistic category until the late 1980s, their study has gained considerable attention in recent decades. However, analyzing their pragmatic functions involves multiple challenges. These include the choice between emphasizing macro- or microfunctional categories, deciding on a semasiological or onomasiological approach, addressing their polyfunctionality in specific contexts, and establishing formal criteria to identify concrete pragmatic functions. This study aims to explore these theoretical and methodological options. It is exemplified through a case study of the marker es que, as observed in the colloquial speech of Madrid. Using a representative sample from the CORMA corpus (Corpus Oral de Madrid), it is argued that es que functions as a polyfunctional pragmatic marker with procedural meaning, whose interpretation is shaped by context, and whose analysis requires a multidimensional approach.