The submitted data consists of the Women’s Empowerment Pilot Corpus, a curated collection of 30 short texts and dialogue excerpts documenting the communicative journey of empowerment. The corpus is divided into two dimensions: (a) internal dialogue, capturing expressions of self-reflection, emotional recognition, and inner transformation, and (b) external expression, covering assertion, resistance, self-definition, and boundary-setting. Each utterance has been annotated with a pragmatic-functional schema, including the categories Inner Realization (IR), Resistance (R), Assertive Act (AA), and Identity Redefinition (ID).
The resource is encoded in TEI/XML and accompanied by CMDI metadata to ensure CLARIN compliance. A JSON version is also provided to facilitate integration into NLP pipelines. The corpus is designed as a proof-of-concept resource that operationalizes theoretical insights from semantics and pragmatics into computationally reusable linguistic data. It contributes to CLARIN-IT by enriching the infrastructure with gender-sensitive communication models and offering applications in education, digital humanities, and cross-cultural studies.