Does standardization matter? Evaluating the potential of the Common European Framework of Reference for Languages (CEFR) to foster labour market inclusion of immigrants (DISCEFRN): Web-scraped dataset

Dataset

DOI

This dataset contains the web-scraped information on the language requirements of about 90,000 job-ads collected within the DISCEFRN project (see metadata section Funding Information). Within DISCEFRN, we collected and analysed the language requirements for all job-ads that were released in one of Norway’s main online job-sites over the course of one year (01/2024-12/2024). We were specifically interested in the prevalence of CEFR-based requirements in the Norwegian labour market, and in the CEFR levels that were required for different professions. This dataset contains all the web-scraped information, most notably on language requirements (CEFR level, non-subjective formulations) and occupational type (ISCO, SIC). It is a stand-alone dataset and contains all relevant data to re-produce associated publications (See metadata field on Publications) or be reused for other research interests. Yet, it can still be linked to additional DISCEFRN datasets, i.e. the vignette-study data set, where a subsample of employers who advertised these job-ads participated in a survey experiment (https://doi.org/10.18710/6YMZLS).

STATA, Version 17

Identifier
DOI	https://doi.org/10.18710/K6WA0V
Metadata Access	https://dataverse.no/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.18710/K6WA0V

Provenance
Creator	Schmaus, Miriam
Publisher	DataverseNO
Contributor	Schmaus, Miriam; Miriam Schmaus; Western Norway University of Applied Sciences
Publication Year	2025
Funding Reference	The European Union's Marie Skłodowska-Curie Actions (MSCA, Horizon Europe Actions) 101065566
Rights	CC0 1.0; info:eu-repo/semantics/restrictedAccess; http://creativecommons.org/publicdomain/zero/1.0
OpenAccess	false
Contact	Schmaus, Miriam (Western Norway University of Applied Sciences)

Representation
Resource Type	Encoded Data; Dataset
Format	text/plain; application/pdf; text/comma-separated-values; application/x-stata
Size	4548; 220645; 203707; 32474164; 198853252
Version	1.2
Discipline	Agriculture, Forestry, Horticulture, Aquaculture; Agriculture, Forestry, Horticulture, Aquaculture and Veterinary Medicine; Life Sciences; Social Sciences; Social and Behavioural Sciences; Soil Sciences
Spatial Coverage	Norway