This dataset contains the web-scraped information on the language requirements of about 90,000 job-ads collected within the DISCEFRN project (see metadata section Funding Information). Within DISCEFRN, we collected and analysed the language requirements for all job-ads that were released in one of Norway’s main online job-sites over the course of one year (01/2024-12/2024). We were specifically interested in the prevalence of CEFR-based requirements in the Norwegian labour market, and in the CEFR levels that were required for different professions.
This dataset contains all the web-scraped information, most notably on language requirements (CEFR level, non-subjective formulations) and occupational type (ISCO, SIC). It is a stand-alone dataset and contains all relevant data to re-produce associated publications (See metadata field on Publications) or be reused for other research interests. Yet, it can still be linked to additional DISCEFRN datasets, i.e. the vignette-study data set, where a subsample of employers who advertised these job-ads participated in a survey experiment (https://doi.org/10.18710/6YMZLS).
STATA, Version 17