ParlLawSpeech

DOI

Parliaments are key institutions of democracy. The documents that parliaments produce - including plenary protocols, legislative bills, and ultimately adopted laws - thus contain invaluable information for understanding how democratic governance works while advances in natural language processing offer an ever increasing potential to extract and analyse this information systematically. Yet, despite this significance of analysing parliamentary documents systematically, accessing them often involves very notable hurdles in applied projects (Sebők et al., 2025). To conduct computational text analysis effectively and to compare across different legislatures, researchers require comprehensive, consistent, and machine-readable text corpora. ParlLawSpeech (PLS) contributes to and extends recent efforts - e.g. the CLARIN project (De Jong et al., 2022), the Comparative Agendas Project (Baumgartner et al., 2019), ParlSpeech (Rauh & Schwalbach, 2020), or Parl-EE (Sylvester, et al., 2022) - to make such corpora available to the research community. Besides extending the coverage of readily available parliamentary text corpora, PLS innovates in two respects. First, it is not constrained to only one type of parliamentary document but rather provides full texts for speeches, bills, and laws. Second, it offers data linkage across these different types of parliamentary texts in a way that is typically not even offered by institutional repositories and data providers (Kiss & Sebők, 2022). These two innovations allow users to map and to combine different parliamentary outputs, thereby enabling them to address research questions spanning the full process of parliamentary decision-making from draft documents, over plenary debates, to the contents of the finally adopted rules. In sum, PLS offers machine-readable full-text vectors of parliamentary speeches, bills, and laws, as well as relevant metadata for seven European countries (Austria, Czech Republic, Croatia, Denmark, Germany, Hungary, and Spain) and the supranational European Parliament (EP). keywords: text as data, legislative politics, computational social sciences, parliamentary debates, quantitative text analysis, law-making, parliamentary speech, political communication

Total Universe / Complete enumeration

Web Scraping

Identifier
DOI https://doi.org/10.7802/2824
Source https://search.gesis.org/research_data/SDN-10.7802-2824?lang=de
Metadata Access https://datacatalogue.cessda.eu/oai-pmh/v0/oai?verb=GetRecord&metadataPrefix=oai_ddi25&identifier=926c830eb9b3b8b0e893bfe872a76af2d4d25f3c43f8ad1b2481f65bbde976f3
Provenance
Creator Schwalbach, Jan; Hetzer, Lukas; Proksch, Sven-Oliver; Rauh, Christian; Sebők, Miklós
Publisher GESIS Data Archive for the Social Sciences; GESIS Datenarchiv für Sozialwissenschaften
Publication Year 2025
Funding Reference [European Union’s Horizon 2020 program (Grant agreement 951832)]
Rights Free access (without registration) - The research data can be downloaded directly by anyone without further limitations. CC BY 4.0: Attribution (https://creativecommons.org/licenses/by/4.0/deed.de); Freier Zugang (ohne Registrierung) - Die Forschungsdaten können von jedem direkt heruntergeladen werden. CC BY 4.0: Attribution (https://creativecommons.org/licenses/by/4.0/deed.de)
OpenAccess true
Contact http://www.gesis.org/
Representation
Discipline Social Sciences
Spatial Coverage Austria; Austria; Czech Republic; Czech Republic; Croatia; Croatia; Denmark; Denmark; Germany; Germany; Hungary; Hungary; Spain; Spain