StateParl includes the speeches in all 16 German state parliaments between 2000 and 2022. The database consists of 9,531,215 paragraphs and 345,068,110 words. Stenographic protocols of German state parliaments have become an important source for researchers, students, and journalists. StateParl integrates these protocols in an accessible, coherent, and machine-readable format that enables systematic applications. The codebook contains information about each variable in the database. The documentation explains how StateParl was developed. It outlines the acquisition of data and the processing methodology, and it then presents the validation of the database as well as its advantages and (current) limitations. It is important to note that this release of StateParl should be considered as the first beta release of the database. Future iterations will aim to address current limitations and further enhance the quality and scope of the database.
Vollerhebung