The data is part of our JCDL paper with the title "Diachronic Analysis of German Parliamentary Proceedings: Ideological Shifts through the Lens of Political Biases". This is the raw data underlying our corpus, the German Reichs- und Bundestagsprotokolle. It was crawled from https://www.reichstagsprotokolle.de/ and https://www.bundestag.de/protokolle Code can be found here: https://github.com/umanlp/crosstemporal_bias
In our revision of the data, we (i) removed XML and (ii) corrected obvious OCR errors (e.g., negation sign instead of dash in line ends). Further modifications are indicated in the accompanying paper.