The dataset was created using a large number of Serbian Legislation texts gathered from the https://www.pravno-informacioni-sistem.rs/ website. The gathered texts were used for fine-tuning a neural network called SRBerta on the masked language modeling task. The dataset contains texts which are part of the following legislation categories:
• Constitution of the Republic of Serbia and state regulation
• Justice
• Defense, military and internal affairs
• Public incomes
• Monetary system, financial organizations and business