Dataset of peer review reports, meta-reviews, reviewer-author discussions, and paper drafts collected from ACL Rolling Review within the context of the new data collection initiative (https://arr-data.aclweb.org/protocol/). All included data is explicitly licensed by the authors and reviewers for publication. This dataset is not meant for commercial purposes. This dataset should not be used for pre-training of neural models such as large language models.
V1 contains accepted paper data from COLING 2025 and NAACL 2025.
V1.1 and V1.1.1 contain accepted paper data from ACL 2025 (V1.1.1 contatins data from ARR 2024 December and ARR 2025 February, and V1.1 contains only data from ARR 2025 February).
V1.2 contains ARR 2024 April and June submissions that did not appear in EMNLP 2024, released after a one-year grace period with explicit author consent.
V1.3 contains accepted paper data from EMNLP 2025.