-
eacl2026-assessing-paper-novelty
Dataset for evaluating automated novelty assessment in academic papers. Contains 182 ICLR submissions with human annotations, LLM-derived novelty assessments from reviewer... -
CORE-T: COherent REtrieval of Tables for Text-to-SQL
We present three preprocessed text-to-SQL benchmarks (BIRD, SPIDER and MMQA). We preprocessed these datasets to follow our open-book setting by merging tables from multiple DBs... -
Author-in-the-Loop Response Generation and Evaluation: Integrating Author Exp...
Re3Align, a new large-scale dataset for author-in-the-loop response generation, comprising 3.4k complete paper records (review, response, paper and revised paper) with 440k... -
Aletheia: What Makes RLVR For Code Verifiers Tick?
Multi-domain thinking verifiers trained via Reinforcement Learning from Verifiable Rewards (RLVR) are a prominent fixture of the Large Language Model (LLM) post-training... -
ARR Data Collection Initiative 2025
Dataset of peer review reports, meta-reviews, reviewer-author discussions, and paper drafts collected from ACL Rolling Review within the context of the new data collection... -
Reward Modeling for Scientific Writing Evaluation
The components of this dataset are used in the experiments of the paper "Reward Modeling for Scientific Writing Evaluation". Please see README.md for more information. -
Exposía: Academic Writing Assessment of Exposés and Peer Feedback
Exposía is a publicly available research dataset that captures the full, pedagogically grounded process of academic writing and feedback in higher education. The dataset... -
ARR Data Collection Initiative 2025
Dataset of peer review reports, meta-reviews, reviewer-author discussions, and paper drafts collected from ACL Rolling Review within the context of the new data collection... -
Dataset for automated material flow characterization of shredded WEEE: RGB-ca...
Three datasets containing data from particles of shredded WEEE, including ferrous metals, non-ferrous metals, plastics and printed circuit boards in two particle size ranges of... -
Aspects in Peer Reviews
The code and files for the paper Identifying Aspects in Peer Reviews. -
RevUtil
Providing constructive feedback to paper authors is a core component of peer review. With reviewers increasingly having less time to perform reviews, automated support systems... -
Low Voltage Main Distribution Board (LV MDB) Data of ETA Research Factory - I...
The dataset contains the active electrical power (in kW) measured at the Low Voltage Main Distribution Board (LV MDB) of the ETA Research Factory - Institute for Production... -
Systematic Task Exploration with LLMs: A Study in Citation Text Generation
The components of this dataset are used in the experiments of the paper "Systematic Task Exploration with LLMs: A Study in Citation Text Generation" published at main conference... -
LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversaria...
In this paper, we propose LiOn-XA, an unsupervised domain adaptation (UDA) approach that combines LiDAR-Only Cross-Modal (X) learning with Adversarial training for 3D LiDAR... -
Attribute or Abstain: Large Language Models as Long Document Assistants
This folder contains data to run experiments on LAB, the Long document Attribution Benchmark introduced in "Attribute or Abstain: Large Language Models as Long Document... -
Real-world misleading visualizations QA dataset
The real-world misleading visualization QA dataset accompanies the paper "'Protecting multimodal large language models againts misleading visualizations". The dataset contains... -
PeerQA: A Scientific Question Answering Dataset from Peer Reviews
We present PeerQA, a real-world, scientific, document-level Question Answering (QA) dataset. PeerQA questions have been sourced from peer reviews, which contain questions that... -
NLPEER: A Unified Resource for the Computational Study of Peer Review
Dataset of peer review reports and paper drafts from diverse domains and venues. We provide multiple versions of the dataset; when in doubt, download the newest version. You can... -
M-Stance: A Multi-Target, Multilingual and Multi-Cultural Stance Detection Da...
M-STANCE is a multilingual, multi-target and multi-cultural stance detection (SD) dataset. It covers social media posts from 2014 and 2019 related to the migration crisis in EU... -
ARR Data Collection Initiative 2024
Dataset of peer review reports, meta-reviews, reviewer-author discussions, and paper drafts collected from ACL Rolling Review within the context of the new data collection...
