-
Author-in-the-Loop Response Generation and Evaluation: Integrating Author Exp...
Re3Align, a new large-scale dataset for author-in-the-loop response generation, comprising 3.4k complete paper records (review, response, paper and revised paper) with 440k... -
SciCoQA: Quality Assurance for Scientific Paper--Code Alignment
We present SciCoQA, a dataset for detecting discrepancies between scientific publications and their codebases to ensure faithful implementations. We construct SciCoQA from... -
eacl2026-assessing-paper-novelty
Dataset for evaluating automated novelty assessment in academic papers. Contains 182 ICLR submissions with human annotations, LLM-derived novelty assessments from reviewer... -
Re3: A Holistic Framework and Dataset for Modeling Collaborative Document Rev...
A dataset of aligned scientific paper revisions manually labeled according to their action and intent, and supplemented with the respective peer reviews and human-written edit... -
Are Large Language Models Good Classifiers? A Study on Edit Intent Classifica...
Re3-Sci2.0, a new large-scale dataset of 1,780 scientific document revisions with over 94k labeled edits intent. This dataset is a supplement to the EMNLP24 paper: Are Large... -
LazyReview A Dataset for Uncovering Lazy Thinking in NLP Peer Reviews
We release the dataset associated with our paper "LazyReview A Dataset for Uncovering Lazy Thinking in NLP Peer Reviews".
