Dataset - B2FIND

Author-in-the-Loop Response Generation and Evaluation: Integrating Author Exp...

Re3Align, a new large-scale dataset for author-in-the-loop response generation, comprising 3.4k complete paper records (review, response, paper and revised paper) with 440k...
SciCoQA: Quality Assurance for Scientific Paper--Code Alignment

We present SciCoQA, a dataset for detecting discrepancies between scientific publications and their codebases to ensure faithful implementations. We construct SciCoQA from...
eacl2026-assessing-paper-novelty

Dataset for evaluating automated novelty assessment in academic papers. Contains 182 ICLR submissions with human annotations, LLM-derived novelty assessments from reviewer...
Re3: A Holistic Framework and Dataset for Modeling Collaborative Document Rev...

A dataset of aligned scientific paper revisions manually labeled according to their action and intent, and supplemented with the respective peer reviews and human-written edit...
Are Large Language Models Good Classifiers? A Study on Edit Intent Classifica...

Re3-Sci2.0, a new large-scale dataset of 1,780 scientific document revisions with over 94k labeled edits intent. This dataset is a supplement to the EMNLP24 paper: Are Large...
LazyReview A Dataset for Uncovering Lazy Thinking in NLP Peer Reviews

We release the dataset associated with our paper "LazyReview A Dataset for Uncovering Lazy Thinking in NLP Peer Reviews".

You can also access this registry using the API (see API Docs).

6 datasets found