-
PeerQA-XT
The rapid growth of scientific publications makes it increasingly difficult for researchers to keep up with new findings. Scientific question answering (QA) systems aim to... -
Author-in-the-Loop Response Generation and Evaluation: Integrating Author Exp...
Re3Align, a new large-scale dataset for author-in-the-loop response generation, comprising 3.4k complete paper records (review, response, paper and revised paper) with 440k... -
SciCoQA: Quality Assurance for Scientific Paper--Code Alignment
We present SciCoQA, a dataset for detecting discrepancies between scientific publications and their codebases to ensure faithful implementations. We construct SciCoQA from... -
No Needles Attached? Inferring Energy Metabolism Zones and Lactate Accumulati...
These are the supplementary materials to the publication "No Needles Attached? Inferring Energy Metabolism Zones and Lactate Accumulation from Touchscreen Input". The repository... -
M4FC dataset
M4FC dataset, accompanying the paper "M4FC: a Multimodal, Multilingual, Multicultural, Multitask Real-World Fact-Checking Dataset". The dataset contains annotations for 4,982... -
eacl2026-assessing-paper-novelty
Dataset for evaluating automated novelty assessment in academic papers. Contains 182 ICLR submissions with human annotations, LLM-derived novelty assessments from reviewer... -
ARR Data Collection Initiative 2025
Dataset of peer review reports, meta-reviews, reviewer-author discussions, and paper drafts collected from ACL Rolling Review within the context of the new data collection... -
Reward Modeling for Scientific Writing Evaluation
The components of this dataset are used in the experiments of the paper "Reward Modeling for Scientific Writing Evaluation". Please see README.md for more information. -
Is this chart lying to me? Automating the detection of misleading visualizations
The Misviz and Misviz-synth datasets accompany the paper "Is this chart lying to me? Automating the detection of misleading visualizations'". The datasets contain examples of... -
CORE-T: COherent REtrieval of Tables for Text-to-SQL
We present three preprocessed text-to-SQL benchmarks (BIRD, SPIDER and MMQA). We preprocessed these datasets to follow our open-book setting by merging tables from multiple DBs... -
Aletheia: What Makes RLVR For Code Verifiers Tick?
Multi-domain thinking verifiers trained via Reinforcement Learning from Verifiable Rewards (RLVR) are a prominent fixture of the Large Language Model (LLM) post-training... -
Exposía: Academic Writing Assessment of Exposés and Peer Feedback
Exposía is a publicly available research dataset that captures the full, pedagogically grounded process of academic writing and feedback in higher education. The dataset... -
ARR Data Collection Initiative 2025
Dataset of peer review reports, meta-reviews, reviewer-author discussions, and paper drafts collected from ACL Rolling Review within the context of the new data collection... -
Dataset for automated material flow characterization of shredded WEEE: RGB-ca...
Three datasets containing data from particles of shredded WEEE, including ferrous metals, non-ferrous metals, plastics and printed circuit boards in two particle size ranges of... -
Aspects in Peer Reviews
The code and files for the paper Identifying Aspects in Peer Reviews. -
RevUtil
Providing constructive feedback to paper authors is a core component of peer review. With reviewers increasingly having less time to perform reviews, automated support systems... -
Low Voltage Main Distribution Board (LV MDB) Data of ETA Research Factory - I...
The dataset contains the active electrical power (in kW) measured at the Low Voltage Main Distribution Board (LV MDB) of the ETA Research Factory - Institute for Production... -
Systematic Task Exploration with LLMs: A Study in Citation Text Generation
The components of this dataset are used in the experiments of the paper "Systematic Task Exploration with LLMs: A Study in Citation Text Generation" published at main conference... -
LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversaria...
In this paper, we propose LiOn-XA, an unsupervised domain adaptation (UDA) approach that combines LiDAR-Only Cross-Modal (X) learning with Adversarial training for 3D LiDAR... -
Attribute or Abstain: Large Language Models as Long Document Assistants
This folder contains data to run experiments on LAB, the Long document Attribution Benchmark introduced in "Attribute or Abstain: Large Language Models as Long Document...
