-
FEDI v2 Dataset
FEDI is the first task-oriented document-grounded dialogue dataset for learning from demographic information, user emotions and implicit user feedback. FEDI v2 improves the... -
NLPEERv2: A Unified Resource for the Computational Study of Peer Review
Dataset of peer review reports and paper drafts from diverse domains and venues. It extends the previous dataset NLPEER to version 2. We provide all sub-versions of the dataset... -
ARR Data Collection Initiative 2024
Dataset of peer review reports, meta-reviews, reviewer-author discussions, and paper drafts collected from ACL Rolling Review within the context of the new data collection... -
NLPEER: A Unified Resource for the Computational Study of Peer Review
Dataset of peer review reports and paper drafts from diverse domains and venues. We provide multiple versions of the dataset; when in doubt, download the newest version. You can... -
Real-world misleading visualizations QA dataset
The real-world misleading visualization QA dataset accompanies the paper "'Protecting multimodal large language models againts misleading visualizations". The dataset contains... -
Attribute or Abstain: Large Language Models as Long Document Assistants
This folder contains data to run experiments on LAB, the Long document Attribution Benchmark introduced in "Attribute or Abstain: Large Language Models as Long Document... -
Systematic Task Exploration with LLMs: A Study in Citation Text Generation
The components of this dataset are used in the experiments of the paper "Systematic Task Exploration with LLMs: A Study in Citation Text Generation" published at main conference...