-
Reward Modeling for Scientific Writing Evaluation
The components of this dataset are used in the experiments of the paper "Reward Modeling for Scientific Writing Evaluation". Please see README.md for more information. -
Is this chart lying to me? Automating the detection of misleading visualizations
The Misviz and Misviz-synth datasets accompany the paper "Is this chart lying to me? Automating the detection of misleading visualizations'". The datasets contain examples of... -
CORE-T: COherent REtrieval of Tables for Text-to-SQL
We present three preprocessed text-to-SQL benchmarks (BIRD, SPIDER and MMQA). We preprocessed these datasets to follow our open-book setting by merging tables from multiple DBs... -
Aletheia: What Makes RLVR For Code Verifiers Tick?
Multi-domain thinking verifiers trained via Reinforcement Learning from Verifiable Rewards (RLVR) are a prominent fixture of the Large Language Model (LLM) post-training... -
ARR Data Collection Initiative 2025
Dataset of peer review reports, meta-reviews, reviewer-author discussions, and paper drafts collected from ACL Rolling Review within the context of the new data collection... -
Dataset for automated material flow characterization of shredded WEEE: RGB-ca...
Three datasets containing data from particles of shredded WEEE, including ferrous metals, non-ferrous metals, plastics and printed circuit boards in two particle size ranges of... -
Aspects in Peer Reviews
The code and files for the paper Identifying Aspects in Peer Reviews. -
Low Voltage Main Distribution Board (LV MDB) Data of ETA Research Factory - I...
The dataset contains the active electrical power (in kW) measured at the Low Voltage Main Distribution Board (LV MDB) of the ETA Research Factory - Institute for Production... -
Systematic Task Exploration with LLMs: A Study in Citation Text Generation
The components of this dataset are used in the experiments of the paper "Systematic Task Exploration with LLMs: A Study in Citation Text Generation" published at main conference... -
LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversaria...
In this paper, we propose LiOn-XA, an unsupervised domain adaptation (UDA) approach that combines LiDAR-Only Cross-Modal (X) learning with Adversarial training for 3D LiDAR... -
Attribute or Abstain: Large Language Models as Long Document Assistants
This folder contains data to run experiments on LAB, the Long document Attribution Benchmark introduced in "Attribute or Abstain: Large Language Models as Long Document... -
Real-world misleading visualizations QA dataset
The real-world misleading visualization QA dataset accompanies the paper "'Protecting multimodal large language models againts misleading visualizations". The dataset contains... -
NLPEER: A Unified Resource for the Computational Study of Peer Review
Dataset of peer review reports and paper drafts from diverse domains and venues. We provide multiple versions of the dataset; when in doubt, download the newest version. You can... -
M-Stance: A Multi-Target, Multilingual and Multi-Cultural Stance Detection Da...
M-STANCE is a multilingual, multi-target and multi-cultural stance detection (SD) dataset. It covers social media posts from 2014 and 2019 related to the migration crisis in EU... -
ARR Data Collection Initiative 2024
Dataset of peer review reports, meta-reviews, reviewer-author discussions, and paper drafts collected from ACL Rolling Review within the context of the new data collection... -
NLPEERv2: A Unified Resource for the Computational Study of Peer Review
Dataset of peer review reports and paper drafts from diverse domains and venues. It extends the previous dataset NLPEER to version 2. We provide all sub-versions of the dataset... -
Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation ...
Omnidirectional depth perception is essential for mobile robotics applications that require scene understanding across a full 360° field of view. Camera-based setups offer a... -
“Can You Handle the Truth?”: Investigating the Effects of AR-Based Visualizat...
Code and user study dataset for the paper: "“Can You Handle the Truth?”: Investigating the Effects of AR-Based Visualization of the Uncertainty of Deep Learning Models on Users... -
FEDI v2 Dataset
FEDI is the first task-oriented document-grounded dialogue dataset for learning from demographic information, user emotions and implicit user feedback. FEDI v2 improves the... -
Conformal Prediction for Semantically-Aware Autonomous Perception in Urban En...
This repository contains the raw code accompanying the paper "Conformal Prediction for Semantically-Aware Autonomous Perception in Urban Environments", published in the...
