-
Author profiling resources
El zip conté tots els recursos que s'han generat durant el desenvolupament de la tesi. Per una banda, hi ha el codi, amb el qual es poden extreure el conjunt de features tal i... -
How2Sign: a large-scale multimodal dataset for continuous American Sign Language
How2Sign consists of a parallel corpus of 80 hours of sign language videos (collected with multi-view RGB and depth sensor data) with corresponding speech transcriptions and... -
Replication Data for: Sign language translation for instructional videos
This repo contains the I3D data used for the paper "Sign Language Translation from Instructional Videos". Together with the data, and weights of models, we also provide the .tsv... -
MARD: Multimodal Album Reviews Dataset
- MARD contains texts and accompanying metadata originally obtained from a much larger dataset of Amazon customer reviews, which have been enriched with music metadata from... -
Supplementary Model Files for "Tasty Burgers, Soggy Fries: Probing Aspect Rob...
Paper: "Tasty Burgers, Soggy Fries: Probing Aspect Robustness in Aspect-Based Sentiment Analysis" (EMNLP 2020) by Xiaoyu Xing, Zhijing Jin, Di Jin, Bingning Wang, Qi Zhang, and... -
Supplementary data for Corr2Cause: "Can Large Language Models Infer Causation...
Paper: "Can Large Language Models Infer Causation from Correlation?" (2023) by Zhijing Jin, Jiarui Liu, Zhiheng Lyu, Spencer Poff, Mrinmaya Sachan, Rada Mihalcea, Mona Diab,... -
CLadder: Assessing Causal Reasoning in Language Models
Paper: "CLadder: Assessing Causal Reasoning in Language Models" (NeurIPS 2023) by Zhijing Jin, Yuen Chen, Felix Leeb, Luigi Gresele, Ojasv Kamal, Zhiheng Lyu, Kevin Blin,... -
GenWiki: A Dataset of 1.3 Million Content-Sharing Text and Graphs for Unsuper...
Paper: "GenWiki: A Dataset of 1.3 Million Content-Sharing Text and Graphs for Unsupervised Graph-to-Text Generation" (COLING 2020) by Zhijing Jin, Qipeng Guo, Xipeng Qiu, and...