Dataset - B2FIND

COSTRA 1.0: A Dataset of Complex Sentence Transformations

COSTRA 1.0 is a dataset of Czech complex sentence transformations. The dataset is intended for the study of sentence-level embeddings beyond simple word alternations or standard...
COSTRA 1.1: A Dataset of Complex Sentence Transformations and Comparisons

Costra 1.1 is a new dataset for testing geometric properties of sentence embeddings spaces. In particular, it concentrates on examining how well sentence embeddings capture...

You can also access this registry using the API (see API Docs).

2 datasets found