The Pile Roberta Large Index

DOI

The repository contains files for a nearest neighbor index of text embeddings for the entire Pile dataset. For more information see: https://github.com/socialfoundations/tttlm

Identifier
DOI https://doi.org/10.17617/3.EJQGAK
Metadata Access https://edmond.mpg.de/api/datasets/export?exporter=dataverse_json&persistentId=doi:10.17617/3.EJQGAK
Provenance
Creator Hardt
Publisher Edmond
Publication Year 2023
OpenAccess true
Contact HARDT(at)IS.MPG.DE
Representation
Language English
Resource Type Dataset
Version 1
Discipline Other