enTenTen

PID

Very large English web corpus enTenTEn, comprising 3,268,798,627 tokens.

Identifier
PID http://hdl.handle.net/11858/00-097C-0000-0001-CCDF-8
Metadata Access http://lindat.mff.cuni.cz/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:lindat.mff.cuni.cz:11858/00-097C-0000-0001-CCDF-8
Provenance
Creator (:unav) Unknown author
Publisher Masaryk University, NLP Centre
Publication Year 2011
Rights NLP Centre Web Corpus License; https://lindat.mff.cuni.cz/repository/static/license-NLPC-WeC.html; ACA
OpenAccess true
Contact lindat-help(at)ufal.mff.cuni.cz
Representation
Language English
Resource Type corpus
Format application/octet-stream; application/x-gzip; downloadable_files_count: 1
Discipline Linguistics