Skip to content
GO TO EUDAT WEBSITE
DATA CATALOGUE
REPOSITORIES
PROJECTS
ABOUT
Service Documentation
EUDAT Core Metadata Schema
EUDAT Support Request
Home
Datasets
Order by
Relevance
Name Ascending
Name Descending
Last Modified
Go
1 dataset found
Keywords:
multilingual corpora
Filter Results
W2C – Web to Corpus – Corpora
A set of corpora for 120 languages automatically collected from wikipedia and the web. Collected using the W2C toolset:
http://hdl.handle.net/11858/00-097C-0000-0022-60D6-1
You can also access this registry using the
API
(see
API Docs
).