Wikipedia Human Medicine Corpus

Wikipedia Human Medicine Corpus is a bilingual—Spanish-English—single-label corpus composed of 2,143 documents extracted from Wikipedia about human medicine written in English, and 469 documents written in Spanish, classified into the following 22 categories: Alternative medicine, Cardiology, Endocrinology, Forensics, Gastroenterology, Human genetics, Geriatrics, Gerontology, Gynecology, Hematology, Nephrology, Neurology, Obstetrics, Oncology, Ophthalmology, Orthopedical surgical procedures, Pathology, Pediatrics, Psychiatry, Rheumatology, Surgery and Urology.

Identifier
DOI https://doi.org/10.17632/sp9mcx5594.2
PID https://nbn-resolving.org/urn:nbn:nl:ui:13-3noz-i2
Source https://nbn-resolving.org/urn:nbn:nl:ui:13-3noz-i2
Metadata Access https://easy.dans.knaw.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:easy.dans.knaw.nl:easy-dataset:75325
Provenance
Creator Mouriño García, M (via Mendeley Data)
Publisher Data Archiving and Networked Services (DANS)
Contributor Marcos Mouriño García
Publication Year 2017
Rights info:eu-repo/semantics/openAccess; License: http://creativecommons.org/publicdomain/zero/1.0; http://creativecommons.org/publicdomain/zero/1.0
OpenAccess true
Representation
Resource Type Dataset
Discipline Other