Information structure and historical English OV/VO variation

This dataset contains the data that is used in:

Struik, Tara and Ans van Kemenade. Information structure and OV word order in Old and Middle English: a phase-based approach. To appear in Journal of Comparative Germanic Linguistics

The history of English is characterised by variation in the position of the object with the regard to the verb. Where Present-day English only allows VO word orders, earlier stages display a significant number of OV structures as well, which are lost after the 13th century. It has frequently been suggested that this variation is influenced by information structure. We test the hypothesis that OV orders are the result of the discourse-givenness of the object. Our understanding of what fuels this variation feeds the formal analysis that is proposed; OV orders are derived from a VO base. The syntactic trigger to move objects leftward is lost.

The hypothesis was tested by retrieving subclauses with a finite verb, a non-finite verb and a direct object from historical English corpora (850-1570). The objects in the Old English and early Middle English period were subsequently annotated for information structure and length. This dataset contains the datafiles, divided by period. Translations are included in separate datafiles. Non-referential objects are included in separate datafiles. This dataset also contains the data on quantified and negated objects and the (non-annotated) data from the late Middle English and early Modern English period.

Structure of this dataset: The data collection procedure and codebook is described in datacollection.pdf The annotation scheme is that used to annotate the Old English and early Middle English data is described in annotation.pdf The source materials that are included in the study textsincluded.pdf

Identifier
DOI https://doi.org/10.17026/dans-2z6-67z4
PID https://nbn-resolving.org/urn:nbn:nl:ui:13-m5-dcr5
Metadata Access https://easy.dans.knaw.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:easy.dans.knaw.nl:easy-dataset:216597
Provenance
Creator Struik, T.
Publisher Data Archiving and Networked Services (DANS)
Contributor Radboud University
Publication Year 2022
Rights info:eu-repo/semantics/openAccess; License: http://creativecommons.org/licenses/by/4.0; http://creativecommons.org/licenses/by/4.0
OpenAccess true
Representation
Language English
Resource Type Dataset
Format application/pdf; .txt; .csv
Discipline Humanities