Database of sentences with deeply embedded clauses

DOI

The database was compiled for the Estonian Research Council project STP2 “Exploring Deep Clausal Embeddings in Finno-Ugric.” It contains samples of complex sentences with deeply embedded clauses (DECs) from Estonian, Moksha Mordvin, and Komi Zyryan literary languages (fiction and journalese). DECs are clauses embedded within clauses that are themselves embedded. The samples are organized in Excel files, with each row containing a complex sentence in which each DEC and its superordinate clause are annotated for seven variables, described in Variables.docx.

Andmebaas koostati Eesti Teadusagentuuri projekti STP2 „Sügavale uputatud kõrvallaused soomeugri keeltes“ jaoks. See sisaldab sügavale uputatud kõrvallausetega liitlausete valimeid eesti, mokša ja sürjakomi kirjakeeltest (proosa ja ajakirjandus). Sügavale uputatud lause on kõrvallause, mille pealause on ise kõrvallause. Valimid on Exceli failide kujul, kus igal real on liitlause, milles iga sügavale uputatud lause ja selle pealause on märgendatud seitsme muutuja järgi, mis on kirjeldatud failis Variables.docx.

Identifier
DOI https://datadoi.ee/handle/33/673
Metadata Access https://datadoi.ee/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:datadoi.ee:33/673
Provenance
Creator Kehayov, Petar; Todesk, Triin
Publisher University of Tartu, Institute of Estonian and General Linguistics
Publication Year 2025
Rights info:eu-repo/semantics/openAccess
OpenAccess true
Contact University of Tartu, Institute of Estonian and General Linguistics
Representation
Language English
Resource Type info:eu-repo/semantics/dataset
Format xlsx; docx; txt; text/plain; application/vnd.openxmlformats-officedocument.spreadsheetml.sheet; application/vnd.openxmlformats-officedocument.wordprocessingml.document
Discipline Other