English-Slovene term candidates KAS-biterm 1.0

PID

KAS-biterm is an automatically generated glossary of English terms with their translations into Slovene. The pairs, possibly with their English and Slovene acronyms, were extracted from the Corpus of Academic Slovene KAS 1.0 (http://hdl.handle.net/11356/1244), where they have been annotated with the kas-biterm tool (https://github.com/clarinsi/kas-biterm) trained on the Bilingual terminology extraction dataset KAS-biterm 1.0 (http://hdl.handle.net/11356/1199). Note that only Query 1 was used for pre-selection of the sentences and for training the tool, and that the bi-lingual terms from the KAS corpus have been filtered to remove noise. The glossary is encoded in TEI-Lex0 (https://github.com/DARIAH-ERIC/lexicalresources) and gives, for each entry, also up to three examples of use, together with their bibliographic information. Various parts of the lexical entries also have links to the appropriate queries to CLARIN.SI noSketch Engine conconrdancer. The TEI encoded corpus is also available in a variant that is a much smaller document as it does not contain the examples of use and links.

Identifier
PID http://hdl.handle.net/11356/1263
Related Identifier http://www.sdjt.si/wp/wp-content/uploads/2018/09/JTDH-2018_Ljubesic-et-al_KAS-term-and-KAS-biterm-Datasets-and-baselines-for-monolingual-and-bilingual-terminology-extraction-from-academic-writing.pdf
Related Identifier http://nl.ijs.si/kas/
Metadata Access http://www.clarin.si/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:www.clarin.si:11356/1263
Provenance
Creator Erjavec, Tomaž; Ljubešić, Nikola; Fišer, Darja
Publisher Jožef Stefan Institute
Publication Year 2020
Rights Creative Commons - Attribution 4.0 International (CC BY 4.0); https://creativecommons.org/licenses/by/4.0/; PUB
OpenAccess true
Contact info(at)clarin.si
Representation
Language Slovenian; Slovene; English
Resource Type lexicalConceptualResource
Format application/zip; text/plain; charset=utf-8; downloadable_files_count: 1
Discipline Linguistics