CASH - Corpus management Annotation and SearcH

PID

CASH (Corpus, Annotation, and SearcH server) is a back-end software for managing text collections, annotations, and associated metadata. The system was developed to handle richly annotated document collections, including both primary texts and extensive metadata related to their historical and contextual information. Its native use case is to deal with a corpus of EpiDoc XML digital critical editions of archaic inscriptions, but it can ingest also CoNLL-x and plain text. CASH is designed to be modular and extensible in multiple ways, including document ingestion, annotation and metadata semantics, data export, and multi-level queries. The back-end services expose APIs documented via Swagger. CASH was developed in the context of the PRIN 2017 project "Languages and Cultures of Ancient Italy. Historical Linguistics and Digital Models". ILC supervisor: Valeria Quochi.

Identifier
PID http://hdl.handle.net/20.500.11752/ILC-1028
Related Identifier https://github.com/DigItAnt/CASH-server
Metadata Access http://dspace-clarin-it.ilc.cnr.it/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:dspace-clarin-it.ilc.cnr.it:20.500.11752/ILC-1028
Provenance
Creator Tommasi, Alessandro; Zavattari, Cesare
Publisher Istituto di Linguistica Computazionale “A. Zampolli” - Consiglio Nazionale delle Ricerche (ILC-CNR)
Publication Year 2024
OpenAccess true
Contact dspace-clarin-it-ilc-help(at)ilc.cnr.it
Representation
Resource Type toolService
Format downloadable_files_count: 0
Discipline Linguistics