A terminological "journey" in the Grey Literature domain

DOI

Please cite as:Bartolini R., Pardelli G., Goggi S., Giannini S., Biagioni S. A terminological "journey" in the Grey Literature domain. In: GL18 - Eighteenth International Conference on Grey Literature: Leveraging Diversity in Grey Literature. (New York, USA, 28-29 November 2016). Proceedings, pp. 117-130. Dominic Farace, Jerry Frantzen (eds.). (GL-Conference series. ISSN: 1385-2308, vol. 18). TextRelease, Amsterdam, The Netherlands, 2017.The work analyzes a corpus constituted of the entire amount of full research papers published in the GL conference series over a time-span of more than one decade (2003-2014) with the aim of creating a terminological map of relevant words in the various GL research topics. The corpus - made up of 231 research papers was processed using a Natural Language Processing (NLP) tool for term extraction . This tool is what is called a “pipeline” - that is, a sequence of different tools - which extracts lexical knowledge from texts: in short, this is a rule-based system tool for knowledge extraction and document indexing that combines Natural Language Processing (NLP) technologies for term extraction. Within our corpus made of GL articles, this NLP tool extracts a list of single (monograms) and multi-word terms (bigrams and trigrams) ordered by frequency with respect to the context.

Identifier
DOI https://doi.org/10.17026/dans-z2n-8et3
Metadata Access https://ssh.datastations.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.17026/dans-z2n-8et3
Provenance
Creator SG Giannini
Publisher DANS Data Station Social Sciences and Humanities
Contributor S.G. Giannini; RB Bartolini (CNR-ILC, Pisa, Italy); GP Pardelli (CNR-ILC, Pisa, Italy); SG Goggi (CNR-ILC, Pisa, Italy); SB Biagioni (CNR-ISTI, Pisa, Italy)
Publication Year 2017
Rights CC BY 4.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/licenses/by/4.0
OpenAccess true
Contact S.G. Giannini (National Research Council, Pisa, Italy)
Representation
Resource Type Dataset
Format text/csv; application/vnd.openxmlformats-officedocument.spreadsheetml.sheet; application/zip; application/pdf
Size 52075; 4191; 14201; 16602356; 590490; 5529; 1111383; 10438195; 24508; 446553; 24646
Version 2.1
Discipline Humanities