Dataset for "Understanding Interaction with Machine Learning through a Thematic Analysis Coding Assistant: A User Study"

DOI

20 participants installed and interacted with a thematic analysis coding assistant (TACA), an interactive machine learning desktop application designed to train a classifier on user-defined coded datasets to generate additional coding suggestions. The interviews were conducted with the participants after they interacted with the tool for 20 minutes, or until no more benefits were perceived. The questions were aimed to understand the experience of the participants with TACA and their perceptions of the ML model.The coded_transcripts.docx file contains the anonymised interview transcripts coded with codes appearing as comments. The document is split into Study 1 (5 participants) and Study 2 (15 participants). The participants in Study 1 imported their own dataset into TACA, while the participants in Study 2 used a set of newspaper restaurant reviews that were given to them by the researchers. Participant IDs follow the structure "S[study number]_P[participant number]", e.g. "S2_P1".The themes.csv file shows all the codes below each corresponding theme, the result of conducting thematic analysis on the interview transcripts.The restaurant_reviews.docx file is the collection of 21 restaurant reviews from the newspaper The Guardian (Restaurants + Reviews | Food | The Guardian) that was given to 15 of the 20 participants who did not have their own dataset available for the study.The logs folder contains an anonymised interaction log file for each participant with the interface of TACA named with the corresponding participant ID. The interaction logs for participants S1_P4 and S2_P5 are missing due to an issue in data storage.

Identifier
DOI https://doi.org/10.5522/04/28182962.v1
Related Identifier HasPart https://ndownloader.figshare.com/files/51595142
Related Identifier HasPart https://ndownloader.figshare.com/files/51595190
Related Identifier HasPart https://ndownloader.figshare.com/files/51595217
Related Identifier HasPart https://ndownloader.figshare.com/files/51596465
Related Identifier HasPart https://ndownloader.figshare.com/files/51596468
Related Identifier HasPart https://ndownloader.figshare.com/files/51596471
Related Identifier HasPart https://ndownloader.figshare.com/files/51596474
Related Identifier HasPart https://ndownloader.figshare.com/files/51596477
Related Identifier HasPart https://ndownloader.figshare.com/files/51596480
Related Identifier HasPart https://ndownloader.figshare.com/files/51596483
Related Identifier HasPart https://ndownloader.figshare.com/files/51596486
Related Identifier HasPart https://ndownloader.figshare.com/files/51596489
Related Identifier HasPart https://ndownloader.figshare.com/files/51596492
Related Identifier HasPart https://ndownloader.figshare.com/files/51596495
Related Identifier HasPart https://ndownloader.figshare.com/files/51596498
Related Identifier HasPart https://ndownloader.figshare.com/files/51596501
Related Identifier HasPart https://ndownloader.figshare.com/files/51596504
Related Identifier HasPart https://ndownloader.figshare.com/files/51596507
Related Identifier HasPart https://ndownloader.figshare.com/files/51596510
Related Identifier HasPart https://ndownloader.figshare.com/files/51596513
Related Identifier HasPart https://ndownloader.figshare.com/files/51596516
Metadata Access https://api.figshare.com/v2/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:figshare.com:article/28182962
Provenance
Creator Milana, Federico (ORCID: 0009-0000-8890-268X); Costanza, Enrico; Musolesi, Mirco; Ayobi, Amid
Publisher University College London UCL
Contributor Figshare
Publication Year 2025
Rights https://creativecommons.org/publicdomain/zero/1.0/
OpenAccess true
Contact researchdatarepository(at)ucl.ac.uk
Representation
Language English
Resource Type Dataset
Discipline Other