Vis enkel innførsel

dc.contributor.authorThuestad, Jens Andreas
dc.contributor.authorGrutle, Øyvind
dc.date.accessioned2023-08-10T06:13:23Z
dc.date.available2023-08-10T06:13:23Z
dc.date.issued2023-06-01
dc.date.submitted2023-08-08T22:00:25Z
dc.identifier.urihttps://hdl.handle.net/11250/3083251
dc.description.abstractThis thesis is part of the larger project “AI-Support in Medical Emergency Calls (AISMEC)”, which aims to develop a decision support system for Emergency Medical Communication Center (EMCC) operators to better identify and respond to acute brain stroke. The system will utilize historical health data and the transcription from the emergency call to assist the EMCC operator in whether or not to dispatch an ambulance and with what priority and urgency. Our research primarily focuses on adapting the Automatic Speech Recognition (ASR) model, Whisper, to create a robust and accurate ASR model to transcribe Norwegian emergency calls. The model was fine-tuned on simulated emergency calls and recordings done by ourselves. Furthermore, a proof-of-concept ASR web application was developed with the goal of streamlining the manual task of transcribing emergency calls. After demonstrating the application to the involved researchers in AISMEC, and the potential users, both suggested optimism about the potential of this solution to streamline the transcription process. As part of our research, we conducted an experiment where we utilized the suggested transcriptions provided by the application and then corrected them for accuracy. This approach showed a notable reduction in our transcription time. We also found that establishing a machine learning pipeline to fine-tune the model on historical emergency calls was feasible. Further work would involve training the model on actual emergency calls. To investigate the efficiency of the ASR web application further, a larger scale of the semi-automatic transcription experiment could be conducted by the professional audio transcribers at Haukeland universitetssjukehus.
dc.language.isoeng
dc.publisherThe University of Bergen
dc.rightsCopyright the Author. All rights reserved
dc.titleSpeech-to-text models to transcribe emergency calls
dc.typeMaster thesis
dc.date.updated2023-08-08T22:00:25Z
dc.rights.holderCopyright the Author. All rights reserved
dc.description.degreeMaster's Thesis in Joint Master's Programme in Software Engineering - collaboration with HVL
dc.description.localcodePROG399
dc.description.localcodeMAMN-PROG
dc.subject.nus754199
fs.subjectcodePROG399
fs.unitcode12-12-0


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel