Vis enkel innførsel

dc.contributor.authorSheikhi, Ghazaal
dc.contributor.authorOpdahl, Andreas Lothe
dc.contributor.authorTouileb, Samia
dc.contributor.authorSetty, Vinay
dc.date.accessioned2023-12-20T11:57:32Z
dc.date.available2023-12-20T11:57:32Z
dc.date.created2023-09-21T15:56:59Z
dc.date.issued2023
dc.identifier.issn1613-0073
dc.identifier.urihttps://hdl.handle.net/11250/3108380
dc.description.abstractAnalysing long text documents of political discourse to identify check-worthy claims (claim detection) is known to be an important task in automated fact-checking systems, as it saves the precious time of fact-checkers, allowing for more fact-checks. However, existing methods use black-box deep neural NLP models to detect check-worthy claims, which limits the understanding of the model and the mistakes they make. The aim of this study is therefore to leverage an explainable neural NLP method to improve the claim detection task. Specifically, we exploit well known integrated gradient-based input reduction on textCNN and BiLSTM to create two different reduced claim data sets from ClaimBuster. We observe that a higher recall in check-worthy claim detection is achieved on the data reduced by BiLSTM compared to the models trained on claims. This is an important remark since the cost of overlooking check-worthy claims is high in claim detection for fact-checking. This is also the case when a pre-trained BERT sequence classification model is fine-tuned on the reduced data set. We argue that removing superfluous tokens using explainable NLP could unlock the true potential of neural language models for claim detection, even though the reduced claims might make no sense to humans. Our findings provide insights on task formulation, design of annotation schema and data set preparation for check-worthy claim detection.en_US
dc.language.isoengen_US
dc.publisherCEURen_US
dc.relation.ispartofProceedings of the 5th Symposium of the Norwegian AI Society (NAIS 2023)
dc.relation.urihttps://ceur-ws.org/Vol-3431/paper8.pdf
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.titleMaking sense of nonsense : Integrated gradient-based input reduction to improve recall for check-worthy claim detectionen_US
dc.typeChapteren_US
dc.description.versionpublishedVersionen_US
dc.rights.holderCopyright 2023 The Author(s)en_US
cristin.ispublishedtrue
cristin.fulltextoriginal
dc.identifier.cristin2177670
dc.relation.projectNorges forskningsråd: 309339en_US
dc.identifier.citationIn: NAIS 2023: The 2023 symposium of the Norwegian AI Societyen_US


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Navngivelse 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal