• Automated Claim Detection for Fact-checking: A Case Study using Norwegian Pre-trained Language Models 

      Sheikhi, Ghazaal; Touileb, Samia; Khan, Sohail Ahmed (Chapter, 2023)
      We investigate to what extent pre-trained language models can be used for automated claim detection for fact-checking in a low resource setting. We explore this idea by fine-tuning four Norwegian pre-trained language models ...
    • Identifying Token-Level Dialectal Features in Social Media 

      Barnes, Jeremy Claude; Touileb, Samia; Mæhlum, Petter; Lison, Pierre (Chapter, 2023)
      Dialectal variation is present in many human languages and is attracting a growing interest in NLP. Most previous work concentrated on either (1) classifying dialectal varieties at the document or sentence level or (2) ...
    • JSEEGraph: Joint Structured Event Extraction as Graph Parsing 

      You, Huiling; Touileb, Samia; Øvrelid, Lilja (Chapter, 2023)
      We propose a graph-based event extraction framework JSEEGraph that approaches the task of event extraction as general graph parsing in the tradition of Meaning Representation Parsing. It explicitly encodes entities and ...
    • Learning Horn envelopes via queries from language models 

      Blum, Sophie; Koudijs, Raoul; Ozaki, Ana; Touileb, Samia (Journal article; Peer reviewed, 2024)
      We present an approach for systematically probing a trained neural network to extract a symbolic abstraction of it, represented as a Boolean formula. We formulate this task within Angluin's exact learning framework, where ...
    • Making sense of nonsense : Integrated gradient-based input reduction to improve recall for check-worthy claim detection 

      Sheikhi, Ghazaal; Opdahl, Andreas Lothe; Touileb, Samia; Setty, Vinay (Chapter, 2023)
      Analysing long text documents of political discourse to identify check-worthy claims (claim detection) is known to be an important task in automated fact-checking systems, as it saves the precious time of fact-checkers, ...
    • NorDiaChange: Diachronic Semantic Change Dataset for Norwegian 

      Kutuzov, Andrei; Touileb, Samia; Mæhlum, Petter; Enstad, Tita; Wittemann, Alexandra (Chapter, 2022)
      We describe NorDiaChange: the first diachronic semantic change dataset for Norwegian. NorDiaChange comprises two novel subsets, covering about 80 Norwegian nouns manually annotated with graded semantic change over time. ...