• Identifying Token-Level Dialectal Features in Social Media 

      Barnes, Jeremy Claude; Touileb, Samia; Mæhlum, Petter; Lison, Pierre (Chapter, 2023)
      Dialectal variation is present in many human languages and is attracting a growing interest in NLP. Most previous work concentrated on either (1) classifying dialectal varieties at the document or sentence level or (2) ...
    • Making sense of nonsense : Integrated gradient-based input reduction to improve recall for check-worthy claim detection 

      Sheikhi, Ghazaal; Opdahl, Andreas Lothe; Touileb, Samia; Setty, Vinay (Chapter, 2023)
      Analysing long text documents of political discourse to identify check-worthy claims (claim detection) is known to be an important task in automated fact-checking systems, as it saves the precious time of fact-checkers, ...
    • NorDiaChange: Diachronic Semantic Change Dataset for Norwegian 

      Kutuzov, Andrei; Touileb, Samia; Mæhlum, Petter; Enstad, Tita; Wittemann, Alexandra (Chapter, 2022)
      We describe NorDiaChange: the first diachronic semantic change dataset for Norwegian. NorDiaChange comprises two novel subsets, covering about 80 Norwegian nouns manually annotated with graded semantic change over time. ...