• Characteristics of 454 pyrosequencing data—enabling realistic simulation with flowsim 

      Balzer, Susanne Mignon; Malde, Ketil; Lanzén, Anders; Sharma, Animesh; Jonassen, Inge (Peer reviewed; Journal article, 2010)
      Motivation: The commercial launch of 454 pyrosequencing in 2005was a milestone in genome sequencing in terms of performance and cost. Throughout the three available releases, average read lengths have increased to ∼500 ...
    • Filtering duplicate reads from 454 pyrosequencing data 

      Balzer, Susanne Mignon; Malde, Ketil; Grohme, Markus A.; Jonassen, Inge (Peer reviewed; Journal article, 2013)
      Motivation: Throughout the recent years, 454 pyrosequencing has emerged as an efficient alternative to traditional Sanger sequencing and is widely used in both de novo whole-genome sequencing and metagenomics. Especially ...
    • Precrec: fast and accurate precision-recall and ROC curve calculations in R 

      Saito, Takaya; Rehmsmeier, Marc (Peer reviewed; Journal article, 2017)
      The precision–recall plot is more informative than the ROC plot when evaluating classifiers on imbalanced datasets, but fast and accurate curve calculation tools for precision–recall plots are currently not available. We ...
    • Systematic exploration of error sources in pyrosequencing flowgram data 

      Balzer, Susanne Mignon; Malde, Ketil; Jonassen, Inge (Peer reviewed; Journal article, 2011)
      Motivation: 454 pyrosequencing, by Roche Diagnostics, has emerged as an alternative to Sanger sequencing when it comes to read lengths, performance and cost, but shows higher per-base error rates. Although there are several ...