Show simple item record

dc.contributor.authorXue, Yaxin
dc.contributor.authorLanzén, Anders
dc.contributor.authorJonassen, Inge
dc.date.accessioned2021-06-14T11:44:01Z
dc.date.available2021-06-14T11:44:01Z
dc.date.created2020-06-17T14:17:07Z
dc.date.issued2020-03-13
dc.PublishedBioinformatics. 2020, 36 (11), 3365-3371.
dc.identifier.issn1367-4803
dc.identifier.urihttps://hdl.handle.net/11250/2759287
dc.description.abstractMotivation Technological advances in meta-transcriptomics have enabled a deeper understanding of the structure and function of microbial communities. ‘Total RNA’ meta-transcriptomics, sequencing of total reverse transcribed RNA, provides a unique opportunity to investigate both the structure and function of active microbial communities from all three domains of life simultaneously. A major step of this approach is the reconstruction of full-length taxonomic marker genes such as the small subunit ribosomal RNA. However, current tools for this purpose are mainly targeted towards analysis of amplicon and metagenomic data and thus lack the ability to handle the massive and complex datasets typically resulting from total RNA experiments. Results In this work, we introduce MetaRib, a new tool for reconstructing ribosomal gene sequences from total RNA meta-transcriptomic data. MetaRib is based on the popular rRNA assembly program EMIRGE, together with several improvements. We address the challenge posed by large complex datasets by integrating sub-assembly, dereplication and mapping in an iterative approach, with additional post-processing steps. We applied the method to both simulated and real-world datasets. Our results show that MetaRib can deal with larger datasets and recover more rRNA genes, which achieve around 60 times speedup and higher F1 score compared to EMIRGE in simulated datasets. In the real-world dataset, it shows similar trends but recovers more contigs compared with a previous analysis based on random sub-sampling, while enabling the comparison of individual contig abundances across samples for the first time.en_US
dc.language.isoengen_US
dc.publisherOxford University Pressen_US
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.titleReconstructing ribosomal genes from large scale total RNA meta-transcriptomic dataen_US
dc.typeJournal articleen_US
dc.typePeer revieweden_US
dc.description.versionpublishedVersionen_US
dc.rights.holderCopyright The Author(s) 2020en_US
cristin.ispublishedtrue
cristin.fulltextpostprint
cristin.qualitycode2
dc.identifier.doi10.1093/bioinformatics/btaa177
dc.identifier.cristin1815981
dc.source.journalBioinformaticsen_US
dc.source.4036
dc.source.1411
dc.source.pagenumber3365-3371en_US
dc.identifier.citationBioinformatics. 2020, 36 (11), 3365–3371en_US
dc.source.volume36en_US
dc.source.issue11en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Navngivelse 4.0 Internasjonal
Except where otherwise noted, this item's license is described as Navngivelse 4.0 Internasjonal