Show simple item record

dc.contributor.authorDe Souza, Gustavo A.en_US
dc.contributor.authorMålen, Hiwaen_US
dc.contributor.authorSøfteland, Tinaen_US
dc.contributor.authorSælensminde, Gisleen_US
dc.contributor.authorPrasad, Swatien_US
dc.contributor.authorJonassen, Ingeen_US
dc.contributor.authorWiker, Harald G.en_US
dc.date.accessioned2013-05-14T09:54:46Z
dc.date.available2013-05-14T09:54:46Z
dc.date.issued2008-07-02eng
dc.PublishedBMC Genomics 9(316)eng
dc.identifier.issn1471-2164
dc.identifier.urihttps://hdl.handle.net/1956/6603
dc.description.abstractBackground: While the genomic annotations of diverse lineages of the Mycobacterium tuberculosis complex are available, divergences between gene prediction methods are still a challenge for unbiased protein dataset generation. M. tuberculosis gene annotation is an example, where the most used datasets from two independent institutions (Sanger Institute and Institute of Genomic Research-TIGR) differ up to 12% in the number of annotated open reading frames, and 46% of the genes contained in both annotations have different start codons. Such differences emphasize the importance of the identification of the sequence of protein products to validate each gene annotation including its sequence coding area. Results: With this objective, we submitted a culture filtrate sample from M. tuberculosis to a highaccuracy LTQ-Orbitrap mass spectrometer analysis and applied refined N-terminal prediction to perform comparison of two gene annotations. From a total of 449 proteins identified from the MS data, we validated 35 tryptic peptides that were specific to one of the two datasets, representing 24 different proteins. From those, 5 proteins were only annotated in the Sanger database. In the remaining proteins, the observed differences were due to differences in annotation of transcriptional start sites. Conclusion: Our results indicate that, even in a less complex sample likely to represent only 10% of the bacterial proteome, we were still able to detect major differences between different gene annotation approaches. This gives hope that high-throughput proteomics techniques can be used to improve and validate gene annotations, and in particular for verification of high-throughput, automatic gene annotations.en_US
dc.language.isoengeng
dc.publisherBioMed Centraleng
dc.rightsAttribution CC BYeng
dc.rights.urihttp://creativecommons.org/licenses/by/2.0/eng
dc.subjectMycobacteriaeng
dc.subjectProteomicseng
dc.subjectMykobakteriernob
dc.subjectProteomikknob
dc.titleHigh accuracy mass spectrometry analysis as a tool to verify and improve gene annotation using Mycobacterium tuberculosis as an exampleen_US
dc.typePeer reviewed
dc.typeJournal article
dc.description.versionpublishedVersionen_US
dc.rights.holderCopyright 2008 de Souza et al; licensee BioMed Central Ltd.
dc.identifier.doihttps://doi.org/10.1186/1471-2164-9-316
dc.identifier.cristin421210
dc.source.journalBMC Genomics
dc.source.409
dc.source.14316


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution CC BY
Except where otherwise noted, this item's license is described as Attribution CC BY