Bsgatlas: A unified bacillus subtilis genome and transcriptome annotation atlas with enhanced information access

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

Bsgatlas : A unified bacillus subtilis genome and transcriptome annotation atlas with enhanced information access. / Geissler, Adrian Sven; Anthon, Christian; Alkan, Ferhat; González-Tortuero, Enrique; Poulsen, Line Dahl; Kallehauge, Thomas Beuchert; Breüner, Anne; Seemann, Stefan Ernst; Vinther, Jeppe; Gorodkin, Jan.

In: Microbial Genomics, Vol. 7, No. 2, 000524, 2021.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Geissler, AS, Anthon, C, Alkan, F, González-Tortuero, E, Poulsen, LD, Kallehauge, TB, Breüner, A, Seemann, SE, Vinther, J & Gorodkin, J 2021, 'Bsgatlas: A unified bacillus subtilis genome and transcriptome annotation atlas with enhanced information access', Microbial Genomics, vol. 7, no. 2, 000524. https://doi.org/10.1099/mgen.0.000524

APA

Geissler, A. S., Anthon, C., Alkan, F., González-Tortuero, E., Poulsen, L. D., Kallehauge, T. B., Breüner, A., Seemann, S. E., Vinther, J., & Gorodkin, J. (2021). Bsgatlas: A unified bacillus subtilis genome and transcriptome annotation atlas with enhanced information access. Microbial Genomics, 7(2), [000524]. https://doi.org/10.1099/mgen.0.000524

Vancouver

Geissler AS, Anthon C, Alkan F, González-Tortuero E, Poulsen LD, Kallehauge TB et al. Bsgatlas: A unified bacillus subtilis genome and transcriptome annotation atlas with enhanced information access. Microbial Genomics. 2021;7(2). 000524. https://doi.org/10.1099/mgen.0.000524

Author

Geissler, Adrian Sven ; Anthon, Christian ; Alkan, Ferhat ; González-Tortuero, Enrique ; Poulsen, Line Dahl ; Kallehauge, Thomas Beuchert ; Breüner, Anne ; Seemann, Stefan Ernst ; Vinther, Jeppe ; Gorodkin, Jan. / Bsgatlas : A unified bacillus subtilis genome and transcriptome annotation atlas with enhanced information access. In: Microbial Genomics. 2021 ; Vol. 7, No. 2.

Bibtex

@article{b3ab8dd5f20e4158a71b617bf43751fc,
title = "Bsgatlas: A unified bacillus subtilis genome and transcriptome annotation atlas with enhanced information access",
abstract = "A large part of our current understanding of gene regulation in Gram-positive bacteria is based on Bacillus subtilis, as it is one of the most well studied bacterial model systems. The rapid growth in data concerning its molecular and genomic biology is distributed across multiple annotation resources. Consequently, the interpretation of data from further B. subtilis experiments becomes increasingly challenging in both low-and large-scale analyses. Additionally, B. subtilis annotation of structured RNA and non-coding RNA (ncRNA), as well as the operon structure, is still lagging behind the annotation of the coding sequences. To address these challenges, we created the B. subtilis genome atlas, BSGatlas, which integrates and unifies multiple existing annotation resources. Compared to any of the individual resources, the BSGatlas contains twice as many ncRNAs, while improving the positional annotation for 70% of the ncRNAs. Furthermore, we combined known transcription start and termination sites with lists of known co-transcribed gene sets to create a comprehensive transcript map. The combination with transcription start/termination site annotations resulted in 717 new sets of co-transcribed genes and 5335 untranslated regions (UTRs). In comparison to existing resources, the number of 5′ and 3′ UTRs increased nearly fivefold, and the number of internal UTRs doubled. The transcript map is organized in 2266 operons, which provides transcriptional annotation for 92% of all genes in the genome compared to the at most 82% by previous resources. We predicted an off-target-aware genome-wide library of CRISPR–Cas9 guide RNAs, which we also linked to polycistronic operons. We provide the BSGatlas in multiple forms: as a website (https://​rth.​dk/​resources/​bsgatlas/), an annotation hub for display in the UCSC genome browser, supplementary tables and standardized GFF3 format, which can be used in large scale-omics studies. By complementing existing resources, the BSGatlas supports analyses of the B. subtilis genome and its molecular biology with respect to not only non-coding genes but also genome-wide transcriptional relationships of all genes.",
keywords = "B. subtilis, Genome annotation, Non-coding and structured RNAs, Operons",
author = "Geissler, {Adrian Sven} and Christian Anthon and Ferhat Alkan and Enrique Gonz{\'a}lez-Tortuero and Poulsen, {Line Dahl} and Kallehauge, {Thomas Beuchert} and Anne Bre{\"u}ner and Seemann, {Stefan Ernst} and Jeppe Vinther and Jan Gorodkin",
year = "2021",
doi = "10.1099/mgen.0.000524",
language = "English",
volume = "7",
journal = "Microbial Genomics",
issn = "2057-5858",
publisher = "Microbiology Society",
number = "2",

}

RIS

TY - JOUR

T1 - Bsgatlas

T2 - A unified bacillus subtilis genome and transcriptome annotation atlas with enhanced information access

AU - Geissler, Adrian Sven

AU - Anthon, Christian

AU - Alkan, Ferhat

AU - González-Tortuero, Enrique

AU - Poulsen, Line Dahl

AU - Kallehauge, Thomas Beuchert

AU - Breüner, Anne

AU - Seemann, Stefan Ernst

AU - Vinther, Jeppe

AU - Gorodkin, Jan

PY - 2021

Y1 - 2021

N2 - A large part of our current understanding of gene regulation in Gram-positive bacteria is based on Bacillus subtilis, as it is one of the most well studied bacterial model systems. The rapid growth in data concerning its molecular and genomic biology is distributed across multiple annotation resources. Consequently, the interpretation of data from further B. subtilis experiments becomes increasingly challenging in both low-and large-scale analyses. Additionally, B. subtilis annotation of structured RNA and non-coding RNA (ncRNA), as well as the operon structure, is still lagging behind the annotation of the coding sequences. To address these challenges, we created the B. subtilis genome atlas, BSGatlas, which integrates and unifies multiple existing annotation resources. Compared to any of the individual resources, the BSGatlas contains twice as many ncRNAs, while improving the positional annotation for 70% of the ncRNAs. Furthermore, we combined known transcription start and termination sites with lists of known co-transcribed gene sets to create a comprehensive transcript map. The combination with transcription start/termination site annotations resulted in 717 new sets of co-transcribed genes and 5335 untranslated regions (UTRs). In comparison to existing resources, the number of 5′ and 3′ UTRs increased nearly fivefold, and the number of internal UTRs doubled. The transcript map is organized in 2266 operons, which provides transcriptional annotation for 92% of all genes in the genome compared to the at most 82% by previous resources. We predicted an off-target-aware genome-wide library of CRISPR–Cas9 guide RNAs, which we also linked to polycistronic operons. We provide the BSGatlas in multiple forms: as a website (https://​rth.​dk/​resources/​bsgatlas/), an annotation hub for display in the UCSC genome browser, supplementary tables and standardized GFF3 format, which can be used in large scale-omics studies. By complementing existing resources, the BSGatlas supports analyses of the B. subtilis genome and its molecular biology with respect to not only non-coding genes but also genome-wide transcriptional relationships of all genes.

AB - A large part of our current understanding of gene regulation in Gram-positive bacteria is based on Bacillus subtilis, as it is one of the most well studied bacterial model systems. The rapid growth in data concerning its molecular and genomic biology is distributed across multiple annotation resources. Consequently, the interpretation of data from further B. subtilis experiments becomes increasingly challenging in both low-and large-scale analyses. Additionally, B. subtilis annotation of structured RNA and non-coding RNA (ncRNA), as well as the operon structure, is still lagging behind the annotation of the coding sequences. To address these challenges, we created the B. subtilis genome atlas, BSGatlas, which integrates and unifies multiple existing annotation resources. Compared to any of the individual resources, the BSGatlas contains twice as many ncRNAs, while improving the positional annotation for 70% of the ncRNAs. Furthermore, we combined known transcription start and termination sites with lists of known co-transcribed gene sets to create a comprehensive transcript map. The combination with transcription start/termination site annotations resulted in 717 new sets of co-transcribed genes and 5335 untranslated regions (UTRs). In comparison to existing resources, the number of 5′ and 3′ UTRs increased nearly fivefold, and the number of internal UTRs doubled. The transcript map is organized in 2266 operons, which provides transcriptional annotation for 92% of all genes in the genome compared to the at most 82% by previous resources. We predicted an off-target-aware genome-wide library of CRISPR–Cas9 guide RNAs, which we also linked to polycistronic operons. We provide the BSGatlas in multiple forms: as a website (https://​rth.​dk/​resources/​bsgatlas/), an annotation hub for display in the UCSC genome browser, supplementary tables and standardized GFF3 format, which can be used in large scale-omics studies. By complementing existing resources, the BSGatlas supports analyses of the B. subtilis genome and its molecular biology with respect to not only non-coding genes but also genome-wide transcriptional relationships of all genes.

KW - B. subtilis

KW - Genome annotation

KW - Non-coding and structured RNAs

KW - Operons

U2 - 10.1099/mgen.0.000524

DO - 10.1099/mgen.0.000524

M3 - Journal article

C2 - 33539279

AN - SCOPUS:85102398935

VL - 7

JO - Microbial Genomics

JF - Microbial Genomics

SN - 2057-5858

IS - 2

M1 - 000524

ER -

ID: 259044243