Reproducible analysis of sequencing-based RNA structure probing data with user-friendly tools

Research output: Chapter in Book/Report/Conference proceedingBook chapterResearchpeer-review

RNA structure-probing data can improve the prediction of RNA secondary and tertiary structure and allow structural changes to be identified and investigated. In recent years, massive parallel sequencing has dramatically improved the throughput of RNA structure probing experiments, but at the same time also made analysis of the data challenging for scientists without formal training in computational biology. Here, we discuss different strategies for data analysis of massive parallel sequencing-based structure-probing data. To facilitate reproducible and standardized analysis of this type of data, we have made a collection of tools, which allow raw sequencing reads to be converted to normalized probing values using different published strategies. In addition, we also provide tools for visualization of the probing data in the UCSC Genome Browser and for converting RNA coordinates to genomic coordinates and vice versa. The collection is implemented as functions in the R statistical environment and as tools in the Galaxy platform, making them easily accessible for the scientific community. We demonstrate the usefulness of the collection by applying it to the analysis of sequencing-based hydroxyl radical probing data and comparing different normalization strategies.

Original languageEnglish
Title of host publicationStructures of large RNA molecules and their complexes
EditorsSarah A. Woodson, Frédéric H.T. Allain
Number of pages28
PublisherElsevier
Publication date2015
Pages153-180
Chapter6
ISBN (Electronic)978-0-12-801934-4
DOIs
Publication statusPublished - 2015
SeriesMethods in Enzymology
Volume558
ISSN0076-6879

ID: 154364159