# Lost in folding space? Comparing four variants of the thermodynamic model for RNA secondary structure prediction

Janssen S, Schudoma C, Steger G, Giegerich R (2011)

BMC Bioinformatics 12(1).

Download

*Journal Article*|

*Published*|

*English*

Author

Department

Abstract

BACKGROUND:Many bioinformatics tools for RNA secondary structure analysis are based on a thermodynamic model of RNA folding. They predict a single, "optimal" structure by free energy minimization, they enumerate near-optimal structures, they compute base pair probabilities and dot plots, representative structures of different abstract shapes, or Boltzmann probabilities of structures and shapes. Although all programs refer to the same physical model, they implement it with considerable variation for different tasks, and little is known about the effects of heuristic assumptions and model simplifications used by the programs on the outcome of the analysis.RESULTS:We extract four different models of the thermodynamic folding space which underlie the programs RNAfold, RNAshapes, and RNAsubopt. Their differences lie within the details of the energy model and the granularity of the folding space. We implement probabilistic shape analysis for all models, and introduce the shape probability shift as a robust measure of model similarity. Using four data sets derived from experimentally solved structures, we provide a quantitative evaluation of the model differences.CONCLUSIONS:We find that search space granularity affects the computed shape probabilities less than the over- or underapproximation of free energy by a simplified energy model. Still, the approximations perform similar enough to implementations of the full model to justify their continued use in settings where computational constraints call for simpler algorithms. On the side, we observe that the rarely used level 2 shapes, which predict the complete arrangement of helices, multiloops, internal loops and bulges, include the "true" shape in a rather small number of predicted high probability shapes. This calls for an investigation of new strategies to extract high probability members from the (very large) level 2 shape space of an RNA sequence. We provide implementations of all four models, written in a declarative style that makes them easy to be modified. Based on our study, future work on thermodynamic RNA folding may make a choice of model based on our empirical data. It can take our implementations as a starting point for further program development.

Publishing Year

ISSN

Financial disclosure

Article Processing Charge funded by the Deutsche Forschungsgemeinschaft and the Open Access Publication Fund of Bielefeld University.

PUB-ID

### Cite this

Janssen S, Schudoma C, Steger G, Giegerich R. Lost in folding space? Comparing four variants of the thermodynamic model for RNA secondary structure prediction.

*BMC Bioinformatics*. 2011;12(1).Janssen, S., Schudoma, C., Steger, G., & Giegerich, R. (2011). Lost in folding space? Comparing four variants of the thermodynamic model for RNA secondary structure prediction.

*BMC Bioinformatics*,*12*(1).Janssen, S., Schudoma, C., Steger, G., and Giegerich, R. (2011). Lost in folding space? Comparing four variants of the thermodynamic model for RNA secondary structure prediction.

*BMC Bioinformatics*12.Janssen, S., et al., 2011. Lost in folding space? Comparing four variants of the thermodynamic model for RNA secondary structure prediction.

*BMC Bioinformatics*, 12(1).S. Janssen, et al., “Lost in folding space? Comparing four variants of the thermodynamic model for RNA secondary structure prediction”,

*BMC Bioinformatics*, vol. 12, 2011.Janssen, S., Schudoma, C., Steger, G., Giegerich, R.: Lost in folding space? Comparing four variants of the thermodynamic model for RNA secondary structure prediction. BMC Bioinformatics. 12, (2011).

Janssen, Stefan, Schudoma, Christian, Steger, Gerhard, and Giegerich, Robert. “Lost in folding space? Comparing four variants of the thermodynamic model for RNA secondary structure prediction”.

*BMC Bioinformatics*12.1 (2011).**Main File(s)**

File Name

Access Level

Open Access

Last Uploaded

2012-03-23 11:35:18

This data publication is cited in the following publications:

This publication cites the following data publications:

### 8 Citations in Europe PMC

Data provided by Europe PubMed Central.

Thermodynamic matchers for the construction of the cuckoo RNA family.

Reinkensmeier J, Giegerich R.,

PMID: 25779873

Reinkensmeier J, Giegerich R.,

*RNA Biol*12(2), 2015PMID: 25779873

Analysing RNA-kinetics based on folding space abstraction.

Huang J, Voß B.,

PMID: 24575751

Huang J, Voß B.,

*BMC Bioinformatics*15(), 2014PMID: 24575751

A silent exonic SNP in kdm3a affects nucleic acids structure but does not regulate experimental autoimmune encephalomyelitis.

Gillett A, Bergman P, Parsa R, Bremges A, Giegerich R, Jagodic M.,

PMID: 24312603

Gillett A, Bergman P, Parsa R, Bremges A, Giegerich R, Jagodic M.,

*PLoS ONE*8(12), 2013PMID: 24312603

Bellman's GAP--a language and compiler for dynamic programming in sequence analysis.

Sauthoff G, Mohl M, Janssen S, Giegerich R.,

PMID: 23355290

Sauthoff G, Mohl M, Janssen S, Giegerich R.,

*Bioinformatics*29(5), 2013PMID: 23355290

Direct updating of an RNA base-pairing probability matrix with marginal probability constraints.

Hamada M.,

PMID: 23210474

Hamada M.,

*J. Comput. Biol.*19(12), 2012PMID: 23210474

Abstract folding space analysis based on helices.

Huang J, Backofen R, Voß B.,

PMID: 23104999

Huang J, Backofen R, Voß B.,

*RNA*18(12), 2012PMID: 23104999

Computational methods for ab initio detection of microRNAs.

Allmer J, Yousef M.,

PMID: 23087705

Allmer J, Yousef M.,

*Front Genet*3(), 2012PMID: 23087705

### 34 References

Data provided by Europe PubMed Central.

Computational approaches for RNA energy parameter estimation.

Andronescu M, Condon A, Hoos HH, Mathews DH, Murphy KP.,

PMID: 20940338

Andronescu M, Condon A, Hoos HH, Mathews DH, Murphy KP.,

*RNA*16(12), 2010PMID: 20940338

Faster computation of exact RNA shape probabilities.

Janssen S, Giegerich R.,

PMID: 20080511

Janssen S, Giegerich R.,

*Bioinformatics*26(5), 2010PMID: 20080511

Mfold web server for nucleic acid folding and hybridization prediction.

Zuker M.,

PMID: 12824337

Zuker M.,

*Nucleic Acids Res.*31(13), 2003PMID: 12824337

UNAFold: software for nucleic acid folding and hybridization.

Markham NR, Zuker M.,

PMID: 18712296

Markham NR, Zuker M.,

*Methods Mol. Biol.*453(), 2008PMID: 18712296

RNAstructure: software for RNA secondary structure prediction and analysis.

Reuter JS, Mathews DH.,

PMID: 20230624

Reuter JS, Mathews DH.,

*BMC Bioinformatics*11(), 2010PMID: 20230624

Coaxial stacking of helixes enhances binding of oligoribonucleotides and improves predictions of RNA folding.

Walter AE, Turner DH, Kim J, Lyttle MH, Muller P, Mathews DH, Zuker M.,

PMID: 7524072

Walter AE, Turner DH, Kim J, Lyttle MH, Muller P, Mathews DH, Zuker M.,

*Proc. Natl. Acad. Sci. U.S.A.*91(20), 1994PMID: 7524072

Thermodynamic parameters for an expanded nearest-neighbor model for formation of RNA duplexes with Watson-Crick base pairs.

Xia T, SantaLucia J Jr, Burkard ME, Kierzek R, Schroeder SJ, Jiao X, Cox C, Turner DH.,

PMID: 9778347

Xia T, SantaLucia J Jr, Burkard ME, Kierzek R, Schroeder SJ, Jiao X, Cox C, Turner DH.,

*Biochemistry*37(42), 1998PMID: 9778347

Strong correlation between SHAPE chemistry and the generalized NMR order parameter (S2) in RNA.

Gherghe CM, Shajani Z, Wilkinson KA, Varani G, Weeks KM.,

PMID: 18710236

Gherghe CM, Shajani Z, Wilkinson KA, Varani G, Weeks KM.,

*J. Am. Chem. Soc.*130(37), 2008PMID: 18710236

A comprehensive comparison of comparative RNA structure prediction approaches.

Gardner PP, Giegerich R.,

PMID: 15458580

Gardner PP, Giegerich R.,

*BMC Bioinformatics*5(), 2004PMID: 15458580

### Export

0 Marked Publications### Web of Science

View record in Web of Science®### Sources

PMID: 22051375

PubMed | Europe PMC