27 February 2012 to 23 March 2012
Nordita
Europe/Stockholm timezone

Probabilistic models of protein structure: from theory to applications

20 Mar 2012, 10:00
1h
132:028 (Nordita)

132:028

Nordita

Speaker

Prof. Thomas Hamelryck (University of Copenhagen)

Description

The so-called protein folding problem is the loose denominator for an amalgam of closely related, unsolved problems that include protein structure prediction, protein design and the simulation of the protein folding process. We adopt a unique probabilistic approach to modeling bio-molecular structure, based on graphical models and directional statistics [1,2,3,4,5]. Notably, we developed the first probabilistic model of protein structure in full atomic detail [1, 4]. In this talk, I will give an overview of how rigorous probabilistic models of something as complicated as a protein structure can be formulated, focusing on the use of graphical models and directional statistics to model angular degrees of freedom. I will also discuss the reference ratio method [6], a novel statistical method that can be seen as a surprising Bayesian variant of the maximum entropy method. This method also sheds an entirely new light on the in protein structure prediction widely used potentials of mean force, which were up to now poorly understood and justified. Finally, I will describe some applications, including the investigation of protein dynamics and the statistical inference of protein structure from nuclear magnetic resonance (NMR) data [7] and small angle X-ray scattering (SAXS) data [8]. [1] Hamelryck, T., Kent, J., Krogh, A. (2006) Sampling realistic protein conformations using local structural bias. PLoS Comput. Biol., 2(9): e131. [2] W. Boomsma, K.V. Mardia, C.C. Taylor, J. Ferkinghoff-Borg, A. Krogh, and T. Hamelryck. (2008) A generative, probabilistic model of local protein structure. Proc. Natl. Acad. Sci. U S A, 105(26):8932–8937. [3] Frellsen, J., Moltke, I., Thiim, M., Mardia, KV., Ferkinghoff-Borg, J., Hamelryck, T. (2009) A probabilistic model of RNA conformational space. PLoS Computational Biology, 5(6), e1000406. [4] Tim Harder, Wouter Boomsma, Martin Paluszewski, Jes Frellsen, Kristoffer Johansson, and Thomas Hamelryck. (2010) Beyond rotamers: a generative, probabilistic model of side chains in proteins. BMC Bioinformatics, 11(1):306. [5] M. Paluszewski and T. Hamelryck. (2010) Mocapy++ – a toolkit for inference and learning in dynamic bayesian networks. BMC bioinformatics, 11(1):126. [6] Thomas Hamelryck, Mikael Borg, Martin Paluszewski, Jonas Paulsen, Jes Frellsen, Christian Andreetta, Wouter Boomsma, Sandro Bottaro, and Jesper Ferkinghoff-Borg (2010). Potentials of mean force for protein structure prediction vindicated, formalized and generalized. PLoS ONE, 5(11):e13714. [7] Simon Olsson, Wouter Boomsma, Jes Frellsen, Sandro Bottaro, Tim Harder, Jesper Ferkinghoff-Borg, and Thomas Hamelryck. (2011) Generative probabilistic models extend the scope of inferential structure determination. J. Magn. Reson., 213(1):182-6. [8] Stovgaard, K., Andreetta, C., Ferkinghoff-Borg, J., Hamelryck, T. (2010) Calculation of accurate small angle X-ray scattering curves from coarse-grained protein models. BMC Bioinformatics, 11:429.

Presentation materials

There are no materials yet.