Statistical Mechanics of Learning and Inference
from
Wednesday 26 May 2010 (16:00)
to
Saturday 29 May 2010 (19:00)
Monday 24 May 2010
Tuesday 25 May 2010
Wednesday 26 May 2010
17:00
From gene expressions to genetic networks
-
Marek Cieplak
(
Institute of Physics, Polish Academy of Sciences
)
From gene expressions to genetic networks
Marek Cieplak
(
Institute of Physics, Polish Academy of Sciences
)
17:00 - 18:00
A method based on the principle of entropy maximization is used to identify the gene interaction network with the highest probability of giving rise to experimentally observed transcript profiles [1]. In its simplest form, the method yields the pairwise gene interaction network, but it can also be extended to deduce higher order correlations. Analysis of microarray data from genes in Saccharomyces cerevisiae chemostat cultures exhibiting energy metabollic oscillations identifies a gene interaction network that reflects the intracellular communication pathways. These pathways adjust cellular metabolic activity and cell division to the limiting nutrient conditions that trigger metabolic oscillations. The success of the present approach in extracting meaningful genetic connections suggests that the maximum entropy principle is a useful concept for understanding living systems, as it is for other complex, nonequilibrium systems. The time-dependent behavior of the genetic network is found to involve only a few fundamental modes [2,3]. REFERENCES: [1] T. R. Lezon, J. R. Banavar, M. Cieplak, A. Maritan, and N. Fedoroff, Using the principle of entropy maximization to infer genetic interaction networks from gene expression patterns, Proc. Natl. Acad. Sci. (USA) 103, 19033-19038 (2006) [2] N. S. Holter, M. Mitra, A. Maritan, M. Cieplak, J. R. Banavar, and N. V. Fedoroff, Fundamental patterns underlying gene expression profiles: simplicity from complexity, Proc. Natl. Acad. Sci. USA 97, 8409-8414 (2000) [3] N. S. Holter, A. Maritan, M. Cieplak, N. V. Fedoroff, and J. R. Banavar, Dynamic modeling of gene expression data, Proc. Natl. Acad. Sci. USA 98, 1693-1698 (2001)
18:00
coffee break
coffee break
18:00 - 18:30
18:30
Information, interaction and inference in finance
-
Matteo Marsili
(
ICTP
)
Information, interaction and inference in finance
Matteo Marsili
(
ICTP
)
18:30 - 19:30
Thursday 27 May 2010
09:15
High-dimensional Ising model selection using ell_1 regularized regression
-
Pradeep Ravikumar
(
University of Texas, Austin
)
High-dimensional Ising model selection using ell_1 regularized regression
Pradeep Ravikumar
(
University of Texas, Austin
)
09:15 - 10:00
10:00
coffee break
coffee break
10:00 - 10:30
10:30
TBA
-
Venkat Chandrasekaran
(
MIT
)
TBA
Venkat Chandrasekaran
(
MIT
)
10:30 - 11:15
11:15
TBA
-
Samuel Kaski
(
Aalto University
)
TBA
Samuel Kaski
(
Aalto University
)
11:15 - 12:00
12:00
Lunch
Lunch
12:00 - 14:30
14:30
Optimal control as a graphical model inference problem
-
Bert Kappen
(
Radboud University Nijmegen
)
Optimal control as a graphical model inference problem
Bert Kappen
(
Radboud University Nijmegen
)
14:30 - 15:15
To compute a course of actions in the presence of uncertainty is the topic of stochastic optimal control theory. Such computations require the solution of complex partial differential equations and these computations become intractable for most problems. I will introduce a class of control problems that can be expressed as a KL divergence and that can be mapped onto a graphical model inference problem. In this talk, we show how to apply this theory in the context of a delayed choice task and for collaborating agents. We first introduce the KL control framework. Then we show that in a delayed reward task when the future is uncertain it is optimal to delay the timing of your decision. We show preliminary results on human subjects that confirm this prediction. Subsequently, we discuss two player games, such as the stag-hunt game, where collaboration can improve or worsen as a result of recursive reasoning about the opponents actions. The Nash equilibria appear as local minima of the optimal cost to go, but may disappear when monetary gain decreases. This behaviour is in agreement with experimental findings in humans. We subsequently extend the setting to delayed rewards and show how cooperation develops as a result of recursive reasoning.Suboptimal cooperation arise as local minima of the objective function.
15:15
Modeling Viral Evolution in competition with the Immune System
-
Silvio Franz
(
Université Paris-Sud
)
Modeling Viral Evolution in competition with the Immune System
Silvio Franz
(
Université Paris-Sud
)
15:15 - 16:00
Influenza viruses evolve at a high speed to escape acquired immunity and infect the same host several time. Contrary to naive expectation, this does not lead to a large diversity in the viral population. Phylogenetic studies show that the viral population display the characters of an "evolving quasispecies" with reduced instantaneous diversity. In this talk I will discuss a simple stochastic model of and evolving viral population that allows to rationalize the evolving quasispecies behavior as an emerging feature of the competition between strains with different level of infectivity.
16:00
coffee break
coffee break
16:00 - 16:30
16:30
The fluctuation-dissipation relations as an inference tool
-
Angelo Vulpiani
(
Università di Roma "La Sapienza"
)
The fluctuation-dissipation relations as an inference tool
Angelo Vulpiani
(
Università di Roma "La Sapienza"
)
16:30 - 17:15
As first we discuss as the Fluctuation Dissipation Relations (FDRs) hold in a generalized form for any systems with a stationary probability distribution. One can say that the essence of the FDRs is the possibility to establish a bridge between equilibrium and non equilibrium properties. We show how FDRs are useful tools to understand the statistical behaviour of complex systems such as driven granular gases and protein models. In the first case although the velocity response function of a particle and its velocity self-correlation are not proportional, a generalized form of fluctuation-dissipation relation holds, this is due to the presence of strong correlations between velocities and spatial density. This happens at high densities and strong inelasticities, but still in the fluid-like (and ergodic) regime. In addition we discuss how, following the Jarzynski- like approach, i.e. perfoming a series of "pulling experiments", one can infer the basic feature of the underlying dynamics. (Joint work with F. Cecconi, A. Puglisi and D. Villamaina)
17:15
TBA
-
Shaomeng Qin
(
Aalto University
)
TBA
Shaomeng Qin
(
Aalto University
)
17:15 - 18:00
Friday 28 May 2010
09:15
Intrinsic Limitations of Inverse Inference in Spin Glasses
-
Enzo Marinari
(
Università di Roma "La Sapienza"
)
Intrinsic Limitations of Inverse Inference in Spin Glasses
Enzo Marinari
(
Università di Roma "La Sapienza"
)
09:15 - 10:00
Abstract. We analyze the limits inherent to the inverse reconstruction of a pairwise Ising spin glass based on susceptibility propagation. We establish the conditions under which the susceptibility propagation algorithm is able to reconstruct the characteristics of the network given first- and second-order local observables, evaluate eventual errors due to various types of noise in the originally observed data, and discuss the scaling of the problem with the number of degrees of freedom.
10:00
coffee break
coffee break
10:00 - 10:30
10:30
Statistical physics of optimization under uncertainty
-
Riccardo Zecchina
(
Politecnico di Torino
)
Statistical physics of optimization under uncertainty
Riccardo Zecchina
(
Politecnico di Torino
)
10:30 - 11:15
Optimization under uncertainty deals with the problem of optimizing stochastic cost functions given some partial information on their inputs. These problems are extremely difficult to solve and yet pervade all areas of technological and natural sciences. We propose a general approach to solve such large-scale stochastic optimization problems and a Survey Propagation based algorithm that implements it. As an illustration, we apply our method to the stochastic bipartite matching problem, in the two-stage and multi-stage cases. The efficiency of our approach, which does not rely on sampling techniques, allows us to validate the analytical predictions with large-scale numerical simulations. (joint work with Fabrizio Altarelli, Alfredo Braunstein and Abolfazl Ramezanpour)
11:15
Dynamical TAP equations and the inverse Ising problem
-
Yasser Roudi
(
Nordita
)
Dynamical TAP equations and the inverse Ising problem
Yasser Roudi
(
Nordita
)
11:15 - 12:00
Recent advances in recording technology allow simultaneous measurement of the activity of many elements in a biological system, e.g. many neurons, genes etc. This has inspired people to study how this recorded data can be used to learn something about the connectivity between these elements. A useful and powerful platform for studying this problem is the inverse Ising problem: finding the coupling of an Ising model given the means and pairwise correlation or samples from the distribution. In this talk, after briefly describing exact and approximate methods for finding the couplings of an equilibrium Ising model, I will describe how we can use a non-equilibrium model to improve the inference of the connections.
12:00
Lunch
Lunch
12:00 - 14:30
14:30
Cluster expansion for the Inverse Ising Problem: application to synthetic and real data.
-
Simona Cocco
(
École Normale Supérieure
)
Cluster expansion for the Inverse Ising Problem: application to synthetic and real data.
Simona Cocco
(
École Normale Supérieure
)
14:30 - 15:15
I will introduce a procedure to infer the fields and the couplings of a spatially-distributed Ising model, given the magnetizations and pairwise correlations of spins. The algorithm is based on the recursive decomposition of the entropy into contributions coming from clusters of spins. I will explain and validate the procedure on synthetic data sets,and then apply it to experimental data coming from multi-electrode recordings of neural activity. (Work done in collaboration with R. Monasson and S. Leibler)
15:15
Reading out the activity of large neural ensembles: The Ising Decoder
-
Simon Schultz
(
Imperial College
)
Reading out the activity of large neural ensembles: The Ising Decoder
Simon Schultz
(
Imperial College
)
15:15 - 16:00
New technologies such as high-density multi-electrode array recording and multiphoton calcium imaging allow the activity of large numbers of neurons to be monitored. However, analysis tools have lagged behind the experimental technology, with most approaches limited to very small population sizes. In the limit of short time windows, where neuronal activity can be binarized without loss of information, the Ising model provides a useful approach towards capturing the information content of large neural ensembles. I will show how maximum entropy models including the Ising model fit with the information component analysis theoretical framework for studying neural coding, and how the Ising model can be used to decode large neural ensembles. I will highlight some recent advances we have made in scaling up our decoders, and demonstrate the algorithms on in vivo multielectrode array and two photon calcium imaging data.
16:00
coffee break
coffee break
16:00 - 16:30
16:30
Exploring Nash Equilibria in Network Games
-
Luca Dall'Asta
(
ICTP
)
Exploring Nash Equilibria in Network Games
Luca Dall'Asta
(
ICTP
)
16:30 - 17:15
Game theoretic problems defined on graphs may admit many Nash equilibria, with very different properties. An example is provided by strategic substitutes game on network. Searching for (socially) optimal Nash equilibria in these games is a non-trivial task. I will discuss some algorithmic techniques based on Monte Carlo and Belief Propagation as well as learning methods by means of which players endogeneously organize toward one or more Nash equilibria.
17:15
Poster Session
Poster Session
17:15 - 19:15
Saturday 29 May 2010
09:15
Inference of protein-protein interactions from multi-species sequence data using statistical-physics inspired approaches
-
Martin Weigt
(
Institute for Scientific Interchange, Torino
)
Inference of protein-protein interactions from multi-species sequence data using statistical-physics inspired approaches
Martin Weigt
(
Institute for Scientific Interchange, Torino
)
09:15 - 10:00
Experimental approaches to transient protein interactions are laborious and serendipitous, and our understanding of fundamental questions like the identification of interaction surfaces or the specificity of molecular recognition between interacting proteins is far from being complete. We propose a computational approach based on recent techniques from the statistical physics of disordered systems, which exploits the natural sequence variability of homologous proteins across hundreds of species species. Using bacterial two-component signal transduction (TCS) as a test case, we show that our method is able (i) to identify inter-protein residue contacts and to facilitate the prediction of protein complex strutures, and (ii) to reconstruct a molecular recognition code which elucidates specificity in signal transduction in bacteria.
10:00
break
break
10:00 - 10:30
10:30
TBA
-
WITOELAR, Aree
(
Comp. Science, University of Groningen
)
TBA
WITOELAR, Aree
(
Comp. Science, University of Groningen
)
10:30 - 11:15
11:15
Mixture modeling of DNA copy number aberrations
-
Jaakko Hollmen
(
Aalto University
)
Mixture modeling of DNA copy number aberrations
Jaakko Hollmen
(
Aalto University
)
11:15 - 12:00
DNA copy number aberrations, i.e. copy number amplifications and copy number deletions, are hallmarks of nearly all advanced tumors. We present the data collection of genome-wide DNA copy number amplification data consisting of data of over 4500 cases of human neoplasms. The data set has been gathered from scientific journal articles covering a period of ten years and is naturally represented as 0-1 data. We motivate the use of mixture models in probabilistic clustering of amplification data and present a mixture model of multivariate Bernoulli distributions to yield patterns that are relevant to all cancer types. Appropriate complexity for the mixture model for each chromosome is selected with a model selection procedure. A methodology to create a naming scheme for the identified patterns is also presented. Results are interpreted and the diagnostic value of the findings is further investigated in the light of background risk factors.