Thursday, January 21, 2010

Changing the Blog's Screen Width

http://bloggersentral.blogspot.com/2009/06/how-to-change-blog-width.html

Learning in Games - Some Notes

Strategic Learning and its limits - Peyton Young
One school points optimistically to particular classes of games that can in fact be learned by simple updating procedures, such as learning minimax equilibria by fictitious play (see Chapter 6). This school also points to the fact that any game can be learned by Bayesian methods provided that the priors are sufficiently aligned to begin with (Kalai and Lehrer, 1993).
The other school views the situation more pessimistically. Its adherents point out that simple learning procedures tend to work only in special cases, such as zero-sum games, potential games, and so forth. Moreover, they are unimpressed by the fact that Bayesian methods lead to learning if the priors are sufficiently aligned to begin with. First, Bayesian reasoning makes extreme demands on the computational capacities of the players, and hence is not a plausible model of decision making by human beings. Secondly, even if one is willing to overlook the empirical implausibility of this approach, the amount of coordination required to learn equilibrium effectively assumes some notion of pre-equilibrium in the prior beliefs. Hence it does not solve the learning problem in a robust sense. (These matters are treated in Chapter 7.)
The purpose of these lectures is to survey results on both sides of the question. On balance, I shall give a little more weight to the positive side, though not in a way that will please all of the positivists. I begin by examining simple forms of learning behavior that depend only on realized payoffs and simple statistical summaries of historical data. These include forms of reinforcement and no regret learning, which turn out to be closely connected. Some of these rules converge in long-run average behavior to correlated forms of equilibrium, including standard correlated equilibrium and variants thereof (see Chapters 2–5). An important property of these learning rules is that they are robust, that is, they require no prior knowledge of the opponents' payoffs or strategies. Significantly, however, they do not yield Nash equilibrium behavior except under special circumstances.
In Chapter 6 we study fictitious play and its variants. These can be viewed as primitive forms of Bayesian learning in which players use simple statistical models based on past evidence and choose best (or almost best) replies given the evidence. Smoothed versions of fictitious play have long-run average behavior that is similar to that of no regret learning, but they do not converge to Nash equilibrium except in special cases.
We then examine the case in which players are very sophisticated in order to see how much can be gained from a high rationality perspective (Chapter 7). The major positive result here is due to Kalai and Lehrer (1993), who show that Bayesian learning leads to Nash equilibrium.
Our concern here is with the learnability of Nash equilibrium in games with finite action spaces. If one allows infinite action spaces, there exist games in which one cannot even determine in finite time whether a Nash equilibrium exists, let alone learn it by a decentralized process (Rabin, 1957; Prasad, 1991, 1997).