Author Abstract
In this paper we address the question of learning in a two-sided matching mechanism that utilizes the deferred acceptance algorithm. We consider a repeated matching game where at each period agents observe their match and have the opportunity to revise their strategy (i.e., the preference list they will submit to the mechanism). We focus in this paper on better-reply dynamics. To this end, we first provide a characterization of better-replies and a comprehensive description of the dominance relation between strategies. Better-replies are shown to have a simple structure and can be decomposed into four types of changes. We then present a simple better-reply dynamics with myopic and boundedly rational agents and identify conditions that ensure that limit outcomes are outcome equivalent to the outcome obtained when agents play their dominant strategies. Better-reply dynamics may not converge, but if they do converge, then the limit strategy profiles constitute a subset of the Nash equilibria of the stage game.
Paper Information
- Full Working Paper Text
- Working Paper Publication Date: June 2011
- HBS Working Paper Number: 11-126
- Faculty Unit(s): Strategy