29 Jun 2011Working Paper Summaries

Better-reply Dynamics in Deferred Acceptance Games

by Guillaume Haeringer and Hanna Halaburda

There's an inherent problem in the market design theory known as mechanism design, in that the players in the market may not understand the design, and thus may make bad choices until they learn to work the system better. This paper explores the issue of learning the design. It focuses on a particular mechanism, the Deferred Acceptance algorithm for two-sided matching markets, which is used in many real-life markets. Research was conducted by Guillaume Haeringer of Universitat Autonoma de Barcelona and Hanna Halaburda of Harvard Business School. Key concepts include:

In the Deferred Acceptance algorithm, matches are made in a series of rounds, until everyone is matched up. The matching achieved through DA has a special property of "stability." In a stable matching, if an individual tries for a better choice than the one initially assigned by the matching, he learns that his ideal choice is already taken—matched to someone more preferred than he is.
The researchers discuss several possibilities of "better-reply dynamics," in which savvy players figure out what is the optimal strategy for getting the best possible match.
They find that even in simple two-sided matching models, the learning is difficult and takes a long time.

Author Abstract

In this paper we address the question of learning in a two-sided matching mechanism that utilizes the deferred acceptance algorithm. We consider a repeated matching game where at each period agents observe their match and have the opportunity to revise their strategy (i.e., the preference list they will submit to the mechanism). We focus in this paper on better-reply dynamics. To this end, we first provide a characterization of better-replies and a comprehensive description of the dominance relation between strategies. Better-replies are shown to have a simple structure and can be decomposed into four types of changes. We then present a simple better-reply dynamics with myopic and boundedly rational agents and identify conditions that ensure that limit outcomes are outcome equivalent to the outcome obtained when agents play their dominant strategies. Better-reply dynamics may not converge, but if they do converge, then the limit strategy profiles constitute a subset of the Nash equilibria of the stage game.

Paper Information

Full Working Paper Text
Working Paper Publication Date: June 2011
HBS Working Paper Number: 11-126
Faculty Unit(s): Strategy

Trending

- 15 Apr 2024
- Book
Struggling With a Big Management Decision? Start by Asking What Really Matters
- 11 Apr 2024
- In Practice
Why Progress on Immigration Might Soften Labor Pains
- 02 Apr 2024
- What Do You Think?
What's Enough to Make Us Happy?
- 24 Jan 2024
- Op-Ed
Why Boeing’s Problems with the 737 MAX Began More Than 25 Years Ago
- 09 Apr 2024
- Book
Why Work Rituals Bring Teams Together and Create More Meaning

Find Related Articles

Sign up for our weekly newsletter

Interested in improving your business? Learn about fresh research and ideas from Harvard Business School faculty.

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Better-reply Dynamics in Deferred Acceptance Games

Author Abstract

Paper Information

Struggling With a Big Management Decision? Start by Asking What Really Matters

Why Progress on Immigration Might Soften Labor Pains

What's Enough to Make Us Happy?

Why Boeing’s Problems with the 737 MAX Began More Than 25 Years Ago

Why Work Rituals Bring Teams Together and Create More Meaning

Sign up for our weekly newsletter