Asaf Amit and Shaul Markovitch. Learning to Bid in Bridge. Machine Learning, 63:287-327 2006.
Bridge bidding is considered to be one of the most difficult problems for game-playing programs. It involves four agents rather than two, including a cooperative agent. In addition, the partial observability of the game makes it impossible to predict the outcome of each action. In this paper we present a new decision-making algorithm that is capable of overcoming these problems. The algorithm allows models to be used for both opponent agents and partners, while utilizing a novel model-based Monte Carlo sampling method to overcome the problem of hidden information. The paper also presents a learning framework that uses the above decision-making algorithm for co-training of partners. The agents refine their selection strategies during training and continuously exchange their refined strategies. The refinement is based on inductive learning applied to examples accumulated for classes of states with conflicting actions. The algorithm was empirically evaluated on a set of bridge deals. The pair of agents that co-trained significantly improved their bidding performance to a level surpassing that of the current state-of-the-art bidding algorithm.
@article{Amit:2006:LBB,
Author = {Asaf Amit and Shaul Markovitch},
Title = {Learning to Bid in Bridge},
Year = {2006},
Journal = {Machine Learning},
Volume = {63},
Number = {3},
Month = {},
Pages = {287--327},
Url = {http://www.cs.technion.ac.il/~shaulm/papers/pdf/Amit-Markovitch-mlj2005.pdf},
Keywords = {Opponent Modeling, Games, Learning in Games, Multi-Agent Systems, Bridge},
Secondary-keywords = {Adversary Search},
Abstract = {
Bridge bidding is considered to be one of the most difficult
problems for game-playing programs. It involves four agents rather
than two, including a cooperative agent. In addition, the partial
observability of the game makes it impossible to predict the
outcome of each action. In this paper we present a new decision-
making algorithm that is capable of overcoming these problems. The
algorithm allows models to be used for both opponent agents and
partners, while utilizing a novel model-based Monte Carlo sampling
method to overcome the problem of hidden information. The paper
also presents a learning framework that uses the above decision-
making algorithm for co-training of partners. The agents refine
their selection strategies during training and continuously
exchange their refined strategies. The refinement is based on
inductive learning applied to examples accumulated for classes of
states with conflicting actions. The algorithm was empirically
evaluated on a set of bridge deals. The pair of agents that co-
trained significantly improved their bidding performance to a
level surpassing that of the current state-of-the-art bidding
algorithm.
}
}