Model-Based Reinforcement Learning in Multiagent Systems with Sequential Action Selection
スポンサーリンク
概要
- 論文の詳細を見る
Model-based reinforcement learning uses the gathered information, during each experience, more efficiently than model-free reinforcement learning. This is especially interesting in multiagent systems, since a large number of experiences are necessary to achieve a good performance. In this paper, model-based reinforcement learning is developed for a group of self-interested agents with sequential action selection based on traditional prioritized sweeping. Every single situation of decision making in this learning process, called extensive Markov game, is modeled as n-person general-sum extensive form game with perfect information. A modified version of backward induction is proposed for action selection, which adjusts the tradeoff between selecting subgame perfect equilibrium points, as the optimal joint actions, and learning new joint actions. The algorithm is proved to be convergent and discussed based on the new results on the convergence of the traditional prioritized sweeping.
著者
-
AKRAMIZADEH Ali
Dept. EE., Computational Intelligence and Large Scale System Lab., Amir Kabir University of Technolo
-
AFSHAR Ahmad
Dept. EE., Computational Intelligence and Large Scale System Lab., Amir Kabir University of Technolo
-
MENHAJ Mohammad
Dept. EE., Computational Intelligence and Large Scale System Lab., Amir Kabir University of Technolo
-
JAFARI Samira
Dept. EE., Computational Intelligence and Large Scale System Lab., Amir Kabir University of Technolo