Learning the Balance between Exploration and Exploitation via Reward (Special Section on Concurrent Systems Technology)
スポンサーリンク
概要
- 論文の詳細を見る
This paper proposes a new method to improve cooperation in concurrent systems within the framework of Multi-Agent Systems (MAS) by utilizing reinforcement learning. When subsystems work independently and concurrently, achieving appropriate cooperation among them is important to improve the effectiveness of the overall system. Treating subsystems as agents makes it easy to explicitly deal with the interactions among them since they can be modeled naturally as communication among agents with intended information. In our approach agents try to learn the appropriate balance between exploration and exploitation via reward, which is important in distributed and concurrent problem solving in general. By focusing on how to give reward in reinforcement learning, not the learning equation, two kinds of reward are defined in the context of cooperation between agents, in contrast to reinforcement learning within the framework of single agent. In our approach reward for insistence by individual agent contributes to facilitating exploration and reward for concession to other agents contributes to facilitating exploitation. Our cooperation method was examined through experiments on the design of micro satellites and the result showed that it was effective to some extent to facilitate cooperation among agents by letting agents themselves learn the appropriate balance between insistence and concession. The result also suggested the possibility of utilizing the relative magnitude of these rewards as a new control parameter in MAS to control the overall behavior of MAS.
- 社団法人電子情報通信学会の論文
- 1999-11-25
著者
-
NAKASUKA Shinichi
Department of Aeronautics and Astronautics, the University of Tokyo
-
Nakasuka S
Department Of Aeronautics And Astronautics The University Of Tokyo
-
Nakasuka Shinichi
Department Of Aeronautics And Astronautics The University Of Tokyo
-
Nakasuka S
University Of Tokyo
-
Nakasuka Shinichi
Rcast University Of Tokyo
-
Nakasuka Shinichi
Research Center For Advanced Science And Technology University Of Tokyo
-
Hori K
Rcast University Of Tokyo
-
Yoshida T
Graduate School Of Engineering Science Osaka University
-
Nakamura Shinobu
Mitsubishi Heavy Industries Ltd.
-
YOSHIDA Tetsuya
Graduate School of Engineering Science, Osaka University
-
HORI Koichi
RCAST, University of Tokyo
-
Nishida Shinichiro
Space Technology Demonstration Research Center Jaxa
-
Yoshida Tetsuya
Graduate School Of Engineering Science Osaka University
関連論文
- University of Tokyo Nano Satellite Project “PRISM”
- Space Technology Demonstration by Small and Micro Satellites in Jaxa(WSANE2006)
- University of Tokyo's CubeSat Project - Mission Concept and Subsystem Design -
- On-orbit servicing and its breakthrough approach using small satellite : Rendezvous and fly-around technology for uncooperative target(WSANE2006)
- BS-6-12 Development of Equipments for Rendezvous Fly-around Experiments on SmartSat-1(BS-6.Observation, communication and positioning technology of equipments mounted on satellite, aircraft, ship and ground,ENGLISH SESSION)
- SmartSat-1 : On Orbit Experiment Plan Using Mini-Satellite(Small satellite (2), Workshop for Space, Aeronautical and Navigational Electronics (W S A N E 2005))
- SmartSat-1 : On Orbit Experiment Plan Using Mini-Satellite
- On-Orbit Experiment of Vision-Based Motion Estimation and Tracking of Tumbling Object in Space
- Moon Tracking Attitude Control Experiment of a Bias Momentum Micro Satellite "μ-LabSat"
- Results of the MicroLabSat project(Small satellite (2), Workshop for Space, Aeronautical and Navigational Electronics (W S A N E 2005))
- Results of the MicroLabSat project
- Moon Tracking Attitude Control Experiment of a Bias Momentum Micro Satellite “μ-LabSat”
- SB-8-5 In-orbit Demonstration of Orbital Maintenance System Using COTS Technologies(Small Satellite 1)(SB-8. Results of Various Satellite Missions)
- VISUAL FEEDBACK ATTITUDE CONTROL EXPERIMENT OF A BIAS MOMENTUM MICRO SATELLITE
- Preliminary experiments on technologies for satellite orbital maintenance using Micro-LabSat 1
- SB-8-9 UNIVERSITY OF TOKYO'S NANO-SATELLITE CUBESAT-XI : RESULTS OF LAUNCH AND ON-ORBIT OPERATIONS(Small Satellite 2)(SB-8. Results of Various Satellite Missions)
- ANOMALY DETECTION METHOD FOR SPACECRAFTS BASED ON ASSOCIATION RULE MINING
- Learning the Balance between Exploration and Exploitation via Reward (Special Section on Concurrent Systems Technology)
- A Cooperation Method via Metaphor of Explanation(Special Section on Concurrent Systems Technology)
- AUTOMATIC ACQUISITION OF REACTIVE BEHAVIOR FOR PLANETARY ROVERS
- Feasibility Study on Enhancement of OTV Capability by Tether for Orbital Transfer Operations
- MICRO SATELLITE AND SPACE EDUCATION