An Accelerated k-Certainty Exploration Method
スポンサーリンク
概要
- 論文の詳細を見る
Reinforcement learning aims to adapt an agent system to an unknown environment according to rewards. On Markov decision processes(MDPs), if a correct environment model is identified, an optimal policy can be derived by applying policy iteration algorithm. For identifying an environment on MDPs, k-Certainty Exploration Method has been proposed. But the k-certainty exploration does not give the direct path to the states with k-uncertainty rules that may result in an agent taking useless steps as exploring an environment. In this paper, we propose an Accelerated k-Certainty Exploration Method which speeds up the k-certainty exploration rate. By simulations, this method is demonstrated to be efficient, and, an satisfied exploration result is also gotten in larger environment with rules numbet 5x10^4.
- 社団法人人工知能学会の論文
- 1999-05-01
著者
-
TATSUMI Shoji
Faculty of Engineering, Osaka City University
-
Tatsumi Shoji
Faculty Of Engineering Osaka City University
-
Tatsumi S
Osaka City Univ. Osaka‐shi Jpn
-
Zhao Gang
Fujitsu Kansai-chubu Net-tech Limited
-
Zhao Gang
National Astronomical Observatories Chinese Acad. Of Sci. Beijing Chn
-
SUN Ruoying
Faculty of Engineering, Osaka City University
-
ZHAO Gang
Faculty of Engineering, Osaka City University
-
SUN Ruoying
College of Industry and Commerce Management, Liaoning University
-
Sun Ruoying
Faculty Of Engineering Osaka City University
関連論文
- Parallel Genetic Algorithm for Constrained Clustering
- Parallel Genetic Algorithms Based on a Multiprocessor System FIN and Its Application
- Boltzmann Machine and Parallel Genetic Algorithms Based on the Fin
- A PARALLEL IMPLEMENTATION OF THE LEARNING CLASSIFIER SYSTEMS ON THE FIN-1
- Substellar Companions to Evolved Intermediate-Mass Stars : HD 145457 and HD 180314
- Detection of Small-Amplitude Oscillations in the G-Giant HD 76294 (ζ Hydrae)
- Calculation of Photoionized Plasmas with a Detailed-Configuration-Accounting Atomic Model
- Multiagent Cooperating Learning Methods by Indirect Media Communication(Neural Netoworks and Bioengineering)
- On the Spectroscopic Determination of Atmospheric Parameters and O/Fe Abundances of RR Lyrae Stars
- Multiagent Cooperating Learning Methods by Indirect Media Communication
- Na I D Lines in the SN 2002ap Spectrum
- On the Abundance of Potassium in Metal-Poor Stars
- α Element Abundances in Mildly Metal-Poor Stars
- Convergence of the Q-ae Learning on Deterministic MDPs and Its Efficiency on the Stochastic Environment
- RTP-Q: A Reinforcement Learning System with Time Constraints Exploration Planning for Accelerating the Learning Rate
- Q-ee Learning : A Novel Q-Learning Method with Exploitation and Exploration
- An Accelerated k-Certainty Exploration Method
- Electron Impact Excitation of Ti XVIII
- Applying Genetic Algorithm to Conceptual Clustering
- Algorithms for Matrix Multiplication and the FFT on a Processor Array with Separable Buses(Regular Section)
- Solving an All-Pairs Shortest Paths Problem on a Processor Array with Separable Buses
- A Pattern Defect Inspection Method by Grayscale Image Comparison without Precise Image Alignment
- Electron Impact Excitation of N-like Ca XIV