Q-ee Learning : A Novel Q-Learning Method with Exploitation and Exploration
スポンサーリンク
概要
- 論文の詳細を見る
This paper proposes a Q-ee learning method which combines Exploitation and Exploration in Q-learning. By propagating Q-values to every rule implemented in one episode to utilize Q-values efficiently and with the action-selecting mechanism to explore actively, this method takes the advantage of both exploitation and exploration. The Q-ee learning method accelerates the learning rate and guarantees to obtain an optimum in a deterministic MDPs. In learning process, an agent uses an unified strategy. Experiments demonstrate this method is efficient.
- 社団法人人工知能学会の論文
- 1999-07-01
著者
-
TATSUMI Shoji
Faculty of Engineering, Osaka City University
-
Tatsumi Shoji
Faculty Of Engineering Osaka City University
-
Tatsumi S
Osaka City Univ. Osaka‐shi Jpn
-
Zhao Gang
Fujitsu Kansai-chubu Net-tech Limited
-
Zhao Gang
National Astronomical Observatories Chinese Acad. Of Sci. Beijing Chn
-
SUN Ruoying
Faculty of Engineering, Osaka City University
-
ZHAO Gang
Faculty of Engineering, Osaka City University
-
SUN Ruoying
College of Industry and Commerce Management, Liaoning University
-
Sun Ruoying
Faculty Of Engineering Osaka City University
関連論文
- Parallel Genetic Algorithm for Constrained Clustering
- Parallel Genetic Algorithms Based on a Multiprocessor System FIN and Its Application
- Boltzmann Machine and Parallel Genetic Algorithms Based on the Fin
- A PARALLEL IMPLEMENTATION OF THE LEARNING CLASSIFIER SYSTEMS ON THE FIN-1
- Substellar Companions to Evolved Intermediate-Mass Stars : HD 145457 and HD 180314
- Detection of Small-Amplitude Oscillations in the G-Giant HD 76294 (ζ Hydrae)
- Calculation of Photoionized Plasmas with a Detailed-Configuration-Accounting Atomic Model
- Multiagent Cooperating Learning Methods by Indirect Media Communication(Neural Netoworks and Bioengineering)
- On the Spectroscopic Determination of Atmospheric Parameters and O/Fe Abundances of RR Lyrae Stars
- Multiagent Cooperating Learning Methods by Indirect Media Communication
- Na I D Lines in the SN 2002ap Spectrum
- On the Abundance of Potassium in Metal-Poor Stars
- α Element Abundances in Mildly Metal-Poor Stars
- Convergence of the Q-ae Learning on Deterministic MDPs and Its Efficiency on the Stochastic Environment
- RTP-Q: A Reinforcement Learning System with Time Constraints Exploration Planning for Accelerating the Learning Rate
- Q-ee Learning : A Novel Q-Learning Method with Exploitation and Exploration
- An Accelerated k-Certainty Exploration Method
- Electron Impact Excitation of Ti XVIII
- Applying Genetic Algorithm to Conceptual Clustering
- Algorithms for Matrix Multiplication and the FFT on a Processor Array with Separable Buses(Regular Section)
- Solving an All-Pairs Shortest Paths Problem on a Processor Array with Separable Buses
- A Pattern Defect Inspection Method by Grayscale Image Comparison without Precise Image Alignment
- Electron Impact Excitation of N-like Ca XIV