On the Effects of Domain Size and Complexity in Empirical Distribution of Reinforcement Learning(Artificial Intelligence and Cognitive Science)
スポンサーリンク
概要
- 論文の詳細を見る
We regard the events of a Markov decision process as the outputs from a Markov information source in order to analyze the randomness of an empirical sequence by the codeword length of the sequence. The randomness is an important viewpoint in reinforcement learning since the learning is to eliminate the randomness and to find an optimal policy. The occurrence of optimal empirical sequence also depends on the randomness. We then introduce the Lempel-Ziv coding for measuring the randomness which consists of the domain size and the stochastic complexity. In experimental results, we confirm that the learning and the occurrence of optimal empirical sequence depend on the randomness and show the fact that in early stages the randomness is mainly characterized by the domain size and as the number of time steps increases the randomness depends greatly on the complexity of Markov decision processes.
- 社団法人電子情報通信学会の論文
- 2005-01-01
著者
-
Iwata Kazunori
The Department Of Systems Science Graduate School Of Informatics Kyoto University
-
Ikeda Kazushi
The Department Of Systems Science Graduate School Of Informatics Kyoto University
-
Sakai Hideaki
The Department Of Systems Science Graduate School Of Informatics Kyoto University
-
Sakai H
Kyoto Univ. Kyoto‐shi Jpn
-
Ikeda K
Kyoto Univ. Kyoto‐shi Jpn
-
Sakai Hideaki
The Department Of Internal Medicine National Nishitaga Hospital
関連論文
- Propagation Characteristics of Boolean Functions and Their Balancedness
- Blind Identification of Multichannel Systems by Exploiting Prior Knowledge of the Channel (Special Section on Digital Signal Processing)
- Studies on the Convergence Speed of Over-Sampled Subband Adaptive Digital Filters(Special Section on Digital Signal Processing)
- The Asymptotic Equipartition Property in Reinforcement Learning and its Relation to Return Maximization
- On the Effects of Domain Size and Complexity in Empirical Distribution of Reinforcement Learning(Artificial Intelligence and Cognitive Science)
- Progressive Multifocal Leukoencephalopathy in a Patient with Acquired Immunodeficiency Syndrome (AIDS) Manifesting Gerstmann's Syndrome