Inertial Estimator Learning Automata
スポンサーリンク
概要
- 論文の詳細を見る
This paper presents an inertial estimator learning automata scheme by which both the short-term and long-term perspectives of the environment can be incorporated in the stochastic estimator — the long term information crystallized in terms of the running reward-probability estimates, and the short term information used by considering whether the most recent response was a reward or a penalty. Thus, when the short-term perspective is considered, the stochastic estimator becomes pertinent in the context of the estimator algorithms. The proposed automata employ an inertial weight estimator as the short-term perspective to achieve a rapid and accurate convergence when operating in stationary random environments. According to the proposed inertial estimator scheme, the estimates of the reward probabilities of actions are affected by the last response from environment. In this way, actions that have gotten the positive response from environment in the short time, have the opportunity to be estimated as “optimal”, to increase their choice probability and consequently, to be selected. The estimates become more reliable and consequently, the automaton rapidly and accurately converges to the optimal action. The asymptotic behavior of the proposed scheme is analyzed and it is proved to be ε-optimal in every stationary random environment. Extensive simulation results indicate that the proposed algorithm converges faster than the traditional stochastic-estimator-based SERI scheme, and the deterministic-estimator-based DGPA and DPRI schemes when operating in stationary random environments.
著者
-
NI Lina
College of Info Sci & Engi, Shandong University of Science & Technology
-
Tang Zheng
Department Of Computer Science And Technology The Key Laboratory Of Embedded System And Service Comp
-
Xie Chen
Department Of Computer Science And Technology The Key Laboratory Of Embedded System And Service Comp
-
Zhang Junqi
Department Of Computer Science And Technology The Key Laboratory Of Embedded System And Service Comp
-
Gao Shangce
Department Of Intellectual Information Systems Engineering University Of Toyama
-
XIE Chen
Department of Computer Science and Technology, the Key Laboratory of Embedded System and Service Computing, Ministry of Education; Tongji University
-
TANG Zheng
Department of Computer Science and Technology, the Key Laboratory of Embedded System and Service Computing, Ministry of Education; Tongji University
-
ZHANG Junqi
Department of Computer Science and Technology, the Key Laboratory of Embedded System and Service Computing, Ministry of Education; Tongji University
関連論文
- Hybrid Uniform Distribution of Particle Swarm Optimizer
- Affinity Based Lateral Interaction Artificial Immune System(Human-computer Interaction)
- Avoiding the Local Minima Problem in Backpropagation Algorithm with Modified Error Function(Neural Networks and Bioengineering)
- An Improved Maximum Neural Network with Stochastic Dynamics Characteristic for Maximum Clique Problem
- An Artificial Immune System with Feedback Mechanisms for Effective Handling of Population Size
- High accuracy microprocessor controlled sine-wave generator
- Quantum Interference Crossover-Based Clonal Selection Algorithm and Its Application to Traveling Salesman Problem
- An Improved Local Search Learning Method for Multiple-Valued Logic Network Minimization with Bi-objectives
- An Expanded Lateral Interactive Clonal Selection Algorithm and Its Application
- Improved Clonal Selection Algorithm Combined with Ant Colony Optimization
- An Improved Clonal Selection Algorithm and Its Application to Traveling Salesman Problems(Neural Networks and Bioengineering)
- A Novel Clonal Selection Algorithm and Its Application to Traveling Salesman Problem(Neural Networks and Bioengineering)
- A stochastic dynamic local search method for learning Multiple-Valued Logic networks
- An Improved Artificial Immune System (AIS) by Considering Different Affinities among Th Cells and Antigens
- A Chaotic Clonal Selection Algorithm and its Application to Synthesize Multiple-Valued Logic Functions
- Hybrid Uniform Distribution of Particle Swarm Optimizer
- Local Search with Probabilistic Modeling for Learning Multiple-Valued Logic Networks
- A study of chinese accounting system
- T-model natural network for PCM encoding
- An MOS four-quadrant analog multiplier using +-0.75-V power supply
- A fully-piped multiplier using the redundant binary addition tree
- AMT-PSO : An Adaptive Magnification Transformation Based Particle Swarm Optimizer
- Construction-and-extraction Based Index for Images Retrieval
- Adaptive Bare Bones Particle Swarm Inspired by Cloud Model
- Construction-and-extraction Based Index for Images Retrieval
- Inertial Estimator Learning Automata