Multi-order Rule Accumulation for an Agent Control Problem in Non-Markov Environments
スポンサーリンク
概要
- 論文の詳細を見る
Multi-agent control in non-Markov environments is difficult because the environment information is partially observable. Agents suffer from the perceptual aliasing problem and couldn't take proper actions. In order to solve this problem, this paper proposes a rule-based model named "multi-order rule accumulation" to guide agent's actions in non-Markov environments. The advantages are, firstly, each multi-order rule memorizes the past environment information and agent's actions, which serves as the additional information to distinguish the aliasing situations, secondly, multi-order rules are very general, so that they are competent for guiding agents' actions in Partial Observable Markov Decision Process (POMDP), thirdly, multi-order rules are accumulated throughout the generations, which could cover many situations experienced in different generations. This also helps agents to take proper actions. Simulations on the tile-world problem prove that this rule-based model outperforms the conventional methods and the previous research.
- 電気学会 ; 1972-の論文
著者
-
Hirasawa Kotaro
Graduate School Of Information Production And System Waseda University
-
Mabu Shingo
Graduate School Of Information Production And System Waseda University
-
WANG Lutao
Graduate School of Information, Production and Systems, Waseda University
関連論文
- Real Time Updating Genetic Network Programming for Adapting to the Change of Stock Prices
- Elevator Group Supervisory Control System Using Genetic Network Programming with Macro Nodes and Reinforcement Learning
- A Double-Deck Elevator Group Supervisory Control System with Destination Floor Guidance System Using Genetic Network Programming
- Network Intrusion Detection Using Class Association Rule Mining Based on Genetic Network Programming
- Application of Universal Learning Networks to PV-Supplied DC Moter Drives
- Universal Learning Network-Based Fuzzy System and Its Application to Non-Linear Control System
- 1C3-4 Benchmark Test of RasID-GA for Inequality/Equality Constrained Optimization
- Task-Oriented Reinforcement Learning for Continuing Task in Dynamic Environment
- Propagation and control of stochastic signals through universal learning networks
- A functions localized neural network with branch gates
- Hybrid Universal Learning Networks
- Genetic Network Programming with Reinforcement Learning and Its Application to Making Mobile Robot Behavior
- EvoCMAR : A New Evolutionary Method to Directly Mine Association Rules for Classification
- Time Related Class Association Rule Mining and Its Application to Traffic Prediction
- Stock Price Prediction using Neural Networks with RasID-GA
- A Nonlinear Model to Rank Association Rules Based on Semantic Similarity and Genetic Network Programing
- Buying and Selling Stocks of Multi Brands Using Genetic Network Programming with Control Nodes
- A Traffic-Flow-Adaptive Controller of Double-Deck Elevator Systems using Genetic Network Programming
- Association Rule Mining for Continuous Attributes using Genetic Network Programming
- KDI-Based Robust Fault Detection in Presence of Nonlinear Undermodeling
- A Hybrid Quasi-ARMAX Modeling Scheme for Identification of Nonlinear Systems
- Learning Petri Network and Its Application to Non-linear System Control
- Elevator Group Control Using Multiagent Task-Oriented Reinforcement Learning
- A New Learning Method Using Local and Global Information for Neural Networks
- Increasing Robustness of Binary-coded Genetic Algorithm
- An Incremental Learning of Neural Network with Multiplication Units for Function Approximation
- Behavior Learning of Autonomous Robots by Modified Learning Vector Quantization
- Multiple-Round English Auction Agent Based on Genetic Network Programming
- Enhancing the Generalization Ability of Neural Networks by Using Gram-Schmidt Orthogonalization Algorithm
- Mining Fuzzy Association Rules : A General Model Based on Genetic Network Programming and its Applications
- Fuzzy Intertransaction Class Association Rule Mining using Genetic Network Programming for Stock Market Prediction
- システム/情報 A New Method Based on Determining Error Surface for Designing Three Layer Neural Networks
- A New Learning Method Using Prior Information of Neural Networks
- An Efficient Preprocessing Method for Suboptimal Route Computation
- Multicar Elevator Group Supervisory Control System using Genetic Network Programming
- Robust Control for System Parameter Perturbation Using Second Order Derivatives of Universal Learning Network
- A new learning method using local and global information for neural networks
- Evolutionary Selection of Interesting Class Association Rules Using Genetic Relation Algorithm
- MBFP Generalized Association Rule Mining and Classification in Traffic Volume Prediction
- Analysis of Energy Consumption of Elevator Group Supervisory Control System Based on Genetic Network Programming
- Image Denoising Using Pulse Coupled Neural Network with an Adaptive Pareto Genetic Algorithm
- A Portfolio Selection Model Using Genetic Relation Algorithm and Genetic Network Programming
- Network Structure Oriented Evolutionary Model : Genetic Network Programming : Its Comparison with Genetic Programming
- Dynamic Optimal Route Search Algorithm for Car Navigation Systems with Preferences by Dynamic Programming
- A Neurofuzzy-Based Adaptive Predictor for Control of Nonlinear Systems
- Dynamic Traffic Management using Temperature Parameter Control in Q value-based Dynamic Programming with Boltzmann Distribution
- Pruning of Redundant Information to Improve Performance for Agent Control in A Changing Environment
- Pruning of Redundant Information to Improve Performance for Agent Control in A Changing Environment
- Multi-order Rule Accumulation for an Agent Control Problem in Non-Markov Environments
- Multi-order Rule Accumulation for an Agent Control Problem in Non-Markov Environments
- A Genetic Network Programming-based Bidding Strategy with Adjusting Parameters for Large-scale Continuous Double Auction
- Distance-based Classification using Average Matching Degree and its Application to Intrusion Detection Systems
- Efficient Pruning of Class Association Rules Using Statistics and Genetic Relation Algorithm
- Genetic Network Programming with Reconstructed Individuals
- A Genetic Network Programming-based Bidding Strategy with Adjusting Parameters for Large-scale Continuous Double Auction
- Q Value-Based Dynamic Programming with Boltzmann Distribution in Large Scale Road Network
- Intertransaction Class Association Rule Mining Based on Genetic Network Programming and Its Application to Stock Market Prediction
- Algorithm for Route Planning with Multiple Intermediate Destinations
- A Genetic Algorithm Based Clustering Method for Optimal Route Calculation on Multilevel Networks
- An Evolutionary Model of Multilateral Negotiation System