Accelerate Learning Processes by Avoiding Inappropriate Rules in Transfer Learning for Actor-Critic
スポンサーリンク
概要
- 論文の詳細を見る
This paper aims to accelerate processesof actor-critic method, which is one of majorreinforcement learning algorithms, by a transferlearning. In general, reinforcement learning is usedto solve optimization problems. Learning agentsacquire a policy to accomplish the target task autonomously.To solve the problems, agents requirelong learning processes for trial and error. Transferlearning is one of effective methods to acceleratelearning processes of machine learning algorithms.It accelerates learning processes by usingprior knowledge from a policy for a source task. Wepropose an effective transfer learning algorithm foractor-critic method. Two basic issues for the transferlearning are method to select an effective sourcepolicy and method to reuse without negative transfer.In this paper, we mainly discuss the latter. We proposedthe reuse method which based on the selectionmethod that uses the forbidden rule set. Forbiddenrule set is the set of rules that cause immediate failureof tasks. It is used to foresee similarity betweena source policy and the target policy. Agents shouldnot transfer the inappropriate rules in the selectedpolicy. In actor-critic, a policy is constructed by twoparameter sets: action preferences and state values.To avoid inappropriate rules, agents reuse only reliableaction preferences and state values that implypreferred actions. We perform simple experimentsto show the effectiveness of the proposed method. Inconclusion, the proposed method accelerates learningprocesses for the target tasks.
論文 | ランダム
- 成人発症の亜急性硬化性全脳炎の終夜睡眠ポリグラフ
- 戦史検定で戦地慰霊碑を保全 政府がやらなきゃ僕らがやります
- 分布定数FETを用いた超広帯域MMICスイッチの開発
- 分布定数FETを用いた超広帯域MMICスイッチの開発
- SC-2-6 76GHz帯車載レーダ用MMICの開発