A learning strategy using simulator for real hardware of swing-up pendulum
スポンサーリンク
概要
- 論文の詳細を見る
We proposed a novel method of hybrid machine learning using both simulator and real hardware. In advance, a simulator of the hardware is built with the actually acquired data from the real hardware using neural networks and the back-propagation learning method. Afterward, the objective controller of the hardware is trained only with the built simulator by the reinforcement learning method. Finally, the controller is applied to the real hardware. The both learning processes for the simulator and the controller are performed without using the real hardware after the data sampling, therefore load against the hardware is less than using the real hardware, and the objective controller can be optimized faster than real time learning. As an example, we picked up the pendulum swing-up task which was a typical nonlinear control problem, and the proposed method worked successfully.
- 日本知能情報ファジィ学会の論文
日本知能情報ファジィ学会 | 論文
- FCNによる自律エージェントの行動制御と行動解析 : タルタロス問題への応用
- コンフリクト, 迷いと意思決定(意思決定)
- 認知心理学における類似性研究(類似尺度と情報検索)
- アメリカ留学体験記
- 文脈への意味の位置付けを用いた対話システムとその評価(言語,テキストの知能情報処理)