| ชื่อเรื่อง | : | Geodesic Gaussian kernels for value function approximation |
| นักวิจัย | : | Sugiyama, Masashi , Hachiya, Hirotaka , Towell, Christopher , Vijayakumar, Sethu |
| คำค้น | : | Reinforcement learning , Value function approximation , Markov decision process , Least-squares policy iteration , Gaussian kernel |
| หน่วยงาน | : | Edinburgh Research Archive, United Kingdom |
| ผู้ร่วมงาน | : | - |
| ปีพิมพ์ | : | 2551 |
| อ้างอิง | : | http://www.springerlink.com/content/4j2g52m1272hj185/ , http://hdl.handle.net/1842/3697 , 10.1007/s10514-008-9095-6 , 09295593 |
| ที่มา | : | - |
| ความเชี่ยวชาญ | : | - |
| ความสัมพันธ์ | : | - |
| ขอบเขตของเนื้อหา | : | - |
| บทคัดย่อ/คำอธิบาย | : | The least-squares policy iteration approach works efficiently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular and useful choice as a basis function. However, it does not allow for discontinuity which typically arises in real-world reinforcement learning tasks. In this paper, we propose a new basis function based on geodesic Gaussian kernels, which exploits the non-linear manifold structure induced by the Markov decision processes. The usefulness of the proposed method is successfully demonstrated in simulated robot arm control and Khepera robot navigation. |
| บรรณานุกรม | : |
Sugiyama, Masashi , Hachiya, Hirotaka , Towell, Christopher , Vijayakumar, Sethu . (2551). Geodesic Gaussian kernels for value function approximation.
กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom . Sugiyama, Masashi , Hachiya, Hirotaka , Towell, Christopher , Vijayakumar, Sethu . 2551. "Geodesic Gaussian kernels for value function approximation".
กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom . Sugiyama, Masashi , Hachiya, Hirotaka , Towell, Christopher , Vijayakumar, Sethu . "Geodesic Gaussian kernels for value function approximation."
กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom , 2551. Print. Sugiyama, Masashi , Hachiya, Hirotaka , Towell, Christopher , Vijayakumar, Sethu . Geodesic Gaussian kernels for value function approximation. กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom ; 2551.
|
