Receding horizon cache and extreme learning machine based reinforcement learning

ridm@nrct.go.th ระบบคลังข้อมูลงานวิจัยไทย รายการโปรดที่คุณเลือกไว้

Receding horizon cache and extreme learning machine based reinforcement learning

หน่วยงาน Nanyang Technological University, Singapore

รายละเอียด

ชื่อเรื่อง	:	Receding horizon cache and extreme learning machine based reinforcement learning
นักวิจัย	:	Shao, Zhifei , Er, Meng Joo , Huang, Guang-Bin
คำค้น	:	DRNTU::Engineering::Electrical and electronic engineering
หน่วยงาน	:	Nanyang Technological University, Singapore
ผู้ร่วมงาน	:	-
ปีพิมพ์	:	2555
อ้างอิง	:	Shao, Z., Er, M. J., & Huang, G.-B. (2012). Receding Horizon Cache and Extreme Learning Machine Based Reinforcement Learning. 2012 12th International Conference on Control Automation Robotics & Vision (ICARCV), 1591-1596. , http://hdl.handle.net/10220/11704 , http://dx.doi.org/10.1109/ICARCV.2012.6485384
ที่มา	:	-
ความเชี่ยวชาญ	:	-
ความสัมพันธ์	:	-
ขอบเขตของเนื้อหา	:	-
บทคัดย่อ/คำอธิบาย	:	Function approximators have been extensively used in Reinforcement Learning (RL) to deal with large or continuous space problems. However, batch learning Neural Networks (NN), one of the most common approximators, has been rarely applied to RL. In this paper, possible reasons for this are laid out and a solution is proposed. Specifically, a Receding Horizon Cache (RHC) structure is designed to collect training data for NN by dynamically archiving state-action pairs and actively updating their Q-values, which makes batch learning NN much easier to implement. Together with Extreme Learning Machine (ELM), a new RL with function approximation algorithm termed as RHC and ELM based RL (RHC-ELM-RL) is proposed. A mountain car task was carried out to test RHC-ELM-RL and compare its performance with other algorithms.
บรรณานุกรม	:	APA Chicago MLA Vancouver Shao, Zhifei , Er, Meng Joo , Huang, Guang-Bin . (2555). Receding horizon cache and extreme learning machine based reinforcement learning. กรุงเทพมหานคร : Nanyang Technological University, Singapore. Shao, Zhifei , Er, Meng Joo , Huang, Guang-Bin . 2555. "Receding horizon cache and extreme learning machine based reinforcement learning". กรุงเทพมหานคร : Nanyang Technological University, Singapore. Shao, Zhifei , Er, Meng Joo , Huang, Guang-Bin . "Receding horizon cache and extreme learning machine based reinforcement learning." กรุงเทพมหานคร : Nanyang Technological University, Singapore, 2555. Print. Shao, Zhifei , Er, Meng Joo , Huang, Guang-Bin . Receding horizon cache and extreme learning machine based reinforcement learning. กรุงเทพมหานคร : Nanyang Technological University, Singapore; 2555.