| ชื่อเรื่อง | : | Hierarchical dialogue optimization using semi-markov decision processes. |
| นักวิจัย | : | Cuayáhuitl, Heriberto , Renals, Steve , Lemon, Oliver , Shimodaira, Hiroshi |
| คำค้น | : | speech technology |
| หน่วยงาน | : | Edinburgh Research Archive, United Kingdom |
| ผู้ร่วมงาน | : | - |
| ปีพิมพ์ | : | 2550 |
| อ้างอิง | : | Heriberto Cuayáhuitl, Steve Renals, Oliver Lemon, and Hiroshi Shimodaira. Hierarchical dialogue optimization using semi-markov decision processes. In Proc. of INTERSPEECH, August 2007. , http://hdl.handle.net/1842/1994 |
| ที่มา | : | - |
| ความเชี่ยวชาญ | : | - |
| ความสัมพันธ์ | : | - |
| ขอบเขตของเนื้อหา | : | - |
| บทคัดย่อ/คำอธิบาย | : | This paper addresses the problem of dialogue optimization on large search spaces. For such a purpose, in this paper we propose to learn dialogue strategies using multiple Semi-Markov Decision Processes and hierarchical reinforcement learning. This approach factorizes state variables and actions in order to learn a hierarchy of policies. Our experiments are based on a simulated flight booking dialogue system and compare flat versus hierarchical reinforcement learning. Experimental results show that the proposed approach produced a dramatic search space reduction (99.36 than flat reinforcement learning with a very small loss in optimality (on average 0.3 system turns). Results also report that the learnt policies outperformed a hand-crafted one under three different conditions of ASR confidence levels. This approach is appealing to dialogue optimization due to faster learning, reusable subsolutions, and scalability to larger problems. |
| บรรณานุกรม | : |
Cuayáhuitl, Heriberto , Renals, Steve , Lemon, Oliver , Shimodaira, Hiroshi . (2550). Hierarchical dialogue optimization using semi-markov decision processes..
กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom . Cuayáhuitl, Heriberto , Renals, Steve , Lemon, Oliver , Shimodaira, Hiroshi . 2550. "Hierarchical dialogue optimization using semi-markov decision processes.".
กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom . Cuayáhuitl, Heriberto , Renals, Steve , Lemon, Oliver , Shimodaira, Hiroshi . "Hierarchical dialogue optimization using semi-markov decision processes.."
กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom , 2550. Print. Cuayáhuitl, Heriberto , Renals, Steve , Lemon, Oliver , Shimodaira, Hiroshi . Hierarchical dialogue optimization using semi-markov decision processes.. กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom ; 2550.
|
