| ชื่อเรื่อง | : | A comparison of phone and grapheme-based spoken term detection |
| นักวิจัย | : | Wang, Dong , Frankel, Joe , Tejedor, Javier , King, Simon |
| คำค้น | : | - |
| หน่วยงาน | : | Edinburgh Research Archive, United Kingdom |
| ผู้ร่วมงาน | : | - |
| ปีพิมพ์ | : | 2551 |
| อ้างอิง | : | 978-1-4244-1483-3 , http://hdl.handle.net/1842/3837 , 10.1109/ICASSP.2008.4518773 , 1520-6149 |
| ที่มา | : | - |
| ความเชี่ยวชาญ | : | - |
| ความสัมพันธ์ | : | - |
| ขอบเขตของเนื้อหา | : | - |
| บทคัดย่อ/คำอธิบาย | : | We propose grapheme-based sub-word units for spoken term detection (STD). Compared to phones, graphemes have a number of potential advantages. For out-of-vocabulary search terms, phone- based approaches must generate a pronunciation using letter-to-sound rules. Using graphemes obviates this potentially error-prone hard decision, shifting pronunciation modelling into the statistical models describing the observation space. In addition, long-span grapheme language models can be trained directly from large text corpora. We present experiments on Spanish and English data, comparing phone and grapheme-based STD. For Spanish, where phone and grapheme-based systems give similar transcription word error rates (WERs), grapheme-based STD significantly outperforms a phone- based approach. The converse is found for English, where the phone-based system outperforms a grapheme approach. However, we present additional analysis which suggests that phone-based STD performance levels may be achieved by a grapheme-based approach despite lower transcription accuracy, and that the two approaches may usefully be combined. We propose a number of directions for future development of these ideas, and suggest that if grapheme-based STD can match phone-based performance, the inherent flexibility in dealing with out-of-vocabulary terms makes this a desirable approach. |
| บรรณานุกรม | : |
Wang, Dong , Frankel, Joe , Tejedor, Javier , King, Simon . (2551). A comparison of phone and grapheme-based spoken term detection.
กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom . Wang, Dong , Frankel, Joe , Tejedor, Javier , King, Simon . 2551. "A comparison of phone and grapheme-based spoken term detection".
กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom . Wang, Dong , Frankel, Joe , Tejedor, Javier , King, Simon . "A comparison of phone and grapheme-based spoken term detection."
กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom , 2551. Print. Wang, Dong , Frankel, Joe , Tejedor, Javier , King, Simon . A comparison of phone and grapheme-based spoken term detection. กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom ; 2551.
|
