ridm@nrct.go.th   ระบบคลังข้อมูลงานวิจัยไทย   รายการโปรดที่คุณเลือกไว้

A Posterior Approach for Microphone Array Based Speech Recognition

หน่วยงาน Edinburgh Research Archive, United Kingdom

รายละเอียด

ชื่อเรื่อง : A Posterior Approach for Microphone Array Based Speech Recognition
นักวิจัย : Wang, Dong , Himawan, Ivan , Frankel, Joe , King, Simon
คำค้น : -
หน่วยงาน : Edinburgh Research Archive, United Kingdom
ผู้ร่วมงาน : -
ปีพิมพ์ : 2551
อ้างอิง : http://hdl.handle.net/1842/3907
ที่มา : -
ความเชี่ยวชาญ : -
ความสัมพันธ์ : -
ขอบเขตของเนื้อหา : -
บทคัดย่อ/คำอธิบาย :

Automatic speech recognition (ASR) becomes rather difficult in meetings domains because of the adverse acoustic conditions, including more background noise, more echo and reverberation and frequent cross-talking. Microphone arrays have been demonstrated able to boost ASR performance dramatically in such noisy and reverberant environments, with various beamforming algorithms. However, almost all existing beamforming measures work in the acoustic domain, resorting to signal processing theories and geometric explanation. This limits their application, and induces significant performance degradation when the geometric property is unavailable or hard to estimate, or if heterogenous channels exist in the audio system. In this paper, we preset a new posterior-based approach for array-based speech recognition. The main idea is, instead of enhancing speech signals, we try to enhance the posterior probabilities that frames belonging to recognition units, e.g., phones. These enhanced posteriors are then transferred to posterior probability based features and are modeled by HMMs, leading to a tandem ANN-HMM hybrid system presented by Hermansky et al.. Experimental results demonstrated the validity of this posterior approach. With the posterior accumulation or enhancement, significant improvement was achieved over the single channel baseline. Moreover, we can combine the acoustic enhancement and posterior enhancement together, leading to a hybrid acoustic-posterior beamforming approach, which works significantly better than just the acoustic beamforming, especially in the scenario with moving-speakers.

บรรณานุกรม :
Wang, Dong , Himawan, Ivan , Frankel, Joe , King, Simon . (2551). A Posterior Approach for Microphone Array Based Speech Recognition.
    กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom .
Wang, Dong , Himawan, Ivan , Frankel, Joe , King, Simon . 2551. "A Posterior Approach for Microphone Array Based Speech Recognition".
    กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom .
Wang, Dong , Himawan, Ivan , Frankel, Joe , King, Simon . "A Posterior Approach for Microphone Array Based Speech Recognition."
    กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom , 2551. Print.
Wang, Dong , Himawan, Ivan , Frankel, Joe , King, Simon . A Posterior Approach for Microphone Array Based Speech Recognition. กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom ; 2551.