Statistical Parametric Speech Synthesis Using Deep Neural Network

ridm@nrct.go.th ระบบคลังข้อมูลงานวิจัยไทย รายการโปรดที่คุณเลือกไว้

Statistical Parametric Speech Synthesis Using Deep Neural Network

หน่วยงาน Edinburgh Research Archive, United Kingdom

รายละเอียด

ชื่อเรื่อง	:	Statistical Parametric Speech Synthesis Using Deep Neural Network
นักวิจัย	:	Ge, Mengtian
คำค้น	:	Deep Neural Network , Speech Synthesis
หน่วยงาน	:	Edinburgh Research Archive, United Kingdom
ผู้ร่วมงาน	:	Lu, Heng , King, Simon
ปีพิมพ์	:	2556
อ้างอิง	:	http://hdl.handle.net/1842/8658
ที่มา	:	-
ความเชี่ยวชาญ	:	-
ความสัมพันธ์	:	-
ขอบเขตของเนื้อหา	:	-
บทคัดย่อ/คำอธิบาย	:	In this work, we implement a deep neural network for the text-to-speech system. We have tried different parameter settings for the DNN layers and units, and find that the three-layer DNN works better than the four-layer ones. We also pre-trained the best three-layer system (1000-1000-1000), and both objective and subjective test results show significant improvement in the synthesizing quality after pre-training. The final pre-trained system obtains an average linear spectral pair (LSP) root mean square error (RMSE) of 0.179096, beating the DNN-TTS benchmark of 0.187225.
บรรณานุกรม	:	APA Chicago MLA Vancouver Ge, Mengtian . (2556). Statistical Parametric Speech Synthesis Using Deep Neural Network. กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom . Ge, Mengtian . 2556. "Statistical Parametric Speech Synthesis Using Deep Neural Network". กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom . Ge, Mengtian . "Statistical Parametric Speech Synthesis Using Deep Neural Network." กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom , 2556. Print. Ge, Mengtian . Statistical Parametric Speech Synthesis Using Deep Neural Network. กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom ; 2556.