Page 151 - 《应用声学》2020年第3期

P. 151

第 39 卷第 3 期李鹏等：基于双向循环神经网络的汉语语音识别 471

(2) 在本文中使用 DNN 与 Bi-RNN 相结合用 Dai Lirong, Zhang Shiliang, Huang Zhiying. Deep learn-
以构建模型。在使用 DNN 时，由于参数太多，易出 ing for speech recognition: review of state-of-the-arts tech-
nologies and prospects[J]. Journal of Data Acquisition and
现过拟合现象，为了更好地解决这一问题，在接下来
Processing, 2017, 32(7): 221–231.
的学习与探索中，将CNN与Bi-RNN 相结合来构建 [9] Deng L, Yu D, Dahl G E. Deep belief network for large vo-
模型，并进行实验。 cabulary continuous speech recognition: US, 8972253[P].
2015–03–03.
[10] 张仕良. 基于深度神经网络的语音识别模型研究 [D]. 合肥:
参考文献中国科学技术大学, 2017.
[11] 邢吉亮. 结合注意力机制的 Bi-LSTM 循环神经网络对关系分
[1] Vensko G, Lieu K B, Meloche S A, et al. Dynamic time 类的研究 [D]. 长春: 吉林大学, 2018.
warping (DTW) apparatus for use in speech recognition [12] Schuster M, Paliwal K K. Bidirectional recurrent neu-
systems: US, 5073939[P]. 1991–12–17.
ral networks[J]. IEEE Transactions on Signal Processing,
[2] Itakura F. Mnimum prediction residual principle applied
1997, 45(11): 2673–2681.
to speech recognition[J]. IEEE Transactions on Acoustics,
[13] 石颖. 基于循环神经网络的语音识别方案的优化与设计 [D].
Speech & Signal Processing, 1975, 23(1): 67–72.
北京: 北京交通大学, 2017.
[3] 赵力. 语音信号处理 [M]. 北京: 机械工业出版社, 2006.
[14] 汪优升. 基于深度学习的语音识别及其交互应用研究 [D]. 长
[4] Rabiner L R. A tutorial on hidden Markov models and se-
沙: 湖南大学, 2017.
lected applications in speech recognition[J]. Proceedings of
[15] 梁静. 基于深度学习的语音识别研究 [D]. 北京: 北京邮电大
the IEEE, 1989, 77(2): 257–286.
学, 2014.
[5] Waibel A, Hanazawa T, Hinton G, et al. Phoneme
recognition using time-delay neural networks[J]. [16] 黄积杨. 基于双向 LSTMN 神经网络的中文分词研究分
IEEE Transactions on Acoustics, Speech and Signal 析 [D]. 南京: 南京大学, 2016.
Processing, 1989, 3(3): 328–339. [17] 杨洋, 汪毓铎. 基于改进卷积神经网络算法的语音识别 [J]. 应
[6] 杨华民, 姜会林, 李平. 基于神经网络的语音识别技术应用研用声学, 2018, 37(6): 940–946.
究 [J]. 电子技术应用, 1997(9): 7–9. Yang Yang, Wang Yuduo. Speech recognition based
[7] Hinton G E, Osindero S, Teh Y. A fast learning algorithm on improved convolutional neural network algorithm[J].
for deep belief nets[J]. Neural Computation, 2006, 18(3): Journal of Applied Acoustics, 2018, 37(6): 940–946.
1527–1554． [18] Maas A L, Qi P, Xie Z, et al. Building DNN acoustic mod-
[8] 戴礼荣, 张仕良, 黄智颖. 基于深度学习的语音识别技术现状 els for large vocabulary speech recognition[J]. Computer
与展望 [J]. 数据采集与处理, 2017, 32(7): 221–231. Speech & Language, 2015, 41: 195–213.

146 147 148 149 150 151 152 153 154 155 156