文章摘要
和椿皓,常铁原,潘立冬,王珺.基于密集连接时延神经网络的说话人识别算法*[J].,2024,43(2):378-384
基于密集连接时延神经网络的说话人识别算法*
A speaker recognition algorithm based on densely connected time delay neural network
投稿时间:2022-11-17  修订日期:2024-03-04
中文摘要:
      说话人识别技术是一项重要的生物特征识别技术。近年来,使用深度神经网络提取发声特征的说话人识别算法取得了突出成果。时延神经网络作为其中的典型代表之一已被证明具有出色的特征提取能力。为进一步提升识别准确率并节约计算资源,通过对现有的说话人识别算法进行研究,提出一种带有注意力机制的密集连接时延神经网络用于说话人识别。密集连接的网络结构在增强不同网络层之间的信息复用的同时能有效控制模型体积。通道注意力机制和帧注意力机制帮助网络聚焦于更关键的细节特征,使得通过统计池化提取出的说话人特征更具有代表性。实验结果表明,在VoxCeleb1测试数据集上取得了1.40%的等错误率(EER)和0.15的最小检测代价标准(DCF),证明了在说话人识别任务上的有效性。
英文摘要:
      Speaker recognition is an important biometric identification technology. In recent years, speaker recognition algorithms that use deep neural networks to extract vocal features have achieved outstanding results. As one of the typical representatives, time-delay neural networks have proven to have excellent feature extraction capabilities. To further improve recognition accuracy and save computational resources, a densely connected time-delay neural network with an attention mechanism is proposed for speaker recognition by investigating existing speaker recognition algorithms. The densely connected structure enhances the information reuse between different network layers while effectively controlling the model size. The channel attention mechanism and frame attention mechanism help the network to focus on more critical details of the features, making the speaker features extracted by statistical pooling more representative. Experimental results show that an equal error rate(EER) of 1.40% and a minimum detection cost criterion(DCF) of 0.15 were achieved on the VoxCeleb1 test dataset, demonstrating effectiveness on the speaker recognition task.
DOI:10.11684/j.issn.1000-310X.2024.02.016
中文关键词: 说话人识别  深度学习  神经网络  密集连接  注意力机制
英文关键词: Speaker recognition  Deep learning  Neural network  Dense connectivity  Attention mechanism
基金项目:开放场景下基于时空层次图卷积网络的行人跟踪算法研究(河北省自然科学基金(F2022201013))
作者单位E-mail
和椿皓 河北大学电子信息工程学院 hchofficial@outlook.com 
常铁原 河北大学电子信息工程学院 tieyuan_chang@hbu.edu.cn 
潘立冬* 河北大学电子信息工程学院 panlidong@vip.163.com 
王珺 河北大学电子信息工程学院 jun_wang@hbu.edu.cn 
摘要点击次数: 1052
全文下载次数: 551
查看全文   查看/发表评论  下载PDF阅读器
关闭