差距仍比较大,这是由于师生模型原本的差距相比 [9] Hu Y, Liu Y, Lyu, S, et al. DCCRN: deep complex convo-
小数据集扩大了,使得师生的引导相对困难。而与 lution recurrent network for phase-aware speech enhance-
ment[C]// Proc. Interspeech 2020: 2472–2476.
同样低复杂度的实时算法 NSNet 和 RNNoise 相比,
[10] Veaux C, Yamagishi J, King S. The voice bank corpus:
本文所提出的模型在维持低参数量的同时取得了 design, collection and data analysis of a large regional ac-
更好的指标结果。 cent speech database[C]// 2013 International Conference
Oriental COCOSDA Held Jointly with 2013 Conference
4 结论 on Asian Spoken Language Research and Evaluation (O-
COCOSDA/CASLRE), 2013: 1–4.
[11] Reddy C K A, Gopal V, Cutler R, et al. The INTER-
SPEECH 2020 deep noise suppression challenge: datasets,
强模型参数规模大、计算复杂度高的问题,基于 subjective testing framework, and challenge results[C]//
DCCRN结构构建了师生学习框架。对复 LSTM 模 Proc. Interspeech 2020: 2492–2496.
[12] ITU R I T U T P. 862.2: wideband extension to
recommendation P. 862 for the assessment of wide-
拉近教师和学生模型的距离。同时,以MRSTFT损 band telephone networks and speech codecs[Z]. ITU-
失作为学生模型的基础损失以提升学生模型的增 Telecommunication Standardization Sector, 2007.
强效果。实验结果表明,相对于基线的学生模型,所 [13] Taal C H, Hendriks R C, Heusdens R, et al. An al-
gorithm for intelligibility prediction of time–frequency
提方法在各项指标上均有优势。通过师生学习引导 weighted noisy speech[J]. IEEE/ACM Transactions on
训练的学生模型能在低参数量下取得与大规模模 Audio, Speech, and Language Processing, 2011, 19(7):
型相近的性能,在公开数据集上取得了具有竞争力 2125–2136.
[14] Hu Y, Loizou P C. Evaluation of objective quality mea-
sures for speech enhancement[J]. IEEE/ACM Transac-
上的应用。 tions on Audio, Speech, and Language Processing, 2008,
16(1): 229–238.
[15] Scalart P, Filho J V. Speech enhancement based on a
