Page 61 - 《应用声学》2025年第3期

P. 61

第 44 卷第 3 期杨雪同等：基于注意力的双层级并行声学场景分类方法 595

Processing (ICASSP). IEEE, 2019: 56–60. [20] Ren S, He K, Girshick R, et al. Faster R-CNN: To-
[14] Gou J, Yu B, Maybank S J, et al. Knowledge distillation: wards real-time object detection with region proposal net-
A survey[J]. International Journal of Computer Vision, works[J]. IEEE Transactions on Pattern Analysis and Ma-
2021, 129: 1789–1819. chine Intelligence, 2017, 39(6): 1137–1149.
[15] Wang K, Gao X, Zhao Y, et al. Pay attention to features, [21] Woo S, Park J, Lee J Y, et al. Cbam: Convolutional block
transfer learn faster CNNs[C]// International Conference attention module[C]// European Conference on Com-
on Learning Representations, 2019. puter Vision (ECCV), 2018: 3–19.
[16] Liu M, Wang W, Li Y. The system for acoustic scene clas- [22] Hu J, Shen L, Sun G. Squeeze-and-excitation net-
siﬁcation using resnet[C]// IEEE AASP Challenge on De- works[C]// IEEE/CVF Conference on Computer Vision
tection and Classiﬁcation of Acoustic Scenes and Events and Pattern Recognition (CVPR), 2018: 7132–7141.
(DCASE), 2019. [23] Plata M. Deep neural networks with supported clus-
[17] Salamon J, Bello J P. Deep convolutional neural networks ters preclassiﬁcation procedure for acoustic scene recogni-
and data augmentation for environmental sound classi- tion[C]// IEEE AASP Challenge on Detection and Clas-
ﬁcation[J]. IEEE Signal Processing Letters, 2017, 24(3): siﬁcation of Acoustic Scenes and Events (DCASE), 2019.
279–283. [24] Ma S, Liu W. Acoustic scene classiﬁcation based on bin-
[18] Hershey S, Chaudhuri S, Ellis D P W, et al. CNN ar- aural deep scattering spectra with neural network[C]//
chitectures for large-scale audio classiﬁcation[C]// 2017 IEEE AASP Challenge on Detection and Classiﬁcation of
IEEE International Conference on Acoustics, Speech and Acoustic Scenes and Events (DCASE), 2019.
Signal Processing (ICASSP). IEEE, 2017: 131–135. [25] Hou Y, Kang B, Mitchell A, et al. Cooperative scene-event
[19] Vaswani A, Shazeer N, Parmar N, et al. Attention is all modelling for acoustic scene classiﬁcation[J]. IEEE/ACM
you need[C]// Advances in Neural Information Processing Transactions on Audio, Speech, and Language Processing,
Systems, 2017: 5998–6008. 2023, 32: 68–82.

56 57 58 59 60 61 62 63 64 65 66