Page 60 - 《应用声学》2024年第1期

P. 60

56 2024 年 1 月

的数据集仍为 3 种目标的混合通道图，将其中 398 [2] Isaacs J C. Sonar automatic target recognition for un-
张图像作为训练集进行模型训练，剩余部分作为测 derwater UXO remediation[C]. 2015 IEEE Conference on
Computer Vision and Pattern Recognition Workshops
试集，参数设置与前述实验相同，对3 个模型均进行
(CVPRW), 2015: 134–140.
100 次的运行测试，取平均值作为最终模型准确率， [3] Dura E, Bell J, Lane D. Superellipse ﬁtting for the re-
实验结果如表5所示。 covery and classiﬁcation of mine-like shapes in sidescan
sonar images[J]. IEEE Journal of Oceanic Engineering,
表 5 不同模型分类准确率 2008, 33(4): 434–444.
[4] 许文海, 续元君, 董丽丽, 等. 基于水平集和支持向量机的图
Table 5 Model classiﬁcation accuracy of
像声呐目标识别 [J]. 仪器仪表学报, 2012, 33(1): 49–55.
diﬀerent network models
Xu Wenhai, Xu Yuanjun, Dong Lili, et al. Level-set and
SVM based target recognition of image sonar[J]. Instru-
模型准确率/%
mentation, 2012, 33(1): 49–55.
VGGNet16 91.07
[5] Neupane D, Seok J. A review on deep learning-based ap-
ResNet50 88.79 proaches for automatic sonar target recognition[J]. Elec-
MobileNetV2 91.13 tronics, 2020, 9(11): 1972.
[6] Mckay J, Gerg I, Monga V. What’s mine is yours:
由此可见，本文 MobileNetV2 模型与 VG- pretrained CNNs for limited training sonar ATR[J].
GNet16 模型对上述 3 类目标的分类准确率达到 OCEANS 2017 - Anchorage, 2017: 1–7.
91%以上，相比ResNet50模型要高出2.3%左右。但 [7] Alshalali T, Josyula D. Fine-tuning of pre-trained deep
learning models with extreme learning machine[C]. 2018
是综合考虑 4.2 节及 4.3 节中 VGGNet16 较高的参
International Conference on Computational Science and
数量和计算代价，本文方法在该水下目标分类任务 Computational Intelligence (CSCI), 2018.
中具有更好的实用性。 [8] 巩文静, 田杰, 李宝奇, 等. 基于改进 MobileNetV2 网络的
声光图像融合水下目标分类方法 [J]. 应用声学, 2022, 41(3):
5 结论 462–470.
Gong Wenjing, Tian Jie, Li Baoqi, et al. Acoustic-optical
image fusion underwater target classiﬁcation method
本文对三维成像声呐目标分类这一问题进行
based on improved MobilenetV2[J]. Journal of Applied
了研究。首先，针对三维点云目标分类网络计算量 Acoustics, 2022, 41(3): 462–470.
大的问题，从三维成像声呐波束形成原理出发，提出 [9] Yin Z, Tuzel O. Voxelnet: end-to-end learning for point
cloud based 3D object detection[C]. Proceedings of the
了一种在不丢失目标散射点位置信息和强度信息
IEEE Conference on Computer Vision and Pattern Recog-
的前提下将三维成像声呐点云投影至二维图像的 nition, 2018: 4490–4499.
方法，从而可以使用二维图像分类网络实现三维成 [10] Qi C R, Liu W, Wu C, et al. Frustum pointnets for 3D
像声呐点云目标分类任务。接着使用 MobileNetV2 object detection from RGB-d data[C]. Proceedings of the
IEEE Conference on Computer Vision and Pattern Recog-
网络实现了三维成像声呐快速目标分类，通过对比
nition, 2018: 918–927.
深度图，强度图以及混合通道图分别作为网络输入 [11] Shi S, Wang X, Li H. Pointrcnn: 3D object proposal gen-
的实验结果，可以看出混合通道图比单独的深度图 eration and detection from point cloud[C]. Proceedings of
和强度图收敛速度更快，目标分类准确率也更高， the IEEE/CVF Conference on Computer Vision and Pat-
tern Recognition, 2019: 770–779.
说明了三维成像声呐所获取目标的位置信息和强
[12] Murino V, Trucco A. Three-dimensional image generation
度信息对于目标分类都非常重要。然后通过和VG- and processing in underwater acoustic vision[J]. Proceed-
GNet16、ResNet50 对比，可以看出 MobileNetV2 在 ings of the IEEE, 2002, 88(12): 1903–1948.
[13] 王朋, 迟骋, 纪永强, 等. 二维解卷积波束形成水下高分辨三
保证目标识别正确率的情况下具有网络参数少、运
维声成像 [J]. 声学学报, 2019, 44(4): 613–625.
行速度快的特点。 Wang Peng, Chi Cheng, Ji Yongqiang, et al. Two-
dimensional deconvolved beamforming for high-resolution
underwater three-dimensional acoustical imaging[J]. Acta
参考文献 Acustica, 2019, 44(4): 613–625.
[14] 王朋, 张扬帆, 黄勇, 等. 基于稀疏布阵的实时三维成像声纳
[1] Einsidler D, Dhanak M, Beaujean P. A deep learning ap- 系统 [J]. 仪器仪表学报, 2016, 37(4): 843–851.
proach to target recognition in side-scan sonar imagery[C]. Wang Peng, Zhang Yangfan, Huang Yong, et al. Real-
Ocean 2018 MTS/IEEE Charleston, 2019: 8–11. time 3D acoustical imaging sonar system based on sparse

55 56 57 58 59 60 61 62 63 64 65