REFNet: reparameterized feature enhancement and fusion network for underwater blur target recognition

Benshun Li; Lei Cai

doi:10.1017/S0263574725000475

REFNet: reparameterized feature enhancement and fusion network for underwater blur target recognition

Published online by Cambridge University Press: 16 May 2025

Benshun Li

and

Lei Cai

Show author details

Benshun Li: Affiliation:
School of Mechanical and Electrical Engineering, Henan University of Technology, Zhengzhou, P.R. China
Lei Cai*: Affiliation:
School of Artificial Intelligence, Henan Institute of Science and Technology, Xinxiang, P.R. China
*: Corresponding author: Lei Cai; Email: cailei2014@126.com

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

The underwater target detection is affected by image blurring caused by suspended particles in water bodies and light scattering effects. To tackle this issue, this paper proposes a reparameterized feature enhancement and fusion network for underwater blur object recognition (REFNet). First, this paper proposes the reparameterized feature enhancement and gathering (REG) module, which is designed to enhance the performance of the backbone network. This module integrates the concepts of reparameterization and global response normalization to enhance the network’s feature extraction capabilities, addressing the challenge of feature extraction posed by image blurriness. Next, this paper proposes the cross-channel information fusion (CIF) module to enhance the neck network. This module combines detailed information from shallow features with semantic information from deeper layers, mitigating the loss of image detail caused by blurring. Additionally, this paper replace the CIoU loss function with the Shape-IoU loss function improves target localization accuracy, addressing the difficulty in accurately locating bounding boxes in blurry images. Experimental results indicate that REFNet achieves superior performance compared to state-of-the-art methods, as evidenced by higher mAP scores on the underwater robot professional competitionand detection underwater objects datasets. REFNet surpasses YOLOv8 by approximately 1.5% in $mAP_{50:95}$ on the URPC dataset and by about 1.3% on the DUO dataset. This enhancement is achieved without significantly increasing the model’s parameters or computational load. This approach enhances the precision of target detection in challenging underwater environments.

Keywords

blurred target recognition you only look once information fusion lightweight architecture deep learning attention mechanism

Type: Research Article
Information: Robotica , First View , pp. 1 - 18

DOI: https://doi.org/10.1017/S0263574725000475 [Opens in a new window]
Copyright: © The Author(s), 2025. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Ismail, A., Mehri, M. and Sahbani, A., “A dual-stage system for real-time license plate detection and recognition on mobile security robots,” Robotica 1(1), 1–22 (2025).CrossRef Google Scholar

Xu, S., Zhang, M., Song, W., Mei, H. and He, Q., “A systematic review and analysis of deep learning-based underwater object detection,” Neurocomputing 527(12), 204–232 (2023).CrossRef Google Scholar

Shi, Y., Feng, D., Cheng, Y. and Biswas, S., “A natural language-inspired multilabel video streaming source identification method based on deep neural networks,” Signal Image Video Process. 15(6), 1161–1168 (2021).CrossRef Google Scholar

Liu, T., Huang, J. and Zhao, J., “Research on obstacle avoidance of underactuated autonomous underwater vehicle based on offline reinforcement learning,” Robotica 1(1), 1–25 (2024).Google Scholar

Zhang, J., Liu, Y., Zhang, S., Poppe, R. and Wang, M., “Light field saliency detection with deep convolutional networks,” IEEE Trans. Image Process. 29(18), 4421–4434 (2020).CrossRef Google Scholar

Liu, H., Sun, X., Gu, J. and Deng, L., “SF-YOLOv5: A lightweight small object detection algorithm based on improved feature fusion mode,” Sensors 22(15), 5817–5836 (2022).CrossRef Google Scholar PubMed

Chang, S., Gao, F. and Zhang, Q., “A systematic review and analysis of deep learning-based underwater object detection,” Electronics 12(13), 2882–2896 (2023).CrossRef Google Scholar

Zhang, W., Li, X., Huang, Y., Xu, S., Tang, J. and Hu, H., “Underwater image enhancement via frequency and spatial domains fusion,” Opt. Lasers Eng. 186(9), 1008826–1008842 (2025).CrossRef Google Scholar

Jiang, X., Zhuang, X., Chen, J., Zhang, J. and Zhang, Y., “YOLOv8-MU: An improved YOLOv8 underwater detector based on a large kernel block and a multi-branch reparameterization module,” Sensors 24(9), 2905–2919 (2024).CrossRef Google Scholar

Xu, Z., Wang, R. and Cao, T., “AquaPile-YOLO: Pioneering underwater pile foundation detection with forward-looking sonar image processing,” Remote Sens. 17(3), 360–376 (2025).CrossRef Google Scholar

Cai, L., Zhang, B., Li, Y. and Chai, H., “IFE-net: Improved feature enhancement network for weak feature target recognition in autonomous underwater vehicles,” Robotica 42(4), 1231–1245 (2024).CrossRef Google Scholar

Zhou, L., Cai, L., Jia, J. and Gao, Y., “Multi-scale aware turbulence network for underwater object recognition,” Front. Mar. Sci. 11(25), 1301072–1301091 (2024).CrossRef Google Scholar

Varghese, R. and Sambath, M.. YOLOv8: A Novel Object Detection Algorithm with Enhanced Performance and Robustness. In: 2024 International Conference on Advances in Data Engineering and Intelligent Computing Systems (ADICS), Chennai, India (2024) pp. 1–6.Google Scholar

Qu, R., Cui, C., Duan, J., Lu, Y. and Pang, Z., “Underwater small target detection under YOLOv8-LA model,” Sci. Rep. 14(1), 16108–16124 (2024).CrossRef Google Scholar PubMed

Lu, Z., Liao, L., Xie, X. and Yuan, H., “SCoralDet: Efficient real-time underwater soft coral detection with YOLO,” Ecol. Inform. 85(7), 102937–102956 (2025).CrossRef Google Scholar

Niu, Z., Zhong, G. and Yu, H., “A review on the attention mechanism of deep learning,” Neurocomputing 452(22), 48–62 (2021).CrossRef Google Scholar

Fei, Y., Liu, F. and Su, M., “Real-time detection of small underwater organisms with a novel lightweight SFESI-YOLOv8n model,” J. Real-Time Image Process. 22(7), 23–43 (2025).CrossRef Google Scholar

Quan, J., Zhao, Z., Li, W., Cao, Y. and Wu, J., “Enhancing YOLOv8n with multiple attention and MRV module for efficient deep-sea pipeline target detection,” Electronics 14(2), 267–288 (2025).CrossRef Google Scholar

Zhou, Z., Hu, Y., Yang, X. and Yang, J., “YOLO-based marine organism detection using two-terminal attention mechanism and difficult-sample resampling,” Appl. Soft. Comput. 153(16), 111291–111305 (2024).CrossRef Google Scholar

Max, J., Karen, S., Andrew, Z. and Koray, K., “Spatial transformer networks,” Adv. Neur. Inf. Process. Syst. 28(22), 1291–1305 (2015).Google Scholar

Li, W., Zhang, X., Peng, Y. and Dong, M.. Squeeze-and-Excitation Networks. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA (2018) pp. 7132–7141.Google Scholar

Fu, J., J.Liu, H. T., Li, Y., Bao, Y., Fang, Z. and Lu, H.. Dual Attention Network for Scene Segmentation. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA (2019) pp. 3141–3149.Google Scholar

Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W. and Hu, Q.. ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA (2020) pp. 11531–11539.Google Scholar

Li, Z., Dong, Y., Shen, L. and YLiu, “Development and challenges of object detection: A survey,” Neurocomputing 598(22), 128102–128125 (2024).CrossRef Google Scholar

Xin, Z., Chen, S., Wu, T. and Shao, Y., “Few-shot object detection: Research advances and challenges,” Inf. Fusion 107(42), 102307–102332 (2024).CrossRef Google Scholar

Lin, T.-Y., Dollár, P., Girshick, R., Hariharan, B. and Belongie, S.. Feature Pyramid Networks for Object Detection. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA (2017) pp. 936–944.Google Scholar

Liu, S., Qi, L., Qin, H., Shi, J. and Jia, J.. Path Aggregation Network for Instance Segmentation. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA (2018) pp. 8759–8768.Google Scholar

Ghiasi, G., Lin, T.-Y and Le, Q. V.. NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA (2019) pp. 7029–7038.Google Scholar

Chen, L., Fu, Y., Gu, L., Yan, C., Harada, T. and Huang, G., “Frequency-aware feature fusion for dense image prediction,” IEEE Trans. Pattern Anal. Mach. Intell. 46(12), 10763–10780 (2024).CrossRef Google Scholar PubMed

Sun, C. and Zhao, F., “Multi-level feature fusion network for neuronal morphology classification,” Front. Neurosci. 18(16), 1465642–1465663 (2024).CrossRef Google Scholar

Zhang, T., Liu, Y. and Zhao, Q., “Edge-guided multi-scale adaptive feature fusion network for liver tumor segmentation,” Sci. Rep. 14(1), 28370–28392 (2024).CrossRef Google Scholar

Ding, X., Zhang, Y., Ge, Y., Zhao, S., Song, L., Yue, X. and Shan, Y.. UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio Video Point Cloud Time-Series and Image Recognition. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) (2024), pp. 5513–5524.Google Scholar

Woo, S., Debnath, S., Hu, R., Chen, X., Liu, Z., Kweon, I. S. and Xie, S.. ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders. In: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada (2023) pp. 16133–16142.Google Scholar

Shao, D., Jiang, J. and Ma, L., “Real-time medical lesion screening: Accurate and rapid detectors,” J. Real-Time Image Process. 21(4), 134–158 (2024).CrossRef Google Scholar

Ren, S., He, K., Girshick, R. and Sun, J.. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. In: IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA (2014) pp. 580–587.Google Scholar

Duan, K., Bai, S., Xie, L., Qi, H., Huang, Q. and Tian, Q.. CenterNet: Keypoint Triplets for Object Detection. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea (South) (2019) pp. 6568–6577.Google Scholar

Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A. and Zagoruyko, S., “End-to-end object detection with transformers,” Eur. Conf. Comput. Vision (ECCV), Glasgow, UK 12346, 213–229 (2020).Google Scholar

Khachatrian, E., Sandalyuk, N. and Lozou, P., “Eddy detection in the marginal ice zone with sentinel-1 data using YOLOv5,” Remote Sens. 15(9), 2244–2259 (2023).CrossRef Google Scholar

Li, Z., Xu, P., Chang, X., Yang, L., Zhang, Y., Yao, L. and Chen, X., “When object detection meets knowledge distillation: A survey,” IEEE Trans. Pattern Anal. Mach. Intell. 45(8), 10555–10579 (2023).CrossRef Google Scholar PubMed

Wang, C.-Y., Bochkovskiy, A. and Liao, H.-Y.M.. YOLOv7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors. In: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada (2023) pp. 7464–7475.Google Scholar

Kang, C. H. and Kim, S. Y., “Real-time object detection and segmentation technology: An analysis of the YOLO algorithm,” JMST Adv. 5(2), 69–76 (2023).CrossRef Google Scholar

Qiu, X., Chen, Y., Cai, W., Niu, M. and Li, J., “LD-YOLOv10: A lightweight target detection algorithm for drone scenarios based on YOLOv10,” Electronics 13(16), 3269–3287 (2024).CrossRef Google Scholar

Muhammad, M. B. and MYeasin, “Eigen-CAM: Visual explanations for deep convolutional neural networks,” SN Comput. Sci. 2(6), 47–59 (2021).CrossRef Google Scholar

Xinde, L., Dunkin, F. and Dezert, J., “Multi-source information fusion: Progress and future,” Chin. J. Aeronaut. 37(7), 24–58 (2024).Google Scholar

Cai, Y., Sui, X. and Gu, G., “Multi-modal interaction with token division strategy for RGB-T tracking,” Pattern Recognit. 115(6), 110626–110648 (2024).CrossRef Google Scholar

Shi, X., Zhang, Y. and Pujahari, A., “When latent features meet side information: A preference relation based graph neural network for collaborative filtering,”Expert,” Syst. Appl. 260(11), 125423–125451 (2025).CrossRef Google Scholar

Guan, D., Xing, Y. and Huang, J., “S2Match: Self-paced sampling for data-limited semi-supervised learning,” Pattern Recognit. 159(26), 111121–111143 (2025).CrossRef Google Scholar

Article contents

REFNet: reparameterized feature enhancement and fusion network for underwater blur target recognition

Abstract

Keywords

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests