Skip navigation
  • 中文
  • English

DSpace CRIS

  • DSpace logo
  • Home
  • Research Outputs
  • Researchers
  • Organizations
  • Projects
  • Explore by
    • Research Outputs
    • Researchers
    • Organizations
    • Projects
  • Communities & Collections
  • SDGs
  • Sign in
  • 中文
  • English
  1. National Taiwan Ocean University Research Hub
  2. 電機資訊學院
  3. 資訊工程學系
Please use this identifier to cite or link to this item: http://scholars.ntou.edu.tw/handle/123456789/25862
Title: Multi-scale detection of underwater objects using attention mechanisms and normalized Wasserstein distance loss
Authors: Tsai, Yu-Shiuan 
Tsai, Chia-Tung
Huang, Jian-Hong
Keywords: Multi-scale underwater object detection;Attention mechanism;Normalized Wasserstein distance loss;Underwater attention-PRB model;Marine biodiversity detection;Feature representation
Issue Date: 27-Apr-2025
Publisher: SPRINGER
Journal Volume: 81
Journal Issue: 6
Source: JOURNAL OF SUPERCOMPUTING
Abstract: 
Underwater imaging faces challenges such as light attenuation, scattering, and water turbidity, which degrade image quality and hinder accurate organism recognition. The detecting underwater objects dataset, with resolutions from 586 x 482 to 3840 x 2160 pixels, highlights significant object scale variation, including a high proportion of small objects (27.38%). This study introduces the underwater attention-parallel residual bi-fusion feature pyramid network model, which improves detection accuracy for small- and medium-sized objects in complex underwater environments. The proposed model incorporates a spatial pyramid pooling module with attention mechanisms to enhance multi-scale feature representation and integrates the normalized Wasserstein distance into the loss function for better detection flexibility. Experimental results demonstrate that the model outperforms state-of-the-art methods, achieving a mean average precision at intersection over union threshold of 0.5 of 88.8% and a mean average precision at intersection over union threshold range of 0.5-0.95 of 68.3%, representing a 2.5-9% improvement over baseline models. Furthermore, the model achieved a precision of 85.5%, recall of 82.9%, and an F1-score of 0.8417. These results highlight the model's robustness and effectiveness, offering significant contributions to underwater biodiversity studies, environmental assessments, and marine ecosystem management. By addressing scale variability and achieving high accuracy even for rare species such as scallops, the proposed model supports practical applications in underwater monitoring and conservation.
URI: http://scholars.ntou.edu.tw/handle/123456789/25862
ISSN: 0920-8542
DOI: 10.1007/s11227-025-07251-5
Appears in Collections:資訊工程學系

Show full item record

Page view(s)

18
checked on Jun 30, 2025

Google ScholarTM

Check

Altmetric

Altmetric

Related Items in TAIR


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Explore by
  • Communities & Collections
  • Research Outputs
  • Researchers
  • Organizations
  • Projects
Build with DSpace-CRIS - Extension maintained and optimized by Logo 4SCIENCE Feedback