Skip navigation
  • 中文
  • English

DSpace CRIS

  • DSpace logo
  • Home
  • Research Outputs
  • Researchers
  • Organizations
  • Projects
  • Explore by
    • Research Outputs
    • Researchers
    • Organizations
    • Projects
  • Communities & Collections
  • SDGs
  • Sign in
  • 中文
  • English
  1. National Taiwan Ocean University Research Hub
  2. 電機資訊學院
  3. 電機工程學系
Please use this identifier to cite or link to this item: http://scholars.ntou.edu.tw/handle/123456789/23871
Title: MSCE-Net: Multi-scale Spatial and Channel Enhancing Net based on Attention for Cloud Image Classification
Authors: Chih-Wei Lin 
Lingjie Jin
Issue Date: Aug-2022
Publisher: IEEE
Conference: 2022 26th International Conference on Pattern Recognition (ICPR)
Montreal, QC, Canada
Abstract: 
The difference between cloud types is mainly present in appearance but has problems of high similarity and insignificant appearance between classes. In addition, the existing approaches separately use the channel and spatial attention machines in a stage of the network that loses the attention in multiple scales and the relationship between channel and spatial attention. Therefore, this study proposed a framework, namely multi-scale spatial and channel enhancing Net (MSCE-Net), based on attention mechanisms to enhance the feature learning ability of the network for cloud image classification. The designed multi-scale spatial and channel attentions consider information between scales and concatenate spatial and temporal attention at each scale to consider the relationship between spatial and channel attention and make the network focus on the appearance of the cloud image, which is significantly different between various clouds. The main contributions of this paper are as follows: (1) we construct a new cloud image dataset with ten categories with a total of 6000 images; (2) we generate multi-scale spatial-based and channel-based enhancing factors to enhance the spatial and channel features, respectively, at each scale; (3) we concatenate spatial and temporal attention at each scale to consider the relationship between spatial and channel attention at each scale. Experimental results show that the accuracies of the proposed framework suppress the state-of-the-art approaches and achieve 85.11%. Moreover, the ablation experiments prove the efficiency of the proposed strategies.
URI: http://scholars.ntou.edu.tw/handle/123456789/23871
DOI: 10.1109/ICPR56361.2022.9956204
Appears in Collections:電機工程學系

Show full item record

Page view(s)

135
checked on Jun 30, 2025

Google ScholarTM

Check

Altmetric

Altmetric

Related Items in TAIR


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Explore by
  • Communities & Collections
  • Research Outputs
  • Researchers
  • Organizations
  • Projects
Build with DSpace-CRIS - Extension maintained and optimized by Logo 4SCIENCE Feedback