Skip navigation
  • 中文
  • English

DSpace CRIS

  • DSpace logo
  • Home
  • Research Outputs
  • Researchers
  • Organizations
  • Projects
  • Explore by
    • Research Outputs
    • Researchers
    • Organizations
    • Projects
  • Communities & Collections
  • SDGs
  • Sign in
  • 中文
  • English
  1. National Taiwan Ocean University Research Hub
  2. 電機資訊學院
  3. 通訊與導航工程學系
Please use this identifier to cite or link to this item: http://scholars.ntou.edu.tw/handle/123456789/22351
Title: Hand gesture recognition via image processing techniques and deep CNN
Authors: Chung, Yao-Liang 
Chung, Hung-Yuan
Tsai, Wei-Feng
Keywords: Deep CNN;gesture recognition;VGGNet;AlexNet
Issue Date: 1-Jan-2020
Publisher: IOS PRESS
Journal Volume: 39
Journal Issue: 3
Start page/Pages: 4405-4418
Source: JOURNAL OF INTELLIGENT & FUZZY SYSTEMS
Abstract: 
In the present study, we sought to enable instant tracking of the hand region as a region of interest (ROI) within the image range of a webcam, while also identifying specific hand gestures to facilitate the control of home appliances in smart homes or issuing of commands to human-computer interaction fields. To accomplish this objective, we first applied skin color detection and noise processing to remove unnecessary background information from the captured image, before applying background subtraction for detection of the ROI. Then, to prevent background objects or noise from influencing the ROI, we utilized the kernelized correlation filters (KCF) algorithm to implement tracking of the detected ROI. Next, the size of the ROI image was resized to 100x120 and input into a deep convolutional neural network (CNN) to enable the identification of various hand gestures. In the present study, two deep CNN architectures modified from the AlexNet CNN and VGGNet CNN, respectively, were developed by substantially reducing the number of network parameters used and appropriately adjusting internal network configuration settings. Then, the tracking and recognition process described above was continuously repeated to achieve immediate effect, with the execution of the system continuing until the hand is removed from the camera range. The results indicated excellent performance by both of the proposed deep CNN architectures. In particular, the modified version of the VGGNet CNN achieved better performance with a recognition rate of 99.90% for the utilized training data set and a recognition rate of 95.61% for the utilized test data set, which indicate the good feasibility of the system for practical applications.
URI: http://scholars.ntou.edu.tw/handle/123456789/22351
ISSN: 1064-1246
DOI: 10.3233/JIFS-200385
Appears in Collections:通訊與導航工程學系

Show full item record

WEB OF SCIENCETM
Citations

10
Last Week
1
Last month
checked on Jun 27, 2023

Page view(s)

149
checked on Jun 30, 2025

Google ScholarTM

Check

Altmetric

Altmetric

Related Items in TAIR


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Explore by
  • Communities & Collections
  • Research Outputs
  • Researchers
  • Organizations
  • Projects
Build with DSpace-CRIS - Extension maintained and optimized by Logo 4SCIENCE Feedback