Skip navigation
  • 中文
  • English

DSpace CRIS

  • DSpace logo
  • Home
  • Research Outputs
  • Researchers
  • Organizations
  • Projects
  • Explore by
    • Research Outputs
    • Researchers
    • Organizations
    • Projects
  • Communities & Collections
  • SDGs
  • Sign in
  • 中文
  • English
  1. National Taiwan Ocean University Research Hub
  2. 電機資訊學院
  3. 資訊工程學系
Please use this identifier to cite or link to this item: http://scholars.ntou.edu.tw/handle/123456789/7178
Title: A novel unsupervised 3D skeleton detection in RGB-D images for video surveillance
Authors: Shyi-Chyi Cheng 
Hsiao, K. F.
Yang, C. K.
Hsiao, P. F.
Yu, W. H.
Keywords: Object skeleton modeling and detection;Moment-based symmetry feature detection;RGB-D images;Part merging;Unsupervised feature learning
Issue Date: Jun-2020
Publisher: Springer Nature
Journal Volume: 79
Journal Issue: 23-24
Start page/Pages: 15829–15857
Source: Multimedia Tools and Applications 
Abstract: 
In this paper we present a novel moment-based skeleton detection for representing human objects in RGB-D videos with animated 3D skeletons. An object often consists of several parts, where each of them can be concisely represented with a skeleton. However, it remains as a challenge to detect the skeletons of individual objects in an image since it requires an effective part detector and a part merging algorithm to group parts into objects. In this paper, we present a novel fully unsupervised learning framework to detect the skeletons of human objects in a RGB-D video. The skeleton modeling algorithm uses a pipeline architecture which consists of a series of cascaded operations, i.e., symmetry patch detection, linear time search of symmetry patch pairs, part and symmetry detection, symmetry graph partitioning, and object segmentation. The properties of geometric moment-based functions for embedding symmetry features into centers of symmetry patches are also investigated in detail. As compared with the state-of-the-art deep learning approaches for skeleton detection, the proposed approach does not require tedious human labeling work on training images to locate the skeleton pixels and their associated scale information. Although our algorithm can detect parts and objects simultaneously, a pre-learned convolution neural network (CNN) can be used to locate the human object from each frame of the input video RGB-D video in order to achieve the goal of constructing real-time applications. This much reduces the complexity to detect the skeleton structure of individual human objects with our proposed method. Using the segmented human object skeleton model, a video surveillance application is constructed to verify the effectiveness of the approach. Experimental results show that the proposed method gives good performance in terms of detection and recognition using publicly available datasets.
URI: http://scholars.ntou.edu.tw/handle/123456789/7178
ISSN: 1380-7501
DOI: ://WOS:000544744600004
://WOS:000544744600004
10.1007/s11042-018-6292-y
://WOS:000544744600004
://WOS:000544744600004
://WOS:000544744600004
://WOS:000544744600004
Appears in Collections:資訊工程學系

Show full item record

Page view(s)

140
Last Week
1
Last month
1
checked on Oct 13, 2022

Google ScholarTM

Check

Altmetric

Altmetric

Related Items in TAIR


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Explore by
  • Communities & Collections
  • Research Outputs
  • Researchers
  • Organizations
  • Projects
Build with DSpace-CRIS - Extension maintained and optimized by Logo 4SCIENCE Feedback