Skip navigation
  • 中文
  • English

DSpace CRIS

  • DSpace logo
  • Home
  • Research Outputs
  • Researchers
  • Organizations
  • Projects
  • Explore by
    • Research Outputs
    • Researchers
    • Organizations
    • Projects
  • Communities & Collections
  • SDGs
  • Sign in
  • 中文
  • English
  1. National Taiwan Ocean University Research Hub
  2. 電機資訊學院
  3. 資訊工程學系
Please use this identifier to cite or link to this item: http://scholars.ntou.edu.tw/handle/123456789/24745
DC FieldValueLanguage
dc.contributor.authorWan-Hsuan Yuen_US
dc.contributor.authorChi-Han Chuangen_US
dc.contributor.authorShyi-Chyi Chengen_US
dc.date.accessioned2024-03-15T06:42:00Z-
dc.date.available2024-03-15T06:42:00Z-
dc.date.issued2013-
dc.identifier.isbn978-1-61208-265-3-
dc.identifier.issn2308-4448-
dc.identifier.urihttp://scholars.ntou.edu.tw/handle/123456789/24745-
dc.description.abstractVideo object detection is one of the most important research problems for video event detection, indexing, and retrieval. For a variety of applications such as video surveillance and event annotation, the spatial-temporal boundaries between video objects are required for annotating visual content with high-level semantics. In this paper, we define spatial-temporal sampling as a unified process of extracting video objects and computing their spatial-temporal boundaries using a learnt video object model. We first provide a learning approach to build a class-specific video object model from a set of training video clips. Then the learnt model is used to locate the video objects with precise spatial-temporal boundaries from a test video clip using graph kernels. A frame sorting process as a preprocessing is also proposed to transform the graph, modeling the shot configuration of a video clip, into a string of shots. Thus, the computation of graph kernels is simplified to be string kernels. The string kernels for support vector machine (SVM) classification are finally adopted to train the SVM classifiers from a set of training samples and detect the video objects in a test video clip by classification. A human action detection and recognition system is finally constructed to verify the performance of the proposed method. Experimental results show that the proposed method gives good performance on several publicly available datasets in terms of detection accuracy and recognition rate.en_US
dc.language.isoen_USen_US
dc.subjectvideo objectsen_US
dc.subjectstring kernelsen_US
dc.subjectdynamic programmingen_US
dc.subjectvideo object modelingen_US
dc.subjectSVM classificationen_US
dc.titleVideo Object Detection by Classification Using String Kernelsen_US
dc.typeconference paperen_US
dc.relation.pages82 to 87en_US
item.openairecristypehttp://purl.org/coar/resource_type/c_5794-
item.cerifentitytypePublications-
item.languageiso639-1en_US-
item.fulltextno fulltext-
item.grantfulltextnone-
item.openairetypeconference paper-
crisitem.author.deptCollege of Electrical Engineering and Computer Science-
crisitem.author.deptDepartment of Computer Science and Engineering-
crisitem.author.deptNational Taiwan Ocean University,NTOU-
crisitem.author.parentorgNational Taiwan Ocean University,NTOU-
crisitem.author.parentorgCollege of Electrical Engineering and Computer Science-
Appears in Collections:資訊工程學系
Show simple item record

Page view(s)

134
checked on Jun 30, 2025

Google ScholarTM

Check

Altmetric

Related Items in TAIR


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Explore by
  • Communities & Collections
  • Research Outputs
  • Researchers
  • Organizations
  • Projects
Build with DSpace-CRIS - Extension maintained and optimized by Logo 4SCIENCE Feedback