http://scholars.ntou.edu.tw/handle/123456789/6038
標題: | Model-based approach to spatial-temporal sampling of video clips for video object detection by classification | 作者: | Chi-Han Chuang Shyi-Chyi Cheng Chin-Chun Chang Yi-Ping Phoebe Chen |
關鍵字: | Semantic video objects;Spatial–temporal sampling;Human action detection;Video object model;Dynamic programming;Multiple alignment;Model-based tracking;Video object detetcion | 公開日期: | 七月-2014 | 卷: | 25 | 期: | 5 | 起(迄)頁: | 1018-1030 | 來源出版物: | Journal of Visual Communication and Image Representation | 摘要: | For a variety of applications such as video surveillance and event annotation, the spatial–temporal boundaries between video objects are required for annotating visual content with high-level semantics. In this paper, we define spatial–temporal sampling as a unified process of extracting video objects and computing their spatial–temporal boundaries using a learnt video object model. We first provide a computational approach for learning an optimal key-object codebook sequence from a set of training video clips to characterize the semantics of the detected video objects. Then, dynamic programming with the learnt codebook sequence is used to locate the video objects with spatial–temporal boundaries in a test video clip. To verify the performance of the proposed method, a human action detection and recognition system is constructed. Experimental results show that the proposed method gives good performance on several publicly available datasets in terms of detection accuracy and recognition rate. |
URI: | http://scholars.ntou.edu.tw/handle/123456789/6038 | ISSN: | 1047-3203 | DOI: | 10.1016/j.jvcir.2014.02.014 |
顯示於: | 資訊工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。