Skip navigation
  • 中文
  • English

DSpace CRIS

  • DSpace logo
  • Home
  • Research Outputs
  • Researchers
  • Organizations
  • Projects
  • Explore by
    • Research Outputs
    • Researchers
    • Organizations
    • Projects
  • Communities & Collections
  • SDGs
  • Sign in
  • 中文
  • English
  1. National Taiwan Ocean University Research Hub
  2. 電機資訊學院
  3. 電機工程學系
Please use this identifier to cite or link to this item: http://scholars.ntou.edu.tw/handle/123456789/17023
Title: Clustering via Dimension Extension and Pseudo-inverse Transformation
Authors: Yu-Chen Chen
Yu-Siang Jheng
Hong-Jie Shih
Jung-Hua Wang 
Keywords: Clustering;pseudo-inverse transformation;dimension extension;centroids;principal component analysis
Issue Date: 6-Jun-2008
Conference: 2008 National Symposium on System Science and Engineering (NSSSE' 08)
Ilan, Taiwan
Abstract: 
Partitional clustering has a major drawback in that once some data have been divided into wrong clusters, they cannot be easily adjusted into the correct one, namely the initialization problem that plagues the k-means algorithm.
This paper presents a novel approach which incorporates Dimension Extension and Pseudo-Inverse Transformation (DEPIT) to realize data clustering. Unlike k-means algorithm, DEPIT needs not pre-specify the number of clusters k, centroids locations are updated and redundant centroids eliminated automatically during iterative training process. The essence of DEPIT is that clustering is performed by pseudo-inverse transforming the input data such that each data point is represented by a linear combination of bases with extended dimension, with each basis corresponding to a centroid and its coefficient representing the closeness between the data point and the basis.
Issue of clustering validation is also addressed in this paper. First, Principal Component Analysis is applied to detect if there exists a dominated dimension, if so, the original input data will be rotated by a certain angle w.r.t. a defined center of mass, and the resulting data undergo another run of iterative training process. After plural runs of rotation and iterative process, the labeled results from various runs are compared, a data point labeled to a centroid more times than others will be labeled to the class indexed by that wining centroid.
URI: http://scholars.ntou.edu.tw/handle/123456789/17023
Appears in Collections:電機工程學系

Show full item record

Page view(s)

140
Last Week
0
Last month
0
checked on Jun 30, 2025

Google ScholarTM

Check

Related Items in TAIR


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Explore by
  • Communities & Collections
  • Research Outputs
  • Researchers
  • Organizations
  • Projects
Build with DSpace-CRIS - Extension maintained and optimized by Logo 4SCIENCE Feedback