http://scholars.ntou.edu.tw/handle/123456789/17020
標題: | Clustering based on Generalized Inverse Transformation | 作者: | Yu-Chen Chen Hong-Jie Shih Yu-Siang Jheng Sih-Yin Shen Meng-Di Guo Jung-Hua Wang |
關鍵字: | Euclidean distance;Clustering algorithms;Principal component analysis;Performance evaluation;data mining;Oceans;Iterative algorithms;Bioinformatics;Convergence;Robustness | 公開日期: | 12-十月-2008 | 出版社: | IEEE | 會議論文: | 2008 IEEE International Conference on Systems, Man and Cybernetics Singapore |
摘要: | This paper presents a novel approach which incorporates dimension extension and generalized inverse transformation (DEGIT) to realize data clustering. Unlike k-means algorithm, DEGIT needs not pre-specify the number of clusters k, centroid locations are updated and redundant centroids eliminated automatically during iterative training process. The essence of DEGIT is that clustering is performed by generalized inverse transforming the input data such that each data point is represented by a linear combination of bases with extended dimension, with each basis corresponding to a centroid and its coefficient representing the closeness between the data point and the basis. Issue of clustering validation is also addressed in this paper. First, principal component analysis is applied to detect if there exists a dominated dimension, if so, the original input data will be rotated by a certain angle w.r.t. a defined center of mass, and the resulting data undergo another run of iterative training process. After plural runs of rotation and iterative process, the labeled results from various runs are compared, a data point labeled to a centroid more times than others will be labeled to the class indexed by that wining centroid. |
URI: | https://ieeexplore.ieee.org/document/4811762 http://scholars.ntou.edu.tw/handle/123456789/17020 |
ISBN: | 978-1-4244-2383-5 | ISSN: | 1062-922X | DOI: | 10.1109/ICSMC.2008.4811762 |
顯示於: | 電機工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。