计算机科学与探索
計算機科學與探索
계산궤과학여탐색
JOURNAL OF FRONTIERS OF COMPUTER SCIENCE & TECHNOLOGY
2014年
11期
1365-1372
,共8页
谱聚类%有效距离%距离度量
譜聚類%有效距離%距離度量
보취류%유효거리%거리도량
spectral clustering%effective distance%distance metric
在现有多种距离度量和传统谱聚类算法的基础上,提出了一种新的基于有效距离的谱聚类算法(spec-tral clustering based on effective distance,SCED)。SCED算法通过稀疏重构系数来构建样本与样本之间的有效距离,从而代替传统谱聚类算法中的欧氏距离,进行样本之间的相似度评估。与传统距离度量相比,有效距离不仅利用了样本对之间的距离信息,同时考虑了目标样本与其他所有相关样本之间的距离信息,因而该距离度量具有全局特性。在UCI标准数据集上的实验结果表明,SCED算法能有效提高聚类效果。
在現有多種距離度量和傳統譜聚類算法的基礎上,提齣瞭一種新的基于有效距離的譜聚類算法(spec-tral clustering based on effective distance,SCED)。SCED算法通過稀疏重構繫數來構建樣本與樣本之間的有效距離,從而代替傳統譜聚類算法中的歐氏距離,進行樣本之間的相似度評估。與傳統距離度量相比,有效距離不僅利用瞭樣本對之間的距離信息,同時攷慮瞭目標樣本與其他所有相關樣本之間的距離信息,因而該距離度量具有全跼特性。在UCI標準數據集上的實驗結果錶明,SCED算法能有效提高聚類效果。
재현유다충거리도량화전통보취류산법적기출상,제출료일충신적기우유효거리적보취류산법(spec-tral clustering based on effective distance,SCED)。SCED산법통과희소중구계수래구건양본여양본지간적유효거리,종이대체전통보취류산법중적구씨거리,진행양본지간적상사도평고。여전통거리도량상비,유효거리불부이용료양본대지간적거리신식,동시고필료목표양본여기타소유상관양본지간적거리신식,인이해거리도량구유전국특성。재UCI표준수거집상적실험결과표명,SCED산법능유효제고취류효과。
Based on existing distance metrics and the traditional spectral clustering algorithm, this paper proposes a new spectral clustering based on effective distance (SCED). Specifically, the proposed SCED algorithm uses effective distance to replace conventional Euclidean distance, by considering global properties of data that are reflected by sparse reconstruction coefficients. In effective distance, the similarity of a sample pair is evaluated by using not only the distance between these two samples, but also distances between one specific sample and other related samples. Sparse reconstruction coefficients are employed to reflect such global relationship among samples. The experimental results on ten UCI benchmark datasets demonstrate the efficiency of the proposed SCED algorithm.