电子科技
電子科技
전자과기
IT AGE
2012年
2期
19-22
,共4页
聚类集成%co-occurrence矩阵%权重
聚類集成%co-occurrence矩陣%權重
취류집성%co-occurrence구진%권중
cluster ensemble%co-occurrence matrix%weighted
聚类集成是数据挖掘研究的一个热点。它是利用同一数据集的多个聚类划分集成在一起,以提高聚类分析的性能。当前相关研究大多没有考虑进行集成的聚类成员的质量,因此较差的成员会对集成结果产生不良影响。文中提出了一种基于加权co-occurrence矩阵的聚类集成算法(WCSCE)。该方法首先计算出聚类成员基于属性值的co-occurrence矩阵,然后对聚类成员的质量进行简单评价并赋予权重,生成加权co-occurrence矩阵,进而产生集成结果。最后通过实验验证了该算法的有效性,并提高了聚类质量。
聚類集成是數據挖掘研究的一箇熱點。它是利用同一數據集的多箇聚類劃分集成在一起,以提高聚類分析的性能。噹前相關研究大多沒有攷慮進行集成的聚類成員的質量,因此較差的成員會對集成結果產生不良影響。文中提齣瞭一種基于加權co-occurrence矩陣的聚類集成算法(WCSCE)。該方法首先計算齣聚類成員基于屬性值的co-occurrence矩陣,然後對聚類成員的質量進行簡單評價併賦予權重,生成加權co-occurrence矩陣,進而產生集成結果。最後通過實驗驗證瞭該算法的有效性,併提高瞭聚類質量。
취류집성시수거알굴연구적일개열점。타시이용동일수거집적다개취류화분집성재일기,이제고취류분석적성능。당전상관연구대다몰유고필진행집성적취류성원적질량,인차교차적성원회대집성결과산생불량영향。문중제출료일충기우가권co-occurrence구진적취류집성산법(WCSCE)。해방법수선계산출취류성원기우속성치적co-occurrence구진,연후대취류성원적질량진행간단평개병부여권중,생성가권co-occurrence구진,진이산생집성결과。최후통과실험험증료해산법적유효성,병제고료취류질량。
Cluster ensemble is a hot topic in data mining research.It can find a combined clustering with better quality from multiple partitions.Most of resent researches pay little attention to the qualities of cluster members.However,bad cluster members and noise may affect the ensemble result.This paper presents a clustering ensemble algorithm based on weighted co-occurrence matrix.First the co-occurrence property value matrix of the cluster members is calculated.The significance of each cluster member is evaluated through information measures of clustering evaluation.Then weighted co-occurrence matrix is generated and the final ensemble result is obtained.Experimental results show the effectiveness of the algorithms,and the clustering accuracy is improved.