计算机工程与设计
計算機工程與設計
계산궤공정여설계
COMPUTER ENGINEERING AND DESIGN
2009年
24期
5698-5700
,共3页
徐分%蒋芸%王勇%马廷斌
徐分%蔣蕓%王勇%馬廷斌
서분%장예%왕용%마정빈
粗糙集%信息增益%属性约简%值约简%分明矩阵
粗糙集%信息增益%屬性約簡%值約簡%分明矩陣
조조집%신식증익%속성약간%치약간%분명구진
rough sets%information gain%attributes reduction%value reduction%discernibility matrix
针对属性过多对于有效的数据挖掘很不利以及约简中差别矩阵的产生会占用较大存储空间的问题,提出了一种基于粗糙集和信息增益的属性约简改进算法.该算法首先采用信息增益技术对决策表属性进行相关分析,删除部分冗余属性,减小属性约简的复杂度,然后直接从决策表中提取出分明函数,求出属性约简.由于避免了分明矩阵的生成,因此该算法不仅节约了时间和空间,而且提高了效率.
針對屬性過多對于有效的數據挖掘很不利以及約簡中差彆矩陣的產生會佔用較大存儲空間的問題,提齣瞭一種基于粗糙集和信息增益的屬性約簡改進算法.該算法首先採用信息增益技術對決策錶屬性進行相關分析,刪除部分冗餘屬性,減小屬性約簡的複雜度,然後直接從決策錶中提取齣分明函數,求齣屬性約簡.由于避免瞭分明矩陣的生成,因此該算法不僅節約瞭時間和空間,而且提高瞭效率.
침대속성과다대우유효적수거알굴흔불리이급약간중차별구진적산생회점용교대존저공간적문제,제출료일충기우조조집화신식증익적속성약간개진산법.해산법수선채용신식증익기술대결책표속성진행상관분석,산제부분용여속성,감소속성약간적복잡도,연후직접종결책표중제취출분명함수,구출속성약간.유우피면료분명구진적생성,인차해산법불부절약료시간화공간,이차제고료효솔.
Aiming at the problems of too many attributes in data mining and much space acquired while generating the discernibility matrix, an improved algorithm for attribute reduction which is based on the rough sets and information gain, is put forward. The analysis of information gain technology is used to analyze the relationship between attributes to reduce the complexity of reduction. We can get the attribute reduction without generating the discernibility matrix. Less time and space complexity are acquired. And it is verified that the algorithm is effective.