系统工程理论与实践
繫統工程理論與實踐
계통공정이론여실천
Systems Engineering—Theory & Practice
2014年
9期
2432~2437
,共null页
王萌 孙树栋 杨宏安 袁宗寅
王萌 孫樹棟 楊宏安 袁宗寅
왕맹 손수동 양굉안 원종인
流形学习 等价支持子集 决策分析 降维
流形學習 等價支持子集 決策分析 降維
류형학습 등개지지자집 결책분석 강유
manifold learning;support subset;decision analysis;dimension reduction
为了有效地分析高维决策表,提出了基于流形学习降维的决策分析算法(decision analysis algorithm based on manifold learning,DAML).算法使用等距映射法(ISOMAP)对原始数据做降维处理,在得到的主坐标数据上进行决策分析.根据核主成分分析法与ISOMAP方法的关系得到主成分与主坐标的转换关系式,并计算原始数据主成分.提出了基于等价支持子集的决策算法用于计算主成分属性重要性、属性区分能力及等价支持子集.在得到等价支持子集的基础上抽取决策规则,根据决策规则预测算法预测未知数据.选取UCI数据库中标准分类数据集作为仿真实验样本,并对比C4.5决策树算法、K最近邻居算法(KNN)与提出的决策规则预测算法在Iris、Breast cancer、Wine、Spectf heart和Ionosphere数据集上的分类精度来验证算法的有效性.
為瞭有效地分析高維決策錶,提齣瞭基于流形學習降維的決策分析算法(decision analysis algorithm based on manifold learning,DAML).算法使用等距映射法(ISOMAP)對原始數據做降維處理,在得到的主坐標數據上進行決策分析.根據覈主成分分析法與ISOMAP方法的關繫得到主成分與主坐標的轉換關繫式,併計算原始數據主成分.提齣瞭基于等價支持子集的決策算法用于計算主成分屬性重要性、屬性區分能力及等價支持子集.在得到等價支持子集的基礎上抽取決策規則,根據決策規則預測算法預測未知數據.選取UCI數據庫中標準分類數據集作為倣真實驗樣本,併對比C4.5決策樹算法、K最近鄰居算法(KNN)與提齣的決策規則預測算法在Iris、Breast cancer、Wine、Spectf heart和Ionosphere數據集上的分類精度來驗證算法的有效性.
위료유효지분석고유결책표,제출료기우류형학습강유적결책분석산법(decision analysis algorithm based on manifold learning,DAML).산법사용등거영사법(ISOMAP)대원시수거주강유처리,재득도적주좌표수거상진행결책분석.근거핵주성분분석법여ISOMAP방법적관계득도주성분여주좌표적전환관계식,병계산원시수거주성분.제출료기우등개지지자집적결책산법용우계산주성분속성중요성、속성구분능력급등개지지자집.재득도등개지지자집적기출상추취결책규칙,근거결책규칙예측산법예측미지수거.선취UCI수거고중표준분류수거집작위방진실험양본,병대비C4.5결책수산법、K최근린거산법(KNN)여제출적결책규칙예측산법재Iris、Breast cancer、Wine、Spectf heart화Ionosphere수거집상적분류정도래험증산법적유효성.
A decision analysis algorithm based on manifold learning(DAML) is proposed for highdimensional decision information system.Firstly,the dimension of the original data is reduced by using isometric mapping(ISOMAP) in the algorithm.Some decision rules are extracted from the low-dimensional embedding of manifold learning.The principal component of the original data is computed by the relationship of kernel principal component analysis and ISOMAP.Then,a support subset significant algorithm is proposed for decision analysis.The support subset significant algorithm is used to computing the attribution significant,discrimination power and support subset.The decision rules are extracted by the support subset.Finally,the Iris,Breast cancer,Wine,Spectf heart and Ionosphere on UCI benchmark are selected for simulation.The classification accuracy among C4.5,k-nearest neighbor(KNN) and DAML are compared on the 5 data sets to validate the effectiveness of the DAML.