计算机工程与应用
計算機工程與應用
계산궤공정여응용
COMPUTER ENGINEERING AND APPLICATIONS
2014年
16期
135-139
,共5页
分类属性数据%特征选择%互信息
分類屬性數據%特徵選擇%互信息
분류속성수거%특정선택%호신식
nominal data%feature selection%mutual information
提出了一种针对分类属性数据特征选择的新算法。通过给出一种能够直接评价分类属性数据特征选择的评价函数新定义,重新构造能实现分类属性数据信息量、条件互信息、特征之间依赖度定义的计算公式,并在此基础上,提出了一种基于互信息较大相关、较小冗余的特征选择(MRLR)算法。MRLR算法在特征选择时不仅考虑了特征与类标签之间的相关性,而且还考虑了特征之间的冗余性。大量的仿真实验表明,MRLR算法在针对分类属性数据的特征选择时,能获得冗余度小且更具代表性的特征子集,具有较好的高效性和稳定性。
提齣瞭一種針對分類屬性數據特徵選擇的新算法。通過給齣一種能夠直接評價分類屬性數據特徵選擇的評價函數新定義,重新構造能實現分類屬性數據信息量、條件互信息、特徵之間依賴度定義的計算公式,併在此基礎上,提齣瞭一種基于互信息較大相關、較小冗餘的特徵選擇(MRLR)算法。MRLR算法在特徵選擇時不僅攷慮瞭特徵與類標籤之間的相關性,而且還攷慮瞭特徵之間的冗餘性。大量的倣真實驗錶明,MRLR算法在針對分類屬性數據的特徵選擇時,能穫得冗餘度小且更具代錶性的特徵子集,具有較好的高效性和穩定性。
제출료일충침대분류속성수거특정선택적신산법。통과급출일충능구직접평개분류속성수거특정선택적평개함수신정의,중신구조능실현분류속성수거신식량、조건호신식、특정지간의뢰도정의적계산공식,병재차기출상,제출료일충기우호신식교대상관、교소용여적특정선택(MRLR)산법。MRLR산법재특정선택시불부고필료특정여류표첨지간적상관성,이차환고필료특정지간적용여성。대량적방진실험표명,MRLR산법재침대분류속성수거적특정선택시,능획득용여도소차경구대표성적특정자집,구유교호적고효성화은정성。
In this paper, a novel feature selection approach based on mutual information called More Relevance Less Redun-dancy(MRLR)algorithm for nominal data is proposed. By reconstructing the computation method of the amount of infor-mation, the conditional mutual information, the dependence between the features so that which can be suitable for compu-tation related the nominal data, and a new definition of the evaluation function of feature selection is given, as well as a new feature selection criterion is used to evaluate the importance of each feature, which takes into account both relevance and redundancy. In MRLR, experimental results show that the relevance and redundancy respectively use mutual informa-tion to measure the dependence of features on the latent class and the dependence between features, and it also enhance the correctness and the effectiveness of MRLR algorithm.