国际自动化与计算杂志(英文版)
國際自動化與計算雜誌(英文版)
국제자동화여계산잡지(영문판)
INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING
2009年
1期
62-71
,共10页
Data mining%classification%feature selection%dimeusionality reduction%Bayes' theorem
This paper proposes one method of feature selection by using Bayes' theorem. The purpose of the proposed method is to reduce the computational complexity and increase the classification accuracy of the selected feature subsets.The dependence between two attributes (binary) is determined based on the probabilities of their joint values that contribute to positive and negative classification decisions.If opposing sets of attribute values do not lead to opposing classification decisions (zero probability),then the two attributes are considered independent of each other,otherwise dependent,and one of them can be removed and thus the number of attributes is reduced.The process must be repeated on all combinations of attributes.The paper also evaluates the approach by comparing it with existing feature selection algorithms over 8 datasets from University of California,Irvine (UCI) machine learning databases.The proposed method shows better results in terms of number of selected features,classification accuracy,and running time than most existing algorithms.