模式识别与人工智能
模式識彆與人工智能
모식식별여인공지능
Moshi Shibie yu Rengong Zhineng
2013年
10期
916-923
,共8页
吴虎胜%张凤鸣%徐显亮%张超%杜继永
吳虎勝%張鳳鳴%徐顯亮%張超%杜繼永
오호성%장봉명%서현량%장초%두계영
分形维数%离散粒子群算法%属性选择%多变量时间序列
分形維數%離散粒子群算法%屬性選擇%多變量時間序列
분형유수%리산입자군산법%속성선택%다변량시간서렬
Fractal Dimension%Discrete Particle Swarm Algorithm%Attribute Selection%Multivariate Time Series
属性选择是一种有效的数据预处理方法,可同时保留多变量时间序列重要变量的时序关系及其实际物理意义。针对很多实际数据无类别信息的问题,文中提出一种无监督属性选择算法并分析其复杂度。首先设计一种无需进行相空间重构的多变量时间序列分形维数计算方法,并将分形维数视为其本质维,利用属性子集的分形维数及其属性数目的变化作为子集优劣的评价标准。再优化离散粒子群算法以解决高维属性空间搜索的“组合爆炸”问题。最后利用典型混沌动力学系统所产生的多变量时间序列和UCI数据库的5组数据集进行仿真计算,结果表明该算法可在较短时间内找到较优的属性子集,具有较优的整体性能。
屬性選擇是一種有效的數據預處理方法,可同時保留多變量時間序列重要變量的時序關繫及其實際物理意義。針對很多實際數據無類彆信息的問題,文中提齣一種無鑑督屬性選擇算法併分析其複雜度。首先設計一種無需進行相空間重構的多變量時間序列分形維數計算方法,併將分形維數視為其本質維,利用屬性子集的分形維數及其屬性數目的變化作為子集優劣的評價標準。再優化離散粒子群算法以解決高維屬性空間搜索的“組閤爆炸”問題。最後利用典型混沌動力學繫統所產生的多變量時間序列和UCI數據庫的5組數據集進行倣真計算,結果錶明該算法可在較短時間內找到較優的屬性子集,具有較優的整體性能。
속성선택시일충유효적수거예처리방법,가동시보류다변량시간서렬중요변량적시서관계급기실제물리의의。침대흔다실제수거무유별신식적문제,문중제출일충무감독속성선택산법병분석기복잡도。수선설계일충무수진행상공간중구적다변량시간서렬분형유수계산방법,병장분형유수시위기본질유,이용속성자집적분형유수급기속성수목적변화작위자집우렬적평개표준。재우화리산입자군산법이해결고유속성공간수색적“조합폭작”문제。최후이용전형혼돈동역학계통소산생적다변량시간서렬화UCI수거고적5조수거집진행방진계산,결과표명해산법가재교단시간내조도교우적속성자집,구유교우적정체성능。
Attribute selection is an effective data preprocessing method. It can keep temporal relations of important attributes of multivariate time series and their actual physical meanings. Aiming at the problem that the actual data lacks the classified information, an unsupervised attribute selection method is proposed and its time complexity is analyzed. Firstly, a method for computing the fractal dimension of multivariate time series is proposed, and there is no need for the proposed method to reconstruct the phase space. The fractal dimension is considered as the essential dimension by the proposed method. Therefore ,the changing of the attributes number and the fractal dimension of attribute subsets are regarded as the evaluation criterion of attribute subsets. To solve the combinatorial explosion problem in high dimensional search space, the discrete particle swarm optimization algorithm is improved. Finally, the results of numerical simulations of multivariate time series from the typical chaotic dynamic system and five datasets of UCI database confirm the effectiveness of the proposed algorithm. Moreover, experimental results show the proposed algorithm finds out better attributes sets in shorter time and achieves better integrative performance.