计算机工程与设计
計算機工程與設計
계산궤공정여설계
COMPUTER ENGINEERING AND DESIGN
2010年
5期
921-924,1034
,共5页
吴海燕%朱靖君%高国柱%程志锐
吳海燕%硃靖君%高國柱%程誌銳
오해연%주정군%고국주%정지예
AprioriAll算法%序列模式%Web日志挖掘%事务%最大向前路径
AprioriAll算法%序列模式%Web日誌挖掘%事務%最大嚮前路徑
AprioriAll산법%서렬모식%Web일지알굴%사무%최대향전로경
AprioriAll algorithm%sequential pattern%web log mining%transaction%maximal forward path
为了减少AprioriAll算法挖掘过程中候选序列的生成以及对序列数据库的扫描次数,提高算法的挖掘效率,提出了一种基于改进的AprioriAll算法的Web序列模式挖掘方法.首先对数据进行预处理,然后利用经过改进的AprioriAll算法进行模式挖掘.算法的改进主要有两点:一个通过改变候选序列的连接方式来减少候选序列的产生;二是通过减少不必要的数据库扫描操作来提高算法的效率.通过实验验证了改进后算法在Web序列模式挖掘过程中的高效性和正确性.
為瞭減少AprioriAll算法挖掘過程中候選序列的生成以及對序列數據庫的掃描次數,提高算法的挖掘效率,提齣瞭一種基于改進的AprioriAll算法的Web序列模式挖掘方法.首先對數據進行預處理,然後利用經過改進的AprioriAll算法進行模式挖掘.算法的改進主要有兩點:一箇通過改變候選序列的連接方式來減少候選序列的產生;二是通過減少不必要的數據庫掃描操作來提高算法的效率.通過實驗驗證瞭改進後算法在Web序列模式挖掘過程中的高效性和正確性.
위료감소AprioriAll산법알굴과정중후선서렬적생성이급대서렬수거고적소묘차수,제고산법적알굴효솔,제출료일충기우개진적AprioriAll산법적Web서렬모식알굴방법.수선대수거진행예처리,연후이용경과개진적AprioriAll산법진행모식알굴.산법적개진주요유량점:일개통과개변후선서렬적련접방식래감소후선서렬적산생;이시통과감소불필요적수거고소묘조작래제고산법적효솔.통과실험험증료개진후산법재Web서렬모식알굴과정중적고효성화정학성.
To reduce the generation of candidate sequences and the scans to sequence database for AprioriAll algorithm, an efficient sequential pattern mining method based on improved AprioriAll algorithm is presented. Firstly, data are preprocessed. Then the sequential pattern mining is finished by improved AprioriAll algorithm. The improvements of AprioriAll algorithm are mainly two points: one is to change the connection of candidate sequences to reduce the generation of candidate sequences; the other is to reduce the needless database scans to improve the efficiency of algorithm. Finally, the efficiency and validity of improved AprioriAll algorithm is validated by experiments.