计算机与现代化
計算機與現代化
계산궤여현대화
COMPUTER AND MODERNIZATION
2015年
5期
17-20
,共4页
网络热点事件%数据挖掘%半结构化数据%特征分割
網絡熱點事件%數據挖掘%半結構化數據%特徵分割
망락열점사건%수거알굴%반결구화수거%특정분할
network hot event%data mining%semi-structured data%feature segmentation
为有效从网络中挖掘出民众关注的热点事件和话题,提高数据分类能力、热点追踪和检测正确率,在分析目前采用非结构化传统挖掘算法所存在问题的基础上,提出一种基于结构化分割的挖掘算法。首先通过分析热点事件挖掘处理流程,设计一种对热点事件数据挖掘的半结构化特征提取算法,对半结构化数据进行特征分割,生成大量请求,进而得到热点事件数据的分配因子,从而提高挖掘性能。仿真结果表明,该算法运行效率较高,精度较好,具有较高的稳健性。
為有效從網絡中挖掘齣民衆關註的熱點事件和話題,提高數據分類能力、熱點追蹤和檢測正確率,在分析目前採用非結構化傳統挖掘算法所存在問題的基礎上,提齣一種基于結構化分割的挖掘算法。首先通過分析熱點事件挖掘處理流程,設計一種對熱點事件數據挖掘的半結構化特徵提取算法,對半結構化數據進行特徵分割,生成大量請求,進而得到熱點事件數據的分配因子,從而提高挖掘性能。倣真結果錶明,該算法運行效率較高,精度較好,具有較高的穩健性。
위유효종망락중알굴출민음관주적열점사건화화제,제고수거분류능력、열점추종화검측정학솔,재분석목전채용비결구화전통알굴산법소존재문제적기출상,제출일충기우결구화분할적알굴산법。수선통과분석열점사건알굴처리류정,설계일충대열점사건수거알굴적반결구화특정제취산법,대반결구화수거진행특정분할,생성대량청구,진이득도열점사건수거적분배인자,종이제고알굴성능。방진결과표명,해산법운행효솔교고,정도교호,구유교고적은건성。
For effectively mining the hot issues and topics concerned by people in network , improving the capabilities of data classification and the correct rate of hot tracking and detection , basing on analyzing the problems existing in the traditional un-structured mining algorithms used currently , we proposed a mining algorithm based on structured segmentation .First, by analy-zing the hot events mining process , we designed a semi-structured features extraction algorithm for hot events data mining , in or-der to make feature segmentation for semi-structured data , generate a lot of requests , and then get hot event data allocation fac-tors, thereby improve mining properties .Simulation results show that the algorithm is running with high efficiency , good accuracy and high robustness .