计算机与现代化
計算機與現代化
계산궤여현대화
Computer and Modernization
2015年
10期
107-111
,共5页
水利普查%数据挖掘%决策树%C4.5算法%Map/Reduce技术
水利普查%數據挖掘%決策樹%C4.5算法%Map/Reduce技術
수리보사%수거알굴%결책수%C4.5산법%Map/Reduce기술
water census%data mining%decision-making tree%C4.5 algorithm%Map/Reduce
随着第一次全国水利普查的结束,海量的水利普查数据随之产生。将云计算技术应用在水利普查数据挖掘领域,可以更加快速、高效和低成本地为水利决策提供科学、合理的支持。本文提出基于Map/Reduce的水利普查数据决策树分类挖掘方法MRC4.5算法,并将该算法应用于全国水利普查地下水取水井数据挖掘中。实验结果表明,与传统的C4.5算法相比,MRC4.5算法在处理大规模数据集时具有更高的执行效率和良好的加速比。
隨著第一次全國水利普查的結束,海量的水利普查數據隨之產生。將雲計算技術應用在水利普查數據挖掘領域,可以更加快速、高效和低成本地為水利決策提供科學、閤理的支持。本文提齣基于Map/Reduce的水利普查數據決策樹分類挖掘方法MRC4.5算法,併將該算法應用于全國水利普查地下水取水井數據挖掘中。實驗結果錶明,與傳統的C4.5算法相比,MRC4.5算法在處理大規模數據集時具有更高的執行效率和良好的加速比。
수착제일차전국수리보사적결속,해량적수리보사수거수지산생。장운계산기술응용재수리보사수거알굴영역,가이경가쾌속、고효화저성본지위수리결책제공과학、합리적지지。본문제출기우Map/Reduce적수리보사수거결책수분류알굴방법MRC4.5산법,병장해산법응용우전국수리보사지하수취수정수거알굴중。실험결과표명,여전통적C4.5산법상비,MRC4.5산법재처리대규모수거집시구유경고적집행효솔화량호적가속비。
With the end of first nation water census, massive water census data have been generated.To use the cloud computing technology in the area of water census data mining can provide scientific, reasonable supports for the decision of water conservan-cy in a quick, efficient and economical way.This paper proposes water census data decision tree classified mining algorithm MRC4.5 based on Map/Reduce and water census data of groundwater wells is applied to data mining with the algorithm.The ex-perimental results indicate that compared with the traditional algorithm C4.5, MRC4.5 algorithm has higher efficiency and good speedup when dealing with massive data sets execution.