计算机技术与发展
計算機技術與髮展
계산궤기술여발전
COMPUTER TECHNOLOGY AND DEVELOPMENT
2015年
5期
56-59
,共4页
条件函数依赖%数据质量%数据清洗%CTANE算法
條件函數依賴%數據質量%數據清洗%CTANE算法
조건함수의뢰%수거질량%수거청세%CTANE산법
conditional functional dependency%data quality%data cleaning%CTANE algorithm
由于采用函数依赖( Functional Dependency,FD)对数据库的检测和修复还不够充分,现提出了条件函数依赖( Con-ditional Functional Dependency,CFD),其是在FD的基础上加入了语义约束。条件函数依赖的挖掘是一种重要的数据库分析技术,CFD挖掘是在FD挖掘的基础上通过条件分析进行更细粒度的信息挖掘,其时间复杂度较高。文中主要介绍了CFD的相关概念及CFD经典挖掘算法之一—CTANE,并对该算法效率进行改进。改进后的算法不仅可以提高数据挖掘过程中操作的效率,同时也将节省数据的存储空间。
由于採用函數依賴( Functional Dependency,FD)對數據庫的檢測和脩複還不夠充分,現提齣瞭條件函數依賴( Con-ditional Functional Dependency,CFD),其是在FD的基礎上加入瞭語義約束。條件函數依賴的挖掘是一種重要的數據庫分析技術,CFD挖掘是在FD挖掘的基礎上通過條件分析進行更細粒度的信息挖掘,其時間複雜度較高。文中主要介紹瞭CFD的相關概唸及CFD經典挖掘算法之一—CTANE,併對該算法效率進行改進。改進後的算法不僅可以提高數據挖掘過程中操作的效率,同時也將節省數據的存儲空間。
유우채용함수의뢰( Functional Dependency,FD)대수거고적검측화수복환불구충분,현제출료조건함수의뢰( Con-ditional Functional Dependency,CFD),기시재FD적기출상가입료어의약속。조건함수의뢰적알굴시일충중요적수거고분석기술,CFD알굴시재FD알굴적기출상통과조건분석진행경세립도적신식알굴,기시간복잡도교고。문중주요개소료CFD적상관개념급CFD경전알굴산법지일—CTANE,병대해산법효솔진행개진。개진후적산법불부가이제고수거알굴과정중조작적효솔,동시야장절성수거적존저공간。
Because the detection and repair of the database is not sufficient by Functional Dependency ( FD) ,Conditional Functional De-pendency ( CFD) is proposed,which is an extension of FD adding semantic constraints. The discovery of CFD is an important database a-nalysis technique,CFD mining do the more fine-grained information mines which based on FD mining,so the time complexity of CFD mining is higher than the latter. Introduce the related concept of CFD and one of the CFD classical mining algorithm—CTANE in this pa-per,and improve the efficiency of this algorithm. The improved algorithm can not only enhance the operating efficiency of the data min-ing process,but also save the data storage space.