山东大学学报(工学版)
山東大學學報(工學版)
산동대학학보(공학판)
JOURNAL OF SHANDONG UNIVERSITY(ENGINEERING SCIENCE)
2015年
1期
24-29,36
,共7页
黄泗勇%陈婷婷%卢清%吴英杰%叶少珍
黃泗勇%陳婷婷%盧清%吳英傑%葉少珍
황사용%진정정%로청%오영걸%협소진
隐私保护%差分隐私%数据划分发布%kd-树%二维空间数据
隱私保護%差分隱私%數據劃分髮佈%kd-樹%二維空間數據
은사보호%차분은사%수거화분발포%kd-수%이유공간수거
privacy preserving%differential privacy%data partitioning publication%kd-tree%two-dimensional dataset
为解决现有基于网格结构的差分隐私二维空间数据划分发布方法可能引起局部划分过细导致查询精度低的问题,提出了基于kd-树的差分隐私二维空间数据划分发布方法—kd-PPDP算法(differentially privacy partitio-ning publication algorithm based on kd-tree)。算法采用了kd-树算法思想,通过启发式地识别网格化后数据分布情况并合并相邻近似网格单元来防止局部划分过细问题,从而减少所添加的噪声,提高查询精度。通过实验对比分析了kd-PPDP算法与现有基于网格结构的划分发布方法的查询误差以及时间效率,结果表明了该算法的有效性和可行性。
為解決現有基于網格結構的差分隱私二維空間數據劃分髮佈方法可能引起跼部劃分過細導緻查詢精度低的問題,提齣瞭基于kd-樹的差分隱私二維空間數據劃分髮佈方法—kd-PPDP算法(differentially privacy partitio-ning publication algorithm based on kd-tree)。算法採用瞭kd-樹算法思想,通過啟髮式地識彆網格化後數據分佈情況併閤併相鄰近似網格單元來防止跼部劃分過細問題,從而減少所添加的譟聲,提高查詢精度。通過實驗對比分析瞭kd-PPDP算法與現有基于網格結構的劃分髮佈方法的查詢誤差以及時間效率,結果錶明瞭該算法的有效性和可行性。
위해결현유기우망격결구적차분은사이유공간수거화분발포방법가능인기국부화분과세도치사순정도저적문제,제출료기우kd-수적차분은사이유공간수거화분발포방법—kd-PPDP산법(differentially privacy partitio-ning publication algorithm based on kd-tree)。산법채용료kd-수산법사상,통과계발식지식별망격화후수거분포정황병합병상린근사망격단원래방지국부화분과세문제,종이감소소첨가적조성,제고사순정도。통과실험대비분석료kd-PPDP산법여현유기우망격결구적화분발포방법적사순오차이급시간효솔,결과표명료해산법적유효성화가행성。
The existing differentially privacy two-dimensional dataset partitioning publication approaches might result in over-partitioning of certain local regions thus lower the query accuracy.To solve this problem,a new differentially pri-vacy partitioning publication algorithm based on kd-tree for two-dimensional dataset (kd-PPDP)was proposed.To re-duce the noise generated by the Laplace mechanism and improve the accuracy of query,the new approach which was in-spired by the core thought of kd-tree took the gridding data distribution into account and merged adjacent grid units with similar information heuristically to prevent local over-partitioning.The new approach was compared with the existing differentially privacy partitioning publication approachs based on grid structure in terms of query error and time complex-ity.Experimental results showed that the approach was feasible and effective.