计算机工程与应用
計算機工程與應用
계산궤공정여응용
COMPUTER ENGINEERING AND APPLICATIONS
2013年
7期
98-101
,共4页
垃圾邮件%邮件过滤%概率%阈值%分类决策
垃圾郵件%郵件過濾%概率%閾值%分類決策
랄급유건%유건과려%개솔%역치%분류결책
spam email%email filter%probability%threshold%classify decision
探讨了基于概率阈值的贝叶斯邮件过滤模型的局限性:由于很少考虑所设定阈值的适用性和实用性,损失了一定的召回率.改进贝叶斯决策,提出了基于随机变量的较小错误分类决策方法;针对邮件处理的特殊性,进一步提出了基于随机变量的较小风险分类决策方法.实验结果表明,处理普通文本分类问题时,前者的分类决策效果更好;而后者在处理邮件问题时性能更优,能够在保持较小误判风险的同时,提高贝叶斯邮件过滤器的召回率以及 F 值.
探討瞭基于概率閾值的貝葉斯郵件過濾模型的跼限性:由于很少攷慮所設定閾值的適用性和實用性,損失瞭一定的召迴率.改進貝葉斯決策,提齣瞭基于隨機變量的較小錯誤分類決策方法;針對郵件處理的特殊性,進一步提齣瞭基于隨機變量的較小風險分類決策方法.實驗結果錶明,處理普通文本分類問題時,前者的分類決策效果更好;而後者在處理郵件問題時性能更優,能夠在保持較小誤判風險的同時,提高貝葉斯郵件過濾器的召迴率以及 F 值.
탐토료기우개솔역치적패협사유건과려모형적국한성:유우흔소고필소설정역치적괄용성화실용성,손실료일정적소회솔.개진패협사결책,제출료기우수궤변량적교소착오분류결책방법;침대유건처리적특수성,진일보제출료기우수궤변량적교소풍험분류결책방법.실험결과표명,처리보통문본분류문제시,전자적분류결책효과경호;이후자재처리유건문제시성능경우,능구재보지교소오판풍험적동시,제고패협사유건과려기적소회솔이급 F 치.
@@@@This paper confers in depth to the limitations of the traditional Bayesian anti-spam mechanism. It seldom thinks about whether the threshold is suitable or not, so the recalling is reduced. Aiming at this question, the paper proposes a lower-error policy decision based on chance variable; and considering the particularity of email classification, a lower-risk policy decision based on chance variable is proposed. The experimental results show that the former one maybe a better way to classify the common text;and the latter one makes better performance on recalling and F value when dealing with emails, at the same time it keeps a lower risk of error judging.