微计算机信息
微計算機信息
미계산궤신식
CONTROL & AUTOMATION
2010年
3期
204-206
,共3页
元搜索引擎%文本聚类%后缀树
元搜索引擎%文本聚類%後綴樹
원수색인경%문본취류%후철수
meta search engine%Text Clustering%Suffix tree
元搜索引擎结果覆盖面广,易于维护,实现简单,能够提供比较全面的结果给用户.后缀树聚类算法(STC)充分考虑了文本集合的语言学特征,并引入了短语特性,从而产生了较好的聚类效果.本文将后缀树聚类算法应用到元搜索引擎中,从而增强了结果的可浏览性,提高了搜索的精度.实验结果表明,STC算法在查准率和时间性能方面都高于传统的聚类算法.
元搜索引擎結果覆蓋麵廣,易于維護,實現簡單,能夠提供比較全麵的結果給用戶.後綴樹聚類算法(STC)充分攷慮瞭文本集閤的語言學特徵,併引入瞭短語特性,從而產生瞭較好的聚類效果.本文將後綴樹聚類算法應用到元搜索引擎中,從而增彊瞭結果的可瀏覽性,提高瞭搜索的精度.實驗結果錶明,STC算法在查準率和時間性能方麵都高于傳統的聚類算法.
원수색인경결과복개면엄,역우유호,실현간단,능구제공비교전면적결과급용호.후철수취류산법(STC)충분고필료문본집합적어언학특정,병인입료단어특성,종이산생료교호적취류효과.본문장후철수취류산법응용도원수색인경중,종이증강료결과적가류람성,제고료수색적정도.실험결과표명,STC산법재사준솔화시간성능방면도고우전통적취류산법.
Meta search engine has many advantage,its retrieve results corver a wide range, it easy to maintain and aehieve, it also can provide a more comprehensive results to users. Suffix tree clustering algorithm (STC) does not treat a document as a set of word but rather as a string,making use of proximity information between words,so it has a good clustering effect. Suffix tree clustering al-gorithm applied to the meta search engine, thus enhancing the browsing of retrieve results, improving the accuracy of the retrieve re-sults.The experimental results show that STC algorithm has a better performance than traditional clustering algorithm.