昆明学院学报
昆明學院學報
곤명학원학보
JOURNAL OF KUNMING UNIVERSITY
2011年
3期
60-63
,共4页
自索引%后向搜索%文本数据%BWC
自索引%後嚮搜索%文本數據%BWC
자색인%후향수색%문본수거%BWC
self-index%backward searching%text data%Burrows-Wheeler Compression
在XML文档中,相当大的部分是由文本数据组成的,针对XML文本数据占用空间较大、对压缩文本数据有效搜索效率较低的难点,基于BWC提出了压缩XML文本数据索引的技术,通过构造全文本数据模型,并利用整体压缩自索引存储XML文档的文本数据,实验结果表明,该技术不仅有效支持XPath查询语言文本搜索,而且内存消耗相对较小,实现了中小规模数据的内存搜索.
在XML文檔中,相噹大的部分是由文本數據組成的,針對XML文本數據佔用空間較大、對壓縮文本數據有效搜索效率較低的難點,基于BWC提齣瞭壓縮XML文本數據索引的技術,通過構造全文本數據模型,併利用整體壓縮自索引存儲XML文檔的文本數據,實驗結果錶明,該技術不僅有效支持XPath查詢語言文本搜索,而且內存消耗相對較小,實現瞭中小規模數據的內存搜索.
재XML문당중,상당대적부분시유문본수거조성적,침대XML문본수거점용공간교대、대압축문본수거유효수색효솔교저적난점,기우BWC제출료압축XML문본수거색인적기술,통과구조전문본수거모형,병이용정체압축자색인존저XML문당적문본수거,실험결과표명,해기술불부유효지지XPath사순어언문본수색,이차내존소모상대교소,실현료중소규모수거적내존수색.
A large number of fractions of an XML document are composed of text data.Considering the problems of the size of large XML document and less efficiency of effective searching on compressed text data,an index technology for compressed XML text data based on BWC is presented.The proposed technique is implemented by constructing a full text data model and in which the text data of XML document is stored with global compressed self-index.Experimental results shows,the proposed technique not only supports XPath query language search text effectively,but also needs fewer consumption of the memory so as to realize small and medium-scale data memory search.