计算机科学与探索
計算機科學與探索
계산궤과학여탐색
JOURNAL OF FRONTIERS OF COMPUTER SCIENCE & TECHNOLOGY
2010年
2期
145-152
,共8页
付强%王腾蛟%李红燕%杨冬青%唐世渭
付彊%王騰蛟%李紅燕%楊鼕青%唐世渭
부강%왕등교%리홍연%양동청%당세위
可扩展标记语言%压缩%文档类型定义%数据流
可擴展標記語言%壓縮%文檔類型定義%數據流
가확전표기어언%압축%문당류형정의%수거류
extensible markup language(XML)%compression%document type definition(DTD)%data stream
XML(extensible mallkup language)文档已经被广泛用作应用程序的一个数据交换格式,针对XML数据的压缩技术也逐渐成为新的研究领域.提出XSLC(XML stream layerecl-coding compression)算法,通过预先扫描DTD对数据模式进行分析,继而根据元素的父子关系进行子元素层面的编码;同时根据数据类型进行数据压缩,能够在压缩之后的文档上进行查询,因为仅需一遍压缩扫描所以可以应用于数据流环境.实验表明:XSLC算法的压缩比率和压缩时间均优于传统算法.
XML(extensible mallkup language)文檔已經被廣汎用作應用程序的一箇數據交換格式,針對XML數據的壓縮技術也逐漸成為新的研究領域.提齣XSLC(XML stream layerecl-coding compression)算法,通過預先掃描DTD對數據模式進行分析,繼而根據元素的父子關繫進行子元素層麵的編碼;同時根據數據類型進行數據壓縮,能夠在壓縮之後的文檔上進行查詢,因為僅需一遍壓縮掃描所以可以應用于數據流環境.實驗錶明:XSLC算法的壓縮比率和壓縮時間均優于傳統算法.
XML(extensible mallkup language)문당이경피엄범용작응용정서적일개수거교환격식,침대XML수거적압축기술야축점성위신적연구영역.제출XSLC(XML stream layerecl-coding compression)산법,통과예선소묘DTD대수거모식진행분석,계이근거원소적부자관계진행자원소층면적편마;동시근거수거류형진행수거압축,능구재압축지후적문당상진행사순,인위부수일편압축소묘소이가이응용우수거류배경.실험표명:XSLC산법적압축비솔화압축시간균우우전통산법.
XML documents have been widely used as a data exchange format. XML (extensible markup language) data compression technology has become a new field of research. A compression methyl called XSLC (XML stream layered-coding compression) is proposed to compress and decompress XML stream in real time. When DTD (document type definition) is available, XSLC can analyze the data model and encode elements according to the relationship of father node and son node, compress data part according to its type, and support query operations applied on compressed files, as for only one time of scanning data is needed, all the processes can be implemented in XML data stream environment. Experimental results show that XSLC outperforms other methods in compression ratio and compression efficiency.