集成技术
集成技術
집성기술
Journal of Integration Technology
2014年
4期
18-30
,共13页
大数据引擎%数据存储%行列混合%聚簇索引
大數據引擎%數據存儲%行列混閤%聚簇索引
대수거인경%수거존저%행렬혼합%취족색인
big data engine%data storage%row columnar%clustered index
大数据计算面对的是传统IT技术无法处理的数据量超大规模、服务请求高吞吐量和数据类型异质多样的挑战。得益于国内外各大互联网公司的实际应用和开源代码贡献,Apache Hadoop 软件已成为 PB 量级大数据处理的成熟技术和事实标准,并且围绕不同类型大数据处理需求的软件生态环境已经建立起来。文章介绍了大数据计算系统中存储、索引和压缩解压缩的硬件加速三项研究工作,即 RCFile、CCIndex 和 SwiftFS,有效解决了大数据计算系统的存储空间问题和查询性能等问题。这些研究成果已形成关键技术并集成在天玑大数据引擎软件栈中,直接支持了淘宝和腾讯公司的多个生产性应用。
大數據計算麵對的是傳統IT技術無法處理的數據量超大規模、服務請求高吞吐量和數據類型異質多樣的挑戰。得益于國內外各大互聯網公司的實際應用和開源代碼貢獻,Apache Hadoop 軟件已成為 PB 量級大數據處理的成熟技術和事實標準,併且圍繞不同類型大數據處理需求的軟件生態環境已經建立起來。文章介紹瞭大數據計算繫統中存儲、索引和壓縮解壓縮的硬件加速三項研究工作,即 RCFile、CCIndex 和 SwiftFS,有效解決瞭大數據計算繫統的存儲空間問題和查詢性能等問題。這些研究成果已形成關鍵技術併集成在天璣大數據引擎軟件棧中,直接支持瞭淘寶和騰訊公司的多箇生產性應用。
대수거계산면대적시전통IT기술무법처리적수거량초대규모、복무청구고탄토량화수거류형이질다양적도전。득익우국내외각대호련망공사적실제응용화개원대마공헌,Apache Hadoop 연건이성위 PB 량급대수거처리적성숙기술화사실표준,병차위요불동류형대수거처리수구적연건생태배경이경건립기래。문장개소료대수거계산계통중존저、색인화압축해압축적경건가속삼항연구공작,즉 RCFile、CCIndex 화 SwiftFS,유효해결료대수거계산계통적존저공간문제화사순성능등문제。저사연구성과이형성관건기술병집성재천기대수거인경연건잔중,직접지지료도보화등신공사적다개생산성응용。
Volume, variety and velocity are the three challenges that the big data computing must be faced with, which cannot be dealt with by traditional IT technologies. Beneifting from numerous domestic and overseas Internet companies’ practical applications and continuous code contributions, the Apache Hadoop has become a mature software stack and the de facto standard of the PetaByte scale data processing. Furthermore, around different types of data processing requirements, different software ecosystems have been established. In the big data system ifeld, three research works of data placement, index construction and compression and decompression hardware acceleration, i.e. RCFile, CCIndex and SwiftFS respectively, effectively solving the storage space and query performance issues, were introduced in this paper. The above research achievements have been already integrated into the Golaxy big data engine software stack in the form of key technologies, and directly supported multiple practical applications of Taobao Inc. and Tencent Inc.