电脑知识与技术
電腦知識與技術
전뇌지식여기술
COMPUTER KNOWLEDGE AND TECHNOLOGY
2013年
7期
1491-1493
,共3页
键值%云计算%集群
鍵值%雲計算%集群
건치%운계산%집군
Key-Value%cloud computing%clusters
Hadoop作为开源组织Apache的一个分布式计算开源框架,可高效的对海量数据进行运算和处理,可以应对互联网上数以千万计的并发处理和访问,但其不支持数据的实时读写和修改.Cassandra是一款面向列的功能强大的Key-Value分布式数据库系统,具有良好的实时读写性能和可扩展性,但缺乏对海量数据进行分析运算的能力.将Hadoop与Cassan?dra 结合起来,取长补短,就能为云计算模型的实施提供一个高效的切实可行的方案.该文首先阐述了Hadoop 整合Cas?sandra处理海量数据的必要性,然后提出了具体的整合方案和实现,最后总结了Hadoop整合Cassandra所遇到的主要问题.
Hadoop作為開源組織Apache的一箇分佈式計算開源框架,可高效的對海量數據進行運算和處理,可以應對互聯網上數以韆萬計的併髮處理和訪問,但其不支持數據的實時讀寫和脩改.Cassandra是一款麵嚮列的功能彊大的Key-Value分佈式數據庫繫統,具有良好的實時讀寫性能和可擴展性,但缺乏對海量數據進行分析運算的能力.將Hadoop與Cassan?dra 結閤起來,取長補短,就能為雲計算模型的實施提供一箇高效的切實可行的方案.該文首先闡述瞭Hadoop 整閤Cas?sandra處理海量數據的必要性,然後提齣瞭具體的整閤方案和實現,最後總結瞭Hadoop整閤Cassandra所遇到的主要問題.
Hadoop작위개원조직Apache적일개분포식계산개원광가,가고효적대해량수거진행운산화처리,가이응대호련망상수이천만계적병발처리화방문,단기불지지수거적실시독사화수개.Cassandra시일관면향렬적공능강대적Key-Value분포식수거고계통,구유량호적실시독사성능화가확전성,단결핍대해량수거진행분석운산적능력.장Hadoop여Cassan?dra 결합기래,취장보단,취능위운계산모형적실시제공일개고효적절실가행적방안.해문수선천술료Hadoop 정합Cas?sandra처리해량수거적필요성,연후제출료구체적정합방안화실현,최후총결료Hadoop정합Cassandra소우도적주요문제.
As a framework of distributed computing in open sourcing Apache organization, Hadoop can solve large scale access?ing of massive data efficiently, which can also cope with tens of millions of concurrency accessing from Internet. Unfortunately, Hadoop can’t support real-time reading, writing and modifying of the data. Furthermore, as a powerful key-value distributed database which faces to the columns, Cassandra has outstanding performance in real-time data reading, writing and scalability, but it lacks of the ability in analyzing and computing of massive data. Therefore, combining Hadoop with Cassandra can draw upon and benefit from each other to achieve a feasible solution in dealing with cloud computing problems. This paper, on the basis of the combination between Hadoop and Cassandra, discusses the necessity of the integration. Then, the specific integrating solution and implement was put forward. The summarizations of the problems during the integration were also be discussed.