计算机科学与探索
計算機科學與探索
계산궤과학여탐색
JOURNAL OF FRONTIERS OF COMPUTER SCIENCE & TECHNOLOGY
2015年
5期
546-554
,共9页
宋宝燕%李雪城%任才%丁琳琳
宋寶燕%李雪城%任纔%丁琳琳
송보연%리설성%임재%정림림
云计算%中间数据%副本%容错算法
雲計算%中間數據%副本%容錯算法
운계산%중간수거%부본%용착산법
cloud computing%intermediate data%replication%fault-tolerant algorithm
通常在云计算框架的处理过程中会产生大量的、短暂的,同时又非常重要的中间数据。一旦有服务器失效,将会导致中间数据失效,进而影响整个任务的计算。现有的数据容错处理方法仅仅采用简单的复制策<br> 略,没有考虑中间数据的特点,会带来庞大的网络开销。因此,提出了一种有效的分层中间数据容错方法,即IDF_Support(intermediate data fault-tolerance_support)方法。通过将计算任务划分为不同类别,IDF_Support方法能够有效地处理中间数据失效。提出了分层的中间数据容错算法,分别是用于解决一个任务内部容错的中间数据容错算法(Inner_Task IDF)和用于解决任务间容错的中间数据容错算法(Outer_Task IDF)。实验结果表明,这些算法在机器出现故障的情况下提高了作业响应时间,保证了系统的可靠性。
通常在雲計算框架的處理過程中會產生大量的、短暫的,同時又非常重要的中間數據。一旦有服務器失效,將會導緻中間數據失效,進而影響整箇任務的計算。現有的數據容錯處理方法僅僅採用簡單的複製策<br> 略,沒有攷慮中間數據的特點,會帶來龐大的網絡開銷。因此,提齣瞭一種有效的分層中間數據容錯方法,即IDF_Support(intermediate data fault-tolerance_support)方法。通過將計算任務劃分為不同類彆,IDF_Support方法能夠有效地處理中間數據失效。提齣瞭分層的中間數據容錯算法,分彆是用于解決一箇任務內部容錯的中間數據容錯算法(Inner_Task IDF)和用于解決任務間容錯的中間數據容錯算法(Outer_Task IDF)。實驗結果錶明,這些算法在機器齣現故障的情況下提高瞭作業響應時間,保證瞭繫統的可靠性。
통상재운계산광가적처리과정중회산생대량적、단잠적,동시우비상중요적중간수거。일단유복무기실효,장회도치중간수거실효,진이영향정개임무적계산。현유적수거용착처리방법부부채용간단적복제책<br> 략,몰유고필중간수거적특점,회대래방대적망락개소。인차,제출료일충유효적분층중간수거용착방법,즉IDF_Support(intermediate data fault-tolerance_support)방법。통과장계산임무화분위불동유별,IDF_Support방법능구유효지처리중간수거실효。제출료분층적중간수거용착산법,분별시용우해결일개임무내부용착적중간수거용착산법(Inner_Task IDF)화용우해결임무간용착적중간수거용착산법(Outer_Task IDF)。실험결과표명,저사산법재궤기출현고장적정황하제고료작업향응시간,보증료계통적가고성。
Cloud computing frameworks usually generate large amounts of intermediate data which are short-lived, yet are important for the completion of job. Once there are server failures, it will lead to the failures of intermediate data, and affects the computation of the whole job. However, the existing fault-tolerant processing approaches only adopt simple replication strategies which can incur significant network overhead, and have no considering of the characteristics of intermediate data. Therefore, this paper proposes an efficient layered intermediate data fault-tolerant approach, named IDF_Support (intermediate data fault-tolerance_support) approach. By dividing the computing tasks into different classi-fications, IDF_Support approach can effectively process the intermediate data failures. Then, this paper proposes two layered intermediate data fault-tolerant algorithms, respectively the inner task intermediate data fault-tolerant algo-rithm (Inner_Task IDF) which resolves the fault-tolerance within a task and the outer task intermediate data fault-tolerant algorithm (Outer_Task IDF) which resolves the fault-tolerance among tasks. The experimental results show that the pro-posed algorithms can improve the response time in the case of machine failure, and keep the reliability of the whole system.