###

计算机系统应用英文版:2017,26(2):112-117

View/Add Comment 过刊浏览高级检索 HTML

←前一篇 | 后一篇→

码上扫一扫！

下载全文

云存储系统管理节点故障自恢复算法

马玮骏¹, 王强¹, 何晓晖¹, 张舒², 张庆³

(1.解放军理工大学野战工程学院, 南京 210014;2.西部战区空军气象中心, 成都 610000;3.东部战区空军气象中心, 南京 210018)

Fault Self-Recovery Algorithm for Management Node in Cloud Storage System

MA Wei-Jun¹, WANG Qiang¹, HE Xiao-Hui¹, ZHANG Shu², ZHANG Qing³

(1.Institute of Field Engineering, PLA University of Science and Technology, Nanjing 210014, China;2.Western Theater Air Meteorological Center, Chengdu 610000, China;3.Eastern Theater Air Meteorological Center, Nanjing 210018, China)

摘要

图/表

参考文献

相似文献

本文已被：浏览 1582次下载 1730次
Received:May 10, 2016 Revised:June 30, 2016

中文摘要: 为了解决大规模云存储系统中管理节点发生故障导致存储服务不可用的问题，建立了管理节点故障影响分析模型，提出了一种基于消息的管理节点动态自我恢复算法FRA-M.该算法通过基于负载均衡的元数据备份更新控制方法完成多个管理节点之间相互协作、透明接管和故障自我恢复.测试结果表明，FRA-M算法能够使得管理节点发生故障时自动进行切换，并且能够合理地分配资源达到良好的负载均衡状态.通过控制TCP超时时限、故障检测周期以及故障检测超时，能够使得FRA-M算法的性能保持在相对稳定的区间，随失效时刻的适应性也比较强.当管理节点发生故障时，FRA-M能够较好地保证存储服务可用性、数据可用性和数据可靠性.

中文关键词: 云存储系统管理节点自我恢复元数据负载均衡动态切换

Abstract:In order to solve the storage service unavailable problem on account of the management node fault in huge cloud storage system, an analysis model for fault effect of management node is built and a dynamic self-recovery algorithm for management node based on message called FRA-M is presented. FRA-M implements the cooperation, transparent take-over and self-recovery of management nodes by metadata update control based on load balance. Experiment shows FRA-M can provide management nodes auto switching when fault occurs and achieve good load balance by favorable resource allocation. The performance of FRA-M is also maintained in a relatively stable interval by reasonable control of TCP timeout, fault detection cycle and fault detection timeout. The storage service availability, data usability and data reliability are guaranteed by FRA-M during the breakdown of management nodes.

keywords: cloud storage system management node self-recovery metadata load balancing dynamic switching

文章编号： 中图分类号： 文献标志码：

基金项目:国家自然科学基金（61371119）

引用文本：
马玮骏,王强,何晓晖,张舒,张庆.云存储系统管理节点故障自恢复算法.计算机系统应用,2017,26(2):112-117
MA Wei-Jun,WANG Qiang,HE Xiao-Hui,ZHANG Shu,ZHANG Qing.Fault Self-Recovery Algorithm for Management Node in Cloud Storage System.COMPUTER SYSTEMS APPLICATIONS,2017,26(2):112-117

Author Name	Affiliation
MA Wei-Jun	Institute of Field Engineering, PLA University of Science and Technology, Nanjing 210014, China
WANG Qiang	Institute of Field Engineering, PLA University of Science and Technology, Nanjing 210014, China
HE Xiao-Hui	Institute of Field Engineering, PLA University of Science and Technology, Nanjing 210014, China
ZHANG Shu	Western Theater Air Meteorological Center, Chengdu 610000, China
ZHANG Qing	Eastern Theater Air Meteorological Center, Nanjing 210018, China