首 页 本刊概况 出 版 人 发行统计 在线订阅 欢迎投稿 市场分析 1 组织交流 1 关于我们
 
1
   通信短波
1
   新品之窗
1
   优秀论文
1
   通信趋势
1
   特别企划
1
   运营商动态
1
   技术前沿
1
   市场聚焦
1
   通信视点
1
   信息化论坛
1
当前位置:首页 > 优秀论文
一种基于正常读带宽的数据修复方法
作者:郑涵+,陈海涛,卢宇彤
来源:本站原创
更新时间:2013/9/18 9:37:00
正文:

                    (国防科学技术大学 计算机学院,长沙 410073)
A novel way to repair data based on normal read bandwidth
ZHENG Han+,CHENG Hai-Tao, LU Yu-Tong
(Department of Computer Science and Technology, National University of Defense Technology, Changsha, Hunan, 410073, China)

Abstract:Cloud storage systems typically usereplication to providereliability. Recently, erasure codeshave been used to reduce the large storage overhead of three-replicated systems. Reed-Solomon codes are the standarddesign choice. But their high repair cost is often consideredan unavoidable price to pay for high storage efficiency andhigh reliability. This paper presents a new scheme based on normal read bandwidth to repair data to reduce the repair cost. We implement our algorithm in an open source storage system and make some test on it. The result shows this new way saved almost half repair cost compared with previous way.
Key words: data repair, cloud storage, erasure codes, repair cost optimization

摘要:云存储系统主流采用副本的方式提供数据可靠性保证。当前,纠删码的方式正逐渐被采用取代传统的三副本方式,以节省存储开销。业界目前主要选择Reed-Solomoncodes作为纠删码的实现方式,但是,Reed-Solomoncodes较高的修复开销对于高效率和高可用性的存储系统而言往往是不可避免的。本文针对这一问题,提出一种基于正常读带宽的数据修复方法,减少数据修复过程中的开销。我们在一个开源的存储系统上实现了我们的算法,并进行测试比较,与之前的实现相比,要节约大约一半的修复开销。
关键词:数据修复,云存储,纠删码,修复开销优化

 

 


 References:
[1]田敬, 代亚非. P2P 持久存储研究[J]. 软件学报(Journal of Software). 2007, 18 (6):1379–1399.
[2]Cheng Huang et al.Erasure Coding in Windows Azure Storage. USENIX ATC’12, 2012
[3]Andrew Fikes. Storage Architecture and Challenges. In Faculty Summit, 2010.
[4]沈志荣,易乐天,舒继武.大规模数据中心的数据存储可靠性.国计算机学会通讯, 2012,8(10):8−16.
[5]Dabek F et al. Wide-Area cooperative storage with CFS.In Proc. of SOSP, 2001.
[6]黄震. 规模分布式存储系统中数据冗余技术研究. 博士, 国防科学技术大学研究生院, 2012.
[7]Sit Eet al. Proactive replication for data durability [C]. InInternational Workshop on Peer-to-Peer Systems (IPTPS). 2006.
[8]Duminuco A et al. Proactive Replication in Distributed StorageSystems Using Machine Availability Estimation [C]. In Proc. CoNEXT’07.December 2007.
[9]Osama Khan et al.Rethinking Erasure Codes for Cloud File Systems:Minimizing I/O for Recovery and Degraded Reads.
[10]James S. Plank. Erasure Codes forStorage Applications.In Proc.Of the 4th USENIX Conference on FAST, 2005.
[11]James S. Plank et al. Optimizing Cauchy Reed-Solomon Codesfor Fault-Tolerant Network Storage Applications. In Proc. of NCA, 2006
[12]Maheswaran et al. XORing Elephants: Novel Erasure Codes for Big Data. The 39th International Conference on Very Large Data Bases,August 26th 30th2013, Riva del Garda, Trento, Italy.
[13]Tahoe-LAFS project.https://tahoe-lafs.org.
[14]EnoThereska et al. Sierra: Practical Power-proportionality for Data Center Storage. EuroSys’11, 2011, Salzburg, Austria
[15]Plank J. A tutorial on reed-solomon coding for fault-tolerance in RAID-like systems. Software Practice and Experience, 1997,27(9):995-1012.


作者简介:
郑涵(1987-),男,硕士生,主要研究分布式存储,数据生存性。

 
 
   
《通信市场》 中国·北京·复兴路49号通信市场(100036) 点击查看具体位置
电话:86-10-6820 7724, 6820 7726
京ICP备05037146号-8
建议使用 Microsoft IE4.0 以上版本 800*600浏览 如果您有什么建议和意见请与管理员联系