Modeling the fault tolerance consequences of deduplication

Eric W D Rozier, William H. Sanders, Pin Zhou, Nagapramod Mandagere, Sandeep M. Uttamchandani, Mark L. Yakushev

Research output: Chapter in Book/Report/Conference proceedingConference contribution

15 Citations (Scopus)

Abstract

Modern storage systems are employing data deduplication with increasing frequency. Often the storage systems on which these techniques are deployed contain important data, and utilize fault-tolerant hardware and software to improve the reliability of the system and reduce data loss. We suggest that data deduplication introduces inter-file relationships that may have a negative impact on the fault tolerance of such systems by creating dependencies that can increase the severity of data loss events. We present a framework composed of data analysis methods and a model of data deduplication that is useful in studying the reliability impact of data deduplication. The framework is useful for determining a deduplication strategy that is estimated to satisfy a set of reliability constraints supplied by a user.

Original languageEnglish
Title of host publicationProceedings of the IEEE Symposium on Reliable Distributed Systems
Pages75-84
Number of pages10
DOIs
StatePublished - Dec 14 2011
Event2011 30th IEEE International Symposium on Reliable Distributed Systems, SRDS 2011 - Madrid, Spain
Duration: Oct 4 2011Oct 7 2011

Other

Other2011 30th IEEE International Symposium on Reliable Distributed Systems, SRDS 2011
CountrySpain
CityMadrid
Period10/4/1110/7/11

Fingerprint

Fault tolerance
Fault Tolerance
Modeling
Storage System
Hardware
Fault-tolerant
Data analysis
Software

Keywords

  • deduplication
  • fault tolerance
  • reliability
  • storage systems
  • UDE

ASJC Scopus subject areas

  • Hardware and Architecture
  • Computer Networks and Communications
  • Software
  • Theoretical Computer Science

Cite this

Rozier, E. W. D., Sanders, W. H., Zhou, P., Mandagere, N., Uttamchandani, S. M., & Yakushev, M. L. (2011). Modeling the fault tolerance consequences of deduplication. In Proceedings of the IEEE Symposium on Reliable Distributed Systems (pp. 75-84). [6076764] https://doi.org/10.1109/SRDS.2011.18

Modeling the fault tolerance consequences of deduplication. / Rozier, Eric W D; Sanders, William H.; Zhou, Pin; Mandagere, Nagapramod; Uttamchandani, Sandeep M.; Yakushev, Mark L.

Proceedings of the IEEE Symposium on Reliable Distributed Systems. 2011. p. 75-84 6076764.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Rozier, EWD, Sanders, WH, Zhou, P, Mandagere, N, Uttamchandani, SM & Yakushev, ML 2011, Modeling the fault tolerance consequences of deduplication. in Proceedings of the IEEE Symposium on Reliable Distributed Systems., 6076764, pp. 75-84, 2011 30th IEEE International Symposium on Reliable Distributed Systems, SRDS 2011, Madrid, Spain, 10/4/11. https://doi.org/10.1109/SRDS.2011.18
Rozier EWD, Sanders WH, Zhou P, Mandagere N, Uttamchandani SM, Yakushev ML. Modeling the fault tolerance consequences of deduplication. In Proceedings of the IEEE Symposium on Reliable Distributed Systems. 2011. p. 75-84. 6076764 https://doi.org/10.1109/SRDS.2011.18
Rozier, Eric W D ; Sanders, William H. ; Zhou, Pin ; Mandagere, Nagapramod ; Uttamchandani, Sandeep M. ; Yakushev, Mark L. / Modeling the fault tolerance consequences of deduplication. Proceedings of the IEEE Symposium on Reliable Distributed Systems. 2011. pp. 75-84
@inproceedings{640a0687f90d469cae3a278c253d1a05,
title = "Modeling the fault tolerance consequences of deduplication",
abstract = "Modern storage systems are employing data deduplication with increasing frequency. Often the storage systems on which these techniques are deployed contain important data, and utilize fault-tolerant hardware and software to improve the reliability of the system and reduce data loss. We suggest that data deduplication introduces inter-file relationships that may have a negative impact on the fault tolerance of such systems by creating dependencies that can increase the severity of data loss events. We present a framework composed of data analysis methods and a model of data deduplication that is useful in studying the reliability impact of data deduplication. The framework is useful for determining a deduplication strategy that is estimated to satisfy a set of reliability constraints supplied by a user.",
keywords = "deduplication, fault tolerance, reliability, storage systems, UDE",
author = "Rozier, {Eric W D} and Sanders, {William H.} and Pin Zhou and Nagapramod Mandagere and Uttamchandani, {Sandeep M.} and Yakushev, {Mark L.}",
year = "2011",
month = "12",
day = "14",
doi = "10.1109/SRDS.2011.18",
language = "English",
isbn = "9780769544502",
pages = "75--84",
booktitle = "Proceedings of the IEEE Symposium on Reliable Distributed Systems",

}

TY - GEN

T1 - Modeling the fault tolerance consequences of deduplication

AU - Rozier, Eric W D

AU - Sanders, William H.

AU - Zhou, Pin

AU - Mandagere, Nagapramod

AU - Uttamchandani, Sandeep M.

AU - Yakushev, Mark L.

PY - 2011/12/14

Y1 - 2011/12/14

N2 - Modern storage systems are employing data deduplication with increasing frequency. Often the storage systems on which these techniques are deployed contain important data, and utilize fault-tolerant hardware and software to improve the reliability of the system and reduce data loss. We suggest that data deduplication introduces inter-file relationships that may have a negative impact on the fault tolerance of such systems by creating dependencies that can increase the severity of data loss events. We present a framework composed of data analysis methods and a model of data deduplication that is useful in studying the reliability impact of data deduplication. The framework is useful for determining a deduplication strategy that is estimated to satisfy a set of reliability constraints supplied by a user.

AB - Modern storage systems are employing data deduplication with increasing frequency. Often the storage systems on which these techniques are deployed contain important data, and utilize fault-tolerant hardware and software to improve the reliability of the system and reduce data loss. We suggest that data deduplication introduces inter-file relationships that may have a negative impact on the fault tolerance of such systems by creating dependencies that can increase the severity of data loss events. We present a framework composed of data analysis methods and a model of data deduplication that is useful in studying the reliability impact of data deduplication. The framework is useful for determining a deduplication strategy that is estimated to satisfy a set of reliability constraints supplied by a user.

KW - deduplication

KW - fault tolerance

KW - reliability

KW - storage systems

KW - UDE

UR - http://www.scopus.com/inward/record.url?scp=83155189006&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=83155189006&partnerID=8YFLogxK

U2 - 10.1109/SRDS.2011.18

DO - 10.1109/SRDS.2011.18

M3 - Conference contribution

AN - SCOPUS:83155189006

SN - 9780769544502

SP - 75

EP - 84

BT - Proceedings of the IEEE Symposium on Reliable Distributed Systems

ER -