You are previewing Reliability Assurance of Big Data in the Cloud.
O'Reilly logo
Reliability Assurance of Big Data in the Cloud

Book Description

With the rapid growth of Cloud computing, the size of Cloud data is expanding at a dramatic speed. A huge amount of data is generated and processed by Cloud applications, putting a higher demand on cloud storage. While data reliability should already be a requirement, data in the Cloud needs to be stored in a highly cost-effective manner. This book focuses on the trade-off between data storage cost and data reliability assurance for big data in the Cloud. Throughout the whole Cloud data lifecycle, four major features are presented: first, a novel generic data reliability model for describing data reliability in the Cloud; second, a minimum replication calculation approach for meeting a given data reliability requirement to facilitate data creation; third, a novel cost-effective data reliability assurance mechanism for big data maintenance, which could dramatically reduce the storage space needed in the Cloud; fourth, a cost-effective strategy for facilitating data creation and recovery, which could significantly reduce the energy consumption during data transfer.

  • Captures data reliability with variable disk rates and compares virtual to physical disks
  • Offers methods for reducing cloud-based storage cost and energy consumption
  • Presents a minimum replication benchmark for data reliability requirements to evaluate various replication-based data storage approaches

Table of Contents

  1. Cover
  2. Title page
  3. Table of Contents
  4. Copyright
  5. About the Authors
  6. Preface
  7. Acknowledgments
  8. 1: Introduction
    1. Abstract
    2. 1.1. Data reliability in the Cloud
    3. 1.2. Background of Cloud storage
    4. 1.3. Key issues of research
    5. 1.4. Book overview
  9. 2: Literature review
    1. Abstract
    2. 2.1. Data reliability assurance in hardware
    3. 2.2. Data reliability assurance in software
    4. 2.3. Data transfer for distributed systems
    5. 2.4. Summary
  10. 3: Motivating example and problem analysis
    1. Abstract
    2. 3.1. Motivating example
    3. 3.2. Problem analysis
    4. 3.3. Summary
  11. 4: Generic data reliability model in the cloud
    1. Abstract
    2. 4.1. Properties of the data reliability model
    3. 4.2. Generic data reliability model
    4. 4.3. Summary
  12. 5: Minimum replication for meeting the data reliability requirement
    1. Abstract
    2. 5.1. The minimum replication calculation approach
    3. 5.2. Minimum replication benchmark
    4. 5.3. Evaluation of the minimum replication calculation approach
    5. 5.4. Summary
  13. 6: Cost-effective data reliability assurance for data maintenance
    1. Abstract
    2. 6.1. Proactive replica checking
    3. 6.2. Overview of PRCR
    4. 6.3. Working process of PRCR
    5. 6.4. Optimization algorithms in PRCR
    6. 6.5. Evaluation of PRCR
    7. 6.6. Summary
  14. 7: Cost-effective data transfer for data creation and data recovery
    1. Abstract
    2. 7.1. Determining the deadline for data creation and data recovery
    3. 7.2. Cloud network model
    4. 7.3. Energy consumption model for Cloud data transfer
    5. 7.4. Novel cost-effective data transfer strategy LRCDT
    6. 7.5. Evaluation of LRCDT
    7. 7.6. Summary
  15. 8: Conclusions and future work
    1. Abstract
    2. 8.1. Summary of this book
    3. 8.2. Key contributions of this book
    4. 8.3. Further discussion and future work
  16. Bibliography
  17. Appendix
  18. Index