What is RAID? data reliability with performance

Transcription

1 What is RAID? RAID is the use of multiple disks and data distribution techniques to get better Resilience and/or Performance RAID stands for: Redundant Array of Inexpensive / Independent Disks RAID can be implemented in Software or Hardware or any combination of both This presentation is a simple introduction to the RAID levels with some information on Caching and different I/O profiles

2 RAID0 - Striping Simple striping of data across disks in the array No RAID resilience - if one disk fails, then the entire array fails Good for load balancing & elimination of hot spindles Usually implemented in conjunction with RAID1 to get resilience Array capacity = disk capacity x n where n is the number of disks in the array

3 RAID1 - Mirroring Mirroring, shadowing, duplexing for resilience with 2nd copy of data Array capacity = disk capacity x n/2 where n is the number of disks in the RAID1 set (or n/3 for triple mirroring) Can be used to mirror RAID0 RAID sets with other identical RAID0 sets (RAID01 or RAID0+1) Can enable simultaneous reads on both parts of mirror

5 RAID5 Data striping with ECC based disk redundancy All disks have round-robin share of ECC data Good small block read I/O performance but complex write process Array Capacity = disk capacity x (n-1) where n is the number of disks in the array

6 RAID6 Same as RAID5 but with additional spindle and double ECC redundancy Allows two disks to fail in the same RAID set Very complex write process Array capacity = disk capacity x (n-2) where n is the number of disks in the array

7 RAID Application Theory The 1988 RAID paper came up with the following observations (without any commercial RAID systems to prove or disprove and without consideration for any enhancements such as caching) RAID5 was promoted as best for TP and database workloads RAID3 was considered best for scientific workloads Other applications fall between the two extremes in terms of typical I/O size

8 Using Cache to boost RAID5 Cache can be used to temporarily buffer write-data from the host to allow faster I/O completion However, RAID5 requires lots of cache to overcome the basic complexity of the write process. This reduces the amount of cache available for write buffering and allows the cache to become saturated in heavy load scenarios. Caching does not significantly boost throughput as small I/O s benefit most - large transfers are typically ignored by cache This means that RAID5 based systems can never be fully suited to the entire range of applications

9 Using Cache to boost RAID3 Just as with RAID5, cache can be used to buffer writes for fast I/O completion Unlike RAID5, parallel RAID systems do not need to use cache to overcome a complex write process This means that a much greater percentage of cache space is available for genuine work and as a result a RAID3 system with cache is not saturated as easily as an equivalent RAID5 system The extra cache space can also be used for hot read data Small I/O s benefit most from caching, extending the suitability of a RAID3 system with cache into the OLTP and DBMS range

10 Summary This presentation just covers the relationship between disks in the basic RAID setups, and introduces the concept of caching. Data integrity issues are not covered

Click on the diagram to see RAID 0 in action RAID Level 0 requires a minimum of 2 drives to implement RAID 0 implements a striped disk array, the data is broken down into blocks and each block is written

RAID The basic idea of RAID (Redundant Array of Independent Disks) is to combine multiple inexpensive disk drives into an array of disk drives to obtain performance, capacity and reliability that exceeds

technology brief RAID Levels March 1997 Introduction RAID is an acronym for Redundant Array of Independent Disks (originally Redundant Array of Inexpensive Disks) coined in a 1987 University of California

Hard Disk Drives and RAID Janaka Harambearachchi (Engineer/Systems Development) INTERFACES FOR HDD A computer interfaces is what allows a computer to send and retrieve information for storage devices such

RAID Overview 91.520 1 The Motivation for RAID Computing speeds double every 3 years Disk speeds can t keep up Data needs higher MTBF than any component in system IO Performance and Availability Issues!

4411 Database Management Systems Acknowledgements and copyrights: these slides are a result of combination of notes and slides with contributions from: Michael Kiffer, Arthur Bernstein, Philip Lewis, Anestis

Chapter 1 Storage Devices Summary Dependability is vital Suitable measures Latency how long to the first bit arrives Bandwidth/throughput how fast does stuff come through after the latency period Obvious

200 Chapter 7 (This observation is reinforced and elaborated in Exercises 7.5 and 7.6, and the reader is urged to work through them.) 7.2 RAID Disks are potential bottlenecks for system performance and

RAID 0 (striping) RAID 0 uses the read/write capabilities of two or more hard drives working together to maximize storage performance. Data in a RAID 0 volume is arranged into blocks that are spread across

BY Shashwath Veerappa Devaru CS615 Aspects of System Administration Using Multiple Hard Drives for Performance and Reliability RAID is the term used to describe a storage systems' resilience to disk failure

NK YORK COLLEGE OF PENNSYLVANIA HG OK 2 RAID YORK COLLEGE OF PENNSYLVAN James Moscola Department of Physical Sciences York College of Pennsylvania Based on Operating System Concepts, 9th Edition by Silberschatz,

Overview of RAID Let's first address, "What is RAID and what does RAID stand for?" RAID, an acronym for "Redundant Array of Independent Disks, is a storage technology that links or combines multiple hard

People often ask: Should I RAID my disks? The question is simple, unfortunately the answer is not. So here is a guide to help you decide when a RAID array is advantageous and how to go about it. This guide

An Introduction to RAID Giovanni Stracquadanio stracquadanio@dmi.unict.it www.dmi.unict.it/~stracquadanio Outline A definition of RAID An ensemble of RAIDs JBOD RAID 0...5 Configuring and testing a Linux

RAID Levels and Components Explained Page 1 of 23 What's RAID? The purpose of this document is to explain the many forms or RAID systems, and why they are useful, and their disadvantages. RAID - Redundant

Spot server problems before they are noticed The system s really slow today! How often have you heard that? Finding the solution isn t so easy. The obvious questions to ask are why is it running slowly

9916 Brooklet Drive Houston, Texas 77099 Phone 832-327-0316 www.safinatechnolgies.com RAID Made Easy By Jon L. Jacobi, PCWorld What is RAID, why do you need it, and what are all those mode numbers that

Dell Systems Getting Started With RAID www.dell.com support.dell.com Notes, Notices, and Cautions NOTE: A NOTE indicates important information that helps you make better use of your computer. NOTICE: A

RAID Performance Analysis We have six 500 GB disks with 8 ms average seek time. They rotate at 7200 RPM and have a transfer rate of 20 MB/sec. The minimum unit of transfer to each disk is a 512 byte sector.

Redundant Array of Inexpensive Disks (RAID) March 23, 2008 Databases: RAID 1 Announcement A delegation of Swedish Students will be visiting the University today. The Dean, Students Affairs, has extended

Firebird and RAID Choosing the right RAID configuration for Firebird. Paul Reeves IBPhoenix mail: preeves@ibphoenix.com Introduction Disc drives have become so cheap that implementing RAID for a firebird

Operating Systems RAID Redundant Array of Independent Disks Submitted by Ankur Niyogi 2003EE20367 YOUR DATA IS LOST@#!! Do we have backups of all our data???? - The stuff we cannot afford to lose?? How

BrightStor ARCserve Backup for Windows Tape RAID Option Guide r11.5 D01183-1E This documentation and related computer software program (hereinafter referred to as the "Documentation") is for the end user's

What is RAID--BASICS? Mylex RAID Primer A simple guide to understanding RAID Let's look at a hard disk... Several platters stacked on top of each other with a little space in between. One to n platters

ioscale: The Holy Grail for Hyperscale The New World of Hyperscale Hyperscale describes new cloud computing deployments where hundreds or thousands of distributed servers support millions of remote, often

To ensure the functioning of the site, we use cookies. We share information about your activities on the site with our partners and Google partners: social networks and companies engaged in advertising and web analytics. For more information, see the Privacy Policy and Google Privacy &amp Terms.
Your consent to our cookies if you continue to use this website.