Tradeoffs in Resiliency: Managing the Burden of Data Recoverability

Friday, 2018, August 31 - 11:00–11:45

Kristina Bennett, Google

Abstract:

Almost every service has critical data somewhere, whether it's large-scale blob storage or minimalistic index tables or just the service's own production configuration. The data's sizes and shapes and storage technologies vary widely; and yet, the possibilities for data loss remain, and the same obstacles to recovery consistently appear. This talk reviews the practices that can prepare a service for practical data recoveries, highlights some of the hidden dangers waiting to ambush a recovery attempt, and examines some of the risk/cost tradeoffs that inevitably dominate data integrity coverage, based on the lessons of five years of data integrity tooling and consulting across Google.

Kristina Bennett has worked at Google since 2009. Although she recently joined the Customer Reliability Engineering team in their mission to apply the principles and lessons of SRE at Google towards customers, prior to that she spent five years working on data integrity across Google.

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.