Home Applications The Richter Scale of Reliability in Highly Scalable Infrastructure

The Richter Scale of Reliability in Highly Scalable Infrastructure

The Richter Scale of Reliability in Highly Scalable Infrastructure

Anyone who operates highly scalable infrastructure will know that there is one maxim that they must abide by:

Assume everything fails

It may seem like a rather morbid approach to the design and operation of infrastructure.  It becomes obvious when you shift from a “aim for 100% uptime” to the Site Reliability Engineer (SRE) approach.

What I mean by the shift in approach, is that we change from building on the architect’s classic assumption that we can design for totally reliability.  The SRE approach is to rely on the ability of every layer of your application infrastructure to be failing and recovering continuously.

Scale-Out Fails By Design

I once read something from Kelly Sommers (@kellabyte on Twitter) about how she operated database infrastructure at such a scale that about 10% of the nodes are failed at any given time due to the load put on them and other operational impact.

Read the entire article here, The Richter Scale of Reliability in Highly Scalable Infrastructure

via the fine folks at Turbonomic!

Turbonomic Turbonomic’s Autonomic Platform enables heterogeneous environments to self-manage to assure the performance of any application in any cloud. Turbonomic’s patented decision engine dynamically analyzes application demand and allocates shared resources in real time to maintain a continuous state of application health.Launched in 2010, Turbonomic is one of the fastest growing technology companies in the virtualization and cloud space. Turbonomic’s Autonomic Platform is trusted by thousands of enterprises to accelerate their adoption of virtual, cloud, and container deployments for all mission critical applications.

Featured Resources:

Related Articles:


White Papers

    Application Lifecycle Management with Stratusphere UX – White Paper

    Enterprises today are faced with many challenges, and among those at the top of the list is the struggle surrounding the design, deployment, management and operations that support desktop applications. The demand for applications is increasing at an exponential rate, and organizations are being forced to consider platforms beyond physical, virtual and cloud-based environments. Users […]


      Download Commvault VM Backup and Recovery: end-to-end VM backup, recovery and cloud management

      Commvault’s ability to provide end-to-end VM backup, recovery and cloud management creates a significantly better way to build, protect and optimize VMs throughout their lifecycle. Our best-in-class software for VM backup, recovery and cloud management delivers a number of significant benefits, including: VM recovery with live recovery options; backup to and in the cloud; custom-fit […]

      On-Demand Webinars

        What’s Going on in EUC Printing – A Technical Deep Dive!

        The IGEL Community and ThinPrint invite you to watch the following technical deep dive webinar. The agenda is to technically bring you up to speed on what’s going on in the EUC Printing space today along with a deep dive into new methods, technologies, printing scenarios and a discussion on why printing still matters. You […]

        Latest Videos

          Views All IT News on DABCC.com
          Views All IT Videos on DABCC.com
          Win big $$, visit ITBaller.com for more info!

          Visit Our Sponsors