Access and Feeds

Chaos Engineering: Learning How to be Resilient by Breaking Things

By Dick Weisinger

Chaos Engineering is a preemptive approach to creating resilient IT infrastructures. Development teams purposely try to throw as many problems at their systems to see if they can break it as a technique to identify system weaknesses and to devise ways to respond and fix the systems so that they are better able to withstand future problems.

Chaos Engineering was a technique developed by Netflix which helped it to grow to be able to support a customer base of 100 million users in 190 countries. The Netflix engineering team created a tool called Chaos Monkey that could create random failures in a system at random times. Based on the results, engineers can check the resiliency of their system when confronted with unplanned failures.

Tanusree McCabe, Cloud Architect at Capital One, wrote for Medium that “Chaos engineering allows for proactive remediation of potential issues prior to adverse impact, and increases confidence in system reliability. To effectively wield chaos engineering requires broad comprehension of failure modes and cascading failures, as well as a well defined monitoring baseline of system behavior. The investment into chaos engineering is non-trivial, and so should be employed for the most impactful outcomes.”

Emrah Samdan, vice president at Thundra, said that “Chaos engineering tests the resiliency of other parts of your system when one part is having some problems due to latency or any type of failures. Considering the distributed systems serverless paradigm implies, running chaos experiments become a no-brainer to reveal the hidden traps before customers reveal them on production.”

Digg This
Reddit This
Stumble Now!
Buzz This
Vote on DZone
Share on Facebook
Bookmark this on Delicious
Kick It on DotNetKicks.com
Shout it
Share on LinkedIn
Bookmark this on Technorati
Post on Twitter
Google Buzz (aka. Google Reader)
One comment on “Chaos Engineering: Learning How to be Resilient by Breaking Things
  1. praveen kumar says:

    Thanks for providing brief information about chaos engineering

Leave a Reply

Your email address will not be published. Required fields are marked *

*