The most popular and comprehensive Open Source ECM platform
Chaos Engineering: Learning How to be Resilient by Breaking Things
Chaos Engineering is a preemptive approach to creating resilient IT infrastructures. Development teams purposely try to throw as many problems at their systems to see if they can break it as a technique to identify system weaknesses and to devise ways to respond and fix the systems so that they are better able to withstand future problems.
Chaos Engineering was a technique developed by Netflix which helped it to grow to be able to support a customer base of 100 million users in 190 countries. The Netflix engineering team created a tool called Chaos Monkey that could create random failures in a system at random times. Based on the results, engineers can check the resiliency of their system when confronted with unplanned failures.
Tanusree McCabe, Cloud Architect at Capital One, wrote for Medium that “Chaos engineering allows for proactive remediation of potential issues prior to adverse impact, and increases confidence in system reliability. To effectively wield chaos engineering requires broad comprehension of failure modes and cascading failures, as well as a well defined monitoring baseline of system behavior. The investment into chaos engineering is non-trivial, and so should be employed for the most impactful outcomes.”
Emrah Samdan, vice president at Thundra, said that “Chaos engineering tests the resiliency of other parts of your system when one part is having some problems due to latency or any type of failures. Considering the distributed systems serverless paradigm implies, running chaos experiments become a no-brainer to reveal the hidden traps before customers reveal them on production.”
Thanks for providing brief information about chaos engineering