Reduce error budget depletion with Chaos Engineering

ChaosWheel

About the speaker

Vishnu V Chikoti

Vishnu V Chikoti

Sr Manager - SRE,

Fanatics

Vishnu Vardhan Chikoti has diverse experience in the areas of Application and Database design and development, Micro-services & Micro-frontends, DevOps, Site Reliability Engineering and Machine Learning. He is the co-author for the book “Hands-on Site Reliability Engineering”. And he has created enterprise SRE adoption framework - Arctic. With an ability to conduct deep analysis, strong execution skills and an innovative mindset, he has successfully led R&D teams to build engineering solutions to improve reliability of applications. He also has deep expertise in building high volume transaction processing applications for middle & back office functions at Investment Banks using a variety of architectures. He has successfully led teams for Agile transformation and Site Reliability Engineering transformation.

About the talk

This talk is about modern complex architectures and how Chaos experiments help in keeping error budgets in control by proactively discovering and preventing failures. When we look at the modern full stack, the number of layers is only increasing. There is usage PaaS, IaaS and SaaS, all in the mix. There are more and more technologies/tools/services being offered to either improve the way we are working today or solve an unsolved problem earlier in an innovative way. At one end, problems are being solved and at other end, more and more dependencies are being added. As per Murphy’s law, “Anything that can go wrong will go wrong” and when we look at it technically, it becomes “Anything that can fail will fail”. When we apply this law to modern multi-layered architectures with multiple dependencies, there is a need to do many things to prevent as many potential failures as possible and detect those that remain. Chaos Engineering helps in prevention of failures and early identification on any missed out detections.

Team Resilience Score: a mechanism to encourage teams’ preparedness to avoid failures
Team Resilience Score: a mechanism to encourage teams’ preparedness to avoid failures
GitOps meets Chaos Engineering
GitOps meets Chaos Engineering

Sign Up

for our Newsletter

Get tips, best practices and updates on Chaos engineering in cloud native.

Videos

by Experts

Checkout our videos from the latest conferences and events

Our Videos

Related Blog

Litmus 2.0

Read

Aug 15, 2021

6 Min Read

Uma's Blog on Litmus 2