Put your systems to the test and improve your operational readiness with a wide variety of experiments. Identify all of your reliability gaps and system limitations.
Our Advice feature provides you with a list of recommended experiments
Create new experiments fast by selecting from over 50 pre-built templates
Build experiments from scratch and add your own custom faults
When it’s easy to build experiments, reliability can be an inclusive cross-team effort.
Learn how to create reliability tests quickly with the Steadybit experiment editor.e exact experiment you want in minutes.
Read MoreBuild new experiments fast with hundreds of no-code actions.
Read MoreUse a library of 80+ templates to generate and review ready-to-run experiments.
Read MoreWatch experiments run in real-time to see how each step impacts your system.
Read MoreRun future experiments as one-offs or recurring tests.
Read MoreCreate automated workflows with the Steadybit API, CLI, and MCP Server.
Read More
It’s important to start with a small blast radius when running experiments for the first time. For example, you may target only 10% of the pods in a cluster with a given attack. As you build confidence in how your system will respond, you can expand your blast radius and take on more risk.
Targeting in Steadybit uses an intuitive query language based on discovered metadata. It’s easy to be specific and safe, so you know exactly what your experiment will impact. Each action has a blast radius you can adjust with a simple toggle control. There is always an emergency stop button close by to hit the brakes and rollback changes.
When you start an experiment, you will be able to watch it run in real-time as each step is executed and review a summary of your system’s behavior. If your target is a Kubernetes cluster for example, you’ll see the Kubernetes event log so you can see each change and the results of health checks.
You can also watch to see if your observability tool is raising an alert when expected. Just install the relevant extension and view these real-time events in Steadybit.
Create reliability tests across your tech stack with a wide range of pre-built actions and templates. View the full library in the Reliability Hub.
Explore More Actions
Browse open source actions that you can easily add to experiments.
Explore More Templates
Browse the full list of open source experiment templates in the Reliability Hub.
Our open source extension framework makes it easy to add custom components to Steadybit. Build your own custom actions using our language-agnostic ActionKit and create any experiment that would be useful for your organization.
You can run experiments manually, on a schedule, or with automation. Many teams will incorporate Steadybit experiments into their CI/CD workflow so they can continually verify experiments and ensure that new deployments meet a certain reliability standard.
With the Steadybit API and CLI, it’s easy to incorporate experiments into your development lifecycle to on your terms.
As you run experiments across teams, you can track your progress with reports in Steadybit. See what types of attacks are being used most often, count experiment runs, and see how many issues you have found and fixed.
See what types of actions, targets, and templates are waiting for you and your team in our open source library.