I've recently been looking deeply into the Chaos Engineering tooling space. I am trying to talk to more SREs who have implemented Chaos Engineering practices within their company whether via in-house tools, or via external solutions like Gremlin or the various open-source projects such as Chaos Toolkit, Chaos Blade, etc.
For context, I work at an early-stage venture capital firm, Innovation Endeavors
, that primarily invests in deeply technical teams going after large industries. I generally think this type of systems-level approach to identifying emergent failures is only going to become more prominent over time, but I have some questions about the right way to truly build a scalable product/service in this space.