Best for Kubernetes-native CNCF OSS
Try Chaos MeshChaos Mesh Open Source is Apache 2 free for Kubernetes-native chaos engineering as a CNCF incubating project; Chaos Mesh Cloud is a free SaaS tier with limits; PingCAP Enterprise covers self-hosted enterprise plus SSO with custom integrations plus dedicated CSM. The differentiator vs Gremlin is the CNCF Kubernetes-native model: where Gremlin treats hosts as the unit, Chaos Mesh treats Kubernetes resources as the unit (pods, nodes, network policies, persistent volume claims). For Kubernetes-first SRE teams, Chaos Mesh fits the cluster model where Gremlin's host-based model adds friction. The trade vs Gremlin: smaller commercial support ecosystem, less polished reliability scoring.
Strengths
- +CNCF Apache 2 OSS for K8s-native chaos
- +Pod, network, IO, kernel chaos primitives
- +Standard chaos experiments + workflows
- +Strong fit for K8s-first teams
Trade-offs
- −Smaller commercial support than Gremlin
- −Less polished reliability scoring
- −K8s-only (no general infrastructure)
- OSS
- Free, Apache 2 + CNCF
- Cloud Free
- Limited free SaaS
- PingCAP Enterprise
- Custom (~$2K/mo)
- Strength
- K8s-native CNCF OSS
Migration steps
- Install Chaos Mesh on Kubernetes cluster via Helm chart.
- Configure RBAC and chaos experiment templates.
- Migrate Gremlin experiments to Chaos Mesh equivalents.
- Run parallel for 30-60 days.
- Cancel Gremlin when Chaos Mesh covers your K8s-native chaos.
Not for: Chaos Mesh is the wrong fit for teams running non-Kubernetes infrastructure (bare-metal, EC2 directly, etc.); staying with Gremlin or Steadybit is correct for those.
Paid plans from $2,000.00/mo