Without GradeX
- Changes ship on intuition
- Few ideas get tested
- No reliable baseline
- Experiments disappear
- Engineering decisions are hard to verify
Product updates, engineering notes, and practical guides for running AI-driven experiments on real codebases.
Evidence-driven engineering replaces intuition-only shipping.
GradeX is an open-source CLI for autonomous code optimization. It runs parallel AI agents in isolated git worktrees, scores every patch against a benchmark, applies correctness gates, and keeps only changes that prove measurable improvement.
Most code changes are evaluated by intuition. A developer believes a refactor is faster, a model believes a patch is cleaner, or a team ships an optimization because it looks plausible. GradeX changes the loop: every candidate patch needs a number, a baseline, and a gate.
Run gradex discover to identify a target and capture a baseline. Then run gradex optimize. GradeX spawns subagents, gives each a brief, evaluates their patches, and updates the dashboard as experiments complete.
Next we are focusing on better multi-language discovery, hosted team dashboards, richer telemetry, and deeper integrations with coding agents.
Try GradeX free →A practical guide to deterministic benchmarks, warmups, gate selection, and avoiding false wins.
Read the docs →Parallel exploration only matters when each change is measured, gated, and recorded.
See the product →