This is an interesting direction for agent frameworks. What stood out to me is the shift from simple tool orchestration to agents that can reason, call other agents, and self-manage workflows. That’s something we’ve been thinking about a lot while building SalesPlay — especially around how autonomous sales agents need clear evaluation, guardrails, and accountability to actually be useful in real GTM teams. The built-in grading/evaluation angle here feels like a practical step toward making agents less brittle and more production-ready. Curious to see how this evolves in real-world use cases.
So I look at something like Mastra (or LangChain) as agent orchestration, where you do computing tasks to line up things for an LLM to execute against.
I look at Gambit as more of an "agent harness", meaning you're building agents that can decide what to do more than you're orchestrating pipelines.
Basically, if we're successful, you should be able to chain agents together to accomplish things extremely simply (using markdown). Mastra, as far as I'm aware, is focused on helping people use programming languages (typescript) to build pipelines and workflows.
So yes it's an alternative, but more like an alternative approach rather than a direct competitor if that makes sense.
omg thank you so much. We're working on the file system stuff, that's an easier lift for us than the initial work, so we wanted to start with the big stuff and work backward. Claude Code and Codex are obviously really great at that stuff, and we'd like to be able to support a lot of that out of the box.
How would it compare?
I look at Gambit as more of an "agent harness", meaning you're building agents that can decide what to do more than you're orchestrating pipelines.
Basically, if we're successful, you should be able to chain agents together to accomplish things extremely simply (using markdown). Mastra, as far as I'm aware, is focused on helping people use programming languages (typescript) to build pipelines and workflows.
So yes it's an alternative, but more like an alternative approach rather than a direct competitor if that makes sense.
[see https://news.ycombinator.com/item?id=45988611 for explanation]
are things like file system baked in?
fan of the design of the system. looks great architecturally