Anthropic did something that sounds like a sci-fi short story: they built a classified marketplace where AI agents represented both sides of every transaction. Buyers, sellers, negotiators, all bots. And the deals were for actual physical goods, paid for with actual money.
This wasn’t a simulation or a toy. The agents had to figure out pricing, haggle over terms, and close the deal. Real products changed hands. Real dollars moved. The only human involvement was setting up the rules and maybe watching the chaos unfold.
I’ve seen plenty of “AI agents doing X” demos that turn out to be heavily scripted or sandboxed to the point of irrelevance. This one feels different. Anthropic isn’t claiming the agents were perfect — they probably weren’t — but the fact that they let agents negotiate real economic value without a human in the loop for every decision is a meaningful step.
The implications are pretty wild. If you can trust an agent to buy or sell on your behalf in a controlled marketplace, what stops you from scaling that to something bigger? A fleet of procurement agents negotiating with a fleet of supplier agents. No purchase orders, no email chains, just back-and-forth offers until a deal sticks.
Of course, there are obvious risks. What happens when two agents get stuck in a loop, or one exploits a loophole in the other’s instructions? Anthropic presumably accounted for some of that, but real-world deployment would be a different beast. The experiment is still interesting precisely because it raises those questions instead of pretending they don’t exist.
I also wonder about the economic dynamics. If agents are purely rational and optimized for cost or profit, do markets behave differently than when humans are involved? Less emotion, maybe, but also less creativity in deal-making. A human seller might throw in a discount for a repeat buyer; an agent would need that explicitly programmed.
Anthropic hasn’t released full details on how the agents were structured or what safeguards they used, but the fact they ran this at all suggests they’re thinking seriously about agent-to-agent commerce as a real use case. It’s one thing to have an agent book a flight for you. It’s another to have two agents negotiate a bulk order of office supplies and actually pay for it.
I’d like to see more transparency about the failures — what deals fell through, how often agents had to escalate to a human, whether any agent accidentally bought something it shouldn’t have. That’s where the real lessons live. But as a proof of concept, this is more interesting than most AI demos I’ve seen this year.
We’re not at the point where you can set your procurement AI loose on the open internet and expect it to behave. But a controlled marketplace with clear rules? That’s a sandbox worth watching.
Comments (0)
Login Log in to comment.
Be the first to comment!