
Post
The Agent Test Score
In this guest post, Flo's VP of Engineering Andrei Varanovich argues that the real challenge in AI agents isn't intelligence — it's engineering discipline. Drawing on Google's ML Test Score, he introduces an 'Agent Test Score' framework to help teams ship agents that don't just demo well, but hold up in production.
Read more




