I tracked what AI agents actually do when nobody’s watching. Built a tool that replays every decision.
Been building AI agents for about a year now and the thing that always drove me crazy is you deploy an agent, it runs for hours, and you have absolutely no idea what it did. The logs say "task complete" 47 times but did it actually do 4…