AI Testing
How to Test Microsoft AutoGen Multi-Agent Systems
Your AutoGen pipeline worked fine in dev. The Planner agent handed off to the Executor, the Executor called the right tool, the Critic reviewed the output, and everything terminated cleanly. Then you deployed it. Now the Executor occasionally ignores the Planner's instructions, the conversation sometimes loops 40 times