Talk

How to Automate AI Agent Testing: Benchmarks, Metrics, LLM-as-a-Judge

In Russian

LLM-as-a-Judge in AI agent testing automation — let's discuss why this practice is vital, which models to choose, why LLM-as-a-Judge will deceive you with its simplicity, and how to start writing regression automation for your AI agents.

Speakers

Talks