TalkDate: 27.04 / Start: 00:00 – Finish: 00:00

How to Automate AI Agent Testing: Benchmarks, Metrics, LLM-as-a-Judge

AI

Hall 1In Russian

LLM-as-a-Judge in AI agent testing automation — let's discuss why this practice is vital, which models to choose, why LLM-as-a-Judge will deceive you with its simplicity, and how to start writing regression automation for your AI agents.

Speakers

Artem Simeshin
Sber

Other talks on «AI»