Skip to main content

Module 6 - LLM Evaluation

Benchmarks, human evaluation, LLM-as-judge, hallucination detection, and production quality metrics.