โThe goal is not to do evals perfectly, it's to actionably improve your product.โ
โEvals is a way to systematically measure and improve an AI application.โ
โI find myself sometimes just encountering arguments on the internet, like this race to eval debates and really think, 'Okay, put myself in their shoes.'โ