Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.
Jonathan Kwan is an Assistant Professor of Philosophy at New York University Abu Dhabi and was previously the Markkula Center’s Inclusive Excellence Postdoctoral Fellow in Immigration Ethics. Views ...
Sarah D. Sparks is a reporter and data journalist for Education Week who covers the teaching profession and pedagogy for Education Week. She has covered education research and the science of learning ...