teaching_llm_agents

Evals

OpenAI

Rethinking evals and software engineering and testing