Practical guides, real evaluation frameworks, and a growing community for people who test AI systems for a living — not for the leaderboard.
Free rubric templates, a 40-case test library, and a verdict framework — everything you need to run your first structured human evaluation this week.
Get free access →I'm an independent AI evaluation consultant. I help teams understand how their models behave in real-world use through structured human testing — and I started AI Testing Lab to share what that work actually looks like.
Everything here comes from real engagements: the rubrics that worked, the failure modes that surprised everyone, and the methods that turn "it feels off" into something you can fix.
Read my essays on Medium →Peer reviews of your eval setups, shared test libraries, live sessions, and honest conversations about what actually breaks. Join the early list — founding members get in first, free.