Category: AI/LLM
-
Building a Quality Flywheel for LLM Applications
Context Whenever you work on agentic applications you need to be able to navigate non-deterministic behavior by building against samples of positive data. This lets you test whether your application works as expected under known scenarios. This is harder than it sounds. Your initial dataset may not cover cases you…