LITMUS_

Turn customer feedback into tests

> Customers flag issues in your bot

> Litmus auto-generates test cases

> Tests sync to Braintrust & LangSmith

> Your evals stay current with reality

EVAL RUNNER
> Processing customer feedback from bot
> Tests generated: 0
> Issues flagged & added to eval set: 0
Tests run: 0Pass rate: 0%

EXPORT_

Integrates with your eval tools

> Push tests to Braintrust & LangSmith

> One-click setup, auto-sync forever

> Your eval tools stay current

Braintrust
LangSmith
EXPORT ENGINE
> Syncing to eval platforms
> Exported: 0
In queue: 0Success rate: 100%

AUTO-CLEAN_

Smart dataset maintenance

> Auto-remove outdated tests

> Detect when features change

> Keep only what's relevant

AUTO-CLEANUP
> Analyzing eval relevance
> Kept: 0 | > Removed: 0
Active evals: 0Cleanup rate: 0%

Start testing what matters

Let your customers write your tests.

> Get started with real-world evals in minutes