LITMUS_

Turn customer feedback into tests

> Customers flag issues in your bot

> Litmus auto-generates test cases

> Tests sync to Braintrust & LangSmith

> Your evals stay current with reality

EVAL RUNNER

> Processing customer feedback from bot

> Tests generated: 0

> Issues flagged & added to eval set: 0

Tests run: 0Pass rate: 0%

EXPORT_

Integrates with your eval tools

> Push tests to Braintrust & LangSmith

> One-click setup, auto-sync forever

> Your eval tools stay current

EXPORT ENGINE

> Syncing to eval platforms

> Exported: 0

In queue: 0Success rate: 100%

AUTO-CLEAN_

Smart dataset maintenance

> Auto-remove outdated tests

> Detect when features change

> Keep only what's relevant

AUTO-CLEANUP

> Analyzing eval relevance

> Kept: 0 | > Removed: 0

Active evals: 0Cleanup rate: 0%

Start testing what matters

Let your customers write your tests.

> Get started with real-world evals in minutes