OpenEvalOps

Customer Support Bot v2

End-to-end evaluation for the customer support chatbot including policy compliance, accuracy, and faithfulness checks.

Status

PASS

Cases

128

Visibility

Public

Baseline

run-001

Target Setup

OpenAI
Auth: Server KeyModel: gpt-4oHealth: Not Tested

Thresholds

Pass Rate Min90%
Faithfulness Min0.85
PII Max0
Secrets Max0
Jailbreak Max0

Policy Configuration

PIIBlocking
SecretsBlocking
JailbreakBlocking
ToxicityBlocking

Case Uploads

StatusFileDate
SUCCESSsupport_cases_batch_1.csvDec 5, 2025

Recent Runs

ResultRun IDStarted
PASSrun-0012d ago
PASSrun-0023d ago
WARNrun-0096d ago