Customer Support Bot v2
End-to-end evaluation for the customer support chatbot including policy compliance, accuracy, and faithfulness checks.
Status
PASSCases
128
Visibility
Public
Baseline
run-001
Target Setup
OpenAI
Auth: Server KeyModel: gpt-4oHealth: Not Tested
Thresholds
Pass Rate Min90%
Faithfulness Min0.85
PII Max0
Secrets Max0
Jailbreak Max0
Policy Configuration
PIIBlocking
SecretsBlocking
JailbreakBlocking
ToxicityBlocking
Case Uploads
| Status | File | Date |
|---|---|---|
| SUCCESS | support_cases_batch_1.csv | Dec 5, 2025 |