OpenEvalOps

Run run-008

Summarization — Earnings Reports — main — gpt-4o

Result

PASSManual

Total Cases

96

Pass Rate

94.8%

Failed

5

Violations

0

Deltas vs Baseline

Accuracy+2.0%
Faithfulness+0.0
PII0.0
Jailbreak0.0
StartedDec 9, 9:00 AM
FinishedDec 9, 9:40 AM
Modelgpt-4o
Executionsimulated
TargetOpenAI (Server Key)

Suite Upload Batches

No upload batches are linked to this suite.

Case Results (0 total)

No case results available for this filter