HyperAIHyperAI

Command Palette

Search for a command to run...

Performance results of various models on this benchmark

Metrics

Exact Match Accuracy (Dev)
Exact Match Accuracy (Test)
Execution Accuracy (Test)
20 rows total