Command Palette
Search for a command to run...
Performance results of various models on this benchmark
Metrics
Consistency
Contextual Understanding
Correctness of Information
Dense Captioning
Detail Orientation
Reasoning
Spatial Understanding
Temporal Understanding
mean
6 rows total