Search for a command to run...
ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing