Generating funny captions seemed unlikely, so Hessel and his collaborators designed a benchmark challenging models to match a ...