Generating funny captions seemed unlikely, so Hessel and his collaborators designed a benchmark challenging models to match a caption to its corresponding illustration, distinguish a finalist from a ...