Large language models (LLMs) are crucial in the field of natural language processing (NLP). However, their performance in tasks requiring visual and spatial reasoning is generally poor. Researchers from Columbia University have proposed a new approach to tackle this issue. Their method, called Whiteboard-of-Thought (WoT) prompting, aims to enhance the visual reasoning abilities of multimodal…
