Recent advancements in Large Language Models (LLMs) have seen impressive accomplishments in various tasks, such as question-answering, captioning, and segmentation, thanks to their integration with visual encoders for multimodal tasks. However, these LLMs, despite their prowess, face limitations when dealing with video inputs due to their context length and constraints with GPU memory. Existing models…
Justin Solomon, based at MIT's Department of Electrical Engineering and Computer Science and a member of the Computer Science and Artificial Intelligence Laboratory, is using modern geometric techniques to solve a wide range of mathematical and AI problems. Drawing on the principles of ancient geometry, Solomon's work has applications from autonomous vehicles identifying pedestrians using…
Over 2,000 years ago, Greek mathematician Euclid, often called the father of geometry, revolutionized the understanding of shapes. In today's technological era, a 21st-century geometer, Justin Solomon, uses sophisticated techniques to solve complex problems related to shapes but often unrelated to them. Solomon applies geometry to study datasets for comparing their effectiveness in machine learning…
In the field of Artificial Intelligence (AI), "zero-shot" capabilities refer to the ability of an AI system to recognize any object, comprehend any text, and generate realistic images without being explicitly trained on those concepts. Companies like Google and OpenAI have made advances in multi-modal AI models such as CLIP and DALL-E, which perform well…
Over 2,000 years ago, Euclid, the Greek mathematician, laid the foundation of geometry and altered our perception of shapes. Justin Solomon, inspired by Euclid's work, applies modern geometric techniques to resolve challenging problems that may not appear related to shapes. As an associate professor at the MIT Department of Electrical Engineering and Computer Science and…
The field of semantic segmentation in artificial intelligence (AI) has seen significant progress, but it still faces distinct challenges, especially imaging in problematic conditions such as poor lighting or obstructions. To help bridge these gaps, researchers are looking into various multi-modal semantic segmentation techniques that combine traditional visual data with additional information sources like thermal…
Justin Solomon, an Associate Professor in the Massachusetts Institute of Technology (MIT)'s Department of Electrical Engineering and Computer Science (EECS) and member of the Computer Science and Artificial Intelligence Laboratory (CSAIL), is leveraging geometric techniques to tackle complex problems in data science. Quite often, these problems are seemingly unrelated to shapes. For example, when a…
Latent diffusion models (LDMs) are at the forefront of the rapid advancements in image generation. Despite their ability to generate incredibly realistic and detailed images, they often struggle with efficiency. The quality images they create necessitate several steps and can slow down the process, limiting their utility in real-time applications. Consequently, researchers are relentlessly exploring…
Using the principles of geometry, Justin Solomon, an associate professor in MIT's Department of Electrical Engineering and Computer Science, is tackling complex problems in data science and computer graphics. Building on Euclid’s ancient foundations of geometry, Solomon is leveraging geometric techniques to solve problems that are seemingly unrelated to shapes. He asserts that the language…
