Skip to content Skip to sidebar Skip to footer

Computer vision

A computer expert is advancing the limits of geometry.

Over 2,000 years ago, Euclid, a Greek mathematician often referred to as the father of geometry, fundamentally transformed our understanding of shapes. Today, Justin Solomon, a professor at MIT's Department of Electrical Engineering and Computer Science and a member of the Computer Science and Artificial Intelligence Laboratory (CSAIL), employs contemporary geometric approaches to tackle complex…

Read More

A new architecture named ViTAR (Vision Transformer with Any Resolution) is introduced in this AI research paper from China.

The Transformer architecture has been highly beneficial in natural language processing (NLP) sparking an increased interest in its application within the computer vision (CV) community. Vision Transformers (ViTs), which apply the Transformer's architecture to vision tasks, have shown great promise across a variety of applications including image classification, object detection, and video recognition. However, ViTs…

Read More

A computer scientist is advancing the limits of geometry.

Over 2000 years since Greek mathematician Euclid revolutionized geometry, Justin Solomon of MIT's Electrical Engineering and Computer Science Department and the Computer Science and Artificial Intelligence Laboratory, is using modern geometric techniques to solve complex problems in various fields. From machine learning to autonomous vehicle recognition systems, Solomon integrates geometry with statistical research for groundbreaking…

Read More

Neural Network with Condition Awareness (CAN): A Novel AI Technique for Incorporating Control into Image-Creating Models

A new method for manipulating and improving control levels in image generative models has been introduced by researchers from MIT, Tsinghua University, and NVIDIA. The technique, known as Condition-Aware Neural Network (CAN), enhances the image generation process by variably adjusting the neural network's weight. This is achieved via a condition-aware weight generation module which generates…

Read More

A computer scientist pioneers advancements in the field of geometry.

Justin Solomon, an associate professor in the MIT Department of Electrical Engineering and Computer Science (EECS) and a member of the Computer Science and Artificial Intelligence Laboratory (CSAIL), is employing modern techniques based on ancient geometric principles to solve a wide array of complex problems, specifically in the field of data science. Drawing upon the geometric…

Read More

This AI Article Presents a New and Crucial Test for Vision Language Models (VLMs) Named Intractable Problem Detection (UPD)

The fast-paced evolution of artificial intelligence, particularly Vision Language Models (VLMs), presents challenges in ensuring their reliability and trustworthiness. These VLMs integrate visual and textual understanding, however, their increasing sophistication has brought into focus their ability to detect and not respond to unsolvable or irrelevant questions— an aspect known as Unsolvable Problem Detection (UPD). UPD…

Read More

Apple’s ReALM perceives visuals displayed on the screen more effectively than GPT-4.

Apple engineers have developed an artificial intelligence (AI) system capable of better understanding and responding to contextual references in user interactions. This new development could possibly enhance on-device virtual assistants, making them more efficient and responsive. Understanding references within a conversation comes naturally to humans. Phrases such as "the bottom one" or "him" are easily…

Read More

A computer scientist is pushing the limits in the field of geometry.

Justin Solomon, an Associate Professor in the MIT Department of Electrical Engineering and Computer Science, is revolutionizing geometry by applying it to myriad cutting-edge fields including machine learning, computer graphics, and artificial intelligence (AI). Drawing on the ancient geometric principles of Greek mathematician Euclid, Solomon explores how geometric structures could be used to understand and…

Read More

A computer scientist is expanding the limits of geometry.

More than two millennia ago, Greek mathematician Euclid, seen by many as the originator of geometry, significantly influenced our comprehension of shapes. Fast forward to today, Justin Solomon, using advanced geometric techniques, is resolving challenging issues that appear unrelated to shapes on the surface. Solomon, an associate professor in the MIT Department of Electrical Engineering…

Read More