Skip to content Skip to sidebar Skip to footer

Computer vision

Introducing Gen4Gen: A Partially Automated Process for Creating Datasets Utilizing Generative Models

Text-to-image diffusion models are arguably some of the greatest advancements in Artificial Intelligence (AI). However, personalizing these models with diverse concepts has proven challenging due to issues predominantly rooted in mismatches between the simplified text descriptions of pre-training datasets and the complexities of real-world scenarios. One significant hurdle in the field is the absence of…

Read More

A technology expert in computing is extending the limits of geometrical studies.

Justin Solomon, an associate professor in the Department of Electrical Engineering and Computer Science at the Massachusetts Institute of Technology (MIT), is making use of modern geometric techniques to address intricate problems, many of which don't appear to be linked to shapes. He extrapolates from the foundations laid more than 2,000 years ago by the…

Read More

The unseen hurdle of today’s AI: Accuracy in image identification.

MIT's Computer Science and Artificial Intelligence Laboratory (CSAIL) researchers, in collaboration with the MIT-IBM Watson AI Lab, have developed a new metric, the "minimum viewing time" (MVT), to measure the difficulty of recognizing an image. The researchers aimed to close the gap between the performance of deep learning-based AI models and humans in recognizing and…

Read More

A versatile approach to assist artists in enhancing animation.

MIT researchers have developed a new tool that provides better control to animators in shaping their characters. The new technique works by generating mathematical functions, known as barycentric coordinates, that describe how 2D and 3D shapes in animations can move, stretch, and deform in space. By using these functions, an animator can tailor the movement…

Read More

Numerous AI systems assist robots in carrying out intricate strategies with greater clarity.

MIT's Improbable AI Lab has developed a novel multimodal framework for artificial intelligence (AI) called the Compositional Foundation Models for Hierarchical Planning (HiP). The aim of this system is to help robots conduct complex tasks that involve numerous smaller steps, from household chores to more elaborate industrial processes. Traditionally, AI systems have required paired visual,…

Read More

Logical thinking and dependability in Artificial Intelligence

MIT PhD students interning at the MIT-IBM Watson AI Lab are researching ways to improve the efficiency and accuracy of AI systems in understanding and communicating through natural language. The team, including Athul Paul Jacob, Maohao Shen, Victor Butoi, and Andi Peng, aims to enhance each stage of the process involving natural language models, from…

Read More