Alumni/ae, Artificial Intelligence, Data, Electrical Engineering & Computer Science (eecs), Faculty, Innovation and Entrepreneurship (I&E), Laboratory for Information and Decision Systems (LIDS), Machine learning, MIT Schwarzman College of Computing, Research, School of Engineering, Startups, UncategorizedJune 17, 202437Views0Likes0Comments
Researchers at Massachusetts Institute of Technology (MIT) have developed an image dataset to simulate peripheral vision in artificial intelligence (AI) models. This step is aimed at helping such models detect approaching dangers more effectively, or predict whether a human driver would take note of an incoming object.
Peripheral vision in humans allows us to see…
The recent misuse of audio deepfakes, including a robocall purporting to be Joe Biden in New Hampshire and spear-phishing campaigns, has prompted questions about the ethical considerations and potential benefits of this emerging technology. Nauman Dawalatabad, a postdoctoral researcher, discussed these concerns in a Q&A prepared for MIT News.
According to Dawalatabad, the attempt to obscure…
The use of Artificial Intelligence (AI) in healthcare started as early as the 1970s, but its significant transformative power was only widely recognized in the last decade. The scope and capabilities of AI is being pushed and innovated by numerous players in the healthcare sector. Despite the promising advances, spectators are questioning the true impact…
Artificial intelligence (AI) with large language models (LLMs) have made major strides in several sophisticated applications, yet struggle with tasks that require complex, multi-step reasoning such as solving mathematical problems. Improving their reasoning abilities is vital for improving their efficiency on such tasks. LLMs often fail when dealing with tasks requiring logical steps and intermediate-step…
Robotic manipulation policies are currently limited by their inability to extrapolate beyond their training data. While these policies can adapt to new situations, such as different object positions or lighting, they struggle with unfamiliar objects or tasks, and require assistance to process unseen instructions.
Promisingly, vision and language foundation models, like CLIP, SigLIP, and Llama…
The evaluation of Large Language Models (LLMs) requires a systematic and multi-layered approach to accurately identify areas of improvement and limitations. As these models advance and become more intricate, their assessment presents greater challenges due to the diversity of tasks they are required to execute. Current benchmarks often employ non-precise, simplistic criteria such as "helpfulness"…
Machine unlearning refers to the efficient elimination of specific training data's influence on a trained AI model. It addresses legal, privacy, and safety issues arising from large, data-dependent AI models. The primary challenge is to eliminate specific data without the expensive and time-consuming approach of retraining the model from scratch, especially for complex deep neural…
The Allen Institute for AI has recently launched the Tulu 2.5 suite, a revolutionary progression in model training employing Direct Preference Optimization (DPO) and Proximal Policy Optimization (PPO). The suite encompasses an array of models that have been trained on several datasets to augment their reward and value models, with the goal of significantly enhancing…
DeepMind researchers have presented TransNAR, a new hybrid architecture which pairs the language comprehension capabilities of Transformers with the robust algorithmic abilities of pre-trained graph neural networks (GNNs), known as neural algorithmic reasoners (NARs. This combination is designed to enhance the reasoning capabilities of language models, while maintaining generalization capacities.
The routine issue faced by…
Machine learning is a crucial domain where differential privacy (DP) and selective classification (SC) play pivotal roles in safeguarding sensitive data. DP adds random noise to protect individual privacy while retaining the overall utility of the data, while SC chooses to refrain from making predictions in cases of uncertainty to enhance model reliability. These components…
Generative AI, renowned for its capability to autonomously produce text and images, plays a crucial role in creating realistic synthetic data from diverse scenarios, helping organizations optimize operations. A notable initiative in the field is the Synthetic Data Vault (SDV), developed by DataCebo, an MIT spinoff. This generative system aids organizations in creating synthetic data…