Algorithms, Artificial Intelligence, Computer Science and Artificial Intelligence Laboratory (CSAIL), Computer science and technology, Defense Advanced Research Projects Agency (DARPA), Electrical Engineering & Computer Science (eecs), Human-computer interaction, Machine learning, MIT Schwarzman College of Computing, MIT-IBM Watson AI Lab, Research, School of Engineering, UncategorizedJuly 17, 202437Views0Likes0Comments
Artificial intelligence (AI) advancements have led to the creation of large language models, like those used in AI chatbots. These models learn and generate responses through exposure to substantial data inputs, opening the potential for unsafe or undesirable outputs. One current solution is "red-teaming" where human testers generate potentially toxic prompts to train chatbots to…
Methods for evaluating the dependability of a multi-functional AI model prior to its implementation.
Foundation models, or large-scale deep-learning models, are becoming increasingly prevalent, particularly in powering prominent AI services such as DALL-E, or ChatGPT. These models are trained on huge quantities of general-purpose, unlabeled data, which is then repurposed for various uses, such as image generation or customer service tasks. However, the complex nature of these AI tools…
MIT researchers have developed a technique for improving the accuracy of uncertainty estimates in machine-learning models. This is especially important in situations where these models are used for critical tasks such as diagnosing diseases from medical imaging or filtering job applications. The new method works more efficiently and is scalable enough to apply to large…
A group of New England Innovation Academy students have developed a mobile app that highlights deforestation trends in Massachusetts as part of a project for the Day of AI, a curriculum developed by the MIT Responsible AI for Social Empowerment and Education (RAISE) initiative. The TreeSavers app aims to educate users about the effects of…
As robots are increasingly being deployed for complex household tasks, engineers at MIT are trying to equip them with common-sense knowledge allowing them to swiftly adapt when faced with disruptions. A newly developed method by the researchers merges robot motion data and common-sense knowledge from extensive language models (LLMs).
The new approach allows a robot to…
Large language models (LLMs), such as those which power AI chatbots like ChatGPT, are highly complex. While these powerful tools are used in diverse applications like customer support, code generation, and language translation, they remain somewhat of a mystery to the scientists who work with them. To develop a deeper understanding of their inner workings,…
Large language models (LLMs) that power artificial intelligence chatbots like ChatGPT are extremely complex and their functioning isn't fully understood. These LLMs are used in a variety of areas such as customer support, code generation and language translation. However, researchers from MIT and other institutions have made strides in understanding how these models retrieve stored…
Tomás Vega, MIT alum and CEO of Augmental, found technology a great equalizer when he dealt with a stuttering disorder at a young age. Vega began programming at age 12 and continued to use technology to augment human abilities throughout high school and college. He made it his mission to help people with disabilities live…
When engaging in continuous dialogues, powerful language machine-learning models that drive chatbot technologies such as ChatGPT can struggle to cope, often leading to a decline in performance. Now, a team of researchers from MIT and elsewhere believe they have found a solution to this issue, which ensures chatbots can continue a conversation without crashing or…