Natural Language Processing (NLP) faces major challenges in addressing the limitations of decoder-only Transformers, which are the backbone of large language models (LLMs). These models contend with issues like representational collapse and over-squashing, which severely hinder their functionality. Representational collapse happens when different sequences produce nearly the same results, while over-squashing occurs when the model…
In 2010, while studying at MIT Media Lab, Karthik Dinakar and Birago Jones developed a tool to assist in content moderation for social media platforms like Twitter and YouTube. The project, aimed at identifying concerning posts and potential cyberbullying, sparked enough interest to receive an invitation to a cyberbullying summit at the White House. However,…
MIT researchers have designed an artificial intelligence solution to help robotic warehouses operate more efficiently. Automated warehouses, which employ hundreds of robots to pick and deliver goods, are becoming more commonplace, especially in industries such as e-commerce and automotive production. However, coordinating this robot workforce to avoid collisions, while also maintaining a high operational pace,…
This paper delves into the realm of uncertainty quantification in large language models (LLMs), aiming to pinpoint scenarios where uncertainty in responses to queries is significant. The study delves into both epistemic and aleatoric uncertainties. Epistemic uncertainty arises from inadequate knowledge or data about reality, while aleatoric uncertainty originates from inherent randomness in prediction problems.…
Machine learning (ML) has been instrumental in advancing healthcare, especially in the realm of medical imaging. However, current models often fall short in explaining how visual changes impact ML decisions, creating a need for transparent models that not only classify medical imagery accurately but also elucidate the signals and patterns they learn. Google's new framework,…
Researchers from Exscientia and the University of Oxford have developed an advanced predictive model called ABodyBuilder3 for antibody structures. This new tool is key for creating monoclonal antibodies, which are integral in immune responses and therapeutic applications. The novel model improves upon the previous ABodyBuilder2 by enhancing the accuracy of predicting Complementarity Determining Region (CDR)…
As companies are becoming increasingly dependent on Artificial Intelligence (AI) for efficiency, automation, and customization, learning AI has become pivotal. However, not everyone is an expert in the domain. Salesforce offers a series of short AI-training courses on its Trailhead platform to equip individuals with essential AI skills, promoting them towards new opportunities and career…
Founded by MIT graduates Karthik Dinakar and Birago Jones, Pienso is a platform that allows non-specialists to build machine-learning models to address societal issues. The idea stemmed from their 2010 study at MIT's Media Lab, where they developed a content moderation tool for companies like Twitter and YouTube. The innovation got companies excited and the…
Fusion oncoproteins, proteins formed by chromosome translocations, play a critical role in many cancers, especially those found in children. However, due to their large and disordered structures, they are difficult to target with traditional drug design methods. To tackle this challenge, researchers at Duke University have developed FusOn-pLM, a novel protein language model specifically tailored…
In the Artificial Intelligence (AI) world, the proper selection of Large Language Models (LLMs) is essential for maximizing efficiency and accuracy in various tasks. The following is a guide to choosing LLMs for several AI-related activities based on their specialized capabilities.
For tasks demanding deep comprehension and interpretation of hard documents such as scientific papers,…