Administration, Aeronautical and astronautical engineering, Alumni/ae, Artificial Intelligence, Computer Science and Artificial Intelligence Laboratory (CSAIL), Electrical Engineering & Computer Science (eecs), Faculty, Leadership, MIT Schwarzman College of Computing, Robotics, School of Engineering, UncategorizedJuly 28, 202429Views0Likes0Comments
Large Language Models (LLMs) face several deployment challenges including latency issues triggered by memory bandwidth constraints. To mitigate such problems, researchers have resorted to applying weight-only quantization, a technique that compresses the parameters of LLMs to lower precision. Nevertheless, to effectively implement weight-only quantization, it is necessary to employ mixed-type matrix-multiply kernels that can manage,…
In the dynamic and complex field of robotics, decision-making often involves managing continuous action spaces and processing high volumes of data. This scenario demands sophisticated methodologies to handle the information efficiently and translate it into meaningful action. To address this challenge, researchers from the University of Maryland, College Park, and Microsoft Research have proposed a…
Artificial intelligence (AI) is continually evolving, with a significant challenge being the creation of systems that can effectively collaborate in dynamic environments. One area of focus in this regard is multi-agent reinforcement learning (MARL), which aims to teach agents to interact and adapt in these settings. However, these methods struggle with complexity and adaptability, especially…
Deep learning's exceptional performance across a wide range of scientific fields and its utilization in various applications have been proven. However, these models often come with many parameters that require a substantial amount of computational power for training and testing. The improvement of these models has been a primary focus of advancement in the field,…
The realm of language models has seen tremendous growth thanks to transformative scaling efforts and applications such as OpenAI's GPT series. Innovations like Transformer-XL have broadened context windows, while models like Mistral, Falcon, Yi, DeepSeek, DBRX, and Gemini extended the reach of these capabilities. Parallel to these, visual language models (VLMs) have also observed similar…
At the Data + AI Summit 2024, Databricks unveiled the public preview of the Mosaic AI Agent Framework and Agent Evaluation, aimed at helping developers build and deploy superior Agentic and Retrieval Augmented Generation (RAG) applications on the Databricks Data Intelligence Platform.
Building quality generative AI applications pose distinct challenges for developers, such as selecting the…
Google DeepMind's AI systems AlphaProof and AlphaGeometry 2 have achieved a silver medal-level score at the 2024 International Mathematical Olympiad (IMO), a highly prestigious competition for budding mathematicians worldwide. Despite competing against 609 contestants, the AI models secured rankings among the top 58, by resolving four of the six difficult math problems, earning 28…
Researchers at MIT and the University of Washington have developed a computational model that can predict an intelligent agent's behaviors based on its "inference budget" (i.e. the limits on its computational resources). This was accomplished by using an algorithm that recorded all the decisions made by the agent within a given period of time. They…
Health-monitoring applications have become pivotal in managing chronic diseases and tracking fitness goals, largely due to the advent of machine-learning powered tools. However, these applications are often slow and energy-inefficient, largely due to the massive machine-learning models that require transfer between a smartphone and a central memory server. Despite the development of machine-learning accelerators that…
Julie Shah, a renowned leader in the field of aeronautics and astronautics, has been named the new head of the Department of Aeronautics and Astronautics (AeroAstro) at the Massachusetts Institute of Technology (MIT). The announcement, effective from May 1, was lauded by MIT's chief innovation and strategy officer, Anantha Chandrakasan, who highlighted Shah's substantial technical…