Artificial Intelligence Archives - Page 114 of 233

A novel method has been developed to allow AI chatbots to engage in conversation all day without experiencing any system breakdowns.

Algorithms, Artificial Intelligence, Computer science and technology, Electrical Engineering & Computer Science (eecs), Human-computer interaction, Machine learning, MIT Schwarzman College of Computing, MIT-IBM Watson AI Lab, National Science Foundation (NSF), Research, School of Engineering, UncategorizedMay 26, 202476Views 0Likes 0Comments

A team of researchers from MIT and other institutions has developed a method to stop the performance deterioration in AI language models involved in continuous dialogue, like the AI chatbot, ChatGPT. Named StreamingLLM, the solution revolves around a modification in the machine’s key-value cache, acting as a conversation memory. Conventionally, when the cache overflows, the…

This small, secure identification tag has the ability to verify nearly anything.

Artificial Intelligence, Computer chips, Computer science and technology, Electrical Engineering & Computer Science (eecs), Electronics, Internet of things, Machine learning, MIT Schwarzman College of Computing, National Science Foundation (NSF), Research, Research Laboratory of Electronics, School of Engineering, Sensors, Supply chains, UncategorizedMay 26, 202468Views 0Likes 0Comments

MIT researchers have developed a small, affordable, and secure cryptographic ID tag that improves upon traditional radio frequency identification (RFID) tags by using terahertz waves, which are smaller and have higher frequencies than radio waves. Traditional RFIDs are often attacked by counterfeiters who take them off genuine items and reattach to a fake one; the…

The new method pinpoints medications that should not be combined.

Artificial Intelligence, Computer science and technology, Drug delivery, Machine learning, Mechanical engineering, Medicine, National Institutes of Health (NIH), Research, School of Engineering, UncategorizedMay 26, 202471Views 0Likes 0Comments

Researchers from MIT, Brigham and Women’s Hospital, and Duke University have developed a strategy to identify how different drugs are transported through the digestive tract. This new multipronged strategy combines the use of tissue models and machine learning algorithms to comprehend which transporters help various drugs to pass through the digestive tract. This is an important…

Implementing artificial intelligence for individuals seeking problem-solving solutions.

Algorithms, Alumni/ae, Artificial Intelligence, Data, Innovation and Entrepreneurship (I&E), Machine learning, Media Lab, School of Architecture and Planning, Startups, UncategorizedMay 26, 202468Views 0Likes 0Comments

In 2010, Media Lab students Karthik Dinakar SM ’12, Ph.D.’17, and Birago Jones SM ’12 developed a tool intended to assist content moderation teams for platforms such as Twitter and YouTube. The tool aimed to flag harmful content, with a key focus on posts that could be linked to cyberbullying. The project was warmly received,…

DIAMOND (Dissemination as a Framework of Environmental Dreams): A Training Method for Reinforcement Learning Agents within a Diffusion-Based World Model.

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Machine learning, Staff, Tech News, Technology, UncategorizedMay 25, 202470Views 0Likes 0Comments

Reinforcement Learning (RL) involves learning decision-making through interactions with an environment and has been used effectively in games, robotics, and autonomous systems. RL agents aim to maximize their results and increase their efficiency by improving performance through continually adapting to new data. However, the RL agent's sample inefficiency impedes its practical application by necessitating comprehensive…

Octo: A Publicly-Available, Advanced Transformer-based Universal Robotic Policy, Trained on 800,000 Trajectories from the Open X-Embodiment Dataset

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Staff, Tech News, Technology, UncategorizedMay 25, 202469Views 0Likes 0Comments

Robotic learning typically involves training datasets tailored to specific robots and tasks, necessitating extensive data collection for each operation. The goal is to create a “general-purpose robot model”, which could control a range of robots using data from previous machines and tasks, ultimately enhancing performance and generalization capabilities. However, these universal models face challenges unique…

AmbientGPT: A Free-to-Use Multi-Functional MacOS Foundation Model GUI

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Open Source Projects, Staff, Tech News, Technology, UncategorizedMay 25, 202467Views 0Likes 0Comments

Foundation models are powerful tools that have revolutionized the field of AI by providing improved accuracy and complexity in analysis and interpretation of data. These models use large datasets and complex neural networks to execute intricate tasks such as natural language processing and image recognition. However, seamlessly integrating these models into everyday workflows remains a…

Revealing the Concealed Parallelism in Transformer Decoders: Fresh Perspectives for Effective Trimming and Improved Efficiency

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Machine learning, Staff, Tech News, Technology, UncategorizedMay 25, 202473Views 0Likes 0Comments

Researchers from various institutions have recently unveiled a unique linear property of transformer decoders in natural language processing models such as GPT, LLaMA, OPT, and BLOOM. This discovery could have significant implications for future advancements in the field. These researchers discovered that there is a nearly perfect linear relationship in the embedding transformations between sequential…

Improving Safety and Productivity: The Essential Function of AI in Sophisticated Cryptocurrency Systems

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Staff, Tech News, Technology, UncategorizedMay 25, 202464Views 0Likes 0Comments

Since Bitcoin's launch in 2009, artificial intelligence (AI) has played an increasingly essential role in the evolution of cryptocurrency systems, proving instrumental in enhancing security and efficiency. With a wealth of expertise in data analysis, pattern recognition, and predictive modelling, AI is uniquely equipped to address the diverse challenges posed by advanced cryptocurrency systems. One prominent…

Researchers from MIT have suggested a change known as Cross-Layer Attention (CLA) to the Transformer Architecture, which leads to a shrinkage in the Key-Value KV Cache size through an integrated approach to KV activations across different layers.

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Machine learning, Staff, Tech News, Technology, UncategorizedMay 25, 202463Views 0Likes 0Comments

Managing large language models (LLMs) often entails dealing with issues related to the size of key-value (KV) cache, given that it scales with both the sequence length and the batch size. While techniques have been employed to reduce the KV cache size, such as Multi-Query Attention (MQA) and Grouped-Query Attention (GQA), they have only managed…

Researchers from MIT suggest a method called Cross-Layer Attention (CLA), which is a modification of Transformer Architecture aimed at decreasing the size of Key-Value KV cache by distributing KV activations over different layers.

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Machine learning, Staff, Tech News, Technology, UncategorizedMay 25, 202469Views 0Likes 0Comments

MIT researchers have developed a method known as Cross-Layer Attention (CLA) to alleviate the memory footprint bottleneck of the key-value (KV) cache in large language models (LLMs). As more applications demand longer input sequences, the KV cache's memory requirements limit batch sizes and necessitate costly offloading techniques. Additionally, persistently storing and retrieving KV caches to…

A novel method allows AI chatbots to engage in conversations throughout the day without experiencing a system crash.

Algorithms, Artificial Intelligence, Computer science and technology, Electrical Engineering & Computer Science (eecs), Human-computer interaction, Machine learning, MIT Schwarzman College of Computing, MIT-IBM Watson AI Lab, National Science Foundation (NSF), Research, School of Engineering, UncategorizedMay 25, 202460Views 0Likes 0Comments

Researchers from MIT have devised a method called StreamingLLM which enables chatbots to maintain long, uninterrupted dialogues without crashing or performance dips. It involves a modification to the key-value cache at the core of many large language models which serves as a conversation memory, ensuring the initial data points remain present. The method facilitates a…

All Categories

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

All Categories

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

All Categories

News(748)

Research(613)

School of Engineering(648)

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

All
Categories

All
Categories

All
Categories