Large language AI models are notorious for crashing or slowing down during lengthy human-AI dialogues, posing a major barrier to the effective use of chatbots in many applications. Now, a team of researchers from MIT and other institutions propose a novel solution - by modifying the key-value cache, or the 'conversation memory', they improved the…
MIT researchers have developed a new tiny cryptographic ID tag with revolutionary terahertz technology, making it smaller, cheaper, and more secure than traditional radio frequency tags (RFIDs). The latter are often attached to products to verify authenticity but can easily be compromised by counterfeiters who remove and reattach them to fake products. To combat this,…
Researchers from Anthropic have successfully identified millions of concepts within an advanced large language model (LLM), Claude Sonnet. The knowledge structure of AI models is often likened to a ‘black box,’ emphasizing the mystery behind their internal workings. This complex model architecture makes identifying individual concepts challenging, a problem Anthropic addressed using a technique called…
Prometheus-Eval is an innovative repository that offers tools for training, evaluating, and using Natural Language Processing (NLP) models. Developed by researchers from several institutes including the KAIST AI, MIT, and the University of Illinois Chicago, the tool is particularly adept at evaluating other language models. Using the Prometheus-eval Python package, users can effectively evaluate instruction-response…
Hugging Face has released LeRobot, an open-source machine learning model developed specifically for use in practical robotics. LeRobot is aimed at increasing the usability and accessibility of robots across a wide range of users and is based on the PyTorch platform. It is designed to merge advanced methods with practical applications in real-world settings, with…
MIT researchers have discovered a fault in the design of language machine-learning models that can cause AI chatbots' performance to drastically deteriorate during lengthy conversations. Essentially, when data stored in a chatbot's "memory" (known as the key-value cache) exceeds its capacity, the earliest data is removed, sometimes causing the chatbot to malfunction or slow down.…
Researchers at MIT have developed a cryptographic ID tag that is significantly smaller, cheaper, and more secure than traditional radio frequency identification (RFID) tags. The new tag is based on the use of terahertz waves which are smaller and have much higher frequencies than radio waves.
The innovation overcomes a major security flaw common with…
Researchers from MIT and other institutions have developed a method that enables a chatbot to carry on unbroken conversation without crashing or losing performance. This method, named StreamingLLM, involves a tweak to the key-value cache, a form of "conversation memory", that helps AI operate. The team found when the cache became too full, the first…
MIT researchers have created a secure and portable ID tag, utilizing terahertz waves, a technology considered superior compared to traditional RFIDs due to the improved security it offers. The team has bypassed the significant vulnerability of both RFID and terahertz tags, where a counterfeiter can remove the tag from an original item and transfer it…
Researchers at The University of Texas at Austin have devised a new framework for training Artificial Intelligence (AI) models using severely distorted images. This novel technique, named Ambient Diffusion, allows AI to generate ideas from images without directly reproducing them—an issue which poses the risk of copyright violation in prevalent AI models. Traditional AI models,…
Reinforcement learning (RL) is a method of machine learning where agents are trained to make decisions by interacting with their environment. This interaction involves taking action and receiving feedback via rewards or penalties. RL has been crucial in developing complex technologies such as advanced robotics, autonomous vehicles, and strategic game-playing mechanisms and has been instrumental…