Skip to content Skip to sidebar Skip to footer

Machine learning

A novel approach allows AI chatbots to engage in conversation throughout the day without experiencing a system failure.

Large language AI models are notorious for crashing or slowing down during lengthy human-AI dialogues, posing a major barrier to the effective use of chatbots in many applications. Now, a team of researchers from MIT and other institutions propose a novel solution - by modifying the key-value cache, or the 'conversation memory', they improved the…

Read More

This small, secure identification label can verify nearly everything.

MIT researchers have developed a new tiny cryptographic ID tag with revolutionary terahertz technology, making it smaller, cheaper, and more secure than traditional radio frequency tags (RFIDs). The latter are often attached to products to verify authenticity but can easily be compromised by counterfeiters who remove and reattach them to fake products. To combat this,…

Read More

Unraveling the AI psyche: Anthropomorphic scientists delve into the “mysterious box”

Researchers from Anthropic have successfully identified millions of concepts within an advanced large language model (LLM), Claude Sonnet. The knowledge structure of AI models is often likened to a ‘black box,’ emphasizing the mystery behind their internal workings. This complex model architecture makes identifying individual concepts challenging, a problem Anthropic addressed using a technique called…

Read More

Prometheus-Eval and Prometheus 2: Raising the Bar in LLM Evaluation and Open-Source Creativity with Cutting-Edge Evaluator Language Model

Prometheus-Eval is an innovative repository that offers tools for training, evaluating, and using Natural Language Processing (NLP) models. Developed by researchers from several institutes including the KAIST AI, MIT, and the University of Illinois Chicago, the tool is particularly adept at evaluating other language models. Using the Prometheus-eval Python package, users can effectively evaluate instruction-response…

Read More

Hugging Face unveiled LeRobot: this open-source robotics-focused machine learning (ML) model is their latest public offering.

Hugging Face has released LeRobot, an open-source machine learning model developed specifically for use in practical robotics. LeRobot is aimed at increasing the usability and accessibility of robots across a wide range of users and is based on the PyTorch platform. It is designed to merge advanced methods with practical applications in real-world settings, with…

Read More

A novel method allows AI chatbots to engage in conversation continuously without breaking down.

MIT researchers have discovered a fault in the design of language machine-learning models that can cause AI chatbots' performance to drastically deteriorate during lengthy conversations. Essentially, when data stored in a chatbot's "memory" (known as the key-value cache) exceeds its capacity, the earliest data is removed, sometimes causing the chatbot to malfunction or slow down.…

Read More

This small, secure identification label can verify almost anything.

Researchers at MIT have developed a cryptographic ID tag that is significantly smaller, cheaper, and more secure than traditional radio frequency identification (RFID) tags. The new tag is based on the use of terahertz waves which are smaller and have much higher frequencies than radio waves. The innovation overcomes a major security flaw common with…

Read More

An innovative approach allows AI chatbots to engage in conversation throughout the day without experiencing a system failure.

Researchers from MIT and other institutions have developed a method that enables a chatbot to carry on unbroken conversation without crashing or losing performance. This method, named StreamingLLM, involves a tweak to the key-value cache, a form of "conversation memory", that helps AI operate. The team found when the cache became too full, the first…

Read More

This small, secure identification label can verify nearly everything.

MIT researchers have created a secure and portable ID tag, utilizing terahertz waves, a technology considered superior compared to traditional RFIDs due to the improved security it offers. The team has bypassed the significant vulnerability of both RFID and terahertz tags, where a counterfeiter can remove the tag from an original item and transfer it…

Read More

Researchers teach a model to generate images, exposing it only to non-copyrighted content.

Researchers at The University of Texas at Austin have devised a new framework for training Artificial Intelligence (AI) models using severely distorted images. This novel technique, named Ambient Diffusion, allows AI to generate ideas from images without directly reproducing them—an issue which poses the risk of copyright violation in prevalent AI models. Traditional AI models,…

Read More

KAUST and Purdue University’s AI Paper introduces effective likelihood methods for vast discrete activity areas.

Reinforcement learning (RL) is a method of machine learning where agents are trained to make decisions by interacting with their environment. This interaction involves taking action and receiving feedback via rewards or penalties. RL has been crucial in developing complex technologies such as advanced robotics, autonomous vehicles, and strategic game-playing mechanisms and has been instrumental…

Read More