Researchers from MIT and the University of Washington have developed a model to predict the behavior of human and artificial intelligence (AI) agents, taking into account computational constraints. The model automatically deduces these constraints by processing previous actions of the agent. This "inference budget" can help predict future behavior of the agent; for instance, it…
Researchers at the Massachusetts Institute of Technology (MIT) and the University of Washington have developed a model that accounts for the computational constraints often experienced by decision-making agents, both human and machine. This model auto-infers an agent's computational restrictions by analysing traces of past actions, which, in turn, can be used to predict future behaviour.
In…
Researchers from MIT and the MIT-IBM Watson AI Lab have developed a machine-learning accelerator that provides security against the two most common types of attacks. This chip can keep sensitive data, such as health records or financial information, private while allowing AI models to run efficiently on devices. The increased security doesn't affect the accuracy…
Springtime in the Northern Hemisphere marks the onset of tornado season, and while the dust and debris-filled spiral of a tornado may seem an unmistakable sight, these violent weather phenomena often evade detection until it's too late. Recognizing the need for better ways of predicting these occurrences, researchers at MIT Lincoln Laboratory have compiled a…
Representational similarity measures are essential instruments in machine learning as they facilitate the comparison of internal representations of neural networks, aiding researchers in understanding how various neural network layers and architectures process information. These measures are vital for understanding the performance, behavior, and learning dynamics of a model. However, the development and application of these…
The recent release of Meta’s AI model, Llama 3.1, has made waves in the artificial intelligence community. Particularly notable for its high performance and open-source accessibility, the 405B variant surpasses even top-tier closed models. This article presents ten diverse and innovative applications of Llama 3.1.
One key use of Llama 3.1 405B is in efficient task…
Advancements in Large Language Models (LLMs) have notably benefitted the development of artificial intelligence, particularly in creating agent-based systems. These systems are designed to interact with various environments and carry out actions to meet specific goals. One of the significant challenges includes the creation of elaborate planning environments and tasks, most of which currently rely…
The Argilla team has debuted Magpie-ultra, a cutting-edge dataset used for supervised fine-tuning. The highlight of this release is its 50,000 instruction-response pairs, produced using the sophisticated Llama 3.1 405B-Instruct model, as well as other versions like Llama-Guard-3-8B and Meta-Llama-3.1-8B-Instruct. This synthetic dataset encompasses a variety of tasks like coding, mathematics, data analysis, creative writing,…
Kolmogorov-Arnold Networks (KANs) are a recent development that offer an alternative to Multi-Layer Perceptrons (MLPs) in machine learning. Using the Kolmogorov-Arnold representation theorem, KANs use neurons that carry out simple addition operations. Nonetheless, current models of KANs can pose challenges in real-world application, prompting researchers to explore other multivariate functions that could boost its use…
MIT and University of Washington researchers have created a model to efficiently predict human behavior, which could potentially improve the effectiveness of AI systems working with human collaborators. Humans tend to behave suboptimally when making decisions due to computational constraints and researchers have created this model to account for these human processing limitations. The model…
Researchers from the MIT-IBM Watson AI Lab and MIT have developed a secure machine-learning accelerator that can efficiently run large AI models while protecting user data. The device keeps user medical records, personal finance information, and other sensitive data confidential, and it is currently resistant to two of the most common security threats. The team…