Kolmogorov-Arnold Networks (KANs) are a recent development that offer an alternative to Multi-Layer Perceptrons (MLPs) in machine learning. Using the Kolmogorov-Arnold representation theorem, KANs use neurons that carry out simple addition operations. Nonetheless, current models of KANs can pose challenges in real-world application, prompting researchers to explore other multivariate functions that could boost its use…
MIT and University of Washington researchers have created a model to efficiently predict human behavior, which could potentially improve the effectiveness of AI systems working with human collaborators. Humans tend to behave suboptimally when making decisions due to computational constraints and researchers have created this model to account for these human processing limitations. The model…
Researchers from the MIT-IBM Watson AI Lab and MIT have developed a secure machine-learning accelerator that can efficiently run large AI models while protecting user data. The device keeps user medical records, personal finance information, and other sensitive data confidential, and it is currently resistant to two of the most common security threats. The team…
The arrival of spring in the Northern Hemisphere brings with it the commencement of tornado season. Meteorologists use radar to track these dangerous natural phenomena, but understanding exactly when a tornado has formed or why can be a challenge. However, a new dataset may provide some answers.
Known as TorNet, this dataset compiled by researchers from…
Multi-layer perceptrons (MLPs) are integral to modern deep learning models for their versatility in replicating nonlinear functions across various tasks. However, interpretation and scalability challenges and reliance on fixed activation functions have raised concerns about their adaptability and scalability. Researchers have explored alternative architectures to overcome these issues, such as Kolmogov-Arnold Networks (KANs).
KANs have…
Multimodal AI models, which integrate diverse data types like text and images, are pivotal for tasks such as answering visual questions and generating descriptive text for images. However, optimizing model efficiency remains a significant challenge. Traditional methods, which fuse modality-specific encoders or decoders, often limit the model's ability to combine information across different data types…
Scientists from the Massachusetts Institute of Technology (MIT) and the University of Washington have developed an approach to mechanically infer the computational weaknesses of an AI or human agent by observing prior activities. This perceptible agent’s "inference budget" can be used to predict future behavior. Used in forthcoming AI structures, the technique could allow them…
Health-monitoring apps that use machine learning can be helpful in managing chronic diseases and fitness goals; however, they can also be slow and use a lot of energy. This is mainly due to machine learning models being shuttled between a smartphone and a central memory server. While machine-learning accelerators are often used to streamline computations,…
Every spring, tornado season returns to the Northern Hemisphere. While the twisted funnel of a tornado may seem like an easily recognizable sight, it remains difficult for radar -- the primary tool of meteorologists -- to detect as they form. Predicting tornadoes also remains challenging, due to an unclear understanding of why they form. Over…
Researchers at MIT and the University of Washington have developed a method to effectively model human behavior, accounting for the computational constraints that limit our decision-making abilities. This model, known as the "inference budget," enables predictions of an individual’s future actions based on their past behaviors. This is particularly useful in AI development, allowing machines…
Researchers from MIT and the MIT-IBM Watson AI Lab have developed a machine-learning accelerator that provides strong data protection while allowing massive AI models to run effectively on individual devices. The innovations applied in developing the chip help protect sensitive information such as health records or financial data against common cyber-attacks, without negatively affecting the…