Black Forest Labs has entered the field of generative artificial intelligence (AI), seeking to transform this sector with their advanced suite of models known as FLUX.1. The company's primary focus is on pushing the boundaries of generative deep learning models for media, like images and videos, while also promoting the safe use of these revolutionary…
Nixtla has announced the launch of NeuralForecast, an advanced library of neural forecasting models set to revolutionise the forecasting community. The library addresses long-standing issues such as usability, accuracy, and computational efficiency, providing a bridge between neural networks' complexity and their practical use.
NeuralForecast comprises multiple neural network architectures, from Multi-Layer Perceptrons (MLP) and Recurrent Neural…
Researchers from Harvard and Stanford universities have developed a new meta-algorithm known as SPRITE (Spatial Propagation and Reinforcement of Imputed Transcript Expression) to improve predictions of spatial gene expression. This technology serves to overcome current limitations in single-cell transcriptomics, which can currently only measure a limited number of genes.
SPRITE works by refining predictions from existing…
Deep learning has transformed the field of pathological voice classification, particularly in the evaluation of the GRBAS (Grade, Roughness, Breathiness, Asthenia, Strain) scale. Unlike traditional methods that involve manual feature extraction and subjective analysis, deep learning leverages 1D convolutional neural networks (1D-CNNs) to autonomously extract relevant features from raw audio data. However, background noise can…
Video captioning is crucial for content understanding, retrieval, and training foundational models for video-related tasks. However, it's a challenging field due to issues like a lack of high-quality data, the complexity of captioning videos compared to images, and the absence of established benchmarks.
Despite these challenges, recent advancements in visual language models have improved video…
In developing AI-based applications, developers often grapple with memory management challenges. High costs, restricted access due to closed-source tools, and poor support for external integration have posed barriers to creating robust applications such as AI-driven dating or health diagnostics platforms.
Typically, memory management for AI applications can be expensive, closed-sourced, or lack comprehensive support for external…
MIT and the University of Washington researchers have devised a method to model human or machine agent behaviour incorporating unknown computational constraints limiting problem-solving abilities. The technique generates an "inference budget" by observing a few previous actions, effectively predicting future behaviour. Lead author Athul Paul Jacob believes the work could help AI systems better understand…
Researchers at MIT and the IBM Watson AI lab have developed a machine-learning accelerator chip which is more resilient to common types of cyber attacks. The chip is designed to protect sensitive user data, such as health records or financial information, whilst also enabling large-scale AI models to run efficiently on devices. The design of…
Julie Shah ’04, SM ’06, PhD ’11, already an esteemed professor in Aeronautics and Astronautics, has been named the new department head for the same field at MIT starting May 1. A renowned figure in the field of robotics and AI, Shah has a reputation for significant technical contributions to this sector, especially in the…
Researchers at MIT Lincoln Laboratory have developed a new open-source dataset, named TorNet, to detect and predict tornadoes. By using artificial intelligence (AI) models trained on TorNet, researchers hope to improve tornado forecasts and warning accuracy, potentially saving lives and minimizing damage.
Tornadoes are challenging to predict, and this represents a high false alarm rate…
Speech recognition technology, a rapidly evolving area of machine learning, allows computers to understand and transcribe human languages. This technology is pivotal for services including virtual assistants, automated transcription, and language translation tools. Despite recent advancements, developing universal speech recognition systems that cater to all languages, particularly those that are less common and understudied, remains…