Skip to content Skip to sidebar Skip to footer

Artificial Intelligence

Searching for a particular activity in a video? This method, powered by artificial intelligence, can locate it for you.

Researchers from MIT and the MIT-IBM Watson AI Lab have introduced an efficient method to train machine-learning models to identify specific actions in videos by making use of the video's automatically generated transcripts. The method, known as spatio-temporal grounding, helps the model intricately understand the video by dissecting it and analysing it through the lens…

Read More

Addressing the Issue of Gradient Inversion in Federated Learning: The DAGER Algorithm for Precise Text Reconstruction

Federated learning is a way to train models collaboratively using data from multiple clients, maintaining data privacy. Yet, this privacy can become compromised by gradient inversion attacks that reconstruct original data from shared gradients. To address this threat and specifically tackle the challenge of text recovery, researchers from INSAIT, Sofia University, ETH Zurich, and LogicStar.ai…

Read More

Mistral-finetune: A streamlined code structure for resource-effective and high-performing refinements of Mistral’s Models.

Fine-tuning large language models is a common challenge for many developers and researchers in the AI field. It is a critical process in adapting models to specific tasks or enhancing their performance. But it often necessitates significant computational resources and time. Conventional solutions, such as adjusting all model weights, are resource-intensive, requiring substantial memory and…

Read More

NV-Embed: NVIDIA’s Innovative Embedding Model Excels in MTEB Benchmarks

NVIDIA, a leader in artificial intelligence (AI) and graphic processing units (GPUs), has recently launched NV-Embed, an advanced embedding model built on the large language model (LLM) architecture. NV-Embed is set to transform the field of natural language processing (NLP) and has already demonstrated high performance results in the Massive Text Embedding Benchmark (MTEB). Its…

Read More

This Artificial Intelligence research study from Cornell University deciphers the intricate factors in estimating interventional probability.

Causal models play a vital role in establishing the cause-and-effect associations between variables in complex systems, though they struggle to estimate probabilities associated with multiple interventions and conditions. Two main types of causal models have been the focus of AI research - functional causal models and causal Bayesian networks (CBN). Functional causal models make it…

Read More

Investigations at Arizona State University Assess ReAct Prompting: The Importance of Similar Examples in Boosting Extensive Language Model Logic

Large language models (LLMs) have rapidly improved over time, proving their prowess in text generation, summarization, translation, and question-answering tasks. These advancements have led researchers to explore their potential in reasoning and planning tasks. Despite this growth, evaluating the effectiveness of LLMs in these complex tasks remains a challenge. It's difficult to assess if any performance…

Read More

An innovative method allows AI chatbots to engage in conversations all day long without experiencing any system failures.

When engaging in continuous dialogues, powerful language machine-learning models that drive chatbot technologies such as ChatGPT can struggle to cope, often leading to a decline in performance. Now, a team of researchers from MIT and elsewhere believe they have found a solution to this issue, which ensures chatbots can continue a conversation without crashing or…

Read More

This small, secure ID label can verify nearly everything.

A few years ago, MIT researchers created an innovative cryptographic ID tag several times smaller and much cheaper than traditional RFIDs (radio frequency tags) commonly used to authenticate products. Despite the significant improvements in size, cost, and security this new ID tag brought, it shared a major security vulnerability with RFIDs, where a counterfeiter could…

Read More

The new system pinpoints medicines that should not be combined.

Researchers from MIT, Brigham and Women’s Hospital and Duke University have developed a multipronged strategy to identify which transporter proteins drugs use to pass through the GI tract. This could not only improve patient treatment by revealing which drugs might interact unfavorably with each other, but also enhance the development of new drugs by informing…

Read More

Empowering individuals facing challenges with the use of Artificial Intelligence.

In 2010, Media Lab students Karthik Dinakar SM ’12, PhD ’17, and Birago Jones SM ’12 wanted to develop a tool to assist content moderation teams at companies like Twitter and YouTube. The project prompted excitement, earning them a demo at a White House cyberbullying summit. When Dinakar struggled to create a working demo, Jones…

Read More

The diffusion control model can alter the characteristics of the material present in pictures.

A team of researchers from the MIT Computer Science and Artificial Intelligence Laboratory (CSAIL) and Google Research have developed an image-to-image diffusion model called Alchemist, which allows users to modify the material properties of objects in photos. The system adjusts aspects such as roughness, metallicity, innate color (albedo), and transparency, and can be applied to…

Read More