Machine learning Archives - Page 58 of 99

This study by Google’s DeepMind examines the disparity in performance between online and offline techniques for aligning AI.

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Machine learning, Staff, Tech News, Technology, UncategorizedMay 18, 202467Views 0Likes 0Comments

The standard method for aligning Language Learning Models (LLMs) is known as RLHF, or Reinforcement Learning from Human Feedback. However, new developments in offline alignment methods - such as Direct Preference Optimization (DPO) - challenge RLHF's reliance on on-policy sampling. Unlike online methods, offline algorithms use existing datasets, making them simpler, cheaper, and often more…

Chasing the Platonic Ideals: AI’s Hunt for a Single Reality Paradigm

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Machine learning, Staff, Tech News, Technology, UncategorizedMay 18, 202467Views 0Likes 0Comments

Artificial Intelligence (AI) systems have demonstrated a fascinating trend of converging data representations across different architectures, training objectives, and modalities. Researchers propose the "Platonic Representation Hypothesis" to explain this phenomenon. Essentially, this hypothesizes that various AI models are striving to capture a unified representation of the underlying reality that forms the basis for observable data.…

An innovative method allows AI chatbots to engage in conversation all day without experiencing any system failures.

Algorithms, Artificial Intelligence, Computer science and technology, Electrical Engineering & Computer Science (eecs), Human-computer interaction, Machine learning, MIT Schwarzman College of Computing, MIT-IBM Watson AI Lab, National Science Foundation (NSF), Research, School of Engineering, UncategorizedMay 18, 202461Views 0Likes 0Comments

A team of researchers from MIT and other institutions has discovered a remarkable cause of performance deterioration in chatbots and found a simple solution that allows persistent, uninterrupted dialogue. This problem occurs when human-AI interaction involves continuous rounds of conversation, which can overburden the large language machine-learning models that power chatbots like ChatGPT. The researchers have…

A novel method has been developed to enable AI chatbots to carry out conversations throughout the day without experiencing any system failure.

Algorithms, Artificial Intelligence, Computer science and technology, Electrical Engineering & Computer Science (eecs), Human-computer interaction, Machine learning, MIT Schwarzman College of Computing, MIT-IBM Watson AI Lab, National Science Foundation (NSF), Research, School of Engineering, UncategorizedMay 18, 202463Views 0Likes 0Comments

A novel approach has been developed to allow AI chatbots to engage in conversation throughout the entire day without collapsing.

Algorithms, Artificial Intelligence, Computer science and technology, Electrical Engineering & Computer Science (eecs), Human-computer interaction, Machine learning, MIT Schwarzman College of Computing, MIT-IBM Watson AI Lab, National Science Foundation (NSF), Research, School of Engineering, UncategorizedMay 17, 202460Views 0Likes 0Comments

Researchers from MIT and other institutions have developed a method that prevents large AI language machines from crashing during lengthy dialogues. The solution, known as StreamingLLM, tweaks the key-value cache (a sort of conversation memory) of large language models to ensure the first few data pieces remain in memory. Typically, once the cache's capacity is…

Learning Harmonics: A Mathematical Proposition for the Emergence of Fourier Elements in Learning Structures Such as Neural Networks

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Machine learning, Staff, Tech News, Technology, UncategorizedMay 17, 202465Views 0Likes 0Comments

Artificial neural networks (ANNs) have remarkable capabilities when trained on natural data. Regardless of exact initialization, dataset, or training objective, neural networks trained on the same data domain tend to converge to similar patterns. For different image models, the initial layer weights typically converge to Gabor filters and color-contrast detectors, underlying a sort of "universal"…

Learning Harmonics: A Mathematical Concept for the Emergence of Fourier Features in Learning Structures such as Neural Networks

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Machine learning, Staff, Tech News, Technology, UncategorizedMay 17, 202464Views 0Likes 0Comments

A novel method enables AI chatbots to communicate continuously without failure.

Algorithms, Artificial Intelligence, Computer science and technology, Electrical Engineering & Computer Science (eecs), Human-computer interaction, Machine learning, MIT Schwarzman College of Computing, MIT-IBM Watson AI Lab, National Science Foundation (NSF), Research, School of Engineering, UncategorizedMay 17, 202474Views 0Likes 0Comments

Researchers from MIT and other institutions have found a solution to an issue that causes machine-learning model-run chatbots to malfunction during long, continuous dialogues. They found that significant delays or crashes happen when the key-value cache, essentially the conversation memory, becomes overloaded leading to early data being ejected and the model to fail. The researchers…

Researchers from Carnegie Mellon University have suggested MOMENT: A range of open-source foundation models for machine learning, tailored for general-purpose time series analysis.

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Machine learning, Staff, Tech News, Technology, UncategorizedMay 16, 202457Views 0Likes 0Comments

Large models pre-training on time series data is a frequent challenge due to the absence of a comprehensive public time series repository, diverse time series characteristics, and emerging benchmarks for model testing. Despite this, time series analysis remains integral in various fields, including weather forecasting, heart rate irregularity detection, and anomaly identification in software deployments.…

An innovative method allows AI chatbots to engage in conversation throughout the day without experiencing any system failures.

Algorithms, Artificial Intelligence, Computer science and technology, Electrical Engineering & Computer Science (eecs), Human-computer interaction, Machine learning, MIT Schwarzman College of Computing, MIT-IBM Watson AI Lab, National Science Foundation (NSF), Research, School of Engineering, UncategorizedMay 16, 202464Views 0Likes 0Comments

Scientists employ generative artificial intelligence to tackle intricate queries in the field of physics.

Artificial Intelligence, Computer modeling, Computer Science and Artificial Intelligence Laboratory (CSAIL), Computer science and technology, Electrical Engineering & Computer Science (eecs), Machine learning, Mathematics, MIT Schwarzman College of Computing, Physics, Research, School of Engineering, School of Science, UncategorizedMay 16, 202476Views 0Likes 0Comments

Researchers from MIT and the University of Basel in Switzerland have developed a new machine-learning framework that can map phase diagrams for novel physical systems automatically. By applying generative artificial intelligence models, the team has developed a more efficient method for tracking and understanding phase transitions in water and other complex physical systems, which offers…

DataSP: A Convertible Universal Shortest Path Algorithm for Machine Learning Aids in Understanding Hidden Expenses from Paths.

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Machine learning, Staff, Tech News, Technology, UncategorizedMay 16, 202472Views 0Likes 0Comments

In the fields of traffic management and urban planning, understanding the most efficient routes based on multiple variables has significant potential benefits. This approach assumes that when individuals are choosing a route, they're trying to minimize certain costs such as travel time, comfort, tolls, and distance. Understanding these costs can help improve traffic flow and…

All Categories

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

All Categories

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

All Categories

News(748)

Research(613)

School of Engineering(648)

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

All
Categories

All
Categories

All
Categories