In late 2019, Hector (Haofeng) Xu, whilst pursuing his PhD in MIT’s aeronautics and astronautics department, took to learning to fly helicopters. After some near misses, he was inspired to make helicopter flight safer. This would be the foundation for his autonomous helicopter company, Rotor Technologies, Inc., established in 2021. The aim was to mitigate…
A team of researchers from MIT and other institutions has discovered a key issue with large-scale machine learning models causing chatbot performance to degrade. When engaged in extensive dialogues, the huge language models behind bots like ChatGPT sometimes begin to fail. However, the team devised a solution enabling nonstop conversation without deterioration or lag. The…
Data mapping, which involves linking fields from one database to another, is a crucial part of data management, particularly in transforming and integrating data from varying sources into a cohesive format. An innovative perspective on this process frames it as a search problem. The efficacy of viewing data mapping as a search problem provides useful…
Machine learning (ML) has become a fundamental part of several industries worldwide due to its wide range of applications. However, understanding and interpreting complex ML models continues to be a challenge. These models, often comprising multiple layers and intricate connections, require precise graph visualization tools to understand how data travels across the model and how…
Machine Learning (ML) models are increasingly becoming an integral part of various sectors globally, with their extensive applications and growing reliance on their capabilities. As these models grow in complexity, understanding and interpreting them becomes more challenging. Visualizing how data flows through the model and how the different parts interact is crucial to debug and…
The world of artificial intelligence (AI) and machine learning continues to evolve at a rapid pace, with OpenAI leading the charge. Their latest development is the introduction of GPT-4o, an optimized version of the widely used GPT-4, part of the Generative Pre-trained Transformer model series renowned for its natural language processing capabilities.
GPT-4 boasts enhanced contextual…
The world of Artificial Intelligence (AI) has taken another step forward with the introduction of the recent Yi-1.5-34B model by 01.AI. This model is considered a significant upgrade over prior versions, providing a bridge between the capabilities of the Llama 3 8B and the 70B models.
The distinguishing features of the Yi-1.5-34B include improvements in multimodal…
In 2021, Haofeng (Hector) Xu, a student well-versed in aviation and aerospace engineering, established Rotor Technologies Inc., an autonomous helicopter company. This decision rooted from his personal experiences in flying helicopters along with the realization of the life risks pilots undertake daily in the US while operating small aircraft, many of which result in fatalities.…
When engaging in lengthy dialogues, advanced AI-powered chatbots often become inept, resulting in a significant performance downturn. A team of researchers from MIT alongside others have deduced a reason for this issue and devised a straightforward solution to prevent the bot from crashing or slowing down. The method, StreamingLLM, effectively ensures a continuous discussion irrespective…
Large language models (LLMs) have been successful in areas like natural language tasks and following instructions, yet they have limitations when dealing with non-textual data such as images and audio. But presently, an approach integrating textual LLMs with speech encoders in one training setup could revolutionize this. One option is multimodal audio-language models, proving advantageous…
The standard method for aligning Language Learning Models (LLMs) is known as RLHF, or Reinforcement Learning from Human Feedback. However, new developments in offline alignment methods - such as Direct Preference Optimization (DPO) - challenge RLHF's reliance on on-policy sampling. Unlike online methods, offline algorithms use existing datasets, making them simpler, cheaper, and often more…
Natural Language Processing (NLP) is a revolutionary field that allows machines to understand, interpret, and generate human language. It is widely used in various sectors, including language translation, text summarization, sentiment analysis, and the creation of conversational agents. Large language models (LLMs), which have greatly improved these applications, require huge computational and energy demands for…