The race to develop powerful, efficient artificial intelligence (AI) hardware took a significant step forward this week, with Intel and Google both revealing new chips aimed at reducing their dependence on NVIDIA’s technology. While NVIDIA’s GPUs currently power a majority of cloud computing data centers used for AI model training, both tech giants are looking…
Large Language Models (LLMs) have taken center stage in many intelligent agent tasks due to their cognitive abilities and quick responses. Even so, existing models often fail to meet demands when negotiating and navigating the multitude of complexities on webpages. Factors such as versatility of actions, HTML text-processing constraints, and the intricacy of on-the-spot decision-making…
The field of semantic segmentation in artificial intelligence (AI) has seen significant progress, but it still faces distinct challenges, especially imaging in problematic conditions such as poor lighting or obstructions. To help bridge these gaps, researchers are looking into various multi-modal semantic segmentation techniques that combine traditional visual data with additional information sources like thermal…
Natural Language Processing (NLP) has traditionally centered around English language models, thereby excluding a significant portion of the global population. However, this status quo is being challenged by the Chinese Tiny LLM (CT-LLM), a groundbreaking development aimed at a more inclusive era of language models. CT-LLM, innovatively trained on the Chinese language, one of the…
Tech giant Meta is pushing the boundaries of artificial intelligence (AI) by introducing the latest version of the Meta Training and Inference Accelerator (MTIA) chip. This move is significant in Meta’s commitment to enhance AI-driven experiences across its products and services.
The new MTIA chip shows remarkable performance enhancements compared to its predecessor, MTIA v1, particularly…
In an industry where large corporations like OpenAI, Meta, and Google dominate, Paris-based AI startup Mistral has recently launched its open-source language model, Mixtral 8x22B. This bold venture establishes Mistral as a notable contender in the field of AI, while simultaneously challenging established models with its commitment to open-source development.
Mixtral 8x22B impressively features an advanced…
A group of leaders and scholars from MIT has released a set of policy briefs aimed at developing a framework for the governance of artificial intelligence (AI) in the United States. The goal of this framework is to enhance US leadership in AI while mitigating potential risks and exploring the benefits of AI deployment.
The main…
Justin Solomon, an Associate Professor in the Massachusetts Institute of Technology (MIT)'s Department of Electrical Engineering and Computer Science (EECS) and member of the Computer Science and Artificial Intelligence Laboratory (CSAIL), is leveraging geometric techniques to tackle complex problems in data science. Quite often, these problems are seemingly unrelated to shapes. For example, when a…
Researchers from Massachusetts Institute of Technology (MIT) and the Chinese University of Hong Kong have developed a digital simulator that mimics the photolithography process, a technique used to manufacture computer chips and optical devices. The project marks the first use of actual data from a photolithography system in the construction of a simulator.
This advancement could…
A study by Massachusetts Institute of Technology (MIT) researchers has indicated that computational models that perform auditory tasks could speed up the development of improved hearing aids, cochlear implants, and brain-machine interfaces. In the study, the largest ever conducted into deep neural network-based models trained to perform hearing-related functions, it was found that most mimic…
A team of researchers at the Massachusetts Institute of Technology (MIT) has developed a machine learning-based method to swiftly calculate the structures of transition states, crucial moments in chemical reactions. This state, at which molecules attain the necessary energy for a reaction, is important but fleetingly transient and difficult to experimentally observe. Calculating these structures…