Language models (LMs) are key components in the realm of artificial intelligence as they facilitate the understanding and generation of human language. In recent times, there has been a significant emphasis on scaling up these models to perform more complex tasks. However, a common challenge stands in the way: understanding how a language model's performance…
Large language models (LLMs) such as GPT-4 have been proven to excel at language comprehension, however, they struggle with high GPU memory usage during inference. This is a significant limitation for real-time applications, such as chatbots, due to scalbility issues. To illustrate, present methods reduce memory by compressing the KV cache, a prevalent memory consumer…
As AI models become increasingly vital for computing functionality and user experience, the challenge lies in effectively integrating them into smaller devices like personal computers without major resource utilization. Microsoft has developed a solution to this challenge with the introduction of Phi Silica, a small language model (SLM) designed to work with the Neural Processing…
With an alarming rise in the use of machine learning (ML) models in high-stakes societal applications come growing concerns about their fairness and transparency. Instances of biased decision-making have caused an increase in distrust among consumers who are subject to decisions based on these models. The demand for technology that allows public verification of fairness…
Large language models (LLMs) play a crucial role in a range of applications, however, their significant memory consumption, particularly the key-value (KV) cache, makes them challenging to deploy efficiently. Researchers from the ShanghaiTech University and Shanghai Engineering Research Center of Intelligent Vision and Imaging offered an efficient method to decrease memory consumption in the KV…
The increasing sophistication of artificial intelligence and large language models (LLMs) like GPT-4 and LLaMA2-70B has sparked interest in their potential to display a theory of mind. Researchers from the University Medical Center Hamburg-Eppendorf, the Italian Institute of Technology, Genoa, and the University of Trento are studying these models to assess their capabilities against human…
Artificial Intelligence (AI) is increasingly transforming many areas of modern life, significantly advancing fields such as technology, healthcare, and finance. Within the AI landscape, there has been significant interest and progress regarding Reinforcement Learning (RL) and Generative Adversarial Networks (GANs). They represent key facilitators of major changes in the AI area, enabling advanced decision-making processes…
Natural Language Processing (NLP) is a critical field that allows computers to comprehend, interpret, and generate human language. This translates to tasks such as language translation, sentiment analysis, and text generation, creating systems that can interact effectively with humans through language. However, carrying out these tasks demands complex models able to cope with aspects of…
Natural language processing (NLP) refers to a field of computer science concerned with enabling computers to understand, interpret, and generate human language. Tasks encompassed in this area include language translation, sentiment analysis, and text generation. The primary objective is creating systems capable of interacting with humans using language fluently. However, achieving this requires developing complex…
Creating high-quality, diverse media from text is often a challenging task for existing models. Such models either generate low-quality outcomes, are slow, or need a significant level of computational power. Current solutions that resolve individual tasks such as text-to-image or text-to-video generation need to be merged with other models to achieve the desired effect. Moreover,…
Generating high-quality, diverse media content from textual input is a complex task. Traditional models have suffered from several limitations such as poor output quality, slow processing or high computational resource requirements, making them less efficient and widespread. Even for individual tasks like text-to-image or text-to-video, these models often need to be used in conjunction with…
Genomic research, which seeks to understand the structure and function of genomes, plays a significant role in a variety of sectors, including medicine, biotechnology, and evolutionary biology. It provides valuable insights into potential therapies for genetic disorders and fundamental life processes. However, the field also faces major challenges, particularly when it comes to modelling and…