Microsoft's AI courses offer robust education in AI and machine learning across a range of skill levels. By emphasizing practical usage, advanced techniques, and ethical AI practices, students learn how to develop and deploy AI solutions effectively and responsibly.
The "Fundamentals of machine learning" course provides a grounding in machine learning's core concepts along with deep…
Matrix multiplication (MatMul) is a fundamental process in most neural network topologies. It is commonly used in vector-matrix multiplication (VMM) by dense layers in neural networks, and in matrix-matrix multiplication (MMM) by self-attention mechanisms. Significant reliance on MatMul can be attributed to GPU optimization for these tasks. Libraries like cuBLAS and the Compute Unified Device…
Researchers have identified cultural accumulation as a crucial aspect of human success. This practice refers to our capacity to learn skills and accumulate knowledge over generations. However, currently used artificial learning systems, like deep reinforcement learning, frame the learning question as happening within a single "lifetime." This approach does not account for the generational and…
Language Learning Models (LLMs) can come up with good answers and even be honest about their mistakes. However, they often provide simplified estimations when they haven't seen certain questions before, and it's crucial to develop ways to draw reliable confidence estimations from them. Traditionally, both training-based and prompting-based approaches have been used, but these often…
Stanford University researchers have developed a new method called Demonstration ITerated Task Optimization (DITTO) designed to align language model outputs directly with users' demonstrated behaviors. This technique was introduced to address the challenges language models (LMs) face - including the need for big data sets for training, generic responses, and mismatches between universal style and…
Large language models (LLMs) have significantly advanced code generation, but they develop code in a linear fashion without access to a feedback loop that allows for corrections based on the previous outputs. This creates challenges in correcting mistakes or suggesting edits. Now, researchers at the University of California, Berkeley, have developed a new approach using…
The field of multimodal learning, which involves training models to understand and generate content in multiple formats such as text and images, is evolving rapidly. Current models have inefficiencies in dealing with text-only and text-image tasks, often excelling in one domain but underperforming in the other. This necessitates distinct systems to retrieve different forms of…
LLM or Language Model-based systems have shown potential to accelerate scientific discovery, especially in the biomedical research field. These systems are able to leverage a large bank of background information to conduct and interpret experiments, particularly useful for identifying drug targets through CRISPR-based genetic modulation. Despite the promise they show, their usage in designing biological…
Research teams from the University of Cambridge, University of Oxford, and the Massachusetts Institute of Technology have developed a dynamic evaluation method called CheckMate. The aim is to enhance the evaluation of Large Language Models (LLMs) like GPT-4 and ChatGPT, especially when used as problem-solving tools. These models are capable of generating text effectively, but…
OpenAI has developed a new feature known as Generic Pre-trained Transformers (GPTs) that allows users to create a custom version of ChatGPT, a sophisticated artificial intelligence text generation technology. These versions can be specialized in any topic ranging from writing to research, productivity, education, lifestyle, and more. The goal of these versions is to assist…