Skip to content Skip to sidebar Skip to footer

AI News
- All
  Categories
  
  Artificial Intelligence(2794)
  View All
  
  Computer science and technology(559)
  View All
  
  Data(164)
  View All
  
  Electrical Engineering & Computer Science (eecs)(430)
  View All
  
  Machine learning(1188)
  View All
  
  News(748)
  View All
  
  Research(613)
  View All
  
  School of Engineering(648)
  View All
About
Contacts

AI News
- All
  Categories
  
  Artificial Intelligence(2794)
  View All
  
  Computer science and technology(559)
  View All
  
  Data(164)
  View All
  
  Electrical Engineering & Computer Science (eecs)(430)
  View All
  
  Machine learning(1188)
  View All
  
  News(748)
  View All
  
  Research(613)
  View All
  
  School of Engineering(648)
  View All
About
Contacts

Large Language Model

AI News
- All
  Categories
  
  Artificial Intelligence(2794)
  View All
  
  Computer science and technology(559)
  View All
  
  Data(164)
  View All
  
  Electrical Engineering & Computer Science (eecs)(430)
  View All
  
  Machine learning(1188)
  View All
  
  News(748)
  View All
  
  Research(613)
  View All
  
  School of Engineering(648)
  View All
About
Contacts

Introducing Tsinghua University’s GLM-4-9B-Chat-1M: A Remarkable Language Model Competing Against GPT 4V, Gemini Pro (focused on vision), Mistral and Llama 3 8B.

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Machine learning, Staff, Tech News, Technology, UncategorizedJune 6, 2024169Views 0Likes 0Comments

Tsinghua University's Knowledge Engineering Group (KEG) has introduced GLM-4 9B, an innovative, open-source language model that surpasses other models like GPT-4 and Gemini in different benchmark tests. Developed by the Tsinghua Deep Model (THUDM) team, GLM-4 9B signals an important development in the sphere of natural language processing. At its core, GLM-4 9B is a colossal…

MMLU-Pro: An Advanced Standard Created for Assessing Language Comprehension Models Over a Wider Range of More Difficult Tasks

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Staff, Tech News, Technology, UncategorizedJune 6, 2024172Views 0Likes 0Comments

A Thorough Evaluation of LLMs, SLMs, and STLMs

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Staff, Tech News, Technology, UncategorizedJune 6, 2024173Views 0Likes 0Comments

The Skywork team announces the unveiling of Skywork-MoE, a highly efficient Mixture-of-Experts (MoE) model, which boasts 146 billion parameters, 16 experts, and 22 billion activated parameters.

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Staff, Tech News, Technology, UncategorizedJune 6, 2024146Views 0Likes 0Comments

The advancement of natural language processing (NLP) capabilities has been to a large extent, dependent on developing large language models (LLMs). Although these models deliver high performance, they also pose challenges due to their need for immense computational resources and related costs, making them hard to scale up without incurring substantial expenses. These challenges, therefore, create…

Snowflake Unveils Polaris Catalog: Enhancing Data Interoperability through the Integration of Open Source Apache Iceberg

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Machine learning, Staff, Tech News, Technology, UncategorizedJune 5, 2024178Views 0Likes 0Comments

Snowflake recently introduced the Polaris Catalog, a new open-source catalog for Apache Iceberg designed to boost data interoperability across multiple engines and cloud services. The release illustrates Snowflake's commitment to granting businesses more control, flexibility, and security in their data management. The data sector has grown increasingly fond of open-source file and table formats due to…

The University of Wisconsin-Madison has developed a new machine learning strategy called ROBOSHOT, designed to improve the robustness of zero-shot learning and curtail bias.

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Large Language Model, Machine learning, Staff, Tech News, Technology, UncategorizedJune 5, 2024137Views 0Likes 0Comments

Nixtla Unveils StatsForecast 1.7.5: Enhancing Time Series Prediction with MFLES and Integration of Scikit-Learn

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Staff, Tech News, Technology, UncategorizedJune 5, 2024131Views 0Likes 0Comments

IEIT SYSTEMS introduces the updated version, Yuan 2.0-M32. This upgraded edition is a Bilingual Mixture of Expert MoE Language Model, which is fundamentally grounded on the Yuan 2.0. It also features an Attention Router.

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Machine learning, Staff, Tech News, Technology, UncategorizedJune 4, 2024149Views 0Likes 0Comments

A research team from IEIT Systems has recently developed a new model, Yuan 2.0-M32, which uses the Mixture of Experts (MoE) architecture. This complex model is built on the same foundation as the Yuan-2.0 2B, but with utilization of 32 experts, only two of whom are active at any given time, resulting in its unique…

AI-RAG Solutions: Hallucination-Free or Not? Stanford University Researchers Evaluate the Dependability of AI in Legal Research and Face Challenges with Illusions and Precision

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Machine learning, Staff, Tech News, Technology, UncategorizedJune 4, 2024141Views 0Likes 0Comments

Artificial Intelligence (AI) is increasingly being used in legal research and document drafting, aimed at improving efficiency and accuracy. However, concerns regarding the reliability of these tools persist, especially given the potential for the creation of false or misleading information, referred to as "hallucinations". This issue is of particular concern given the high-stakes nature of…

Navigational Guidance and Prejudices in LLMs: Maneuvering through the Complexities of Persona Representation

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Staff, Tech News, Technology, UncategorizedJune 4, 2024172Views 0Likes 0Comments

Language and Large Model (LLM) research has shifted focus to steerability and persona congruity with complexities, challenging previous research simply based on one-dimensional personas or multiple-choice formats. A persona's intricacy and its potential to multiply biases in LLM simulations when there's lack of alignment with typical demographic views is now recognized. A recent research by…