Skip to content Skip to sidebar Skip to footer

- AI News
  - All
    Categories
    
    Artificial Intelligence(2794)
    View All
    
    Computer science and technology(559)
    View All
    
    Data(164)
    View All
    
    Electrical Engineering & Computer Science (eecs)(430)
    View All
    
    Machine learning(1188)
    View All
    
    News(748)
    View All
    
    Research(613)
    View All
    
    School of Engineering(648)
    View All
- About
- Contacts

- AI News
  - All
    Categories
    
    Artificial Intelligence(2794)
    View All
    
    Computer science and technology(559)
    View All
    
    Data(164)
    View All
    
    Electrical Engineering & Computer Science (eecs)(430)
    View All
    
    Machine learning(1188)
    View All
    
    News(748)
    View All
    
    Research(613)
    View All
    
    School of Engineering(648)
    View All
- About
- Contacts

Small Language Model

AI News
- All
  Categories
  
  News(748)
  View All
  
  Research(613)
  View All
  
  School of Engineering(648)
  View All
  
  Artificial Intelligence(2794)
  View All
  
  Computer science and technology(559)
  View All
  
  Data(164)
  View All
  
  Electrical Engineering & Computer Science (eecs)(430)
  View All
  
  Machine learning(1188)
  View All
  
  News(748)
  View All
  
  Research(613)
  View All
  
  School of Engineering(648)
  View All
  
  Artificial Intelligence(2794)
  View All
  
  Computer science and technology(559)
  View All
  
  Data(164)
  View All
About
Contacts

The Allen Institute for AI (AI2) has launched a fresh collection of resources for OLMo 1B and 7B.

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, New Releases, Open Source, Small Language Model, Staff, Tech News, Technology, UncategorizedAugust 6, 202499Views 0Likes 0Comments

BRAG Unveils: Small Language Models (SLMs) Optimized for RAG Tasks Available for Under $25 each.

The BRAG series is a set of high-performance Retrieval Augmented Generation (RAG) models developed by Maximalists AI Researcher. They are a small language model designed to be a low-cost alternative for AI-driven language processing, proving effective in artificial intelligence due to their affordability and cost-effectiveness. They were created to meet the need for more powerful…

Improving Text Embeddings in Compact Language Models: A Comparative Refinement Method using MiniCPM.

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Small Language Model, Staff, Tech News, Technology, UncategorizedAugust 6, 202473Views 0Likes 0Comments

Researchers from Tsinghua University have developed an approach to improve the performance of smaller language models such as MiniCPM, Phi-2, and Gemma by enhancing their text embeddings. By applying contrastive fine-tuning using the NLI dataset, the researchers significantly improved the text embedding quality across various benchmarks. In particular, MiniCPM showed a significant 56.33% performance improvement,…

Patronus AI has launched Lynx v1.1, an advanced 8B RAG model reputed for its proficient hallucination detection capabilities.

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Small Language Model, Staff, Tech News, Technology, UncategorizedAugust 2, 202458Views 0Likes 0Comments

Arcee AI has launched DistillKit, an accessible, open-source instrument that revolutionizes model distillation, facilitating the development of high-functioning, efficient compact language models.

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Small Language Model, Staff, Tech News, Technology, UncategorizedAugust 2, 202461Views 0Likes 0Comments

The Gemma 2-2B model has been launched, featuring an advanced text generation capability with 2.6 billion parameters, enhanced security measures, and the ability to deploy on the device itself.

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Open Source, Open Source Projects, Small Language Model, Staff, Tech News, Technology, UncategorizedAugust 1, 202464Views 0Likes 0Comments

Google's AI research team, DeepMind, has unveiled Gemma 2 2B, its new, sophisticated language model. This version, supporting 2.6 billion parameters, is optimized for on-device use and is a top choice for applications demanding high performance and efficiency. It holds enhancements for handling massive text generation tasks with more precision and higher levels of efficiency…

Released Zamba2-2.7B: An Advanced Mini Language Model that Doubles the Speed and Lessens Memory Usage by 27%

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Small Language Model, Staff, Tech News, Technology, UncategorizedJuly 31, 202467Views 0Likes 0Comments

OuteAI Introduces Innovative Lite-Oute-1 Variants: Lite-Oute-1-300M and Lite-Oute-1-65M as Robust Yet Space-Saving AI Platforms.

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Open Source, Small Language Model, Staff, Tech News, Technology, UncategorizedJuly 31, 202472Views 0Likes 0Comments

OuteAI has released two new models of its Lite series, namely Lite-Oute-1-300M and Lite-Oute-1-65M, which are designed to maintain optimum efficiency and performance, making them suitable for deployment across various devices. The Lite-Oute-1-300M model is based on the Mistral architecture and features 300 million parameters, while the Lite-Oute-1-65M, based on the LLaMA architecture, hosts around…

Neural Magic has launched a fully quantized FP8 iteration of Meta’s Llama 3.1 405B Model, including FP8 Dynamic and Static Quantization.

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Small Language Model, Staff, Tech News, Technology, UncategorizedJuly 30, 202465Views 0Likes 0Comments

Neural Magic, an AI solutions provider, has recently announced a breakthrough in AI model compression with the introduction of a fully quantized FP8 version of Meta's Llama 3.1 405B model. This achievement is significant in the field of AI as it allows this massive model to fit on any 8xH100 or 8xA100 system without the…

TensorOpera introduces the Fox Foundation Model: A novel advancement in small language models boosting scalability and efficiency for cloud and edge computing.

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Small Language Model, Staff, Tech News, Technology, UncategorizedJuly 29, 202470Views 0Likes 0Comments

Nvidia AI Unveils Minitron 4B and 8B: A Fresh Range of Compact Language Models Offering 40x Quicker Model Training through Pruning and Distillation

AI Paper Summary, AI Shorts, Artificial Intelligence, Editors Pick, Language Model, Small Language Model, Staff, Tech News, Technology, UncategorizedJuly 25, 202462Views 0Likes 0Comments

Arcee AI has unveiled Arcee-Nova, a new open-source language model. This revolutionary model, based on Qwen2-72B, nears the performance level of GPT-4.

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Open Source, Small Language Model, Staff, Tech News, Technology, UncategorizedJuly 22, 202456Views 0Likes 0Comments

Arcee AI, known for its innovation in open-source artificial intelligence, has launched Arcee-Nova, which is hailed as a pioneering accomplishment in the AI sector. Arcee-Nova has quickly gained recognition as the highest-performing model within the open-source arena, nearly on par with the performance of GPT-4, a benchmark AI model as of May 2023. Arcee-Nova is an…

Facebook Instagram

+60 12-462 2768

hello@goacademyai.com

All Categories

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

All Categories

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

All Categories

News(748)

Research(613)

School of Engineering(648)

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

All
Categories

All
Categories

All
Categories