Skip to content Skip to sidebar Skip to footer

ETH Zurich

Scientists from ETH Zurich, EPFL, and Microsoft have presented QuaRot, a new machine learning technique that facilitates 4-bit inference of Latent Linear Models (LLMs) by eliminating unconventional features.

Large language models (LLMs) have substantially impacted various applications across sectors by offering excellent natural language processing capabilities. They help generate, interpret, and understand the human language, opening routes for new technological advancements. However, LLMs demand considerable computational, memory, and energy resources, particularly during the inference phase, which restricts operational efficiency and their deployment. The extensive…

Read More