Large language models (LLMs), exemplified by dense transformer models like GPT-2 and PaLM, have revolutionized natural language processing thanks to their vast number of parameters, leading to record levels of accuracy and essential roles in data management tasks. However, these models are incredibly large and power-intensive, overwhelming the capabilities of even the strongest Graphic Processing…
