Large language AI models are notorious for crashing or slowing down during lengthy human-AI dialogues, posing a major barrier to the effective use of chatbots in many applications. Now, a team of researchers from MIT and other institutions propose a novel solution - by modifying the key-value cache, or the 'conversation memory', they improved the…
