Large-scale language models (LLMs) have made substantial progress in understanding language by absorbing information from their environment. However, while they excel in areas like historical knowledge and insightful responses, they struggle when it comes to real-time comprehension. Embodied AI, integrated into items like smart glasses or home robots, aims to interact with humans using everyday…
