Skip to content Skip to sidebar Skip to footer

Applications

This Artificial Intelligence research investigates how greatly Language Models can enhance their performance as agents in lengthy tasks within a complex environment using the WebArena Benchmark.

Large Language Models (LLMs) have shown great potential in natural language processing tasks such as summarization and question answering, using zero-shot and few-shot prompting approaches. However, these prompts are insufficient for enabling LLMs to operate as agents navigating environments to carry out complex, multi-step tasks. One reason for this is the lack of adequate training…

Read More