Scientists at the University of Auckland have presented ChatLogic, an advanced tool for multi-step reasoning in large language models, which improves precision in complex tasks by over half.

Large language models (LLMs) are exceptional at generating content and solving complex problems across various domains. Nevertheless, they struggle with multi-step deductive reasoning — a process requiring coherent and logical thinking over extended interactions. The existing training methodologies for LLMs, based on next-token prediction, do not equip them to apply logical rules effectively or maintain deep contextual understanding, limiting their ability to produce logically consistent responses in tasks that require multi-step reasoning.

Existing techniques to improve LLMs’ reasoning capabilities, such as integrating external memory databases and Recursive Model Training (RMT), pose challenges such as embedding biases from the retrieval models into the LLMs and handling long sequence limitations in multi-turn dialogues.

Researchers from the University of Auckland have developed a novel framework, ChatLogic, to overcome these limitations. This framework seeks to enhance LLMs’ deductive reasoning by converting logic problems into symbolic representations that the models can process, leveraging their situational understanding and integrating symbolic memory.

ChatLogic uses a unique method called ‘Mix-shot Chain of Thought’ (CoT), which blends various prompt engineering techniques to guide LLMs efficiently through logical reasoning steps. This method translates natural language queries into logical symbols using pyDatalog, and the framework includes semantic and syntax correction modules that improve the practical application of logic programs.

Experimental results show that LLMs integrated with ChatLogic significantly outperform standard models in multi-step reasoning tasks. As an example, on the PARARULE-Plus dataset, GPT-3.5 with ChatLogic achieved an accuracy of 0.5275, compared to 0.344 for the standard model. For GPT-4, ChatLogic delivered an accuracy of 0.73, compared to the base model’s 0.555, demonstrating substantial improvements in high-precision scenarios, where the accuracy and reliability of reasoning are paramount.

Further analysis of the CONCEPTRULES datasets shows the effectiveness of ChatLogic. GPT-3.5 with ChatLogic achieved an accuracy of 0.69, compared to 0.57 for the standard model. For GPT-4, the accuracy with ChatLogic was 0.96, slightly better than the standard model’s 0.95.

ChatLogic offers a robust solution to the multi-step reasoning limitations of LLMs and significantly enhances the accuracy and reliability of LLMs in complex reasoning tasks. These improvements could have vast applications, such as in customer service, healthcare and education, where precise and logical responses are vital. The effectiveness of ChatLogic in improving reasoning performance while maintaining high accuracy highlights its potential as a valuable addition to artificial intelligence and natural language processing research.

All
Categories

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

All
Categories

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

All
Categories

News(748)

Research(613)

School of Engineering(648)

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Scientists at the University of Auckland have presented ChatLogic, an advanced tool for multi-step reasoning in large language models, which improves precision in complex tasks by over half.

Leave a comment Cancel reply

You May Also Like

Researchers from Google’s DeepMind introduce Mobility VLA: A multimodal guide navigation system utilizing extended-context VLMs and topological diagrams.

Agent Poison: A Unique Approach for Red Teaming and Backdoor Assaults on Generic and RAG-based LLM Agents by Contaminating their Persistent Memory or RAG Information Repository.

+60 12-462 2768

All Categories

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

All Categories

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

All Categories

News(748)

Research(613)

School of Engineering(648)

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Scientists at the University of Auckland have presented ChatLogic, an advanced tool for multi-step reasoning in large language models, which improves precision in complex tasks by over half.

Leave a comment Cancel reply

You May Also Like

Researchers from Google’s DeepMind introduce Mobility VLA: A multimodal guide navigation system utilizing extended-context VLMs and topological diagrams.

Agent Poison: A Unique Approach for Red Teaming and Backdoor Assaults on Generic and RAG-based LLM Agents by Contaminating their Persistent Memory or RAG Information Repository.

+60 12-462 2768

All
Categories

All
Categories

All
Categories