All collections

LLM Survey 2024

Source

Learning From Correctness Without Prompting Makes LLM Efficient Reasoner

Summary pending...

High Quality

Summary pending...

Exhasutive Review on [Search Workflows](https://github.com/xinzhel/LLM-Search)

Summary pending...

LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback

Summary pending...

Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning

Summary pending...

MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback

Summary pending...

Augmented Language Models: a Survey

Summary pending...

The Rise and Potential of Large Language Model Based Agents: A Survey

Summary pending...

A Survey on the Memory Mechanism of Large Language Model based Agents

Summary pending...

Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models

Summary pending...

ART: Automatic multi-step reasoning and tool-use for large language models

Summary pending...

TALM: Tool Augmented Language Models

Summary pending...

On the Tool Manipulation Capability of Open-source Large Language Models

Summary pending...

Large Language Models as Tool Makers

Summary pending...

LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language Models

Summary pending...

Understanding the planning of LLM agents: A survey

This survey paper explores the planning mechanisms employed by large language model (LLM) agents, highlighting their capabilities and limitations. Understanding these planning strategies is crucial for improving LLM performance in complex tasks and applications.

LLMplanningsurvey

GEAR: Augmenting Language Models with Generalizable and Efficient Tool Resolution

Summary pending...

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

Summary pending...

Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models

Summary pending...

Large Language Model Guided Tree-of-Thought

Summary pending...

Tree Search for Language Model Agents

Summary pending...

Q\*: Improving multi-step reasoning for llms with deliberative planning

Summary pending...

Agent q: Advanced reasoning and learning for autonomous ai agents

Summary pending...

Making Large Language Models into World Models with Precondition and Effect Knowledge

Summary pending...

A Survey on Large Language Model based Autonomous Agents

This paper surveys the development and application of large language model-based autonomous agents, highlighting their capabilities and challenges. It is important as it provides a comprehensive overview of how these agents are transforming various fields and identifies future research directions.

large language modelsautonomous agentssurvey

TaskBench: Benchmarking Large Language Models for Task Automation

Summary pending...

ISR-LLM: Iterative Self-Refined Large Language Model for Long-Horizon Sequential Task Planning

Summary pending...

TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents

Summary pending...

TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems

Summary pending...

LLM+P: Empowering Large Language Models with Optimal Planning Proficiency

Summary pending...

MetaTool Benchmark: Deciding Whether to Use Tools and Which to Use

Summary pending...

Learning From Mistakes Makes LLM Better Reasoner

Summary pending...

CoLing2025

A Review of Prominent Paradigms for LLM-Based Agents: Tool Use (Including RAG), Planning, and Feedback Learning

Summary pending...

2025

A Review of Prominent Paradigms for LLM-Based Agents: Tool Use (Including RAG), Planning, and Feedback Learning

This paper explores how to improve the coherence of understanding across different repositories. It highlights the importance of consistent data interpretation for better information retrieval and knowledge management.

data coherenceinformation retrievalknowledge management
ACL2024

When is Tree Search Useful for {LLM} Planning? It Depends on the Discriminator

Summary pending...

ICML2024

Alphazero-like Tree-Search can guide large language model decoding and training

Summary pending...

ICML2024

Language Agent Tree Search Unifies Reasoning, Acting, and Planning in Language Models

Summary pending...

ACL2024

Can Language Models Serve as Text-Based World Simulators?

Summary pending...

ACL2024

Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation

Summary pending...

ACL2024

Tree-of-Traversals: A Zero-Shot Reasoning Algorithm for Augmenting Black-box Language Models with Knowledge Graphs

Summary pending...

ICLR2024

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing

Summary pending...

ICLR2024

SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning

Summary pending...

ICLR2024

ToolChain: Efficient Action Space Navigation in Large Language Models with A\* Search

Summary pending...

EMNLP2024

LLM-A\*: Large Language Model Enhanced Incremental Heuristic Search on Path Planning

This paper presents LLM-A*, an approach that integrates large language models into incremental heuristic search for path planning. It aims to enhance the efficiency and effectiveness of finding optimal paths in complex environments, which is crucial for applications in robotics and AI navigation.

path planningheuristic searchlarge language models
NeurIPS2023

HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face

This paper introduces HuggingGPT, a framework that integrates ChatGPT with various models from Hugging Face to tackle a range of AI tasks. It highlights the potential of combining different AI models to enhance task performance and flexibility.

AI IntegrationChatGPTHugging Face
NeurIPS2023

On the Planning Abilities of Large Language Models -- A Critical Investigation

Summary pending...

NeurIPS2023

LLM-MCTS:Large Language Models as Commonsense Knowledge for Large-Scale Task Planning

Summary pending...

NeurIPS2023

Self-Evaluation Guided Beam Search for Reasoning

Summary pending...

EMNLP2023

RAP: Reasoning with Language Model is Planning with World Model

Summary pending...

EMNLP2023

Prompt-Based Monte-Carlo Tree Search for Goal-oriented Dialogue Policy Planning

Summary pending...

EMNLP2023

Monte Carlo Thought Search: Large Language Model Querying for Complex Scientific Reasoning in Catalyst Design

Summary pending...

EMNLP2023

ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models

The paper introduces ChatCoT, a method that enhances chat-based large language models by integrating tool-augmented reasoning through chain-of-thought techniques. This approach improves the models' ability to perform complex tasks by leveraging external tools for better decision-making and problem-solving.

Tool UseLarge Language ModelsReasoning Techniques
NeurIPS2023

HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face

Summary pending...

NeurIPS2023

PathFinder: Guided Search over Multi-Step Reasoning Paths

Summary pending...

NeurIPS2023

Least-to-Most Prompting Enables Complex Reasoning in Large Language Models

Summary pending...

ACL2023

MultiTool-CoT: GPT-3 Can Use Multiple External Tools with Chain of Thought Prompting

Summary pending...

NeurIPS2023

Toolformer: Language Models Can Teach Themselves to Use Tools

Summary pending...

ICLR2023

ReAct: Synergizing Reasoning and Acting in Language Models

Summary pending...

NeurIPS2023

ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings

This paper introduces ToolkenGPT, a method that enhances frozen language models by integrating them with a variety of external tools through the use of tool embeddings. This advancement allows for improved task performance by leveraging specialized functionalities of these tools, making language models more versatile.

Tool UseLanguage ModelsNeurIPS 2023
EMNLP2023

API-Bank: A Benchmark for Tool-Augmented LLMs

Summary pending...

NeurIPS2023

Reflexion: Language Agents with Verbal Reinforcement Learning

Summary pending...

NeurIPS2023

On the Planning Abilities of Large Language Models - A Critical Investigation

Summary pending...

NeurIPS2023

PlanBench: An Extensible Benchmark for Evaluating Large Language Models on Planning and Reasoning about Change

Summary pending...

NeurIPS2023

Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Summary pending...

ICLR2023

Planning with Large Language Models for Code Generation

Summary pending...

EMNLP2023

Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts

Summary pending...

EMNLP2023

ByteSized32: A Corpus and Challenge Task for Generating Task-Specific World Models Expressed as Text Games

Summary pending...

NeurIPS2023

Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Planning

Summary pending...

NeurIPS2023

Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Planning

Summary pending...

NeurIPS2023

Self-Refine: Iterative Refinement with Self-Feedback

Summary pending...

NeurIPS2023

AdaPlanner: Adaptive Planning from Feedback with Language Models

Summary pending...

NeurIPS2023

Large Language Models Still Can't Plan (A Benchmark for LLMs on Planning and Reasoning about Change)

Summary pending...