Papers

12094 papers

NeurIPS2023

Toolformer: Language Models Can Teach Themselves to Use Tools

Summary pending...

Paper

Augmented Language Models: a Survey

Summary pending...

The Rise and Potential of Large Language Model Based Agents: A Survey

Summary pending...

A Survey on the Memory Mechanism of Large Language Model based Agents

Summary pending...

CoLing2025

A Review of Prominent Paradigms for LLM-Based Agents: Tool Use (Including RAG), Planning, and Feedback Learning

Summary pending...

LLM+P: Empowering Large Language Models with Optimal Planning Proficiency

Summary pending...

Paper

ART: Automatic multi-step reasoning and tool-use for large language models

Summary pending...

Paper

TALM: Tool Augmented Language Models

Summary pending...

Paper

On the Tool Manipulation Capability of Open-source Large Language Models

Summary pending...

Paper

Large Language Models as Tool Makers

Summary pending...

Paper

LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language Models

Summary pending...

ACL2024

When is Tree Search Useful for {LLM} Planning? It Depends on the Discriminator

Summary pending...

Paper

GEAR: Augmenting Language Models with Generalizable and Efficient Tool Resolution

Summary pending...

Paper

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

Summary pending...

Paper

NeurIPS2023

On the Planning Abilities of Large Language Models - A Critical Investigation

Summary pending...

Paper

NeurIPS2023

PlanBench: An Extensible Benchmark for Evaluating Large Language Models on Planning and Reasoning about Change

Summary pending...

Paper

Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models

Summary pending...

Paper

Large Language Model Guided Tree-of-Thought

Summary pending...

Paper

NeurIPS2023

Reflexion: Language Agents with Verbal Reinforcement Learning

Summary pending...

Paper

ICLR2024

SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning

Summary pending...