Papers

12094 papers

ICLR2025

Overcoming Slow Decision Frequencies in Continuous Control: Model-Based Sequence Reinforcement Learning for Model-Free Control

Summary pending...

Decision FrequencyAction Sequence GenerationModel-Based Training
ICLR2025

Agent-Oriented Planning in Multi-Agent Systems

Summary pending...

Multi-Agent System; Planning
ICLR2025

Reasoning of Large Language Models over Knowledge Graphs with Super-Relations

Summary pending...

Knowledge GraphsLarge Language ModelsQuestion Answering
ICLR2025

Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems

Summary pending...

pretraininglanguage modelerror correction
ICLR2025

Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition

Summary pending...

knowledge entropyknowledge acquisition and forgettingevolving behavior during LLM pretraining
ICLR2025

SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration

Summary pending...

AttentionQuantizationquantized attention
ICLR2025

FreDF: Learning to Forecast in the Frequency Domain

Summary pending...

Time seriesLong-term Forecast
ICLR2025

You Only Prune Once: Designing Calibration-Free Model Compression With Policy Learning

Summary pending...

Model CompressionLarge Language ModelsStructured Pruning
ICLR2025

Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs

Summary pending...

Multi-modal ModelVisual-Language ModelSycophancy
ICLR2025

Decentralized Optimization with Coupled Constraints

Summary pending...

decentralized optimizationconvex optimizationaffine constraints
ICLR2025

Diffusion-Based Planning for Autonomous Driving with Flexible Guidance

Summary pending...

diffusion planningautonomous driving
ICLR2025

Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass

Summary pending...

language model; efficient adaptation; fine-tuning; prompting
ICLR2025

Neural Multi-Objective Combinatorial Optimization via Graph-Image Multimodal Fusion

Summary pending...

Neural Multi-Objective Combinatorial OptimizationMultimodal FusionDeep Reinforcement Learning
ICLR2025

Searching for Optimal Solutions with LLMs via Bayesian Optimization

Summary pending...

searchoptimizationLLMs
ICLR2025

PN-GAIL: Leveraging Non-optimal Information from Imperfect Demonstrations

Summary pending...

Generative adversarial imitation learningimperfect demonstrationsreinforcement learning
ICLR2025

Frame-Voyager: Learning to Query Frames for Video Large Language Models

Summary pending...

Video-LLMAdaptive Frame Sampling
ICLR2025

MQuAKE-Remastered: Multi-Hop Knowledge Editing Can Only Be Advanced with Reliable Evaluations

Summary pending...

knowledge editmodel editmulti-hop
ICLR2025

Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining

Summary pending...

sample reweighinglarge language modelspretraining
ICLR2025

Deep Distributed Optimization for Large-Scale Quadratic Programming

Summary pending...

Learning-to-OptimizeDistributed OptimizationLarge-Scale Quadratic Programming
ICLR2025

Hyperbolic Genome Embeddings

Summary pending...

genomicsrepresentation learninghyperbolic geometry