Papers
12094 papers
ICLR2025
Overcoming Slow Decision Frequencies in Continuous Control: Model-Based Sequence Reinforcement Learning for Model-Free Control
Summary pending...
Decision FrequencyAction Sequence GenerationModel-Based Training
ICLR2025
Agent-Oriented Planning in Multi-Agent Systems
Summary pending...
Multi-Agent System; Planning
ICLR2025
Reasoning of Large Language Models over Knowledge Graphs with Super-Relations
Summary pending...
Knowledge GraphsLarge Language ModelsQuestion Answering
ICLR2025
Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems
Summary pending...
pretraininglanguage modelerror correction
ICLR2025
Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition
Summary pending...
knowledge entropyknowledge acquisition and forgettingevolving behavior during LLM pretraining
ICLR2025
SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration
Summary pending...
AttentionQuantizationquantized attention
ICLR2025
FreDF: Learning to Forecast in the Frequency Domain
Summary pending...
Time seriesLong-term Forecast
ICLR2025
You Only Prune Once: Designing Calibration-Free Model Compression With Policy Learning
Summary pending...
Model CompressionLarge Language ModelsStructured Pruning
ICLR2025
Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
Summary pending...
Multi-modal ModelVisual-Language ModelSycophancy
ICLR2025
Decentralized Optimization with Coupled Constraints
Summary pending...
decentralized optimizationconvex optimizationaffine constraints
ICLR2025
Diffusion-Based Planning for Autonomous Driving with Flexible Guidance
Summary pending...
diffusion planningautonomous driving
ICLR2025
Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass
Summary pending...
language model; efficient adaptation; fine-tuning; prompting
ICLR2025
Neural Multi-Objective Combinatorial Optimization via Graph-Image Multimodal Fusion
Summary pending...
Neural Multi-Objective Combinatorial OptimizationMultimodal FusionDeep Reinforcement Learning
ICLR2025
Searching for Optimal Solutions with LLMs via Bayesian Optimization
Summary pending...
searchoptimizationLLMs
ICLR2025
PN-GAIL: Leveraging Non-optimal Information from Imperfect Demonstrations
Summary pending...
Generative adversarial imitation learningimperfect demonstrationsreinforcement learning
ICLR2025
Frame-Voyager: Learning to Query Frames for Video Large Language Models
Summary pending...
Video-LLMAdaptive Frame Sampling
ICLR2025
MQuAKE-Remastered: Multi-Hop Knowledge Editing Can Only Be Advanced with Reliable Evaluations
Summary pending...
knowledge editmodel editmulti-hop
ICLR2025
Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining
Summary pending...
sample reweighinglarge language modelspretraining
ICLR2025
Deep Distributed Optimization for Large-Scale Quadratic Programming
Summary pending...
Learning-to-OptimizeDistributed OptimizationLarge-Scale Quadratic Programming
ICLR2025
Hyperbolic Genome Embeddings
Summary pending...
genomicsrepresentation learninghyperbolic geometry