Papers

12094 papers

ICLR2025

RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph

Summary pending...

AI Software EngineeringLarge Language ModelsCode Intelligence
ICLR2025

Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models

Summary pending...

reinforcement learning from human feedbackefficient llm finetuningoff-policy RL
ICLR2025

MGDA Converges under Generalized Smoothness, Provably

Summary pending...

Multi-Objective OptimizationGeneralized SmoothnessConvergence Analysis
ICLR2025

PolyNet: Learning Diverse Solution Strategies for Neural Combinatorial Optimization

Summary pending...

neural combinatorial optimizationlearning to optimizereinforcement learning
ICLR2025

MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos

Summary pending...

Video UnderstandingBenchmark
ICLR2025

On the Convergence of No-Regret Dynamics in Information Retrieval Games with Proportional Ranking Functions

Summary pending...

Game theoryno-regret dynamicsrecommendation systems
ICLR2025

Scalable Bayesian Learning with posteriors

Summary pending...

Bayesian deep learningPyTorchVariational Inference
ICLR2025

Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents

Summary pending...

GUI AgentsVisual GroundingMultimodal Large Language Models
ICLR2025

MetaOOD: Automatic Selection of OOD Detection Models

Summary pending...

Out-of-distribution DetectionMeta-learningLanguage Modeling
ICLR2025

Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better

Summary pending...

Model mergingconsistency modeldiffusion model
ICLR2025

TetSphere Splatting: Representing High-Quality Geometry with Lagrangian Volumetric Meshes

Summary pending...

geometry representation3D modeling
ICLR2025

Multi-Robot Motion Planning with Diffusion Models

Summary pending...

Multi-Agent PlanningRoboticsGenerative Models
ICLR2025

Learning from Imperfect Human Feedback: A Tale from Corruption-Robust Dueling

Summary pending...

online learningdueling bandit
ICLR2025

Grokking at the Edge of Numerical Stability

Summary pending...

grokkingdeep learninglearning theory
ICLR2025

Is Large-scale Pretraining the Secret to Good Domain Generalization?

Summary pending...

Domain GeneralizationRobustnessCLIP
ICLR2025

NextBestPath: Efficient 3D Mapping of Unseen Environments

Summary pending...

3D reconstructionactive mapping
ICLR2025

Improved Algorithms for Kernel Matrix-Vector Multiplication Under Sparsity Assumptions

Summary pending...

AlgorithmsKernel MatrixKernel Density Estimation
ICLR2025

Balancing Act: Diversity and Consistency in Large Language Model Ensembles

Summary pending...

LLMensemblingdiversity
ICLR2025

Transformers Can Learn Temporal Difference Methods for In-Context Reinforcement Learning

Summary pending...

in-context reinforcement learningpolicy evaluationtemporal difference learning
ICLR2025

SBSC: Step-by-Step Coding for Improving Mathematical Olympiad Performance

Summary pending...

math AILLM math reasoning