Papers

12094 papers

ICLR2025

RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph

Summary pending...

AI Software EngineeringLarge Language ModelsCode Intelligence

Paper

ICLR2025

Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models

Summary pending...

reinforcement learning from human feedbackefficient llm finetuningoff-policy RL

Paper

ICLR2025

MGDA Converges under Generalized Smoothness, Provably

Summary pending...

Multi-Objective OptimizationGeneralized SmoothnessConvergence Analysis

Paper

ICLR2025

PolyNet: Learning Diverse Solution Strategies for Neural Combinatorial Optimization

Summary pending...

neural combinatorial optimizationlearning to optimizereinforcement learning

Paper

ICLR2025

MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos

Summary pending...

Video UnderstandingBenchmark

Paper

ICLR2025

On the Convergence of No-Regret Dynamics in Information Retrieval Games with Proportional Ranking Functions

Summary pending...

Game theoryno-regret dynamicsrecommendation systems

Paper

ICLR2025

Scalable Bayesian Learning with posteriors

Summary pending...

Bayesian deep learningPyTorchVariational Inference

Paper

ICLR2025

Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents

Summary pending...

GUI AgentsVisual GroundingMultimodal Large Language Models

Paper

ICLR2025

MetaOOD: Automatic Selection of OOD Detection Models

Summary pending...

Out-of-distribution DetectionMeta-learningLanguage Modeling

Paper

ICLR2025

Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better

Summary pending...

Model mergingconsistency modeldiffusion model

Paper

ICLR2025

TetSphere Splatting: Representing High-Quality Geometry with Lagrangian Volumetric Meshes

Summary pending...

geometry representation3D modeling

Paper

ICLR2025

Multi-Robot Motion Planning with Diffusion Models

Summary pending...

Multi-Agent PlanningRoboticsGenerative Models

Paper

ICLR2025

Learning from Imperfect Human Feedback: A Tale from Corruption-Robust Dueling

Summary pending...

online learningdueling bandit

Paper

ICLR2025

Grokking at the Edge of Numerical Stability

Summary pending...

grokkingdeep learninglearning theory

Paper

ICLR2025

Is Large-scale Pretraining the Secret to Good Domain Generalization?

Summary pending...

Domain GeneralizationRobustnessCLIP

Paper

ICLR2025

NextBestPath: Efficient 3D Mapping of Unseen Environments

Summary pending...

3D reconstructionactive mapping

Paper

ICLR2025

Improved Algorithms for Kernel Matrix-Vector Multiplication Under Sparsity Assumptions

Summary pending...

AlgorithmsKernel MatrixKernel Density Estimation

Paper

ICLR2025

Balancing Act: Diversity and Consistency in Large Language Model Ensembles

Summary pending...

LLMensemblingdiversity

Paper

ICLR2025

Transformers Can Learn Temporal Difference Methods for In-Context Reinforcement Learning

Summary pending...

in-context reinforcement learningpolicy evaluationtemporal difference learning

Paper

ICLR2025

SBSC: Step-by-Step Coding for Improving Mathematical Olympiad Performance

Summary pending...

math AILLM math reasoning

Paper