Papers
12094 papers
ICLR2025
RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph
Summary pending...
AI Software EngineeringLarge Language ModelsCode Intelligence
ICLR2025
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Summary pending...
reinforcement learning from human feedbackefficient llm finetuningoff-policy RL
ICLR2025
MGDA Converges under Generalized Smoothness, Provably
Summary pending...
Multi-Objective OptimizationGeneralized SmoothnessConvergence Analysis
ICLR2025
PolyNet: Learning Diverse Solution Strategies for Neural Combinatorial Optimization
Summary pending...
neural combinatorial optimizationlearning to optimizereinforcement learning
ICLR2025
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
Summary pending...
Video UnderstandingBenchmark
ICLR2025
On the Convergence of No-Regret Dynamics in Information Retrieval Games with Proportional Ranking Functions
Summary pending...
Game theoryno-regret dynamicsrecommendation systems
ICLR2025
Scalable Bayesian Learning with posteriors
Summary pending...
Bayesian deep learningPyTorchVariational Inference
ICLR2025
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents
Summary pending...
GUI AgentsVisual GroundingMultimodal Large Language Models
ICLR2025
MetaOOD: Automatic Selection of OOD Detection Models
Summary pending...
Out-of-distribution DetectionMeta-learningLanguage Modeling
ICLR2025
Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better
Summary pending...
Model mergingconsistency modeldiffusion model
ICLR2025
TetSphere Splatting: Representing High-Quality Geometry with Lagrangian Volumetric Meshes
Summary pending...
geometry representation3D modeling
ICLR2025
Multi-Robot Motion Planning with Diffusion Models
Summary pending...
Multi-Agent PlanningRoboticsGenerative Models
ICLR2025
Learning from Imperfect Human Feedback: A Tale from Corruption-Robust Dueling
Summary pending...
online learningdueling bandit
ICLR2025
Grokking at the Edge of Numerical Stability
Summary pending...
grokkingdeep learninglearning theory
ICLR2025
Is Large-scale Pretraining the Secret to Good Domain Generalization?
Summary pending...
Domain GeneralizationRobustnessCLIP
ICLR2025
NextBestPath: Efficient 3D Mapping of Unseen Environments
Summary pending...
3D reconstructionactive mapping
ICLR2025
Improved Algorithms for Kernel Matrix-Vector Multiplication Under Sparsity Assumptions
Summary pending...
AlgorithmsKernel MatrixKernel Density Estimation
ICLR2025
Balancing Act: Diversity and Consistency in Large Language Model Ensembles
Summary pending...
LLMensemblingdiversity
ICLR2025
Transformers Can Learn Temporal Difference Methods for In-Context Reinforcement Learning
Summary pending...
in-context reinforcement learningpolicy evaluationtemporal difference learning
ICLR2025
SBSC: Step-by-Step Coding for Improving Mathematical Olympiad Performance
Summary pending...
math AILLM math reasoning