Papers

12094 papers

ICML2025

Convergence of Consistency Model with Multistep Sampling under General Data Assumptions

Summary pending...

Consistency modelsdiffusion modelslearning theory

Paper

ICML2025

Robust Reward Alignment via Hypothesis Space Batch Cutting

Summary pending...

Learning from Human FeedbackInverse Reinforcement LearningPreference Based Reinforcement Learning

Paper

ICML2025

High-Dimensional Prediction for Sequential Decision Making

Summary pending...

online decision makingcombinatorial optimizationmulticalibration

Paper

ICML2025

Improving Model Alignment Through Collective Intelligence of Open-Source Models

Summary pending...

AlignmentOpen-Source ModelMixture of Agents

Paper

ICML2025

GSM-$\infty$: How Do your LLMs Behave over Infinitely Increasing Reasoning Complexity and Context Length?

Summary pending...

Long ContextReasoningUnderstanding

Paper

ICML2025

No Free Lunch from Random Feature Ensembles: Scaling Laws and Near-Optimality Conditions

Summary pending...

Ensemble LearningRegressionScaling Laws

Paper

ICML2025

What can large language models do for sustainable food?

Summary pending...

large language modelssustainabilityclimate

Paper

ICML2025

Do Vision-Language Models Really Understand Visual Language?

Summary pending...

vision-language modelvisual languageevaluation

Paper

ICML2025

Efficient Molecular Conformer Generation with SO(3)-Averaged Flow Matching and Reflow

Summary pending...

Flow-matchingfew-shot generationequivariance

Paper

ICML2025

Training a Generally Curious Agent

Summary pending...

LLM AgentSynethic DataMultiturn finetuning

Paper

ICML2025

Putnam-AXIOM: A Functional & Static Benchmark for Measuring Higher Level Mathematical Reasoning in LLMs

Summary pending...

BenchmarksLarge Language ModelsMathematical Reasoning

Paper

ICML2025

Hardware and Software Platform Inference

Summary pending...

ML securityML governance

Paper

ICML2025

Value-Based Deep RL Scales Predictably

Summary pending...

scaling lawsonline reinforcement learningq-learning

Paper

ICML2025

DeepCrossAttention: Supercharging Transformer Residual Connections

Summary pending...

residual networkcross attentionresnet

Paper

ICML2025

MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations

Summary pending...

mathematical reasoningbenchmarkrobustness

Paper

ICML2025

Policy-Regret Minimization in Markov Games with Function Approximation

Summary pending...

policy regretMarkov gamesstrategic opponents

Paper

ICML2025

SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

Summary pending...

software engineeringbenchmarkevals

Paper

ICML2025

Universal Neural Optimal Transport

Summary pending...

Optimal TransportNeural OperatorsMeta Learning

Paper

ICML2025

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning

Summary pending...

large language modelsmulti-turn code generationreinforcement learning

Paper

ICML2025

PENCIL: Long Thoughts with Short Memory

Summary pending...

Large language modelschain-of-thought

Paper