Papers

12094 papers

ICML2025

The Lock-in Hypothesis: Stagnation by Algorithm

Summary pending...

Value lock-inHuman-AI interactionAI Alignment
ICML2025

Dialogue Without Limits: Constant-Sized KV Caches for Extended Response in LLMs

Summary pending...

Large Language ModelKey-Value Cache Compression
ICML2025

A New Concentration Inequality for Sampling Without Replacement and Its Application for Transductive Learning

Summary pending...

Concentration InequalitySampling Without ReplacementTransductive Local Rademacher Complexity
ICML2025

LensLLM: Unveiling Fine-Tuning Dynamics for LLM Selection

Summary pending...

LLM SelectionPAC-Bayesian TheoryGeneralization Bound
ICML2025

Return Capping: Sample Efficient CVaR Policy Gradient Optimisation

Summary pending...

Reinforcement LearningMachine LearningCVaR
ICML2025

Skip the Equations: Learning Behavior of Personalized Dynamical Systems Directly From Data

Summary pending...

dynamical systemsdifferential equationsODE discovery
ICML2025

Memorization Sinks: Isolating Memorization during LLM Training

Summary pending...

MemorizationLocalizationUnlearning
ICML2025

Modulated Diffusion: Accelerating Generative Modeling with Modulated Quantization

Summary pending...

Diffusion Models; Efficiency; Quantization; Caching Method
ICML2025

Online Laplacian-Based Representation Learning in Reinforcement Learning

Summary pending...

Reinforcement LearningRepresentation learningOnline Learning
ICML2025

UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction

Summary pending...

Large language modelsMultimodal modelsLLMs
ICML2025

KinDEL: DNA-Encoded Library Dataset for Kinase Inhibitors

Summary pending...

DELsmall moleculedataset
ICML2025

Near-optimal Sketchy Natural Gradients for Physics-Informed Neural Networks

Summary pending...

Sci-MLPhysics Informed Neural NetworksNatural Gradients
ICML2025

Addressing Concept Mislabeling in Concept Bottleneck Models Through Preference Optimization

Summary pending...

Concept Bottleneck ModelsInterpretable AIXAI
ICML2025

AlphaPO: Reward Shape Matters for LLM Alignment

Summary pending...

llmlarge language modelsdeep learning
ICML2025

What Has a Foundation Model Found? Using Inductive Bias to Probe for World Models

Summary pending...

world modelsfoundation modelsinductive bias
ICML2025

Test-Time Graph Neural Dataset Search With Generative Projection

Summary pending...

Test-Time Adaption (TTA); Graph Neural Networks (GNNs); Distribution Shift; Generative Models
ICML2025

Does Generation Require Memorization? Creative Diffusion Models using Ambient Diffusion

Summary pending...

diffusionmemorizationcorrupted data
ICML2025

Hierarchical Equivariant Policy via Frame Transfer

Summary pending...

robot learningimitation learningrobotic manipulation
ICML2025

PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling

Summary pending...

jailbreakinglong-context LLMsin-context learning
ICML2025

Adversarial Reasoning at Jailbreaking Time

Summary pending...

LLMsJailbreakingReasoning