Papers

12094 papers

ICML2025

The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret

Summary pending...

Reward learningRLHFRL
ICML2025

Decoding Rewards in Competitive Games: Inverse Game Theory with Entropy Regularization

Summary pending...

Inverse game; Quantal Response Equilibrium; Reward recovery; confidence sets
ICML2025

Efficient Multivariate Robust Mean Estimation Under Mean-Shift Contamination

Summary pending...

mean estimationhigh-dimensional inferencerobust statistics
ICML2025

Accelerating Spectral Clustering under Fairness Constraints

Summary pending...

fairnessspectral clusteringdifference of convex
ICML2025

Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination’s Impact on Machine Translation

Summary pending...

Data contaminationlarge language models(LLMs)machine translation
ICML2025

PEAKS: Selecting Key Training Examples Incrementally via Prediction Error Anchored by Kernel Similarity

Summary pending...

data pruningcoresetsdata selection
ICML2025

Geometric and Physical Constraints Synergistically Enhance Neural PDE Surrogates

Summary pending...

Geometric and Physical ConstraintsNeural PDE SurrogatesSymmetry Equivariance
ICML2025

Reward-Guided Iterative Refinement in Diffusion Models at Test-Time with Applications to Protein and DNA Design

Summary pending...

Diffusion models
ICML2025

Pareto-Optimality, Smoothness, and Stochasticity in Learning-Augmented One-Max-Search

Summary pending...

Learning-augmented algorithmsalgorithms with predictionsonline algorithms
ICML2025

Looking Beyond the Top-1: Transformers Determine Top Tokens in Order

Summary pending...

mechanistic interpretabilitytransformerlarge language model
ICML2025

AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders

Summary pending...

LLMsInterpretabilitySparse Autoencoders
ICML2025

Multiaccuracy and Multicalibration via Proxy Groups

Summary pending...

multicalibrationmultiaccuracyfairness
ICML2025

ADIOS: Antibody Development via Opponent Shaping

Summary pending...

Opponent ShapingAntibody DesignMeta Learning
ICML2025

Universal Approximation of Mean-Field Models via Transformers

Summary pending...

Mean Field EquationsTransformersUniversal Approximation
ICML2025

NeuronTune: Towards Self-Guided Spurious Bias Mitigation

Summary pending...

spurious correlationrobustnessbias mitigation
ICML2025

Improving Rationality in the Reasoning Process of Language Models through Self-playing Game

Summary pending...

Self-playLarge Language ModelsLLM Reasoning
ICML2025

Model-Based Exploration in Monitored Markov Decision Processes

Summary pending...

Exploration-ExploitationModel-Based Interval EstimationMonitored Markov Decision Processes
ICML2025

LASER: Attention with Exponential Transformation

Summary pending...

Large language modelingdeep learningtransformer
ICML2025

Scaling Test-Time Compute Without Verification or RL is Suboptimal

Summary pending...

LLMstest-time computeverification
ICML2025

Censor Dependent Variational Inference

Summary pending...

survival analysisvariational inferencevariational autoencoders