Papers
12094 papers
ICML2025
The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret
Summary pending...
Reward learningRLHFRL
ICML2025
Decoding Rewards in Competitive Games: Inverse Game Theory with Entropy Regularization
Summary pending...
Inverse game; Quantal Response Equilibrium; Reward recovery; confidence sets
ICML2025
Efficient Multivariate Robust Mean Estimation Under Mean-Shift Contamination
Summary pending...
mean estimationhigh-dimensional inferencerobust statistics
ICML2025
Accelerating Spectral Clustering under Fairness Constraints
Summary pending...
fairnessspectral clusteringdifference of convex
ICML2025
Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination’s Impact on Machine Translation
Summary pending...
Data contaminationlarge language models(LLMs)machine translation
ICML2025
PEAKS: Selecting Key Training Examples Incrementally via Prediction Error Anchored by Kernel Similarity
Summary pending...
data pruningcoresetsdata selection
ICML2025
Geometric and Physical Constraints Synergistically Enhance Neural PDE Surrogates
Summary pending...
Geometric and Physical ConstraintsNeural PDE SurrogatesSymmetry Equivariance
ICML2025
Reward-Guided Iterative Refinement in Diffusion Models at Test-Time with Applications to Protein and DNA Design
Summary pending...
Diffusion models
ICML2025
Pareto-Optimality, Smoothness, and Stochasticity in Learning-Augmented One-Max-Search
Summary pending...
Learning-augmented algorithmsalgorithms with predictionsonline algorithms
ICML2025
Looking Beyond the Top-1: Transformers Determine Top Tokens in Order
Summary pending...
mechanistic interpretabilitytransformerlarge language model
ICML2025
AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders
Summary pending...
LLMsInterpretabilitySparse Autoencoders
ICML2025
Multiaccuracy and Multicalibration via Proxy Groups
Summary pending...
multicalibrationmultiaccuracyfairness
ICML2025
ADIOS: Antibody Development via Opponent Shaping
Summary pending...
Opponent ShapingAntibody DesignMeta Learning
ICML2025
Universal Approximation of Mean-Field Models via Transformers
Summary pending...
Mean Field EquationsTransformersUniversal Approximation
ICML2025
NeuronTune: Towards Self-Guided Spurious Bias Mitigation
Summary pending...
spurious correlationrobustnessbias mitigation
ICML2025
Improving Rationality in the Reasoning Process of Language Models through Self-playing Game
Summary pending...
Self-playLarge Language ModelsLLM Reasoning
ICML2025
Model-Based Exploration in Monitored Markov Decision Processes
Summary pending...
Exploration-ExploitationModel-Based Interval EstimationMonitored Markov Decision Processes
ICML2025
LASER: Attention with Exponential Transformation
Summary pending...
Large language modelingdeep learningtransformer
ICML2025
Scaling Test-Time Compute Without Verification or RL is Suboptimal
Summary pending...
LLMstest-time computeverification
ICML2025
Censor Dependent Variational Inference
Summary pending...
survival analysisvariational inferencevariational autoencoders