Papers

12094 papers

ICML2025

Monte-Carlo Tree Search with Uncertainty Propagation via Optimal Transport

Summary pending...

Monte-Carlo Tree SearchPlanning under Uncertainty
ICML2025

Contextual Bandits for Unbounded Context Distributions

Summary pending...

Contextual banditsnonparametric statistics
ICML2025

PiD: Generalized AI-Generated Images Detection with Pixelwise Decomposition Residuals

Summary pending...

AIGC detectionGenerative modelsDeepfakes
ICML2025

CAT: Contrastive Adversarial Training for Evaluating the Robustness of Protective Perturbations in Latent Diffusion Models

Summary pending...

Latent Diffusion ModelsProtective PerturbationsAdversarial Training
ICML2025

Online Robust Reinforcement Learning Through Monte-Carlo Planning

Summary pending...

Monte-carlo tree searchdistributionally robust reinforcement learningonline reinforcement learning
ICML2025

Learning Condensed Graph via Differentiable Atom Mapping for Reaction Yield Prediction

Summary pending...

Yield PredictionGNNAtom Mapping
ICML2025

What Limits Bidirectional Model's Generative Capabilities? A Uni-Bi-Directional Mixture-of-Expert Method For Bidirectional Fine-tuning

Summary pending...

Large Language ModelBidirectional ModelingCausal Attention
ICML2025

The Noisy Laplacian: a Threshold Phenomenon for Non-Linear Dimension Reduction

Summary pending...

Dimension ReductionDenoisingDiffusion Maps
ICML2025

GTR: A General, Multi-View, and Dynamic Framework for Trajectory Representation Learning

Summary pending...

Trajectory representation learningmobility learningspatio-temporal learning
ICML2025

BounDr.E: Predicting Drug-likeness via Biomedical Knowledge Alignment and EM-like One-Class Boundary Optimization

Summary pending...

Drug-likenessExpectation-MaximizationMulti-modal learning
ICML2025

Private Federated Learning using Preference-Optimized Synthetic Data

Summary pending...

Differential privacylarge language modelssynthetic data
ICML2025

Comparing Comparisons: Informative and Easy Human Feedback with Distinguishability Queries

Summary pending...

Reinforcement Learning from Human FeedbackPreference-based Reinforcement LearningHuman-in-the-loop Machine Learning
ICML2025

OR-Bench: An Over-Refusal Benchmark for Large Language Models

Summary pending...

safetyllmover-refusal
ICML2025

Provable Benefit of Random Permutations over Uniform Sampling in Stochastic Coordinate Descent

Summary pending...

OptimizationStochastic OptimizationCoordinate Descent
ICML2025

Generalization Analysis for Supervised Contrastive Representation Learning under Non-IID Settings

Summary pending...

Contrastive LearningGeneralization Analysis
ICML2025

A Theoretical Justification for Asymmetric Actor-Critic Algorithms

Summary pending...

Partially Observable EnvironmentAsymmetric LearningPrivileged Information
ICML2025

Optimal and Practical Batched Linear Bandit Algorithm

Summary pending...

linear banditbatched banditexploration-exploitation
ICML2025

SAE-V: Interpreting Multimodal Models for Enhanced Alignment

Summary pending...

interpretabilityalignmentmultimodal large language model
ICML2025

AtlasD: Automatic Local Symmetry Discovery

Summary pending...

Local symmetry discoverysymmetry discoveryequivariance
ICML2025

Reducing Confounding Bias without Data Splitting for Causal Inference via Optimal Transport

Summary pending...

Causal inferencecontinuous treatmentoptimal transport