Papers
12094 papers
ICML2025
Monte-Carlo Tree Search with Uncertainty Propagation via Optimal Transport
Summary pending...
Monte-Carlo Tree SearchPlanning under Uncertainty
ICML2025
Contextual Bandits for Unbounded Context Distributions
Summary pending...
Contextual banditsnonparametric statistics
ICML2025
PiD: Generalized AI-Generated Images Detection with Pixelwise Decomposition Residuals
Summary pending...
AIGC detectionGenerative modelsDeepfakes
ICML2025
CAT: Contrastive Adversarial Training for Evaluating the Robustness of Protective Perturbations in Latent Diffusion Models
Summary pending...
Latent Diffusion ModelsProtective PerturbationsAdversarial Training
ICML2025
Online Robust Reinforcement Learning Through Monte-Carlo Planning
Summary pending...
Monte-carlo tree searchdistributionally robust reinforcement learningonline reinforcement learning
ICML2025
Learning Condensed Graph via Differentiable Atom Mapping for Reaction Yield Prediction
Summary pending...
Yield PredictionGNNAtom Mapping
ICML2025
What Limits Bidirectional Model's Generative Capabilities? A Uni-Bi-Directional Mixture-of-Expert Method For Bidirectional Fine-tuning
Summary pending...
Large Language ModelBidirectional ModelingCausal Attention
ICML2025
The Noisy Laplacian: a Threshold Phenomenon for Non-Linear Dimension Reduction
Summary pending...
Dimension ReductionDenoisingDiffusion Maps
ICML2025
GTR: A General, Multi-View, and Dynamic Framework for Trajectory Representation Learning
Summary pending...
Trajectory representation learningmobility learningspatio-temporal learning
ICML2025
BounDr.E: Predicting Drug-likeness via Biomedical Knowledge Alignment and EM-like One-Class Boundary Optimization
Summary pending...
Drug-likenessExpectation-MaximizationMulti-modal learning
ICML2025
Private Federated Learning using Preference-Optimized Synthetic Data
Summary pending...
Differential privacylarge language modelssynthetic data
ICML2025
Comparing Comparisons: Informative and Easy Human Feedback with Distinguishability Queries
Summary pending...
Reinforcement Learning from Human FeedbackPreference-based Reinforcement LearningHuman-in-the-loop Machine Learning
ICML2025
OR-Bench: An Over-Refusal Benchmark for Large Language Models
Summary pending...
safetyllmover-refusal
ICML2025
Provable Benefit of Random Permutations over Uniform Sampling in Stochastic Coordinate Descent
Summary pending...
OptimizationStochastic OptimizationCoordinate Descent
ICML2025
Generalization Analysis for Supervised Contrastive Representation Learning under Non-IID Settings
Summary pending...
Contrastive LearningGeneralization Analysis
ICML2025
A Theoretical Justification for Asymmetric Actor-Critic Algorithms
Summary pending...
Partially Observable EnvironmentAsymmetric LearningPrivileged Information
ICML2025
Optimal and Practical Batched Linear Bandit Algorithm
Summary pending...
linear banditbatched banditexploration-exploitation
ICML2025
SAE-V: Interpreting Multimodal Models for Enhanced Alignment
Summary pending...
interpretabilityalignmentmultimodal large language model
ICML2025
AtlasD: Automatic Local Symmetry Discovery
Summary pending...
Local symmetry discoverysymmetry discoveryequivariance
ICML2025
Reducing Confounding Bias without Data Splitting for Causal Inference via Optimal Transport
Summary pending...
Causal inferencecontinuous treatmentoptimal transport