Papers

12094 papers

ICML2025

Towards Cost-Effective Reward Guided Text Generation

Summary pending...

LLMRLHFAlignment

Paper

ICML2025

Efficient Optimization with Orthogonality Constraint: a Randomized Riemannian Submanifold Method

Summary pending...

Riemannian manifoldStiefel manifoldEfficient optimization

Paper

ICML2025

Regress, Don't Guess: A Regression-like Loss on Number Tokens for Language Models

Summary pending...

language modelsmathematical reasoningarithmetics

Paper

ICML2025

Can We Predict Performance of Large Models across Vision-Language Tasks?

Summary pending...

Large Vision-Language Models (LVLMs)BenchmarkingProbabilistic Matrix Factorization (PMF)

Paper

ICML2025

Towards a Mechanistic Explanation of Diffusion Model Generalization

Summary pending...

Diffusion ModelsGeneralizationDiffusion

Paper

ICML2025

Towards LLM Unlearning Resilient to Relearning Attacks: A Sharpness-Aware Minimization Perspective and Beyond

Summary pending...

Large Language ModelMachine UnlearningRobustness

Paper

ICML2025

Learn from Downstream and Be Yourself in Multimodal Large Language Models Fine-Tuning

Summary pending...

Multimodal Large Language ModelsFine-TuningCatastrophic Forgetting

Paper

ICML2025

Improving the Variance of Differentially Private Randomized Experiments through Clustering

Summary pending...

causal inferencedifferential privacyclustering

Paper

ICML2025

MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents

Summary pending...

Indirect prompt injectionAgent system for tool useLarge langugae models

Paper

ICML2025

Automated Hypothesis Validation with Agentic Sequential Falsifications

Summary pending...

LLM agenthypothesis testingsequential decision making

Paper

ICML2025

Leveraging Offline Data in Linear Latent Contextual Bandits

Summary pending...

banditslatent banditshybrid RL

Paper

ICML2025

Optimizing Robustness and Accuracy in Mixture of Experts: A Dual-Model Approach

Summary pending...

Mixture of ExpertsAdversarial DefenseAdversarial Robustness

Paper

ICML2025

Explainable Concept Generation through Vision-Language Preference Learning for Understanding Neural Networks' Internal Representations

Summary pending...

Concept based Explainable AITCAVVision-Language Models

Paper

ICML2025

An Asymptotically Optimal Approximation Algorithm for Multiobjective Submodular Maximization at Scale

Summary pending...

submodular maximizationmultiobjectivemax-min fairness

Paper

ICML2025

Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence

Summary pending...

evolutionary algorithmmodel adaptationswarm intelligence

Paper

ICML2025

OrcaLoca: An LLM Agent Framework for Software Issue Localization

Summary pending...

LLMLLM AgentSoftware Engineering

Paper

ICML2025

WOMD-Reasoning: A Large-Scale Dataset for Interaction Reasoning in Driving

Summary pending...

Language Q&A DatasetAutonomous DrivingInteraction Reasoning

Paper

ICML2025

SEAD: Unsupervised Ensemble of Streaming Anomaly Detectors

Summary pending...

anomaly detectiononline learningstreaming

Paper

ICML2025

Concurrent Reinforcement Learning with Aggregated States via Randomized Least Squares Value Iteration

Summary pending...

worst-case regret boundRL theoryrandomized least squares value iteration

Paper

ICML2025

Contrastive Localized Language-Image Pre-Training

Summary pending...

CLIPMLLMFoundation Models

Paper