Papers

12094 papers

ICML2025

Revisiting Cooperative Off-Policy Multi-Agent Reinforcement Learning

Summary pending...

cooperative multi-agent reinforcement learningoff-policy multi-agent reinforcement learningvalue factorization
ICML2025

Semantics-aware Test-time Adaptation for 3D Human Pose Estimation

Summary pending...

Test-time Adaptation3D Human Pose Estimation
ICML2025

Maximal Update Parametrization and Zero-Shot Hyperparameter Transfer for Fourier Neural Operators

Summary pending...

Fourier Neural OperatorsHyperparameter transfer
ICML2025

Robot-Gated Interactive Imitation Learning with Adaptive Intervention Mechanism

Summary pending...

Imitation LearningHuman-in-the-loop Reinforcement LearningShared Autonomy
ICML2025

COGNATE: Acceleration of Sparse Tensor Programs on Emerging Hardware using Transfer Learning

Summary pending...

machine learning for systemstransfer learninghardware accelerators
ICML2025

Reflection-Bench: Evaluating Epistemic Agency in Large Language Models

Summary pending...

large language modelsautonomous agentcognitive psychology
ICML2025

Accelerating LLM Inference with Lossless Speculative Decoding Algorithms for Heterogeneous Vocabularies

Summary pending...

Speculative DecodingLarge Language ModelsVocabulary Alignment
ICML2025

Aligning Protein Conformation Ensemble Generation with Physical Feedback

Summary pending...

Proteingenerative modelsmolecular dynamics
ICML2025

RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models

Summary pending...

model quantizationquantization-aware trainingfine-tuning
ICML2025

TIMING: Temporality-Aware Integrated Gradients for Time Series Explanation

Summary pending...

Time SeriesXAIExplainability
ICML2025

O-MAPL: Offline Multi-agent Preference Learning

Summary pending...

Multi-agent Reinforcement LearningPreference Learning
ICML2025

NextCoder: Robust Adaptation of Code LMs to Diverse Code Edits

Summary pending...

Code-LMscode-editingcode-generation
ICML2025

Linear Transformers as VAR Models: Aligning Autoregressive Attention Mechanisms with Autoregressive Forecasting

Summary pending...

Linear AttentionTransformerTime Series Forecasting
ICML2025

Contextual Optimization Under Model Misspecification: A Tractable and Generalizable Approach

Summary pending...

Contextual OptimizationMachine LearningConstrained Optimization
ICML2025

DyCodeEval: Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination

Summary pending...

benchmarkingcode generationlarge language model
ICML2025

µnit Scaling: Simple and Scalable FP8 LLM Training

Summary pending...

LLMFP8Transformer
ICML2025

Orthogonal Subspace Decomposition for Generalizable AI-Generated Image Detection

Summary pending...

AI-Generated Image DetectionFace Forgery DetectionDeepfake Detection
ICML2025

Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models

Summary pending...

Machine LearningRoboticsLanguage
ICML2025

Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback

Summary pending...

Dueling BanditsAdversarial feedbackoptimal
ICML2025

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?

Summary pending...

evaluationsbenchmarksscaling laws