Papers
12094 papers
ICML2025
Revisiting Cooperative Off-Policy Multi-Agent Reinforcement Learning
Summary pending...
cooperative multi-agent reinforcement learningoff-policy multi-agent reinforcement learningvalue factorization
ICML2025
Semantics-aware Test-time Adaptation for 3D Human Pose Estimation
Summary pending...
Test-time Adaptation3D Human Pose Estimation
ICML2025
Maximal Update Parametrization and Zero-Shot Hyperparameter Transfer for Fourier Neural Operators
Summary pending...
Fourier Neural OperatorsHyperparameter transfer
ICML2025
Robot-Gated Interactive Imitation Learning with Adaptive Intervention Mechanism
Summary pending...
Imitation LearningHuman-in-the-loop Reinforcement LearningShared Autonomy
ICML2025
COGNATE: Acceleration of Sparse Tensor Programs on Emerging Hardware using Transfer Learning
Summary pending...
machine learning for systemstransfer learninghardware accelerators
ICML2025
Reflection-Bench: Evaluating Epistemic Agency in Large Language Models
Summary pending...
large language modelsautonomous agentcognitive psychology
ICML2025
Accelerating LLM Inference with Lossless Speculative Decoding Algorithms for Heterogeneous Vocabularies
Summary pending...
Speculative DecodingLarge Language ModelsVocabulary Alignment
ICML2025
Aligning Protein Conformation Ensemble Generation with Physical Feedback
Summary pending...
Proteingenerative modelsmolecular dynamics
ICML2025
RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models
Summary pending...
model quantizationquantization-aware trainingfine-tuning
ICML2025
TIMING: Temporality-Aware Integrated Gradients for Time Series Explanation
Summary pending...
Time SeriesXAIExplainability
ICML2025
O-MAPL: Offline Multi-agent Preference Learning
Summary pending...
Multi-agent Reinforcement LearningPreference Learning
ICML2025
NextCoder: Robust Adaptation of Code LMs to Diverse Code Edits
Summary pending...
Code-LMscode-editingcode-generation
ICML2025
Linear Transformers as VAR Models: Aligning Autoregressive Attention Mechanisms with Autoregressive Forecasting
Summary pending...
Linear AttentionTransformerTime Series Forecasting
ICML2025
Contextual Optimization Under Model Misspecification: A Tractable and Generalizable Approach
Summary pending...
Contextual OptimizationMachine LearningConstrained Optimization
ICML2025
DyCodeEval: Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination
Summary pending...
benchmarkingcode generationlarge language model
ICML2025
Orthogonal Subspace Decomposition for Generalizable AI-Generated Image Detection
Summary pending...
AI-Generated Image DetectionFace Forgery DetectionDeepfake Detection
ICML2025
Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models
Summary pending...
Machine LearningRoboticsLanguage
ICML2025
Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback
Summary pending...
Dueling BanditsAdversarial feedbackoptimal
ICML2025
Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?
Summary pending...
evaluationsbenchmarksscaling laws