Papers
12094 papers
ICML2025
Is Best-of-N the Best of Them? Coverage, Scaling, and Optimality in Inference-Time Alignment
Summary pending...
inference time alignmentbest of npessimism
ICML2025
Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration
Summary pending...
Cooperative Multi-Agent Reinforcement LearningState ModellingSparse Reward
ICML2025
TANGO: Clustering with Typicality-Aware Nonlocal Mode-Seeking and Graph-Cut Optimization
Summary pending...
clusteringdensity-based clusteringmode-seeking
ICML2025
CALM: Consensus-Aware Localized Merging for Multi-Task Learning
Summary pending...
model merging; multi-task learning; global task consensus; localized merging
ICML2025
SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning
Summary pending...
Language ModelsAlignmentPreference Optimization
ICML2025
DRAG: Data Reconstruction Attack using Guided Diffusion
Summary pending...
Data Reconstruction AttackPrivacyDiffusion Model
ICML2025
VCT: Training Consistency Models with Variational Noise Coupling
Summary pending...
Consistency ModelsGenerative Models
ICML2025
The Missing Alignment Link of In-context Learning on Sequences
Summary pending...
Model adaptationfew-shot learningsequence to sequence learning
ICML2025
Sparse Autoencoders, Again?
Summary pending...
autoencoderssparse representationsvariational autoencoders
ICML2025
Unveiling Markov heads in Pretrained Language Models for Offline Reinforcement Learning
Summary pending...
decision transformer; cross-domain
ICML2025
Integrating Intermediate Layer Optimization and Projected Gradient Descent for Solving Inverse Problems with Diffusion Models
Summary pending...
diffusion modelinverse problems
ICML2025
Latent Variable Causal Discovery under Selection Bias
Summary pending...
causal discoverylatent variablesselection bias
ICML2025
Learning Efficient Robotic Garment Manipulation with Standardization
Summary pending...
Deformable Object Manipulation,Bimanual Manipulation,Self-supervised learning
ICML2025
The Berkeley Function Calling Leaderboard (BFCL): From Tool Use to Agentic Evaluation of Large Language Models
Summary pending...
Function Calling EvaluationTool useAgentic Evaluation
ICML2025
Deep Fuzzy Multi-view Learning for Reliable Classification
Summary pending...
Multi-view LearningTrustworthy Machine LearningFuzzy Deep Learning
ICML2025
LipsNet++: Unifying Filter and Controller into a Policy Network
Summary pending...
Deep Reinforcement LearningPolicy Network DesignAction Fluctuation
ICML2025
BOPO: Neural Combinatorial Optimization via Best-anchored and Objective-guided Preference Optimization
Summary pending...
Neural Combinatorial Optimization; Preference Optimization; Machine Learning;
ICML2025
A Memory Efficient Randomized Subspace Optimization Method for Training Large Language Models
Summary pending...
randomized subspace optimizationlarge language model trainingstochastic optimization
ICML2025
Graph-Supported Dynamic Algorithm Configuration for Multi-Objective Combinatorial Optimization
Summary pending...
Deep Reinforcement LearningMulti-objective Combinatorial OptimizationDynamic Algorithm Configuration
ICML2025
Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search
Summary pending...
Text-to-SQLDatabasesLLMs