Papers
12094 papers
ICML2025
Simple Policy Optimization
Summary pending...
Model-Free Reinforcement LearningPolicy Optimization
ICML2025
Prices, Bids, Values: One ML-Powered Combinatorial Auction to Rule Them All
Summary pending...
Combinatorial AuctionsAuction DesignAuctions
ICML2025
TTFSFormer: A TTFS-based Lossless Conversion of Spiking Transformer
Summary pending...
spiking neural networksANN-SNN conversiontime-to-first spike
ICML2025
On the Training Convergence of Transformers for In-Context Classification of Gaussian Mixtures
Summary pending...
In-context learningTransformer
ICML2025
Graph4MM: Weaving Multimodal Learning with Structural Information
Summary pending...
Multi-modal LearningLarge Language ModelsGraph Neural Networks
ICML2025
FlatQuant: Flatness Matters for LLM Quantization
Summary pending...
flatnesspost-training-quantiationaffine transformation
ICML2025
Learnable Spatial-Temporal Positional Encoding for Link Prediction
Summary pending...
Positional EncodingLink PredictionTransformer
ICML2025
Revisiting the Predictability of Performative, Social Events
Summary pending...
performative predictiononline learningmulticalibration
ICML2025
SPRI: Aligning Large Language Models with Context-Situated Principles
Summary pending...
Large Language ModelsAlignmentScalable Context-Situated Oversight
ICML2025
Design Considerations in Offline Preference-based RL
Summary pending...
Reinforcement Learning from Human FeedbackRLHFDPO
ICML2025
EVOLvE: Evaluating and Optimizing LLMs For In-Context Exploration
Summary pending...
ExplorationIn-Context Reinforcement LearningBandit
ICML2025
Haste Makes Waste: A Simple Approach for Scaling Graph Neural Networks
Summary pending...
Graph Neural NetworksLarge Scale Machine LearningHistorical Embeddings
ICML2025
Federated Disentangled Tuning with Textual Prior Decoupling and Visual Dynamic Adaptation
Summary pending...
Federated LearningParameter-Efficient Fine-TuningVision-Language Model
ICML2025
Learngene Tells You How to Customize: Task-Aware Parameter Initialization at Flexible Scales
Summary pending...
model initializationLearngenehypernetwork
ICML2025
Linear Bandits with Partially Observable Features
Summary pending...
Linear BanditsPartially Observable FeaturesDoubly Robust
ICML2025
CAN: Leveraging Clients As Navigators for Generative Replay in Federated Continual Learning
Summary pending...
Federated LearningContinual Learning
ICML2025
Contextual Online Decision Making with Infinite-Dimensional Functional Regression
Summary pending...
Infinite-Dimensional Functional RegressionContextual Decision-MakingOnline learning
ICML2025
From Black Boxes to Transparent Minds: Evaluating and Enhancing the Theory of Mind in Multimodal Large Language Models
Summary pending...
Multimodal Large Language Models (MLLMs)Theory of Mind (ToM)Interpretability
ICML2025
A Model of Place Field Reorganization During Reward Maximization
Summary pending...
Reinforcement learningTemporal Difference errorHippocampus