Papers
12094 papers
ICML2025
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
Summary pending...
LLM reasoningLLM planningLatent reasoning
ICML2025
Design Considerations in Offline Preference-based RL
Summary pending...
Reinforcement Learning from Human FeedbackRLHFDPO
ICML2025
Graph4MM: Weaving Multimodal Learning with Structural Information
Summary pending...
Multi-modal LearningLarge Language ModelsGraph Neural Networks
ICML2025
FlatQuant: Flatness Matters for LLM Quantization
Summary pending...
flatnesspost-training-quantiationaffine transformation
ICML2025
EVOLvE: Evaluating and Optimizing LLMs For In-Context Exploration
Summary pending...
ExplorationIn-Context Reinforcement LearningBandit
ICML2025
TTFSFormer: A TTFS-based Lossless Conversion of Spiking Transformer
Summary pending...
spiking neural networksANN-SNN conversiontime-to-first spike
ICML2025
Prices, Bids, Values: One ML-Powered Combinatorial Auction to Rule Them All
Summary pending...
Combinatorial AuctionsAuction DesignAuctions
ICML2025
Learnable Spatial-Temporal Positional Encoding for Link Prediction
Summary pending...
Positional EncodingLink PredictionTransformer
ICML2025
On the Training Convergence of Transformers for In-Context Classification of Gaussian Mixtures
Summary pending...
In-context learningTransformer
ICML2025
SPRI: Aligning Large Language Models with Context-Situated Principles
Summary pending...
Large Language ModelsAlignmentScalable Context-Situated Oversight
ICML2025
Revisiting the Predictability of Performative, Social Events
Summary pending...
performative predictiononline learningmulticalibration
ICML2025
Are LLMs Prescient? A Continuous Evaluation using Daily News as the Oracle
Summary pending...
LLMForecastingContinuous Evaluation
ICML2025
Scaling Trends in Language Model Robustness
Summary pending...
ai safetylanguage modelsscaling laws
ICML2025
Understanding Nonlinear Implicit Bias via Region Counts in Input Space
Summary pending...
implicit biasregion countsnon-linear neural network
ICML2025
ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding
Summary pending...
Visual Chain of ThoughtVisual PromptingTable
ICML2025
Agent-Centric Actor-Critic for Asynchronous Multi-Agent Reinforcement Learning
Summary pending...
Multi-Agent Reinforcement LearningAsynchronous Multi-Agent Reinforcement LearningMacDec-POMDP
ICML2025
Efficient Noise Calculation in Deep Learning-based MRI Reconstructions
Summary pending...
MRI reconstructiondeep learningnoise
ICML2025
Nesterov Method for Asynchronous Pipeline Parallel Optimization
Summary pending...
Asynchronous OptimizationPipeline ParallelismNesterov Method
ICML2025
Flow Matching for Few-Trial Neural Adaptation with Stable Latent Dynamics
Summary pending...
Brain-Computer InterfaceNeural DecodingFlow Matching
ICML2025
OmniBal: Towards Fast Instruction-Tuning for Vision-Language Models via Omniverse Computation Balance
Summary pending...
VLMBalance Training