Papers

12094 papers

ICML2025

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Summary pending...

LLM reasoningLLM planningLatent reasoning
ICML2025

Design Considerations in Offline Preference-based RL

Summary pending...

Reinforcement Learning from Human FeedbackRLHFDPO
ICML2025

Graph4MM: Weaving Multimodal Learning with Structural Information

Summary pending...

Multi-modal LearningLarge Language ModelsGraph Neural Networks
ICML2025

FlatQuant: Flatness Matters for LLM Quantization

Summary pending...

flatnesspost-training-quantiationaffine transformation
ICML2025

EVOLvE: Evaluating and Optimizing LLMs For In-Context Exploration

Summary pending...

ExplorationIn-Context Reinforcement LearningBandit
ICML2025

TTFSFormer: A TTFS-based Lossless Conversion of Spiking Transformer

Summary pending...

spiking neural networksANN-SNN conversiontime-to-first spike
ICML2025

Prices, Bids, Values: One ML-Powered Combinatorial Auction to Rule Them All

Summary pending...

Combinatorial AuctionsAuction DesignAuctions
ICML2025

Learnable Spatial-Temporal Positional Encoding for Link Prediction

Summary pending...

Positional EncodingLink PredictionTransformer
ICML2025

On the Training Convergence of Transformers for In-Context Classification of Gaussian Mixtures

Summary pending...

In-context learningTransformer
ICML2025

SPRI: Aligning Large Language Models with Context-Situated Principles

Summary pending...

Large Language ModelsAlignmentScalable Context-Situated Oversight
ICML2025

Revisiting the Predictability of Performative, Social Events

Summary pending...

performative predictiononline learningmulticalibration
ICML2025

Are LLMs Prescient? A Continuous Evaluation using Daily News as the Oracle

Summary pending...

LLMForecastingContinuous Evaluation
ICML2025

Scaling Trends in Language Model Robustness

Summary pending...

ai safetylanguage modelsscaling laws
ICML2025

Understanding Nonlinear Implicit Bias via Region Counts in Input Space

Summary pending...

implicit biasregion countsnon-linear neural network
ICML2025

ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding

Summary pending...

Visual Chain of ThoughtVisual PromptingTable
ICML2025

Agent-Centric Actor-Critic for Asynchronous Multi-Agent Reinforcement Learning

Summary pending...

Multi-Agent Reinforcement LearningAsynchronous Multi-Agent Reinforcement LearningMacDec-POMDP
ICML2025

Efficient Noise Calculation in Deep Learning-based MRI Reconstructions

Summary pending...

MRI reconstructiondeep learningnoise
ICML2025

Nesterov Method for Asynchronous Pipeline Parallel Optimization

Summary pending...

Asynchronous OptimizationPipeline ParallelismNesterov Method
ICML2025

Flow Matching for Few-Trial Neural Adaptation with Stable Latent Dynamics

Summary pending...

Brain-Computer InterfaceNeural DecodingFlow Matching
ICML2025

OmniBal: Towards Fast Instruction-Tuning for Vision-Language Models via Omniverse Computation Balance

Summary pending...

VLMBalance Training