Papers

12094 papers

ICML2025

Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

Summary pending...

agentsbenchmarkcomputer agents
ICML2025

Safety-Polarized and Prioritized Reinforcement Learning

Summary pending...

Safe Reinforcement Learning
ICML2025

Test-Time Training Provably Improves Transformers as In-context Learners

Summary pending...

in-context learningtransformerstest-time training
ICML2025

Dynamical Modeling of Behaviorally Relevant Spatiotemporal Patterns in Neural Imaging Data

Summary pending...

Deep LearningDynamical ModelingNeural Imaging
ICML2025

ELoRA: Low-Rank Adaptation for Equivariant GNNs

Summary pending...

graph neural networkequivarianceparameter-efficient fine-tuning
ICML2025

Tracking Most Significant Shifts in Infinite-Armed Bandits

Summary pending...

non-stationaryinfinite-armed banditsbandits
ICML2025

Mind Your Step (by Step): Chain-of-Thought can Reduce Performance on Tasks where Thinking Makes Humans Worse

Summary pending...

chain of thoughtpsychologycognitive science
ICML2025

Floating-Point Neural Networks Can Represent Almost All Floating-Point Functions

Summary pending...

Universal approximationFloating-Point Arithmetic
ICML2025

Geometry Informed Tokenization of Molecules for Language Model Generation

Summary pending...

Language models
ICML2025

Uncertainty Quantification for LLM-Based Survey Simulations

Summary pending...

synthetic datalarge language modelsuncertainty quantification
ICML2025

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

Summary pending...

diffusioninterpretabilitytransformers
ICML2025

(How) Can Transformers Predict Pseudo-Random Numbers?

Summary pending...

InterpretabilityIn-context learningGrokking
ICML2025

Causal Abstraction Inference under Lossy Representations

Summary pending...

causalitycausal inferencecausal abstractions
ICML2025

The Role of Randomness in Stability

Summary pending...

ReplicabilityDifferential PrivacyPAC Learning
ICML2025

Emergence in non-neural models: grokking modular arithmetic via average gradient outer product

Summary pending...

Theory of deep learninggrokkingmodular arithmetic
ICML2025

OmiAD: One-Step Adaptive Masked Diffusion Model for Multi-class Anomaly Detection via Adversarial Distillation

Summary pending...

Anomaly DetectionDiffusion ModelDiffusion Distillation
ICML2025

The Butterfly Effect: Neural Network Training Trajectories Are Highly Sensitive to Initial Conditions

Summary pending...

early traininglinear mode connectivityloss landscape
ICML2025

Accelerating Large Language Model Reasoning via Speculative Search

Summary pending...

Large Language Model ReasoningInference AccelerationTree Search
ICML2025

Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces

Summary pending...

Diffusion ModelDiscrete DiffusionMultimodal
ICML2025

SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity

Summary pending...

large language modelsefficient fine-tuningsparsity