Papers

12094 papers

ICML2025

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations

Summary pending...

efficiencyLLMsquantization

Paper

ICML2025

Long-Short Alignment for Effective Long-Context Modeling in LLMs

Summary pending...

Large language modellength generalizationlong-short alignment

Paper

ICML2025

Conditional Diffusion Model with Nonlinear Data Transformation for Time Series Forecasting

Summary pending...

diffusion modeltime series forecastinggenerative modeling

Paper

ICML2025

SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models

Summary pending...

intrinsic motivationexplorationfoundation models

Paper

ICML2025

Boosting Multi-Domain Fine-Tuning of Large Language Models through Evolving Interactions between Samples

Summary pending...

curriculum learninglarge language modelsmixed data

Paper

ICML2025

Measuring Variable Importance in Heterogeneous Treatment Effects with Confidence

Summary pending...

Causal LearningInterpretable ML

Paper

ICML2025

Strategic Planning: A Top-Down Approach to Option Generation

Summary pending...

Reinforcement LearningTop-down approachStrategic Planning

Paper

ICML2025

Tree-Sliced Wasserstein Distance: A Geometric Perspective

Summary pending...

tree-sliced wasserstein distancetree wasserstein distanceoptimal transport

Paper

ICML2025

Learning Adversarial MDPs with Stochastic Hard Constraints

Summary pending...

CMDPhard constraintsonline learning

Paper

ICML2025

A Closer Look at Transformers for Time Series Forecasting: Understanding Why They Work and Where They Struggle

Summary pending...

Time series forecastingtransformerspoint-wise

Paper

ICML2025

The Relationship Between No-Regret Learning and Online Conformal Prediction

Summary pending...

Conformal predictionno-regret learningonline gradient descent

Paper

ICML2025

Fairness Overfitting in Machine Learning: An Information-Theoretic Perspective

Summary pending...

information-theoretic boundsfairnessgeneralization error bounds

Paper

ICML2025

Scalable Approximation Algorithms for $p$-Wasserstein Distance and Its Variants

Summary pending...

p-Wasserstein DistanceOptimal TransportApproximation Algorithms

Paper

ICML2025

On the Duality between Gradient Transformations and Adapters

Summary pending...

memoryefficienttraining

Paper

ICML2025

Can Large Language Models Understand Intermediate Representations in Compilers?

Summary pending...

Large Language Models (LLMs)Intermediate Representations (IRs)Code Comprehension

Paper

ICML2025

Robust Multimodal Large Language Models Against Modality Conflict

Summary pending...

Multimodal Large Language ModelsModality ConflictHallucinations

Paper

ICML2025

Pixel-level Certified Explanations via Randomized Smoothing

Summary pending...

explainability robustnessrobustness certificationexplainability

Paper

ICML2025

Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning

Summary pending...

Data mixtureLLMsleverage score

Paper

ICML2025

Optimal Auction Design in the Joint Advertising

Summary pending...

Joint AdvertisementAuction DesignBundleNet

Paper

ICML2025

Score-of-Mixture Training: One-Step Generative Model Training Made Simple via Score Estimation of Mixture Distributions

Summary pending...

one-step generationskew Jensen-Shannon Divergencediffusion models

Paper