Papers

12094 papers

ICML2025

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations

Summary pending...

efficiencyLLMsquantization
ICML2025

Long-Short Alignment for Effective Long-Context Modeling in LLMs

Summary pending...

Large language modellength generalizationlong-short alignment
ICML2025

Conditional Diffusion Model with Nonlinear Data Transformation for Time Series Forecasting

Summary pending...

diffusion modeltime series forecastinggenerative modeling
ICML2025

SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models

Summary pending...

intrinsic motivationexplorationfoundation models
ICML2025

Boosting Multi-Domain Fine-Tuning of Large Language Models through Evolving Interactions between Samples

Summary pending...

curriculum learninglarge language modelsmixed data
ICML2025

Measuring Variable Importance in Heterogeneous Treatment Effects with Confidence

Summary pending...

Causal LearningInterpretable ML
ICML2025

Strategic Planning: A Top-Down Approach to Option Generation

Summary pending...

Reinforcement LearningTop-down approachStrategic Planning
ICML2025

Tree-Sliced Wasserstein Distance: A Geometric Perspective

Summary pending...

tree-sliced wasserstein distancetree wasserstein distanceoptimal transport
ICML2025

Learning Adversarial MDPs with Stochastic Hard Constraints

Summary pending...

CMDPhard constraintsonline learning
ICML2025

A Closer Look at Transformers for Time Series Forecasting: Understanding Why They Work and Where They Struggle

Summary pending...

Time series forecastingtransformerspoint-wise
ICML2025

The Relationship Between No-Regret Learning and Online Conformal Prediction

Summary pending...

Conformal predictionno-regret learningonline gradient descent
ICML2025

Fairness Overfitting in Machine Learning: An Information-Theoretic Perspective

Summary pending...

information-theoretic boundsfairnessgeneralization error bounds
ICML2025

Scalable Approximation Algorithms for $p$-Wasserstein Distance and Its Variants

Summary pending...

p-Wasserstein DistanceOptimal TransportApproximation Algorithms
ICML2025

On the Duality between Gradient Transformations and Adapters

Summary pending...

memoryefficienttraining
ICML2025

Can Large Language Models Understand Intermediate Representations in Compilers?

Summary pending...

Large Language Models (LLMs)Intermediate Representations (IRs)Code Comprehension
ICML2025

Robust Multimodal Large Language Models Against Modality Conflict

Summary pending...

Multimodal Large Language ModelsModality ConflictHallucinations
ICML2025

Pixel-level Certified Explanations via Randomized Smoothing

Summary pending...

explainability robustnessrobustness certificationexplainability
ICML2025

Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning

Summary pending...

Data mixtureLLMsleverage score
ICML2025

Optimal Auction Design in the Joint Advertising

Summary pending...

Joint AdvertisementAuction DesignBundleNet
ICML2025

Score-of-Mixture Training: One-Step Generative Model Training Made Simple via Score Estimation of Mixture Distributions

Summary pending...

one-step generationskew Jensen-Shannon Divergencediffusion models