Papers
12094 papers
ICML2025
QuEST: Stable Training of LLMs with 1-Bit Weights and Activations
Summary pending...
efficiencyLLMsquantization
ICML2025
Long-Short Alignment for Effective Long-Context Modeling in LLMs
Summary pending...
Large language modellength generalizationlong-short alignment
ICML2025
Conditional Diffusion Model with Nonlinear Data Transformation for Time Series Forecasting
Summary pending...
diffusion modeltime series forecastinggenerative modeling
ICML2025
SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models
Summary pending...
intrinsic motivationexplorationfoundation models
ICML2025
Boosting Multi-Domain Fine-Tuning of Large Language Models through Evolving Interactions between Samples
Summary pending...
curriculum learninglarge language modelsmixed data
ICML2025
Measuring Variable Importance in Heterogeneous Treatment Effects with Confidence
Summary pending...
Causal LearningInterpretable ML
ICML2025
Strategic Planning: A Top-Down Approach to Option Generation
Summary pending...
Reinforcement LearningTop-down approachStrategic Planning
ICML2025
Tree-Sliced Wasserstein Distance: A Geometric Perspective
Summary pending...
tree-sliced wasserstein distancetree wasserstein distanceoptimal transport
ICML2025
Learning Adversarial MDPs with Stochastic Hard Constraints
Summary pending...
CMDPhard constraintsonline learning
ICML2025
A Closer Look at Transformers for Time Series Forecasting: Understanding Why They Work and Where They Struggle
Summary pending...
Time series forecastingtransformerspoint-wise
ICML2025
The Relationship Between No-Regret Learning and Online Conformal Prediction
Summary pending...
Conformal predictionno-regret learningonline gradient descent
ICML2025
Fairness Overfitting in Machine Learning: An Information-Theoretic Perspective
Summary pending...
information-theoretic boundsfairnessgeneralization error bounds
ICML2025
Scalable Approximation Algorithms for $p$-Wasserstein Distance and Its Variants
Summary pending...
p-Wasserstein DistanceOptimal TransportApproximation Algorithms
ICML2025
On the Duality between Gradient Transformations and Adapters
Summary pending...
memoryefficienttraining
ICML2025
Can Large Language Models Understand Intermediate Representations in Compilers?
Summary pending...
Large Language Models (LLMs)Intermediate Representations (IRs)Code Comprehension
ICML2025
Robust Multimodal Large Language Models Against Modality Conflict
Summary pending...
Multimodal Large Language ModelsModality ConflictHallucinations
ICML2025
Pixel-level Certified Explanations via Randomized Smoothing
Summary pending...
explainability robustnessrobustness certificationexplainability
ICML2025
Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning
Summary pending...
Data mixtureLLMsleverage score
ICML2025
Optimal Auction Design in the Joint Advertising
Summary pending...
Joint AdvertisementAuction DesignBundleNet
ICML2025
Score-of-Mixture Training: One-Step Generative Model Training Made Simple via Score Estimation of Mixture Distributions
Summary pending...
one-step generationskew Jensen-Shannon Divergencediffusion models