Papers

12094 papers

TMLR2025

Predictive Control and Regret Analysis of Non-Stationary MDP with Look-ahead Information

Summary pending...

TMLR2025

SKADA-Bench: Benchmarking Unsupervised Domain Adaptation Methods with Realistic Validation On Diverse Modalities

Summary pending...

TMLR2025

Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models

Summary pending...

TMLR2025

Reinforcement Learning from Bagged Reward

Summary pending...

TMLR2025

Link Prediction with Relational Hypergraphs

Summary pending...

TMLR2025

Towards Undistillable Models by Minimizing Conditional Mutual Information

Summary pending...

TMLR2025

Tighter sparse variational Gaussian processes

Summary pending...

TMLR2025

Piecewise Constant Spectral Graph Neural Network

Summary pending...

TMLR2025

Is What You Ask For What You Get? Investigating Concept Associations in Text-to-Image Models

Summary pending...

TMLR2025

Gradient Inversion Attack on Graph Neural Networks

Summary pending...

TMLR2025

Learning Energy-Based Generative Models via Potential Flow: A Variational Principle Approach to Probability Density Homotopy Matching

Summary pending...

TMLR2025

A Survey on Large Language Model Acceleration based on KV Cache Management

Summary pending...

TMLR2025

Offset Unlearning for Large Language Models

Summary pending...

TMLR2025

Pitfalls in Evaluating Inference-time Methods for Improving LLM Reliability

Summary pending...

TMLR2025

On the Utility of Existing Fine-Tuned Models on Data-Scarce Domains

Summary pending...

TMLR2025

Investigating Continual Pretraining in Large Language Models: Insights and Implications

Summary pending...

TMLR2025

Double Horizon Model-Based Policy Optimization

Summary pending...

TMLR2025

SEE-DPO: Self Entropy Enhanced Direct Preference Optimization

Summary pending...

TMLR2025

Efficient and Flexible Neural Network Training through Layer-wise Feedback Propagation

Summary pending...

TMLR2025

Towards Better Understanding of In-Context Learning Ability from In-Context Uncertainty Quantification

Summary pending...