Papers
12094 papers
TMLR2025
On the Low-Rank Parametrization of Reward Models for Controlled Language Generation
Summary pending...
TMLR2025
SURE-VQA: Systematic Understanding of Robustness Evaluation in Medical VQA Tasks
Summary pending...
TMLR2025
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models
Summary pending...
TMLR2025
Unified Wisdom: Harnessing Collaborative Learning to Improve Efficacy of Knowledge Distillation
Summary pending...
TMLR2025
What Should Embeddings Embed? Autoregressive Models Represent Latent Generating Distributions
Summary pending...
TMLR2025
Beyond ordinary Lipschitz constraints: Differentially Private optimization with TNC
Summary pending...
TMLR2025
Beyond Grids: Multi-objective Bayesian Optimization With Adaptive Discretization
Summary pending...
TMLR2025
Probabilities of Chat LLMs Are Miscalibrated but Still Predict Correctness on Multiple-Choice Q&A
Summary pending...
TMLR2025
G2D2: Gradient-Guided Discrete Diffusion for Inverse Problem Solving
Summary pending...
TMLR2025
YoooP: You Only Optimize One Prototype per Class for Non-Exemplar Incremental Learning
Summary pending...
TMLR2025
LAPP: Large Language Model Feedback for Preference-Driven Reinforcement Learning
Summary pending...
TMLR2025
Continual learning via probabilistic exchangeable sequence modelling
Summary pending...
TMLR2025
DNR-Pruning: Sparsity-Aware Pruning via Dying Neuron Reactivation in Convolutional Neural Networks
Summary pending...
TMLR2025
Complementarity: Toward Better Metrics and Optimizing Data Efficiency in LLMs
Summary pending...