Papers
12094 papers
ICLR2024
Nearly $d$-Linear Convergence Bounds for Diffusion Models via Stochastic Localization
Summary pending...
diffusion modelsscore-based generative modelsconvergence bounds
ICLR2024
Bridging State and History Representations: Understanding Self-Predictive RL
Summary pending...
Reinforcement LearningRepresentation LearningPOMDPs
ICLR2024
ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate
Summary pending...
Large Language ModelMulti-Agent DebateLLM evaluators
ICLR2024
When should we prefer Decision Transformers for Offline Reinforcement Learning?
Summary pending...
offline reinforcement learningsequence modelingreinforcement learning
ICLR2024
Scaling Laws for Sparsely-Connected Foundation Models
Summary pending...
sparsityscalingoptimal sparsity
ICLR2024
The Expressive Power of Transformers with Chain of Thought
Summary pending...
The Expressive Power of Transformers with Chain of Thought
ICLR2024
DENEVIL: TOWARDS DECIPHERING AND NAVIGATING THE ETHICAL VALUES OF LARGE LANGUAGE MODELS VIA INSTRUCTION LEARNING
Summary pending...
Ethical valuesLarge Language ModelsAlignment
ICLR2024
Function-space Parameterization of Neural Networks for Sequential Learning
Summary pending...
Neural networksBayesian deep learningdeep learning
ICLR2024
On Penalty Methods for Nonconvex Bilevel Optimization and First-Order Stochastic Approximation
Summary pending...
Bilevel-OptimizationPenalty MethodsLandscape Analysis
ICLR2024
LLMCarbon: Modeling the End-to-End Carbon Footprint of Large Language Models
Summary pending...
carbon footprint modelinglarge lanaguage models
ICLR2024
Amortizing intractable inference in large language models
Summary pending...
large language modelsLLMsBayesian inference
ICLR2024
Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform
Summary pending...
DistributedDeep Reinforcement LearningDistributed Deep Reinforcement Learning
ICLR2024
Learning to Act from Actionless Videos through Dense Correspondences
Summary pending...
Video-Based PolicyVideo Dense Correspondence
ICLR2024
Meta Inverse Constrained Reinforcement Learning: Convergence Guarantee and Generalization Analysis
Summary pending...
inverse reinforcement learningmeta learning
ICLR2024
DreamLLM: Synergistic Multimodal Comprehension and Creation
Summary pending...
Multimodal Large Language ModelsLarge Language ModelsGenerative Models
ICLR2024
How to Capture Higher-order Correlations? Generalizing Matrix Softmax Attention to Kronecker Computation
Summary pending...
Attention computationkronecker computation
ICLR2024
Structural Estimation of Partially Observed Linear Non-Gaussian Acyclic Model: A Practical Approach with Identifiability
Summary pending...
causal discoverylatent variable modelstructure learning
ICLR2024
TEST: Text Prototype Aligned Embedding to Activate LLM's Ability for Time Series
Summary pending...
Time SeriesEmbeddingLLM
ICLR2024
DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing
Summary pending...
Model-based Reinforcement Learning; Reward Shaping; Reward Smoothing
ICLR2024
How Does Unlabeled Data Provably Help Out-of-Distribution Detection?
Summary pending...
out-of-distribution detectionlearnability