Papers

12094 papers

ICLR2024

Nearly $d$-Linear Convergence Bounds for Diffusion Models via Stochastic Localization

Summary pending...

diffusion modelsscore-based generative modelsconvergence bounds
ICLR2024

Bridging State and History Representations: Understanding Self-Predictive RL

Summary pending...

Reinforcement LearningRepresentation LearningPOMDPs
ICLR2024

ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate

Summary pending...

Large Language ModelMulti-Agent DebateLLM evaluators
ICLR2024

When should we prefer Decision Transformers for Offline Reinforcement Learning?

Summary pending...

offline reinforcement learningsequence modelingreinforcement learning
ICLR2024

Scaling Laws for Sparsely-Connected Foundation Models

Summary pending...

sparsityscalingoptimal sparsity
ICLR2024

The Expressive Power of Transformers with Chain of Thought

Summary pending...

The Expressive Power of Transformers with Chain of Thought
ICLR2024

DENEVIL: TOWARDS DECIPHERING AND NAVIGATING THE ETHICAL VALUES OF LARGE LANGUAGE MODELS VIA INSTRUCTION LEARNING

Summary pending...

Ethical valuesLarge Language ModelsAlignment
ICLR2024

Function-space Parameterization of Neural Networks for Sequential Learning

Summary pending...

Neural networksBayesian deep learningdeep learning
ICLR2024

On Penalty Methods for Nonconvex Bilevel Optimization and First-Order Stochastic Approximation

Summary pending...

Bilevel-OptimizationPenalty MethodsLandscape Analysis
ICLR2024

LLMCarbon: Modeling the End-to-End Carbon Footprint of Large Language Models

Summary pending...

carbon footprint modelinglarge lanaguage models
ICLR2024

Amortizing intractable inference in large language models

Summary pending...

large language modelsLLMsBayesian inference
ICLR2024

Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform

Summary pending...

DistributedDeep Reinforcement LearningDistributed Deep Reinforcement Learning
ICLR2024

Learning to Act from Actionless Videos through Dense Correspondences

Summary pending...

Video-Based PolicyVideo Dense Correspondence
ICLR2024

Meta Inverse Constrained Reinforcement Learning: Convergence Guarantee and Generalization Analysis

Summary pending...

inverse reinforcement learningmeta learning
ICLR2024

DreamLLM: Synergistic Multimodal Comprehension and Creation

Summary pending...

Multimodal Large Language ModelsLarge Language ModelsGenerative Models
ICLR2024

How to Capture Higher-order Correlations? Generalizing Matrix Softmax Attention to Kronecker Computation

Summary pending...

Attention computationkronecker computation
ICLR2024

Structural Estimation of Partially Observed Linear Non-Gaussian Acyclic Model: A Practical Approach with Identifiability

Summary pending...

causal discoverylatent variable modelstructure learning
ICLR2024

TEST: Text Prototype Aligned Embedding to Activate LLM's Ability for Time Series

Summary pending...

Time SeriesEmbeddingLLM
ICLR2024

DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing

Summary pending...

Model-based Reinforcement Learning; Reward Shaping; Reward Smoothing
ICLR2024

How Does Unlabeled Data Provably Help Out-of-Distribution Detection?

Summary pending...

out-of-distribution detectionlearnability