Papers

12094 papers

ICLR2024

Variance-aware Regret Bounds for Stochastic Contextual Dueling Bandits

Summary pending...

Dueling BanditVariance-awarecontextual bandit
ICLR2024

MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training

Summary pending...

self-supervised learningmusicaudio
ICLR2024

Harnessing Density Ratios for Online Reinforcement Learning

Summary pending...

reinforcement learning theoryonline RLoffline RL
ICLR2024

MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning

Summary pending...

Large Language ModelsChain-of-ThoughtTextual Reasoning
ICLR2024

Group Preference Optimization: Few-Shot Alignment of Large Language Models

Summary pending...

Large Language Modelsalignmentgroup preference alignment
ICLR2024

NeuroBack: Improving CDCL SAT Solving using Graph Neural Networks

Summary pending...

Propositional satisfiabilityGraph Neural NetworksCDCL SAT Solving
ICLR2024

BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models

Summary pending...

large language modelchain-of-thoughtbackdoor attack
ICLR2024

L2MAC: Large Language Model Automatic Computer for Extensive Code Generation

Summary pending...

Code GenerationMemory-augmented LLMsLarge Language Models (LLMs)
ICLR2024

Learning to design protein-protein interactions with enhanced generalization

Summary pending...

protein-protein interactionsprotein designgeneralization
ICLR2024

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Summary pending...

LLM finetuningScaling LawsFull-model finetuning
ICLR2024

Unified Projection-Free Algorithms for Adversarial DR-Submodular Optimization

Summary pending...

Stochastic optimizationsubmodular maximizationFrank-Wolfe algorithm
ICLR2024

Does CLIP’s generalization performance mainly stem from high train-test similarity?

Summary pending...

robustnessfoundation modelsCLIP
ICLR2024

Curriculum reinforcement learning for quantum architecture search under hardware errors

Summary pending...

Quantum ComputingReinforcement LearningQuantum Chemistry
ICLR2024

MT-Ranker: Reference-free machine translation evaluation by inter-system ranking

Summary pending...

Machine Translation Evaluation
ICLR2024

Forward Learning of Graph Neural Networks

Summary pending...

graph neural networksforward learningforward-forward algorithm
ICLR2024

Sufficient conditions for offline reactivation in recurrent neural networks

Summary pending...

computational neuroscienceoffline reactivationreplay
ICLR2024

Cycle Consistency Driven Object Discovery

Summary pending...

cycle consistencyobject discoverydownstream RL
ICLR2024

Provable Compositional Generalization for Object-Centric Learning

Summary pending...

compositional generalizationidentifiabilityobject-centric learning
ICLR2024

Learning to Make Adherence-aware Advice

Summary pending...

Human-AI interactionReinforcement Learning
ICLR2024

Demystifying Poisoning Backdoor Attacks from a Statistical Perspective

Summary pending...

backdoor attackmachine learning safetyasymptotic