Papers

12094 papers

ICLR2024

Variance-aware Regret Bounds for Stochastic Contextual Dueling Bandits

Summary pending...

Dueling BanditVariance-awarecontextual bandit

Paper

ICLR2024

MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training

Summary pending...

self-supervised learningmusicaudio

Paper

ICLR2024

Harnessing Density Ratios for Online Reinforcement Learning

Summary pending...

reinforcement learning theoryonline RLoffline RL

Paper

ICLR2024

MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning

Summary pending...

Large Language ModelsChain-of-ThoughtTextual Reasoning

Paper

ICLR2024

Group Preference Optimization: Few-Shot Alignment of Large Language Models

Summary pending...

Large Language Modelsalignmentgroup preference alignment

Paper

ICLR2024

NeuroBack: Improving CDCL SAT Solving using Graph Neural Networks

Summary pending...

Propositional satisfiabilityGraph Neural NetworksCDCL SAT Solving

Paper

ICLR2024

BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models

Summary pending...

large language modelchain-of-thoughtbackdoor attack

Paper

ICLR2024

L2MAC: Large Language Model Automatic Computer for Extensive Code Generation

Summary pending...

Code GenerationMemory-augmented LLMsLarge Language Models (LLMs)

Paper

ICLR2024

Learning to design protein-protein interactions with enhanced generalization

Summary pending...

protein-protein interactionsprotein designgeneralization

Paper

ICLR2024

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Summary pending...

LLM finetuningScaling LawsFull-model finetuning

Paper

ICLR2024

Unified Projection-Free Algorithms for Adversarial DR-Submodular Optimization

Summary pending...

Stochastic optimizationsubmodular maximizationFrank-Wolfe algorithm

Paper

ICLR2024

Does CLIP’s generalization performance mainly stem from high train-test similarity?

Summary pending...

robustnessfoundation modelsCLIP

Paper

ICLR2024

Curriculum reinforcement learning for quantum architecture search under hardware errors

Summary pending...

Quantum ComputingReinforcement LearningQuantum Chemistry

Paper

ICLR2024

MT-Ranker: Reference-free machine translation evaluation by inter-system ranking

Summary pending...

Machine Translation Evaluation

Paper

ICLR2024

Forward Learning of Graph Neural Networks

Summary pending...

graph neural networksforward learningforward-forward algorithm

Paper

ICLR2024

Sufficient conditions for offline reactivation in recurrent neural networks

Summary pending...

computational neuroscienceoffline reactivationreplay

Paper

ICLR2024

Cycle Consistency Driven Object Discovery

Summary pending...

cycle consistencyobject discoverydownstream RL

Paper

ICLR2024

Provable Compositional Generalization for Object-Centric Learning

Summary pending...

compositional generalizationidentifiabilityobject-centric learning

Paper

ICLR2024

Learning to Make Adherence-aware Advice

Summary pending...

Human-AI interactionReinforcement Learning

Paper

ICLR2024

Demystifying Poisoning Backdoor Attacks from a Statistical Perspective

Summary pending...

backdoor attackmachine learning safetyasymptotic

Paper