Papers
12094 papers
ICLR2024
Variance-aware Regret Bounds for Stochastic Contextual Dueling Bandits
Summary pending...
Dueling BanditVariance-awarecontextual bandit
ICLR2024
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training
Summary pending...
self-supervised learningmusicaudio
ICLR2024
Harnessing Density Ratios for Online Reinforcement Learning
Summary pending...
reinforcement learning theoryonline RLoffline RL
ICLR2024
MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning
Summary pending...
Large Language ModelsChain-of-ThoughtTextual Reasoning
ICLR2024
Group Preference Optimization: Few-Shot Alignment of Large Language Models
Summary pending...
Large Language Modelsalignmentgroup preference alignment
ICLR2024
NeuroBack: Improving CDCL SAT Solving using Graph Neural Networks
Summary pending...
Propositional satisfiabilityGraph Neural NetworksCDCL SAT Solving
ICLR2024
BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models
Summary pending...
large language modelchain-of-thoughtbackdoor attack
ICLR2024
L2MAC: Large Language Model Automatic Computer for Extensive Code Generation
Summary pending...
Code GenerationMemory-augmented LLMsLarge Language Models (LLMs)
ICLR2024
Learning to design protein-protein interactions with enhanced generalization
Summary pending...
protein-protein interactionsprotein designgeneralization
ICLR2024
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method
Summary pending...
LLM finetuningScaling LawsFull-model finetuning
ICLR2024
Unified Projection-Free Algorithms for Adversarial DR-Submodular Optimization
Summary pending...
Stochastic optimizationsubmodular maximizationFrank-Wolfe algorithm
ICLR2024
Does CLIP’s generalization performance mainly stem from high train-test similarity?
Summary pending...
robustnessfoundation modelsCLIP
ICLR2024
Curriculum reinforcement learning for quantum architecture search under hardware errors
Summary pending...
Quantum ComputingReinforcement LearningQuantum Chemistry
ICLR2024
MT-Ranker: Reference-free machine translation evaluation by inter-system ranking
Summary pending...
Machine Translation Evaluation
ICLR2024
Forward Learning of Graph Neural Networks
Summary pending...
graph neural networksforward learningforward-forward algorithm
ICLR2024
Sufficient conditions for offline reactivation in recurrent neural networks
Summary pending...
computational neuroscienceoffline reactivationreplay
ICLR2024
Cycle Consistency Driven Object Discovery
Summary pending...
cycle consistencyobject discoverydownstream RL
ICLR2024
Provable Compositional Generalization for Object-Centric Learning
Summary pending...
compositional generalizationidentifiabilityobject-centric learning
ICLR2024
Learning to Make Adherence-aware Advice
Summary pending...
Human-AI interactionReinforcement Learning
ICLR2024
Demystifying Poisoning Backdoor Attacks from a Statistical Perspective
Summary pending...
backdoor attackmachine learning safetyasymptotic