Papers

12094 papers

ICLR2024

Tool-Augmented Reward Modeling

Summary pending...

Reward ModelLarge Language ModelTool Learning

Paper

ICLR2024

A Differentially Private Clustering Algorithm for Well-Clustered Graphs

Summary pending...

differential privacygraph clusteringsemidefinite programming

Paper

ICLR2024

Navigating Dataset Documentations in AI: A Large-Scale Analysis of Dataset Cards on HuggingFace

Summary pending...

dataset documentationdata-centric AIlarge-scale analysis

Paper

ICLR2024

Error Feedback Reloaded: From Quadratic to Arithmetic Mean of Smoothness Constants

Summary pending...

error feedbackgreedy sparsificationdistributed optimization

Paper

ICLR2024

PROGRAM: PROtotype GRAph Model based Pseudo-Label Learning for Test-Time Adaptation

Summary pending...

test-time adaptationdomain adaptationdomain shift

Paper

ICLR2024

Accurate Retraining-free Pruning for Pretrained Encoder-based Language Models

Summary pending...

Retraining-freePruningCompression

Paper

ICLR2024

Scalable Monotonic Neural Networks

Summary pending...

neural networksmonotonicityscalability

Paper

ICLR2024

MixSATGEN: Learning Graph Mixing for SAT Instance Generation

Summary pending...

Combinatorial OptimizationBoolean Satisfiability ProblemGraph Generation

Paper

ICLR2024

Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making

Summary pending...

MetaFormerConvolutionReinforcement Learning

Paper

ICLR2024

AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models

Summary pending...

Large Language ModelsJailbreak AttackAdversarial Attack

Paper

ICLR2024

The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A”

Summary pending...

LLMsLarge Language ModelsQuestion Answering

Paper

ICLR2024

Learning Delays in Spiking Neural Networks using Dilated Convolutions with Learnable Spacings

Summary pending...

Spiking Neural NetworksDelaysNeuromorphic Computing

Paper

ICLR2024

Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes

Summary pending...

Reinforcement learningmulti-task learningbracketing number

Paper

ICLR2024

Improving Offline RL by Blending Heuristics

Summary pending...

offline RLheuristicRL

Paper

ICLR2024

Explaining Kernel Clustering via Decision Trees

Summary pending...

Kernel k-meansPrice of explainability

Paper

ICLR2024

Never Train from Scratch: Fair Comparison of Long-Sequence Models Requires Data-Driven Priors

Summary pending...

Pre TrainingTransformersState Space Models

Paper

ICLR2024

An Agnostic View on the Cost of Overfitting in (Kernel) Ridge Regression

Summary pending...

kernel ridge regressioncost of overfittingbenign overfitting

Paper

ICLR2024

Expressive Losses for Verified Robustness via Convex Combinations

Summary pending...

Verified TrainingNeural Network VerificationVerified Adversarial Robustness

Paper

ICLR2024

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Summary pending...

audio synthesisvocoderGAN

Paper

ICLR2024

Enhancing Human Experience in Human-Agent Collaboration: A Human-Centered Modeling Approach Based on Positive Human Gain

Summary pending...

human enhancementhuman-agent collaborationgame playing

Paper