Papers
12094 papers
ICLR2024
Tool-Augmented Reward Modeling
Summary pending...
Reward ModelLarge Language ModelTool Learning
ICLR2024
A Differentially Private Clustering Algorithm for Well-Clustered Graphs
Summary pending...
differential privacygraph clusteringsemidefinite programming
ICLR2024
Navigating Dataset Documentations in AI: A Large-Scale Analysis of Dataset Cards on HuggingFace
Summary pending...
dataset documentationdata-centric AIlarge-scale analysis
ICLR2024
Error Feedback Reloaded: From Quadratic to Arithmetic Mean of Smoothness Constants
Summary pending...
error feedbackgreedy sparsificationdistributed optimization
ICLR2024
PROGRAM: PROtotype GRAph Model based Pseudo-Label Learning for Test-Time Adaptation
Summary pending...
test-time adaptationdomain adaptationdomain shift
ICLR2024
Accurate Retraining-free Pruning for Pretrained Encoder-based Language Models
Summary pending...
Retraining-freePruningCompression
ICLR2024
Scalable Monotonic Neural Networks
Summary pending...
neural networksmonotonicityscalability
ICLR2024
MixSATGEN: Learning Graph Mixing for SAT Instance Generation
Summary pending...
Combinatorial OptimizationBoolean Satisfiability ProblemGraph Generation
ICLR2024
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making
Summary pending...
MetaFormerConvolutionReinforcement Learning
ICLR2024
AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models
Summary pending...
Large Language ModelsJailbreak AttackAdversarial Attack
ICLR2024
The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A”
Summary pending...
LLMsLarge Language ModelsQuestion Answering
ICLR2024
Learning Delays in Spiking Neural Networks using Dilated Convolutions with Learnable Spacings
Summary pending...
Spiking Neural NetworksDelaysNeuromorphic Computing
ICLR2024
Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes
Summary pending...
Reinforcement learningmulti-task learningbracketing number
ICLR2024
Explaining Kernel Clustering via Decision Trees
Summary pending...
Kernel k-meansPrice of explainability
ICLR2024
Never Train from Scratch: Fair Comparison of Long-Sequence Models Requires Data-Driven Priors
Summary pending...
Pre TrainingTransformersState Space Models
ICLR2024
An Agnostic View on the Cost of Overfitting in (Kernel) Ridge Regression
Summary pending...
kernel ridge regressioncost of overfittingbenign overfitting
ICLR2024
Expressive Losses for Verified Robustness via Convex Combinations
Summary pending...
Verified TrainingNeural Network VerificationVerified Adversarial Robustness
ICLR2024
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Summary pending...
audio synthesisvocoderGAN
ICLR2024
Enhancing Human Experience in Human-Agent Collaboration: A Human-Centered Modeling Approach Based on Positive Human Gain
Summary pending...
human enhancementhuman-agent collaborationgame playing