Papers

12094 papers

ICLR2024

Tool-Augmented Reward Modeling

Summary pending...

Reward ModelLarge Language ModelTool Learning
ICLR2024

A Differentially Private Clustering Algorithm for Well-Clustered Graphs

Summary pending...

differential privacygraph clusteringsemidefinite programming
ICLR2024

Navigating Dataset Documentations in AI: A Large-Scale Analysis of Dataset Cards on HuggingFace

Summary pending...

dataset documentationdata-centric AIlarge-scale analysis
ICLR2024

Error Feedback Reloaded: From Quadratic to Arithmetic Mean of Smoothness Constants

Summary pending...

error feedbackgreedy sparsificationdistributed optimization
ICLR2024

PROGRAM: PROtotype GRAph Model based Pseudo-Label Learning for Test-Time Adaptation

Summary pending...

test-time adaptationdomain adaptationdomain shift
ICLR2024

Accurate Retraining-free Pruning for Pretrained Encoder-based Language Models

Summary pending...

Retraining-freePruningCompression
ICLR2024

Scalable Monotonic Neural Networks

Summary pending...

neural networksmonotonicityscalability
ICLR2024

MixSATGEN: Learning Graph Mixing for SAT Instance Generation

Summary pending...

Combinatorial OptimizationBoolean Satisfiability ProblemGraph Generation
ICLR2024

Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making

Summary pending...

MetaFormerConvolutionReinforcement Learning
ICLR2024

AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models

Summary pending...

Large Language ModelsJailbreak AttackAdversarial Attack
ICLR2024

The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A”

Summary pending...

LLMsLarge Language ModelsQuestion Answering
ICLR2024

Learning Delays in Spiking Neural Networks using Dilated Convolutions with Learnable Spacings

Summary pending...

Spiking Neural NetworksDelaysNeuromorphic Computing
ICLR2024

Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes

Summary pending...

Reinforcement learningmulti-task learningbracketing number
ICLR2024

Improving Offline RL by Blending Heuristics

Summary pending...

offline RLheuristicRL
ICLR2024

Explaining Kernel Clustering via Decision Trees

Summary pending...

Kernel k-meansPrice of explainability
ICLR2024

Never Train from Scratch: Fair Comparison of Long-Sequence Models Requires Data-Driven Priors

Summary pending...

Pre TrainingTransformersState Space Models
ICLR2024

An Agnostic View on the Cost of Overfitting in (Kernel) Ridge Regression

Summary pending...

kernel ridge regressioncost of overfittingbenign overfitting
ICLR2024

Expressive Losses for Verified Robustness via Convex Combinations

Summary pending...

Verified TrainingNeural Network VerificationVerified Adversarial Robustness
ICLR2024

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Summary pending...

audio synthesisvocoderGAN
ICLR2024

Enhancing Human Experience in Human-Agent Collaboration: A Human-Centered Modeling Approach Based on Positive Human Gain

Summary pending...

human enhancementhuman-agent collaborationgame playing