Papers

12094 papers

ICLR2024

Tree Search-Based Policy Optimization under Stochastic Execution Delay

Summary pending...

Reinforcement LearningDelayEfficientZero
ICLR2024

Designing Skill-Compatible AI: Methodologies and Frameworks in Chess

Summary pending...

Skill-AI compatibilityAgent SystemsDecision-making
ICLR2024

A Flexible Generative Model for Heterogeneous Tabular EHR with Missing Modality

Summary pending...

Generative ModelSynthetic EHR
ICLR2024

Differentiable Learning of Generalized Structured Matrices for Efficient Deep Neural Networks

Summary pending...

Structured MatrixBlock Low RankLow Rank
ICLR2024

Maximum Likelihood Estimation is All You Need for Well-Specified Covariate Shift

Summary pending...

Covariate shift; Maximum Likelihood Estimation; Out-of-Distribution generalization;
ICLR2024

Predictive, scalable and interpretable knowledge tracing on structured domains

Summary pending...

knowledge tracinginterpretable representationsknowledge graphs
ICLR2024

Directly Fine-Tuning Diffusion Models on Differentiable Rewards

Summary pending...

diffusion modelspreference-based learning
ICLR2024

Llemma: An Open Language Model for Mathematics

Summary pending...

reasoninglanguage modelspretraining
ICLR2024

Task structure and nonlinearity jointly determine learned representational geometry

Summary pending...

representational geometrykernel target alignmentdisentanglement
ICLR2024

Risk Bounds of Accelerated SGD for Overparameterized Linear Regression

Summary pending...

Accelerated stochastic gradient descentexcess risklinear regression
ICLR2024

Layer-wise linear mode connectivity

Summary pending...

linear mode connectivitylayer-wisefederated averaging
ICLR2024

Understanding Certified Training with Interval Bound Propagation

Summary pending...

Certified RobustnessAdversarial RobustnessNeural Network Verification
ICLR2024

Offline RL with Observation Histories: Analyzing and Improving Sample Complexity

Summary pending...

offline reinforcement learningPOMDPsrepresentation learning
ICLR2024

Graph-based Virtual Sensing from Sparse and Partial Multivariate Observations

Summary pending...

Spatio-temporal datatime seriesvirtual sensing
ICLR2024

Two-stage LLM Fine-tuning with Less Specialization and More Generalization

Summary pending...

language modelGeneralization
ICLR2024

NEFTune: Noisy Embeddings Improve Instruction Finetuning

Summary pending...

Instruction Finetuning
ICLR2024

An operator preconditioning perspective on training in physics-informed machine learning

Summary pending...

physics-informed machine learningoperator preconditioningdeep learning
ICLR2024

Stochastic Gradient Descent for Gaussian Processes Done Right

Summary pending...

Gaussian processstochastic gradient descent
ICLR2024

Predictive auxiliary objectives in deep RL mimic learning in the brain

Summary pending...

hippocampusneurosciencecognitive science
ICLR2024

A Discretization Framework for Robust Contextual Stochastic Optimization

Summary pending...

Robust OptimizationStochastic OptimizationEnd-to-End learning