Papers

12094 papers

ICLR2024

Tree Search-Based Policy Optimization under Stochastic Execution Delay

Summary pending...

Reinforcement LearningDelayEfficientZero

Paper

ICLR2024

Designing Skill-Compatible AI: Methodologies and Frameworks in Chess

Summary pending...

Skill-AI compatibilityAgent SystemsDecision-making

Paper

ICLR2024

A Flexible Generative Model for Heterogeneous Tabular EHR with Missing Modality

Summary pending...

Generative ModelSynthetic EHR

Paper

ICLR2024

Differentiable Learning of Generalized Structured Matrices for Efficient Deep Neural Networks

Summary pending...

Structured MatrixBlock Low RankLow Rank

Paper

ICLR2024

Maximum Likelihood Estimation is All You Need for Well-Specified Covariate Shift

Summary pending...

Covariate shift; Maximum Likelihood Estimation; Out-of-Distribution generalization;

Paper

ICLR2024

Predictive, scalable and interpretable knowledge tracing on structured domains

Summary pending...

knowledge tracinginterpretable representationsknowledge graphs

Paper

ICLR2024

Directly Fine-Tuning Diffusion Models on Differentiable Rewards

Summary pending...

diffusion modelspreference-based learning

Paper

ICLR2024

Llemma: An Open Language Model for Mathematics

Summary pending...

reasoninglanguage modelspretraining

Paper

ICLR2024

Task structure and nonlinearity jointly determine learned representational geometry

Summary pending...

representational geometrykernel target alignmentdisentanglement

Paper

ICLR2024

Risk Bounds of Accelerated SGD for Overparameterized Linear Regression

Summary pending...

Accelerated stochastic gradient descentexcess risklinear regression

Paper

ICLR2024

Layer-wise linear mode connectivity

Summary pending...

linear mode connectivitylayer-wisefederated averaging

Paper

ICLR2024

Understanding Certified Training with Interval Bound Propagation

Summary pending...

Certified RobustnessAdversarial RobustnessNeural Network Verification

Paper

ICLR2024

Offline RL with Observation Histories: Analyzing and Improving Sample Complexity

Summary pending...

offline reinforcement learningPOMDPsrepresentation learning

Paper

ICLR2024

Graph-based Virtual Sensing from Sparse and Partial Multivariate Observations

Summary pending...

Spatio-temporal datatime seriesvirtual sensing

Paper

ICLR2024

Two-stage LLM Fine-tuning with Less Specialization and More Generalization

Summary pending...

language modelGeneralization

Paper

ICLR2024

NEFTune: Noisy Embeddings Improve Instruction Finetuning

Summary pending...

Instruction Finetuning

Paper

ICLR2024

An operator preconditioning perspective on training in physics-informed machine learning

Summary pending...

physics-informed machine learningoperator preconditioningdeep learning

Paper

ICLR2024

Stochastic Gradient Descent for Gaussian Processes Done Right

Summary pending...

Gaussian processstochastic gradient descent

Paper

ICLR2024

Predictive auxiliary objectives in deep RL mimic learning in the brain

Summary pending...

hippocampusneurosciencecognitive science

Paper

ICLR2024

A Discretization Framework for Robust Contextual Stochastic Optimization

Summary pending...

Robust OptimizationStochastic OptimizationEnd-to-End learning

Paper