Papers
12094 papers
ICLR2024
Tree Search-Based Policy Optimization under Stochastic Execution Delay
Summary pending...
Reinforcement LearningDelayEfficientZero
ICLR2024
Designing Skill-Compatible AI: Methodologies and Frameworks in Chess
Summary pending...
Skill-AI compatibilityAgent SystemsDecision-making
ICLR2024
A Flexible Generative Model for Heterogeneous Tabular EHR with Missing Modality
Summary pending...
Generative ModelSynthetic EHR
ICLR2024
Differentiable Learning of Generalized Structured Matrices for Efficient Deep Neural Networks
Summary pending...
Structured MatrixBlock Low RankLow Rank
ICLR2024
Maximum Likelihood Estimation is All You Need for Well-Specified Covariate Shift
Summary pending...
Covariate shift; Maximum Likelihood Estimation; Out-of-Distribution generalization;
ICLR2024
Predictive, scalable and interpretable knowledge tracing on structured domains
Summary pending...
knowledge tracinginterpretable representationsknowledge graphs
ICLR2024
Directly Fine-Tuning Diffusion Models on Differentiable Rewards
Summary pending...
diffusion modelspreference-based learning
ICLR2024
Llemma: An Open Language Model for Mathematics
Summary pending...
reasoninglanguage modelspretraining
ICLR2024
Task structure and nonlinearity jointly determine learned representational geometry
Summary pending...
representational geometrykernel target alignmentdisentanglement
ICLR2024
Risk Bounds of Accelerated SGD for Overparameterized Linear Regression
Summary pending...
Accelerated stochastic gradient descentexcess risklinear regression
ICLR2024
Layer-wise linear mode connectivity
Summary pending...
linear mode connectivitylayer-wisefederated averaging
ICLR2024
Understanding Certified Training with Interval Bound Propagation
Summary pending...
Certified RobustnessAdversarial RobustnessNeural Network Verification
ICLR2024
Offline RL with Observation Histories: Analyzing and Improving Sample Complexity
Summary pending...
offline reinforcement learningPOMDPsrepresentation learning
ICLR2024
Graph-based Virtual Sensing from Sparse and Partial Multivariate Observations
Summary pending...
Spatio-temporal datatime seriesvirtual sensing
ICLR2024
Two-stage LLM Fine-tuning with Less Specialization and More Generalization
Summary pending...
language modelGeneralization
ICLR2024
NEFTune: Noisy Embeddings Improve Instruction Finetuning
Summary pending...
Instruction Finetuning
ICLR2024
An operator preconditioning perspective on training in physics-informed machine learning
Summary pending...
physics-informed machine learningoperator preconditioningdeep learning
ICLR2024
Stochastic Gradient Descent for Gaussian Processes Done Right
Summary pending...
Gaussian processstochastic gradient descent
ICLR2024
Predictive auxiliary objectives in deep RL mimic learning in the brain
Summary pending...
hippocampusneurosciencecognitive science
ICLR2024
A Discretization Framework for Robust Contextual Stochastic Optimization
Summary pending...
Robust OptimizationStochastic OptimizationEnd-to-End learning