Papers

12094 papers

ICLR2024

Towards Generative Abstract Reasoning: Completing Raven’s Progressive Matrix via Rule Abstraction and Selection

Summary pending...

Deep Latent Variable ModelsGenerative ModelsRaven’s Progressive Matrix
ICLR2024

When can transformers reason with abstract symbols?

Summary pending...

transformerslanguage modelsreasoning
ICLR2024

Nougat: Neural Optical Understanding for Academic Documents

Summary pending...

Visual Document UnderstandingOptical Character RecognitionMathematical Expression Recognition
ICLR2024

T-MARS: Improving Visual Representations by Circumventing Text Feature Learning

Summary pending...

Data CleaningCLIPImage-caption
ICLR2024

Evaluating Representation Learning on the Protein Structure Universe

Summary pending...

ProteinRepresentationLearning
ICLR2024

Quadratic models for understanding catapult dynamics of neural networks

Summary pending...

quadratic modelswide neural networkscatapult phase
ICLR2024

LLM Augmented LLMs: Expanding Capabilities through Composition

Summary pending...

Large Language ModelsModel CompositionKnowledge Augmentation
ICLR2024

Estimating Conditional Mutual Information for Dynamic Feature Selection

Summary pending...

dynamic feature selectionadaptivefeature selection
ICLR2024

Reconciling Spatial and Temporal Abstractions for Goal Representation

Summary pending...

Hierarchical Reinforcement LearningGoal RepresentationReachability Analysis
ICLR2024

Towards Diverse Behaviors: A Benchmark for Imitation Learning with Human Demonstrations

Summary pending...

Imitation LearningBenchmarkDatasets
ICLR2024

Transformers can optimally learn regression mixture models

Summary pending...

transformersmixture modelslinear regression
ICLR2024

Optimal Sample Complexity of Contrastive Learning

Summary pending...

learning theorysample complexityvc dimension
ICLR2024

Mixture of Weak and Strong Experts on Graphs

Summary pending...

Graph Neural NetworksMixture of expertsNode classification
ICLR2024

Learning Polynomial Problems with $SL(2, \mathbb{R})$-Equivariance

Summary pending...

equivarianceinvariancepolynomials
ICLR2024

Rethinking Backdoor Attacks on Dataset Distillation: A Kernel Method Perspective

Summary pending...

BackdoorTriggerDataset Condensation
ICLR2024

Towards Principled Representation Learning from Videos for Reinforcement Learning

Summary pending...

Reinforcement LearningRepresentation Learning
ICLR2024

The Marginal Value of Momentum for Small Learning Rate SGD

Summary pending...

momentumSGDdynamics
ICLR2024

Language Model Cascades: Token-Level Uncertainty And Beyond

Summary pending...

CascadesEfficient InferenceLanguage Models
ICLR2024

Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning

Summary pending...

Reinforcement learningPosterior samplingCausality
ICLR2024

Grokking as the transition from lazy to rich training dynamics

Summary pending...

GrokkingFeature LearningNeural Tangent Kernel