Papers

12094 papers

ICLR2024

YaRN: Efficient Context Window Extension of Large Language Models

Summary pending...

transformersnlpfine-tuning

Paper

ICLR2024

Privately Aligning Language Models with Reinforcement Learning

Summary pending...

Large Language ModelsRLHFAlignment

Paper

ICLR2024

Sharpness-Aware Data Poisoning Attack

Summary pending...

Data poisoning attack; generalization; deep learning

Paper

ICLR2024

Post-hoc bias scoring is optimal for fair classification

Summary pending...

group fairnesspost-hoc fair classificationBayes optimal classifier

Paper

ICLR2024

Talk like a Graph: Encoding Graphs for Large Language Models

Summary pending...

Graph problemslarge language modelsencoding graphs

Paper

ICLR2024

Think before you speak: Training Language Models With Pause Tokens

Summary pending...

LLM training and inferenceDownstream finetuning

Paper

ICLR2024

A Characterization Theorem for Equivariant Networks with Point-wise Activations

Summary pending...

Geometric Deep LearningEquivariant Neural NetworksCharacterization Theorem

Paper

ICLR2024

Towards Generative Abstract Reasoning: Completing Raven’s Progressive Matrix via Rule Abstraction and Selection

Summary pending...

Deep Latent Variable ModelsGenerative ModelsRaven’s Progressive Matrix

Paper

ICLR2024

When can transformers reason with abstract symbols?

Summary pending...

transformerslanguage modelsreasoning

Paper

ICLR2024

Nougat: Neural Optical Understanding for Academic Documents

Summary pending...

Visual Document UnderstandingOptical Character RecognitionMathematical Expression Recognition

Paper

ICLR2024

T-MARS: Improving Visual Representations by Circumventing Text Feature Learning

Summary pending...

Data CleaningCLIPImage-caption

Paper

ICLR2024

Evaluating Representation Learning on the Protein Structure Universe

Summary pending...

ProteinRepresentationLearning

Paper

ICLR2024

Quadratic models for understanding catapult dynamics of neural networks

Summary pending...

quadratic modelswide neural networkscatapult phase

Paper

ICLR2024

LLM Augmented LLMs: Expanding Capabilities through Composition

Summary pending...

Large Language ModelsModel CompositionKnowledge Augmentation

Paper

ICLR2024

Estimating Conditional Mutual Information for Dynamic Feature Selection

Summary pending...

dynamic feature selectionadaptivefeature selection

Paper

ICLR2024

Reconciling Spatial and Temporal Abstractions for Goal Representation

Summary pending...

Hierarchical Reinforcement LearningGoal RepresentationReachability Analysis

Paper

ICLR2024

Towards Diverse Behaviors: A Benchmark for Imitation Learning with Human Demonstrations

Summary pending...

Imitation LearningBenchmarkDatasets

Paper

ICLR2024

Transformers can optimally learn regression mixture models

Summary pending...

transformersmixture modelslinear regression

Paper

ICLR2024

Optimal Sample Complexity of Contrastive Learning

Summary pending...

learning theorysample complexityvc dimension

Paper

ICLR2024

Mixture of Weak and Strong Experts on Graphs

Summary pending...

Graph Neural NetworksMixture of expertsNode classification

Paper