Papers

12094 papers

ICLR2024

YaRN: Efficient Context Window Extension of Large Language Models

Summary pending...

transformersnlpfine-tuning
ICLR2024

Privately Aligning Language Models with Reinforcement Learning

Summary pending...

Large Language ModelsRLHFAlignment
ICLR2024

Sharpness-Aware Data Poisoning Attack

Summary pending...

Data poisoning attack; generalization; deep learning
ICLR2024

Post-hoc bias scoring is optimal for fair classification

Summary pending...

group fairnesspost-hoc fair classificationBayes optimal classifier
ICLR2024

Talk like a Graph: Encoding Graphs for Large Language Models

Summary pending...

Graph problemslarge language modelsencoding graphs
ICLR2024

Think before you speak: Training Language Models With Pause Tokens

Summary pending...

LLM training and inferenceDownstream finetuning
ICLR2024

A Characterization Theorem for Equivariant Networks with Point-wise Activations

Summary pending...

Geometric Deep LearningEquivariant Neural NetworksCharacterization Theorem
ICLR2024

Towards Generative Abstract Reasoning: Completing Raven’s Progressive Matrix via Rule Abstraction and Selection

Summary pending...

Deep Latent Variable ModelsGenerative ModelsRaven’s Progressive Matrix
ICLR2024

When can transformers reason with abstract symbols?

Summary pending...

transformerslanguage modelsreasoning
ICLR2024

Nougat: Neural Optical Understanding for Academic Documents

Summary pending...

Visual Document UnderstandingOptical Character RecognitionMathematical Expression Recognition
ICLR2024

T-MARS: Improving Visual Representations by Circumventing Text Feature Learning

Summary pending...

Data CleaningCLIPImage-caption
ICLR2024

Evaluating Representation Learning on the Protein Structure Universe

Summary pending...

ProteinRepresentationLearning
ICLR2024

Quadratic models for understanding catapult dynamics of neural networks

Summary pending...

quadratic modelswide neural networkscatapult phase
ICLR2024

LLM Augmented LLMs: Expanding Capabilities through Composition

Summary pending...

Large Language ModelsModel CompositionKnowledge Augmentation
ICLR2024

Estimating Conditional Mutual Information for Dynamic Feature Selection

Summary pending...

dynamic feature selectionadaptivefeature selection
ICLR2024

Reconciling Spatial and Temporal Abstractions for Goal Representation

Summary pending...

Hierarchical Reinforcement LearningGoal RepresentationReachability Analysis
ICLR2024

Towards Diverse Behaviors: A Benchmark for Imitation Learning with Human Demonstrations

Summary pending...

Imitation LearningBenchmarkDatasets
ICLR2024

Transformers can optimally learn regression mixture models

Summary pending...

transformersmixture modelslinear regression
ICLR2024

Optimal Sample Complexity of Contrastive Learning

Summary pending...

learning theorysample complexityvc dimension
ICLR2024

Mixture of Weak and Strong Experts on Graphs

Summary pending...

Graph Neural NetworksMixture of expertsNode classification