Papers
12094 papers
ICLR2024
YaRN: Efficient Context Window Extension of Large Language Models
Summary pending...
transformersnlpfine-tuning
ICLR2024
Privately Aligning Language Models with Reinforcement Learning
Summary pending...
Large Language ModelsRLHFAlignment
ICLR2024
Sharpness-Aware Data Poisoning Attack
Summary pending...
Data poisoning attack; generalization; deep learning
ICLR2024
Post-hoc bias scoring is optimal for fair classification
Summary pending...
group fairnesspost-hoc fair classificationBayes optimal classifier
ICLR2024
Talk like a Graph: Encoding Graphs for Large Language Models
Summary pending...
Graph problemslarge language modelsencoding graphs
ICLR2024
Think before you speak: Training Language Models With Pause Tokens
Summary pending...
LLM training and inferenceDownstream finetuning
ICLR2024
A Characterization Theorem for Equivariant Networks with Point-wise Activations
Summary pending...
Geometric Deep LearningEquivariant Neural NetworksCharacterization Theorem
ICLR2024
Towards Generative Abstract Reasoning: Completing Raven’s Progressive Matrix via Rule Abstraction and Selection
Summary pending...
Deep Latent Variable ModelsGenerative ModelsRaven’s Progressive Matrix
ICLR2024
When can transformers reason with abstract symbols?
Summary pending...
transformerslanguage modelsreasoning
ICLR2024
Nougat: Neural Optical Understanding for Academic Documents
Summary pending...
Visual Document UnderstandingOptical Character RecognitionMathematical Expression Recognition
ICLR2024
T-MARS: Improving Visual Representations by Circumventing Text Feature Learning
Summary pending...
Data CleaningCLIPImage-caption
ICLR2024
Evaluating Representation Learning on the Protein Structure Universe
Summary pending...
ProteinRepresentationLearning
ICLR2024
Quadratic models for understanding catapult dynamics of neural networks
Summary pending...
quadratic modelswide neural networkscatapult phase
ICLR2024
LLM Augmented LLMs: Expanding Capabilities through Composition
Summary pending...
Large Language ModelsModel CompositionKnowledge Augmentation
ICLR2024
Estimating Conditional Mutual Information for Dynamic Feature Selection
Summary pending...
dynamic feature selectionadaptivefeature selection
ICLR2024
Reconciling Spatial and Temporal Abstractions for Goal Representation
Summary pending...
Hierarchical Reinforcement LearningGoal RepresentationReachability Analysis
ICLR2024
Towards Diverse Behaviors: A Benchmark for Imitation Learning with Human Demonstrations
Summary pending...
Imitation LearningBenchmarkDatasets
ICLR2024
Transformers can optimally learn regression mixture models
Summary pending...
transformersmixture modelslinear regression
ICLR2024
Optimal Sample Complexity of Contrastive Learning
Summary pending...
learning theorysample complexityvc dimension
ICLR2024
Mixture of Weak and Strong Experts on Graphs
Summary pending...
Graph Neural NetworksMixture of expertsNode classification