Papers

12094 papers

ICLR2024

Fast-ELECTRA for Efficient Pre-training

Summary pending...

Language model Pre-trainingELECTRAEfficiency

Paper

ICLR2024

Causal Modelling Agents: Causal Graph Discovery through Synergising Metadata- and Data-driven Reasoning

Summary pending...

Causal ReasoningCausal DiscoveryStructural Causal Models

Paper

ICLR2024

Toward Student-oriented Teacher Network Training for Knowledge Distillation

Summary pending...

Knowledge distillationTeacher-student trainingEmpirical risk minimization

Paper

ICLR2024

An Emulator for Fine-tuning Large Language Models using Small Language Models

Summary pending...

pre-trainingfine-tuningdecouple

Paper

ICLR2024

Pre-training LiDAR-based 3D Object Detectors through Colorization

Summary pending...

3D object detectionLiDAR point cloudpre-training

Paper

ICLR2024

Efficient Modulation for Vision Networks

Summary pending...

EfficientModEfficient Networks

Paper

ICLR2024

Language Models Represent Space and Time

Summary pending...

Interpretabilityworld modelsprobing

Paper

ICLR2024

FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

Summary pending...

convolutionsGPUshardware-efficient algorithms

Paper

ICLR2024

Independent-Set Design of Experiments for Estimating Treatment and Spillover Effects under Network Interference

Summary pending...

Causal inferenceDesign of experimentsInterference

Paper

ICLR2024

Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets

Summary pending...

graph neural networksDatasetsmolecules

Paper

ICLR2024

AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents

Summary pending...

Meta-RLGeneralizationLong-Term Memory

Paper

ICLR2024

From Molecules to Materials: Pre-training Large Generalizable Models for Atomic Property Prediction

Summary pending...

atomic property predictionpre-training3D atomic pre-training

Paper

ICLR2024

PlaSma: Procedural Knowledge Models for Language-based Planning and Re-Planning

Summary pending...

language-based planningprocedural/script knowledgedistillation

Paper

ICLR2024

Are Models Biased on Text without Gender-related Language?

Summary pending...

Large language modelsbias evaluationgender bias

Paper

ICLR2024

Counting Graph Substructures with Graph Neural Networks

Summary pending...

graph neural networksexpressive powerrepresentation learning

Paper

ICLR2024

Reverse Diffusion Monte Carlo

Summary pending...

posterior Samplingmulti-modal samplingdiffusion process

Paper

ICLR2024

Optimal Sketching for Residual Error Estimation for Matrix and Vector Norms

Summary pending...

SketchingResidual errorLow-rank approximation

Paper

ICLR2024

Tractable MCMC for Private Learning with Pure and Gaussian Differential Privacy

Summary pending...

Pure Differential PrivacyMonte Carlo samplingGaussian Differential Privacy

Paper

ICLR2024

Chain of Thought Empowers Transformers to Solve Inherently Serial Problems

Summary pending...

Chain of thoughtlanguage modelingcircuit complexity

Paper

ICLR2024

The Trickle-down Impact of Reward Inconsistency on RLHF

Summary pending...

Large language modelreward modelRLHF

Paper