Papers

12094 papers

ICLR2024

Fast-ELECTRA for Efficient Pre-training

Summary pending...

Language model Pre-trainingELECTRAEfficiency
ICLR2024

Causal Modelling Agents: Causal Graph Discovery through Synergising Metadata- and Data-driven Reasoning

Summary pending...

Causal ReasoningCausal DiscoveryStructural Causal Models
ICLR2024

Toward Student-oriented Teacher Network Training for Knowledge Distillation

Summary pending...

Knowledge distillationTeacher-student trainingEmpirical risk minimization
ICLR2024

An Emulator for Fine-tuning Large Language Models using Small Language Models

Summary pending...

pre-trainingfine-tuningdecouple
ICLR2024

Pre-training LiDAR-based 3D Object Detectors through Colorization

Summary pending...

3D object detectionLiDAR point cloudpre-training
ICLR2024

Efficient Modulation for Vision Networks

Summary pending...

EfficientModEfficient Networks
ICLR2024

Language Models Represent Space and Time

Summary pending...

Interpretabilityworld modelsprobing
ICLR2024

FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

Summary pending...

convolutionsGPUshardware-efficient algorithms
ICLR2024

Independent-Set Design of Experiments for Estimating Treatment and Spillover Effects under Network Interference

Summary pending...

Causal inferenceDesign of experimentsInterference
ICLR2024

Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets

Summary pending...

graph neural networksDatasetsmolecules
ICLR2024

AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents

Summary pending...

Meta-RLGeneralizationLong-Term Memory
ICLR2024

From Molecules to Materials: Pre-training Large Generalizable Models for Atomic Property Prediction

Summary pending...

atomic property predictionpre-training3D atomic pre-training
ICLR2024

PlaSma: Procedural Knowledge Models for Language-based Planning and Re-Planning

Summary pending...

language-based planningprocedural/script knowledgedistillation
ICLR2024

Are Models Biased on Text without Gender-related Language?

Summary pending...

Large language modelsbias evaluationgender bias
ICLR2024

Counting Graph Substructures with Graph Neural Networks

Summary pending...

graph neural networksexpressive powerrepresentation learning
ICLR2024

Reverse Diffusion Monte Carlo

Summary pending...

posterior Samplingmulti-modal samplingdiffusion process
ICLR2024

Optimal Sketching for Residual Error Estimation for Matrix and Vector Norms

Summary pending...

SketchingResidual errorLow-rank approximation
ICLR2024

Tractable MCMC for Private Learning with Pure and Gaussian Differential Privacy

Summary pending...

Pure Differential PrivacyMonte Carlo samplingGaussian Differential Privacy
ICLR2024

Chain of Thought Empowers Transformers to Solve Inherently Serial Problems

Summary pending...

Chain of thoughtlanguage modelingcircuit complexity
ICLR2024

The Trickle-down Impact of Reward Inconsistency on RLHF

Summary pending...

Large language modelreward modelRLHF