Papers
12094 papers
ICLR2024
Fast-ELECTRA for Efficient Pre-training
Summary pending...
Language model Pre-trainingELECTRAEfficiency
ICLR2024
Causal Modelling Agents: Causal Graph Discovery through Synergising Metadata- and Data-driven Reasoning
Summary pending...
Causal ReasoningCausal DiscoveryStructural Causal Models
ICLR2024
Toward Student-oriented Teacher Network Training for Knowledge Distillation
Summary pending...
Knowledge distillationTeacher-student trainingEmpirical risk minimization
ICLR2024
An Emulator for Fine-tuning Large Language Models using Small Language Models
Summary pending...
pre-trainingfine-tuningdecouple
ICLR2024
Pre-training LiDAR-based 3D Object Detectors through Colorization
Summary pending...
3D object detectionLiDAR point cloudpre-training
ICLR2024
Efficient Modulation for Vision Networks
Summary pending...
EfficientModEfficient Networks
ICLR2024
Language Models Represent Space and Time
Summary pending...
Interpretabilityworld modelsprobing
ICLR2024
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
Summary pending...
convolutionsGPUshardware-efficient algorithms
ICLR2024
Independent-Set Design of Experiments for Estimating Treatment and Spillover Effects under Network Interference
Summary pending...
Causal inferenceDesign of experimentsInterference
ICLR2024
Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets
Summary pending...
graph neural networksDatasetsmolecules
ICLR2024
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Summary pending...
Meta-RLGeneralizationLong-Term Memory
ICLR2024
From Molecules to Materials: Pre-training Large Generalizable Models for Atomic Property Prediction
Summary pending...
atomic property predictionpre-training3D atomic pre-training
ICLR2024
PlaSma: Procedural Knowledge Models for Language-based Planning and Re-Planning
Summary pending...
language-based planningprocedural/script knowledgedistillation
ICLR2024
Are Models Biased on Text without Gender-related Language?
Summary pending...
Large language modelsbias evaluationgender bias
ICLR2024
Counting Graph Substructures with Graph Neural Networks
Summary pending...
graph neural networksexpressive powerrepresentation learning
ICLR2024
Reverse Diffusion Monte Carlo
Summary pending...
posterior Samplingmulti-modal samplingdiffusion process
ICLR2024
Optimal Sketching for Residual Error Estimation for Matrix and Vector Norms
Summary pending...
SketchingResidual errorLow-rank approximation
ICLR2024
Tractable MCMC for Private Learning with Pure and Gaussian Differential Privacy
Summary pending...
Pure Differential PrivacyMonte Carlo samplingGaussian Differential Privacy
ICLR2024
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
Summary pending...
Chain of thoughtlanguage modelingcircuit complexity
ICLR2024
The Trickle-down Impact of Reward Inconsistency on RLHF
Summary pending...
Large language modelreward modelRLHF