Papers

12094 papers

ICLR2025

Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference

Summary pending...

Realtime EnvironmentsAsynchronous AlgorithmsTime Discretization
ICLR2025

From Models to Microtheories: Distilling a Model's Topical Knowledge for Grounded Question-Answering

Summary pending...

microtheorytextual entailmentknowledge representation
ICLR2025

Language Agents Meet Causality -- Bridging LLMs and Causal World Models

Summary pending...

Large Language ModelsCausalityCausal Representation Learning
ICLR2025

Tracing Representation Progression: Analyzing and Enhancing Layer-Wise Similarity

Summary pending...

Representation SimilaritySaturation EventEarly Exit
ICLR2025

Unlocking Point Processes through Point Set Diffusion

Summary pending...

Generative ModelDiffusion ModelSet Model
ICLR2025

Dynamic Sparse Training versus Dense Training: The Unexpected Winner in Image Corruption Robustness

Summary pending...

Dynamic Sparse TrainingImage Corruption Robustness
ICLR2025

Probing the Latent Hierarchical Structure of Data via Diffusion Models

Summary pending...

data structurehierarchical compositionalitydiffusion models
ICLR2025

SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training

Summary pending...

Gradient SpikesSpike-Aware AdamLLMs
ICLR2025

OpenPRM: Building Open-domain Process-based Reward Models with Preference Trees

Summary pending...

large language modelsreward modelsopen-domain instruction following
ICLR2025

IterGen: Iterative Semantic-aware Structured LLM Generation with Backtracking

Summary pending...

LLMGrammarFormal Languages
ICLR2025

End-to-end Learning of Gaussian Mixture Priors for Diffusion Sampler

Summary pending...

Variational InferenceSamplingDiffusion Models
ICLR2025

FlowDec: A flow-based full-band general audio codec with high perceptual quality

Summary pending...

audioaudio codecgenerative models
ICLR2025

Scaling Stick-Breaking Attention: An Efficient Implementation and In-depth Study

Summary pending...

transformerattentionstick-breaking
ICLR2025

Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy Hessians

Summary pending...

machine learning force fieldsgraph neural networksknowledge distillation
ICLR2025

Model-Agnostic Knowledge Guided Correction for Improved Neural Surrogate Rollout

Summary pending...

deep learningknowledge guided machine learningscientific machine learning
ICLR2025

Meta-Dynamical State Space Models for Integrative Neural Data Analysis

Summary pending...

neural dynamicsstate-space modelmeta learning
ICLR2025

Optimizing Posterior Samples for Bayesian Optimization via Rootfinding

Summary pending...

Bayesian optimizationglobal optimizationacquisition function
ICLR2025

InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation

Summary pending...

Automated Data AnalysisData Analytics BenchmarkLLM agents
ICLR2025

Measuring Non-Adversarial Reproduction of Training Data in Large Language Models

Summary pending...

large language modelsmemorizationdata extraction
ICLR2025

Scalable Extraction of Training Data from Aligned, Production Language Models

Summary pending...

privacylanguage modelsdata extraction