Papers

12094 papers

ICLR2025

Value-aligned Behavior Cloning for Offline Reinforcement Learning via Bi-level Optimization

Summary pending...

offline reinforcement learning;bi-level optimization;value alignment
ICLR2025

One Hundred Neural Networks and Brains Watching Videos: Lessons from Alignment

Summary pending...

representational alignmentRepresentational Similarity AnalysisRSA
ICLR2025

Free Hunch: Denoiser Covariance Estimation for Diffusion Models Without Extra Costs

Summary pending...

diffusion modelconditional generationinverse problems
ICLR2025

Bridging the Data Provenance Gap Across Text, Speech, and Video

Summary pending...

training dataauditspeech
ICLR2025

Understanding Factual Recall in Transformers via Associative Memories

Summary pending...

transformersassociative memoriesfactual recall
ICLR2025

The Belief State Transformer

Summary pending...

representation learningtransformersnext-token prediction
ICLR2025

Diffusion Transformers for Tabular Data Time Series Generation

Summary pending...

tabular data generationtime seriesdiffusion models
ICLR2025

Gaussian Differentially Private Human Faces Under a Face Radial Curve Representation

Summary pending...

differential privacyshape analysisfunctional data analysis
ICLR2025

Interpreting Emergent Planning in Model-Free Reinforcement Learning

Summary pending...

reinforcement learninginterpretabilityplanning
ICLR2025

On Targeted Manipulation and Deception when Optimizing LLMs for User Feedback

Summary pending...

manipulationdeceptionalignment
ICLR2025

Non-Equilibrium Dynamics of Hybrid Continuous-Discrete Ground-State Sampling

Summary pending...

Combinatorial optimizationDegenerate ground-state samplingMetropolis-Hastings algorithm
ICLR2025

Utility-Directed Conformal Prediction: A Decision-Aware Framework for Actionable Uncertainty Quantification

Summary pending...

Decision-focused learningdecision makinguncertainty quantification
ICLR2025

ScImage: How good are multimodal large language models at scientific text-to-image generation?

Summary pending...

LLMsmultimodalityscience
ICLR2025

Semantic Temporal Abstraction via Vision-Language Model Guidance for Efficient Reinforcement Learning

Summary pending...

Reinforcement Learning; Vision-Language Models; Temporal Abstraction
ICLR2025

Generating Freeform Endoskeletal Robots

Summary pending...

co-designagent designrobots
ICLR2025

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language Models

Summary pending...

Mathematical BenchmarkLLM EvaluationOlympic
ICLR2025

Learning Color Equivariant Representations

Summary pending...

Equivariant Neural NetworkGeometric Deep LearningGroup Convolution
ICLR2025

Surprising Effectiveness of pretraining Ternary Language Model at Scale

Summary pending...

Large Language Modelslow-bit language modelsquantization-aware training
ICLR2025

Connectome Mapping: Shape-Memory Network via Interpretation of Contextual Semantic Information

Summary pending...

neural representation
ICLR2025

Restructuring Vector Quantization with the Rotation Trick

Summary pending...

Vector QuantizationVQ-VAE