Papers

12094 papers

ICLR2025

Feedback Favors the Generalization of Neural ODEs

Summary pending...

Neural ODEsfeedbackgeneralization
ICLR2025

Mix-CPT: A Domain Adaptation Framework via Decoupling Knowledge Learning and Format Alignment

Summary pending...

LLMContinual Pre-trainingknowledge distillation
ICLR2025

An Effective Manifold-based Optimization Method for Distributionally Robust Classification

Summary pending...

robustnessoptimizationrepresentation learning
ICLR2025

Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning

Summary pending...

Imitation LearningPolicy DiversityOffline Learning
ICLR2025

Second-Order Fine-Tuning without Pain for LLMs: A Hessian Informed Zeroth-Order Optimizer

Summary pending...

LLM; deep learning; zeroth order optimizer
ICLR2025

Visually Consistent Hierarchical Image Classification

Summary pending...

Hierarchical classificationvisual grounding
ICLR2025

Stochastic Semi-Gradient Descent for Learning Mean Field Games with Population-Aware Function Approximation

Summary pending...

mean field gamelinear function approximationstochastic semi-gradient descent
ICLR2025

Learning vector fields of differential equations on manifolds with geometrically constrained operator-valued kernels

Summary pending...

Dynamics on manifoldsOperator-valued kernelGeometry-preserving time integration
ICLR2025

System 1.x: Learning to Balance Fast and Slow Planning with Language Models

Summary pending...

Large Language ModelsPlanning
ICLR2025

Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaptation

Summary pending...

image compressionvqgangenerative compression model
ICLR2025

MuseGNN: Forming Scalable, Convergent GNN Layers that Minimize a Sampling-Based Energy

Summary pending...

Graph Neural NetworksEnergy-based ModelsScalable Training
ICLR2025

Reward Dimension Reduction for Scalable Multi-Objective Reinforcement Learning

Summary pending...

Multi-Objective Reinforcement LearningReinforcement LearningDimension Reduction
ICLR2025

On Large Language Model Continual Unlearning

Summary pending...

Continual UnlearningLarge Language Models
ICLR2025

An Empirical Analysis of Uncertainty in Large Language Model Evaluations

Summary pending...

Large Language ModelModel-based LLM EvaluationLLM-as-a-Judge
ICLR2025

Graph Neural Preconditioners for Iterative Solutions of Sparse Linear Systems

Summary pending...

General-purpose preconditionerlinear systemsgraph neural networks
ICLR2025

The "Law'' of the Unconscious Contrastive Learner: Probabilistic Alignment of Unpaired Modalities

Summary pending...

theorycontrastive learningprobabilistic graphical models
ICLR2025

On Evaluating the Durability of Safeguards for Open-Weight LLMs

Summary pending...

AI SafetyFine-tuning AttacksOpen-weight LLMs
ICLR2025

Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models

Summary pending...

LLM agentslanguage agentstheory of mind
ICLR2025

Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs

Summary pending...

personalizationbenchmarkLarge language models
ICLR2025

h4rm3l: A Language for Composable Jailbreak Attack Synthesis

Summary pending...

LLM safetyprogram synthesiscompositional modeling