Papers

12094 papers

ICLR2025

What Makes Large Language Models Reason in (Multi-Turn) Code Generation?

Summary pending...

Large language ModelsMulti-turn Code GenerationChain-of-Thought
ICLR2025

Towards Robust and Parameter-Efficient Knowledge Unlearning for LLMs

Summary pending...

Machine UnlearningLarge Language ModelsLow-rank Adaptation
ICLR2025

Modeling Future Conversation Turns to Teach LLMs to Ask Clarifying Questions

Summary pending...

Clarifying QuestionsQAAmbiguity
ICLR2025

GraphEval: A Lightweight Graph-Based LLM Framework for Idea Evaluation

Summary pending...

Idea EvaluationView-graphLightweight model
ICLR2025

Exact Certification of (Graph) Neural Networks Against Label Poisoning

Summary pending...

graph neural networksrobustnesscertificates
ICLR2025

A Generalist Hanabi Agent

Summary pending...

Multi-Agent Reinforcement Learning (MARL)Cooperative gameMulti Agent Text-based game
ICLR2025

Revealing and Mitigating Over-Attention in Knowledge Editing

Summary pending...

model editingmechanistic interpretabilityNLP
ICLR2025

GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation

Summary pending...

Robot Manipulation; Vision Language Action Model
ICLR2025

Neural Sampling from Boltzmann Densities: Fisher-Rao Curves in the Wasserstein Geometry

Summary pending...

SamplingBoltzmann densitiesFisher-Rao Curves
ICLR2025

Lift Your Molecules: Molecular Graph Generation in Latent Euclidean Space

Summary pending...

Drug DesignComputational BiologyMolecule Generation
ICLR2025

Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective

Summary pending...

text-to-image synthesissemanticsevaluation
ICLR2025

Correcting the Mythos of KL-Regularization: Direct Alignment without Overoptimization via Chi-Squared Preference Optimization

Summary pending...

Reinforcement Learning TheoryOffline Reinforcement Learningsingle-policy concentrability
ICLR2025

Protecting against simultaneous data poisoning attacks

Summary pending...

backdoorsbackdoor defensesdata poisoning
ICLR2025

DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback

Summary pending...

iterative data generationllm agentlifelong learning
ICLR2025

LIFe-GoM: Generalizable Human Rendering with Learned Iterative Feedback Over Multi-Resolution Gaussians-on-Mesh

Summary pending...

Generalizable human renderingerror feedbackdual representation
ICLR2025

High-Dimensional Bayesian Optimisation with Gaussian Process Prior Variational Autoencoders

Summary pending...

Variational autoencodersGaussian processesBayesian optimisation
ICLR2025

RB-Modulation: Training-Free Stylization using Reference-Based Modulation

Summary pending...

Inverse ProblemsGenerative ModelingDiffusion Models
ICLR2025

From Probability to Counterfactuals: the Increasing Complexity of Satisfiability in Pearl's Causal Hierarchy

Summary pending...

complexitycausal reasoningPearl's Causal Hierarchy
ICLR2025

RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction

Summary pending...

Jailbreak AttackLarge Language ModelKV cache optimization
ICLR2025

Beyond Content Relevance: Evaluating Instruction Following in Retrieval Models

Summary pending...

LLMInstruction-FollowingRetrieval Model