Papers
12094 papers
ICLR2025
What Makes Large Language Models Reason in (Multi-Turn) Code Generation?
Summary pending...
Large language ModelsMulti-turn Code GenerationChain-of-Thought
ICLR2025
Towards Robust and Parameter-Efficient Knowledge Unlearning for LLMs
Summary pending...
Machine UnlearningLarge Language ModelsLow-rank Adaptation
ICLR2025
Modeling Future Conversation Turns to Teach LLMs to Ask Clarifying Questions
Summary pending...
Clarifying QuestionsQAAmbiguity
ICLR2025
GraphEval: A Lightweight Graph-Based LLM Framework for Idea Evaluation
Summary pending...
Idea EvaluationView-graphLightweight model
ICLR2025
Exact Certification of (Graph) Neural Networks Against Label Poisoning
Summary pending...
graph neural networksrobustnesscertificates
ICLR2025
A Generalist Hanabi Agent
Summary pending...
Multi-Agent Reinforcement Learning (MARL)Cooperative gameMulti Agent Text-based game
ICLR2025
Revealing and Mitigating Over-Attention in Knowledge Editing
Summary pending...
model editingmechanistic interpretabilityNLP
ICLR2025
GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation
Summary pending...
Robot Manipulation; Vision Language Action Model
ICLR2025
Neural Sampling from Boltzmann Densities: Fisher-Rao Curves in the Wasserstein Geometry
Summary pending...
SamplingBoltzmann densitiesFisher-Rao Curves
ICLR2025
Lift Your Molecules: Molecular Graph Generation in Latent Euclidean Space
Summary pending...
Drug DesignComputational BiologyMolecule Generation
ICLR2025
Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective
Summary pending...
text-to-image synthesissemanticsevaluation
ICLR2025
Correcting the Mythos of KL-Regularization: Direct Alignment without Overoptimization via Chi-Squared Preference Optimization
Summary pending...
Reinforcement Learning TheoryOffline Reinforcement Learningsingle-policy concentrability
ICLR2025
Protecting against simultaneous data poisoning attacks
Summary pending...
backdoorsbackdoor defensesdata poisoning
ICLR2025
DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
Summary pending...
iterative data generationllm agentlifelong learning
ICLR2025
LIFe-GoM: Generalizable Human Rendering with Learned Iterative Feedback Over Multi-Resolution Gaussians-on-Mesh
Summary pending...
Generalizable human renderingerror feedbackdual representation
ICLR2025
High-Dimensional Bayesian Optimisation with Gaussian Process Prior Variational Autoencoders
Summary pending...
Variational autoencodersGaussian processesBayesian optimisation
ICLR2025
RB-Modulation: Training-Free Stylization using Reference-Based Modulation
Summary pending...
Inverse ProblemsGenerative ModelingDiffusion Models
ICLR2025
From Probability to Counterfactuals: the Increasing Complexity of Satisfiability in Pearl's Causal Hierarchy
Summary pending...
complexitycausal reasoningPearl's Causal Hierarchy
ICLR2025
RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction
Summary pending...
Jailbreak AttackLarge Language ModelKV cache optimization
ICLR2025
Beyond Content Relevance: Evaluating Instruction Following in Retrieval Models
Summary pending...
LLMInstruction-FollowingRetrieval Model