Papers

12094 papers

ICLR2025

Constraint-Conditioned Actor-Critic for Offline Safe Reinforcement Learning

Summary pending...

Offline Safe Reinforcement LearningConstraint-conditioned Actor-CriticData Generation
ICLR2025

Conformal Language Model Reasoning with Coherent Factuality

Summary pending...

language modelsreasoningconformal prediction
ICLR2025

NeSyC: A Neuro-symbolic Continual Learner For Complex Embodied Tasks In Open Domains

Summary pending...

Embodied AINeuro-symbolic AI
ICLR2025

ACC-Collab: An Actor-Critic Approach to Multi-Agent LLM Collaboration

Summary pending...

Multi-Agent CollaborationLLM AgentsPreference Optimization
ICLR2025

Dissecting Adversarial Robustness of Multimodal LM Agents

Summary pending...

LM agentsmultimodal agentssafety
ICLR2025

SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators

Summary pending...

Model CompressionLarge Language ModelsPost-Training Quantization
ICLR2025

CL-MFAP: A Contrastive Learning-Based Multimodal Foundation Model for Molecular Property Prediction and Antibiotic Screening

Summary pending...

Contrastive LearningMultimodal Foundation ModelAntibiotic Property Prediction
ICLR2025

Towards Principled Evaluations of Sparse Autoencoders for Interpretability and Control

Summary pending...

mechanistic interpretabilitysparse autoencodersevaluations
ICLR2025

Commit0: Library Generation from Scratch

Summary pending...

code generationlanguage modelevaluation
ICLR2025

VVC-Gym: A Fixed-Wing UAV Reinforcement Learning Environment for Multi-Goal Long-Horizon Problems

Summary pending...

Reinforcement Learning EnvironmentDemonstrationsGoal-Conditioned Reinforcement Learning
ICLR2025

MrT5: Dynamic Token Merging for Efficient Byte-level Language Models

Summary pending...

NLPByT5T5
ICLR2025

Simple is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation

Summary pending...

Knowledge GraphsLarge Language ModelsRetrieval-Augmented Generation
ICLR2025

RazorAttention: Efficient KV Cache Compression Through Retrieval Heads

Summary pending...

LLMsKV cache compressionLLM inference acceleration
ICLR2025

Efficient Top-m Data Values Identification for Data Selection

Summary pending...

data selectiondata valuationtop-m arms identification
ICLR2025

Finally Rank-Breaking Conquers MNL Bandits: Optimal and Efficient Algorithms for MNL Assortment

Summary pending...

Active online assortment optimizationPreference feedbackSubsetwise utility maximization
ICLR2025

Collab: Controlled Decoding using Mixture of Agents for LLM Alignment

Summary pending...

AlignmentDecodingRLHF
ICLR2025

DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life

Summary pending...

language modelmoral dilemmamodel alignment
ICLR2025

Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems

Summary pending...

Retrieval-Augmented GenerationSecurityPrivacy
ICLR2025

SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?

Summary pending...

Language modelsNatural language processingSoftware engineering
ICLR2025

PEARL: Towards Permutation-Resilient LLMs

Summary pending...

In-Context LearningLarge Language ModelsInstruction Tuning